Best practices of using const (page 2) - D Programming Language Discussion Forum

Forums

New users
- Learn
Community
- General
- Announce
Improvements
- DIP Ideas
- DIP Devel.
Ecosystem
- GDC
- LDC
- Debuggers
- IDEs
- DWT
Development
- Internals
- Issues
- Beta
- DMD
- Phobos
- Druntime
- Study
Turkish
- Genel
- Duyuru

Index » Learn » Best practices of using const (page 2)

February 19, 2019

Re: Best practices of using const

Posted by drug
in reply to Kagamin

drug

Posted in reply to Kagamin

On 19.02.2019 19:19, Kagamin wrote:
> On Tuesday, 19 February 2019 at 15:30:22 UTC, Atila Neves wrote:
>> I keep hearing how const is nigh unusable in D, and except for ranges I litter my code with const everywhere, pretty much just as often as I used in C++.
> 
> I once spent a good amount of effort to annotate my code with pure and inout only to find a compiler bug, then I realized that annotations aren't really needed, because the collection is inherently mutable anyway (appender).

I use const all over the place too. And I made PR to other libraries to add const qualifier.
Yes, it sometimes forces me to make a copy of data to mutate it - but I'm pretty sure this is the purpose of the qualifier. This helps me to catch/prevent bug. So I don't agree with people who do not use const at all. Definitely const qualifier in D is usable and is useful. The same I can say about properties - for example I use them in meta programming to detect what to serialize/process - I skip methods but serialize properties and for me this is a nice language feature.

February 20, 2019

Re: Best practices of using const

Posted by Kagamin
in reply to drug

Kagamin

Posted in reply to drug

On Tuesday, 19 February 2019 at 16:38:17 UTC, drug wrote:
> The same I can say about properties - for example I use them in meta programming to detect what to serialize/process - I skip methods but serialize properties and for me this is a nice language feature.

Serialization of arbitrary stuff is a bad practice anyway, it was the cause of vulnerabilities in serialization libraries. DTO is the way to go.

February 20, 2019

Re: Best practices of using const

Posted by drug
in reply to Kagamin

drug

Posted in reply to Kagamin

On 20.02.2019 11:05, Kagamin wrote:
> On Tuesday, 19 February 2019 at 16:38:17 UTC, drug wrote:
>> The same I can say about properties - for example I use them in meta programming to detect what to serialize/process - I skip methods but serialize properties and for me this is a nice language feature.
> 
> Serialization of arbitrary stuff is a bad practice anyway, it was the cause of vulnerabilities in serialization libraries. DTO is the way to go.
serialization is just an example here. But using properties lets me to avoid using DTO except really complex cases and lets me decrease maintenance cost. In my case (I develop a prototype and very often change its data structures) they work really well.

June 21, 2019

Re: Best practices of using const

Posted by Yatheendra
in reply to Marco de Wild

Yatheendra

Posted in reply to Marco de Wild

Am I mistaken in saying that we are conflating:
   "anything that is logically const should be declared const"
   // makes perfect sense
   // e.g. the lowest 2, and some branches of the 3rd and 4th, levels
   // of members (and a subset of the overall methods) in a 5-deep type hierarchy are const
with:
   "most code/data should be declared const"
   // no! isn't efficient code all about mutation?
   // no grounds for, e.g.: "ideally, no more than 40% of code should be doing mutation"

> On Wednesday, 13 February 2019 at 16:40:18 UTC, H. S. Teoh wrote:
>> On Wed, Feb 13, 2019 at 11:32:46AM +0000, envoid via Digitalmars-d-learn wrote:
>> Unfortunately, that guarantee also excludes a lot of otherwise useful idioms, like objects that cache data -- a const object cannot cache data because that means it's being mutated, or lazily-initialized objects -- because once the ctor has run, the object can no longer be mutated. Most notably, D's powerful range idiom is pretty much unusable with const because iteration over a range requires mutating the range (though having const *elements* in a range is fine).  This doesn't seem as bad at first glance, but it wreaks havoc on generic code, another thing that D is purportedly good at. It's very hard (and often impossible) to write generic code that works with both const and mutable objects.
>>
>> So ironically, the iron-clad semantics of D's const system turns out to be also its own downfall.
>>
>>
>> T

The point about generic code (reiterated by many) is intriguing on its own; until now, I hadn't explicitly thought about const even for my C++ template library code (whatever little I have of those). Any pointers to other posts or articles elaborating this a little bit?

I believe the other points probably matter when interacting with every other feature (I would have to write some of my "real" code in D to see if I hit it on my own), but there doesn't seem to be anything unusable about them on their own.

The inability to have a const caching object seems correct. The way around would be to have a wrapper that caches (meh). If that is not possible, then maybe caching objects just aren't meant to be const by their nature? Isn't memoize a standard library feature? I should look at it, but I wouldn't expect it to be const.

On Monday, 18 February 2019 at 06:50:32 UTC, Marco de Wild wrote:
>
> I agree that const by nature unfortunately kills lazy initialization.

Lazy initialization - is this the same as post-blit? At the cost of copying (justifiable? maybe), doesn't D have a way to copy-construct a const/immutable struct object from a mutable one? If there is a way (or will be - there is a recent posting and a Dconf talk about copy constructors), does the copying negate the benefits of lazy initialization?

> However, I don't really understand why const is a problem with ranges. Const elements are not a problem. Iterating over a range consumes it (if I understand correctly). It does not make sense to be able to consume a const object, so from my point of view it's perfectly logical to disallow iterating const ranges. If I'm missing something, please correct me.
> ...

+1.

Or I haven't understood why ranges would ever ever need to be const.
After all, in C++, what use is:
   std::vector::const_iterator const iter = sequence.begin();
About the only kind of use would be:
   std::vector::const_iterator iter = sequence.begin();
   std::vector::const_iterator const iterEnd = sequence.end();
What are ranges if not an encapsulation of the above functionality?

June 21, 2019

Re: Best practices of using const

Posted by H. S. Teoh
in reply to Yatheendra

H. S. Teoh

Posted in reply to Yatheendra

On Fri, Jun 21, 2019 at 06:07:59AM +0000, Yatheendra via Digitalmars-d-learn wrote:
> Am I mistaken in saying that we are conflating:
>    "anything that is logically const should be declared const"

No, not in D.  D does not have logical const; it has "physical" const, which is a strict subset of logical const, and therefore there are some uses of logical const for which D's const is unsuitable.

>    // makes perfect sense
>    // e.g. the lowest 2, and some branches of the 3rd and 4th, levels
>    // of members (and a subset of the overall methods) in a 5-deep type hierarchy are const
> with:
>    "most code/data should be declared const"
>    // no! isn't efficient code all about mutation?
>    // no grounds for, e.g.: "ideally, no more than 40% of code should be
> doing mutation"

Actually, optimizers work best when there is minimal mutation *in the original source*.  The emitted code, of course, is free to use mutation however it wants.  But the trouble with mutation at the source level is that it makes many code analyses very complex, which hinders the optimizer from doing what it might have been able to do in the absence of mutation (or a reduced usage of mutation).  Aliasing is one example that hampers optimizers from emitting optimal code. Aliasing plus mutation makes the analysis so complex that the optimizer has a hard time deciding whether a particular construct can be optimized away or not.

Having minimal mutation in the original source code allows the optimizer to make more assumptions, which in turn leads to better optimizations. It also makes the source code easier to understand.  Paradoxically, having less mutation in the source code means it's easier for the compiler to optimize it into mutation-heavy optimal code -- because it doesn't have to worry about arbitrary mutations in the source code, and therefore can be free(r) to, e.g., eliminate redundant copies, redundant movement of data, etc., which ultimately results in in-place modification of values, i.e., mutation-heavy emitted code.

Conversely, if the source code is heavy on mutations, then the compiler cannot confidently predict the overall effect of the mutations, and therefore is forced to err on the safe side of assuming the worst, i.e., don't apply aggressive optimizations in case the programmer's mutations invalidate said optimizations. The result is less optimal code.

[...]
> The inability to have a const caching object seems correct. The way around would be to have a wrapper that caches (meh). If that is not possible, then maybe caching objects just aren't meant to be const by their nature? Isn't memoize a standard library feature? I should look at it, but I wouldn't expect it to be const.

It's not as simple as it might seem.  Here's the crux of the problem: you have an object that logically never changes (assuming no bugs, of course).  Meaning every time you read it, you get the same value, and so multiple reads can be elided, etc.. I.e., you want to tell the compiler that it's OK to assume this object is const (or immutable).

However, it is expensive to initialize, and you'd like it to be initialized only when it's actually needed, and once initialized you'd like it to be cached so that you don't have to incur the initialization cost again.  However, declaring a const object in D requires initialization, and after initialization it cannot be mutated anymore. This means you cannot declare it const in the first place if you want caching.

It gets worse, though.  Wrappers only work up to a certain point.  But when you're dealing with generic code, it becomes problematic.  Assume, for instance, that you have a type Costly that's logically const, but lazily initialized (and cached).  Since you can't actually declare it const -- otherwise lazy initialization doesn't work -- you have to declare it mutable.  Or, in this case, declare a wrapper that holds a const reference to it, say something like this:

	struct Payload {
		// lazily-initialized data
	}

	struct Wrapper {
		const(Payload)* impl;
		...
	}

However, what if you're applying some generic algorithms to it?  Generic code generally assume that given a type T, if you want to declare a const instance of it, you simply write const(T).  But what do you pass to the generic function? If you pass Wrapper, const(Wrapper) means `impl` cannot be rebound, so lazily initialization fails.  OK, then let's pass const(Payload) directly.  But that means you no longer have a wrapper, so you can't have lazy initialization (Payload must be constructed before you can pass it to the function, thus it must be eagerly initialized at this point).

It's an impasse.  Cached / lazily-initialized objects and D's const simply don't mix.  Well, you can try to mix them, but it's like trying to mix water and oil.  They just don't work well together.

T

-- 
Notwithstanding the eloquent discontent that you have just respectfully expressed at length against my verbal capabilities, I am afraid that I must unfortunately bring it to your attention that I am, in fact, NOT verbose.

June 21, 2019

Re: Best practices of using const

Posted by Yatheendra
in reply to H. S. Teoh

Yatheendra

Posted in reply to H. S. Teoh

That is a comprehensive reply. No pointers to other material required :-)

On Friday, 21 June 2019 at 16:35:50 UTC, H. S. Teoh wrote:
> On Fri, Jun 21, 2019 at 06:07:59AM +0000, Yatheendra via Digitalmars-d-learn wrote:
> Actually, optimizers work best when there is minimal mutation *in the original source*.  The emitted code, of course, is free to use mutation however it wants.  But the trouble with mutation at the source level is that it makes many code analyses very complex, which hinders the optimizer from doing what it might have been able to do in the absence of mutation (or a reduced usage of mutation).
> [...]

(aside: I hope we don't end up advocating the Haskell/Erlang way or the Clojure way!)

Yes, the hindrances of non-const code are documented (are most programmers listening!). I was only pointing out that mutation being part of the design limits what can be logically const. Is the trade-off clear, between (mythical) guaranteed C++-like-const at all the points we remember to put it, versus guaranteed D-const at the fewer points we manage to put it? Does D-const distort the design (but you get all the optimizations possible in that scenario)?

>> The inability to have a const caching object seems correct. The way around would be to have a wrapper that caches (meh). If that is not possible, then maybe caching objects just aren't meant to be const by their nature? Isn't memoize a standard library feature? I should look at it, but I wouldn't expect it to be const.
>
> It's not as simple as it might seem.  Here's the crux of the problem: you have an object that logically never changes (assuming no bugs, of course).  Meaning every time you read it, you get the same value, and so multiple reads can be elided, etc.. I.e., you want to tell the compiler that it's OK to assume this object is const (or immutable).
>
> However, it is expensive to initialize, and you'd like it to be initialized only when it's actually needed, and once initialized you'd like it to be cached so that you don't have to incur the initialization cost again.  However, declaring a const object in D requires initialization, and after initialization it cannot be mutated anymore. This means you cannot declare it const in the first place if you want caching.
>
> It gets worse, though.  Wrappers only work up to a certain point.  But when you're dealing with generic code, it becomes problematic.  Assume, for instance, that you have a type Costly that's logically const, but lazily initialized (and cached).  Since you can't actually declare it const -- otherwise lazy initialization doesn't work -- you have to declare it mutable.  Or, in this case, declare a wrapper that holds a const reference to it, say something like this:
>
> 	struct Payload {
> 		// lazily-initialized data
> 	}
>
> 	struct Wrapper {
> 		const(Payload)* impl;
> 		...
> 	}
>
> However, what if you're applying some generic algorithms to it?
>  Generic code generally assume that given a type T, if you want to declare a const instance of it, you simply write const(T).  But what do you pass to the generic function? If you pass Wrapper, const(Wrapper) means `impl` cannot be rebound, so lazily initialization fails.  OK, then let's pass const(Payload) directly.  But that means you no longer have a wrapper, so you can't have lazy initialization (Payload must be constructed before you can pass it to the function, thus it must be eagerly initialized at this point).
>

I should check on std memoize & maybe code something up for understanding before writing more than this - would you mind pointing to an example range algorithm that we would have trouble passing a caching wrapper to?

I hadn't considered pointers as an option. Why wouldn't the following work, if expressible in D?
   struct CostlyComputeResult {
      ... // data fields
      // constructor takes compute results, no postblit
   }

   struct Wrapper {
      const (CostlyComputeResult) *cachee = 0;
      ... // data fields storing compute inputs
      // constructor takes compute inputs
      // pointer to function(compute inputs)
      const ref get() {
         if (!cachee) {
            cachee = new(function(inputs));
         }
         return cachee;
      }
   }

Hopefully jumping through these hoops is worth the while. Instead, maybe just wait until the compiler grows a 'cache pure' function qualifier (move constructor required?).

June 21, 2019

Re: Best practices of using const

Posted by H. S. Teoh
in reply to Yatheendra

H. S. Teoh

Posted in reply to Yatheendra

On Fri, Jun 21, 2019 at 06:32:33PM +0000, Yatheendra via Digitalmars-d-learn wrote: [...]
>    struct CostlyComputeResult {
>       ... // data fields
>       // constructor takes compute results, no postblit
>    }
> 
>    struct Wrapper {
>       const (CostlyComputeResult) *cachee = 0;
>       ... // data fields storing compute inputs
>       // constructor takes compute inputs
>       // pointer to function(compute inputs)
>       const ref get() {
>          if (!cachee) {
>             cachee = new(function(inputs));
>          }
>          return cachee;
>       }
>    }
> 
> Hopefully jumping through these hoops is worth the while. Instead, maybe just wait until the compiler grows a 'cache pure' function qualifier (move constructor required?).

The problem with this is that you cannot use const(Wrapper).

In particular, if you have a function that wants to document that it does not mutate its argument, you cannot write:

	auto func(in Wrapper data) { ... }

because const(Wrapper) does not allow lazy initialization.  Basically you have to resort to convention (e.g., name it ReadOnly or something similar) rather than actually mark it const.  This generally isn't a big problem if you're using func in isolation, but as soon as you need to compose func with other const code, you quickly find yourself in a gordian knot of const incompatibilities that percolate throughout the entire call chain, because D's const is transitive.

I.e., you want func to interact with other functions that trade in const data, but you cannot because of the constant(!) need to keep Wrapper mutable.  The non-constness of Wrapper will then percolate up the call chain, "tainting" all functions that call it so that they cannot be marked const, even though *logically* they are const.  This mix is already bad enough (try it on a non-trivial codebase sometime and see for yourself), but once you add generic functions to the mix, the whole thing simply becomes unusable -- because generic functions expect to write const types as const(T), but that will break if T is Wrapper. OK, so you can try to make it mutable as a workaround.  But then that breaks const-ness attribute inference so the generic function becomes non-const, which in turn recursively causes its callers to be non-const, etc.  Somewhere at the top of the call chain you'll have a const method that wants to call a const function, passing some higher-level data structure that eventually contains Wrapper somewhere deep down -- and it simply doesn't work without making the *entire* structure mutable, due to the infectiousness of const.

So as soon as you use Wrapper in any data structure of arbitrary complexity, the entire thing must be mutable -- otherwise const percolates all the way down to Wrapper and the caching doesn't work anymore.

tl;dr: using a wrapper works fine for relatively simple cases.  But as soon as you add any meaningful complexity to it, the scheme quickly becomes either impractically convoluted, or outright impossible to use without a hard cast to cast away const (thereby invoking the lovely UB).

T

-- 
INTEL = Only half of "intelligence".

June 22, 2019

Re: Best practices of using const

Posted by Yatheendra
in reply to H. S. Teoh

Yatheendra

Posted in reply to H. S. Teoh

It feels disingenous to want to call a caching object even "logically" const. There has to be a scaffolding-based but hopefully generic compromise. I haven't yet tested this belief, but I believe "physical" const is of good use wherever it can be applied.

On Friday, 21 June 2019 at 23:39:20 UTC, H. S. Teoh wrote:
> The problem with this is that you cannot use const(Wrapper).
>
> In particular, if you have a function that wants to document that it does not mutate its argument, you cannot write:
>
> 	auto func(in Wrapper data) { ... }
>
> because const(Wrapper) does not allow lazy initialization.
> ...

IMHO, in parameters are a more important scenario than const in ranges (of course, same constraints).

Just for the heck of it, I'll try to get a snippet "working" but I see out parameters snaking all through the call chain!

June 22, 2019

Re: Best practices of using const

Posted by Yatheendra
in reply to Yatheendra

Yatheendra

Posted in reply to Yatheendra

On Saturday, 22 June 2019 at 05:10:14 UTC, Yatheendra wrote:
> It feels disingenous to want to call a caching object even "logically" const. There has to be a scaffolding-based but hopefully generic compromise. I haven't yet tested this belief, but I believe "physical" const is of good use wherever it can be applied.
>
> On Friday, 21 June 2019 at 23:39:20 UTC, H. S. Teoh wrote:
>> The problem with this is that you cannot use const(Wrapper).
>>
>> In particular, if you have a function that wants to document that it does not mutate its argument, you cannot write:
>>
>> 	auto func(in Wrapper data) { ... }
>>
>> because const(Wrapper) does not allow lazy initialization.
>> ...
> ...

"physical" const has to be applicable & good in many/most other use-cases than caching (citation needed). Somehow, wanting to call mutating code on logically const values sounds to be the wrong want.

Lazy initialization sounds like it will be a good DIP :-) Generate (once) on (first) read, just like copy (once) on (first) write. But there are other ways.

Heavy computation called at most once: bite the bullet, eagerly construct an immutable value ahead of time. "physical" const might have just enough optimization opportunity to offset biting the bullet.

Called more than once: same thing.

June 23, 2019

Re: Best practices of using const

Posted by Bart
in reply to H. S. Teoh

Bart

Posted in reply to H. S. Teoh

On Wednesday, 13 February 2019 at 16:40:18 UTC, H. S. Teoh wrote:
> So ironically, the iron-clad semantics of D's const system turns out to be also its own downfall.
>
>

Such things are not ironic. There is always a trade off. You get nothing for free in this universe. Physics tells us this. Conservation laws apply to energy and everything is energy. Hence your computer cannot violate these laws nor can the D specification(whatever it ends up meaning) nor can the D const system, so to speak...

That is, any time there something is inversely related to something else then there will be a conservation relationship. If you restrict something too much then something else in direct opposition is becoming too unrestricted.

It's not that D's const system is bad, it is that it creates too much restriction without any other option. The usual way too solve these problems is granularity. this way you can choose the right tool for the job.

Maybe D needs different levels of const. constN where constN can always be cast to constn for n <= N. One would need to properly define the levels to maximize utility and minimize the granularity. D probably only needs 3-5 levels to be effective.

Top | Forum index | About this forum

Copyright © 1999-2021 by the D Language Foundation