AA with complex keytype?

Manfred Nowak Wrote: <snip> > Why are classes required to implement opCmp( Object) for AA's to function properly? Apparently because Walter doesn't like the idea of providing an alternative implementation of AAs for unordered classes, but hasn't to my knowledge given a truly convincing reason. My library implementation http://pr.stewartsplace.org.uk/d/sutil/ uses only toHash and opEquals, thus works on unordered types. > Currently AA's misbehave if opCmp(Object) is not overwritten for a class, but no error is thrown. Currently? Not under DMD 1.005 as I try it. Please supply a code example that demonstrates this behaviour. > For the transitivity of an ordering it suffices for opCmp(Object) to always give "==" if the objects are equal, and give "<" if they are not. So if x != y, then both x < y and y < x? That wouldn't make sense at all. > It should be easy to implement this as the default opCmp(Object). OUAT, Object.opCmp was set up to compare the memory addresses, but this behaviour was removed to prepare for copying/compacting GC, and probably partly to eliminate the confusing behaviour it created. Stewart.

Stewart Gordon wrote >> Currently AA's misbehave if opCmp(Object) is not overwritten for >> a class, but no error is thrown. > Currently? Not under DMD 1.005 as I try it. Please supply a code example that demonstrates this behaviour. Following code shows under 1.005 that no compile time error is given. class C{ hash_t toHash() { return 0; } } void main(){ bool[C] map; map[ new C]= true; map[ new C]= false; } A runtime error shows up only, when a comparison is actually needed. This might be far too late. In this example the runtime error is forced by implementing a worthless toHash. > So if x != y, then both x < y and y < x? That wouldn't make sense at all. Enough sense for an AA: all colliding elements are put into a linear list and searched in sequence on retrieval. Without a good toHash this would natrally lead to a bad runtime behaviour. -manfred

Manfred Nowak wrote: >> So if x != y, then both x < y and y < x? That wouldn't make sense >> at all. > > Enough sense for an AA: all colliding elements are put into a linear list and searched in sequence on retrieval. Without a good toHash this would natrally lead to a bad runtime behaviour. If they'd be put into a linear list you wouldn't even *need* opCmp, so no opCmp would be "enough sence" as well ;). But the fact is they are put into a binary tree to keep acceptable behavior (i.e. O(log N) instead of O(N)) when the hash function sucks. Since hash functions can be user-defined, and not all users are experts at hashing, you need to consider that use case.

Frits van Bommel wrote > Since hash functions can be user-defined, and not all users are experts at hashing, you need to consider that use case. Currently implemented AA's need to have for a good runtime beahviour at least one of 1) or 2) where: (1a) toHash implements a good distribution and (1b) opCmp==!opEquals or better or (2a) toHash worse than described by (1a) and (2b) opCmp implements an ordering suitable for use in unbalanced binary trees There is no evidence, that users that are no experts at hashing will be experts at implementing a suitable ordering, especially if the elements do not have a natural ordering. -manfred