range result in Tuple! and how to convert into assocArray by sort?

May 10, 2022

MichaelBi

May 10, 2022

rikki cattermole

May 10, 2022

May 10, 2022

May 11, 2022

May 10, 2022

May 12, 2022

If I am understanding the problem correctly, this is a super expensive method for doing something pretty simple. Even if it is a bit more code, this won't require memory allocation which in this case wouldn't be cheap (given how big DNA tends to be). string s = "ACGTACGT"; uint[4] counts; foreach(char c; s) { switch(c) { case 'A': case 'a': counts[0]++; break; case 'C': case 'c': counts[1]++; break; case 'G': case 'g': counts[2]++; break; case 'T': case 't': counts[3]++; break; default: assert(0, "Unknown compound"); } } writeln(counts);

On 5/9/22 20:38, rikki cattermole wrote: > this is a super expensive > method for doing something pretty simple. Yes! :) Assuming the data is indeed validated in some way, the following should be even faster. It validates the data after the fact: import std.stdio; import std.range; import std.exception; import std.algorithm; import std.format; const ulong[] alphabet = [ 'A', 'C', 'G', 'T' ]; void main() { string s = "ACGTACGT"; auto counts = new ulong[char.max]; foreach(char c; s) { counts[c]++; } validateCounts(counts); writeln(counts.indexed(alphabet)); } void validateCounts(ulong[] counts) { // The other elements should all be zero. enforce(counts .enumerate .filter!(t => !alphabet.canFind(t.index)) .map!(t => t.value) .sum == 0, format!"There were illegal letters in the data: %s"(counts)); } Ali

On Tuesday, 10 May 2022 at 03:38:08 UTC, rikki cattermole wrote: > If I am understanding the problem correctly, this is a super expensive method for doing something pretty simple. Even if it is a bit more code, this won't require memory allocation which in this case wouldn't be cheap (given how big DNA tends to be). > > string s = "ACGTACGT"; > > uint[4] counts; > > foreach(char c; s) { > switch(c) { > case 'A': > case 'a': > counts[0]++; > break; > case 'C': > case 'c': > counts[1]++; > break; > case 'G': > case 'g': > counts[2]++; > break; > case 'T': > case 't': > counts[3]++; > break; > default: > assert(0, "Unknown compound"); > } > } > > writeln(counts); yes, thanks. understood this. the problem for me now is after learning D, always thinking about using range and function composition...and forgot the basic algorithm :)

On Tuesday, 10 May 2022 at 04:21:04 UTC, Ali Çehreli wrote: > On 5/9/22 20:38, rikki cattermole wrote: > > > [...] > > Yes! :) > > Assuming the data is indeed validated in some way, the following should be even faster. It validates the data after the fact: > > [...] this is cool! thanks for your time and i really like your book Programming in D :)

On 5/9/22 22:12, MichaelBi wrote: > On Tuesday, 10 May 2022 at 04:21:04 UTC, Ali Çehreli wrote: >> On 5/9/22 20:38, rikki cattermole wrote: >> >> > [...] >> >> Yes! :) >> >> Assuming the data is indeed validated in some way, the following >> should be even faster. It validates the data after the fact: >> >> [...] > > this is cool! I've been meaning to write about a bug in my code, which would likely cause zero issues, and which you've probably already fixed. ;) BAD: auto counts = new ulong[char.max]; GOOD: auto counts = new ulong[char.max - char.min + 1]; FINE: auto counts = new ulong[256]; > thanks for your time and i really like your book > Programming in D :) Yay! :) Ali