Thread overview
byCodePoint for a range of chars
May 20, 2014
John Colvin
May 20, 2014
Justin Whear
May 20, 2014
monarch_dodra
May 20, 2014
monarch_dodra
May 20, 2014
John Colvin
May 20, 2014
Given a range with element type char, what's the best way of iterating over it by code-point, without filling an array first?

Related to this: What's the status of std.utf and std.encoding? The comments in std.encoding say that some functions supersede their std.utf counterparts.
May 20, 2014
On Tue, 20 May 2014 17:59:07 +0000, John Colvin wrote:

> Given a range with element type char, what's the best way of iterating over it by code-point, without filling an array first?
> 
> Related to this: What's the status of std.utf and std.encoding? The comments in std.encoding say that some functions supersede their std.utf counterparts.

Foreach on narrow strings automatically decodes, so it's as simple as:

// assume UTF-8 encoded
char[] myData = ...
foreach (dchar codePoint; myData)
   ...
May 20, 2014
On Tuesday, 20 May 2014 at 17:59:09 UTC, John Colvin wrote:
> Given a range with element type char, what's the best way of iterating over it by code-point, without filling an array first?
>
> Related to this: What's the status of std.utf and std.encoding? The comments in std.encoding say that some functions supersede their std.utf counterparts.

FWI, Walter just wrote "byDchar", that does what you want:
https://github.com/D-Programming-Language/phobos/pull/2043

It's about to be merged.
May 20, 2014
On Tuesday, 20 May 2014 at 18:06:09 UTC, Justin Whear wrote:
> Foreach on narrow strings automatically decodes, so it's as simple as:
>
> // assume UTF-8 encoded
> char[] myData = ...
> foreach (dchar codePoint; myData)
>    ...

I think the point of his question is if you have an actual non-array range of chars, in which case foreach does NOT decode. It simply iterates.
May 20, 2014
On Tuesday, 20 May 2014 at 19:58:17 UTC, monarch_dodra wrote:
> On Tuesday, 20 May 2014 at 17:59:09 UTC, John Colvin wrote:
>> Given a range with element type char, what's the best way of iterating over it by code-point, without filling an array first?
>>
>> Related to this: What's the status of std.utf and std.encoding? The comments in std.encoding say that some functions supersede their std.utf counterparts.
>
> FWI, Walter just wrote "byDchar", that does what you want:
> https://github.com/D-Programming-Language/phobos/pull/2043
>
> It's about to be merged.

Sweet! I was vaguely aware of this but didn't know what the progress was. Thanks :)

In the mean time I hand-rolled my own, but I'm sure Walter's is infinitely better so I'll swap over ASAP.