Thread overview |
---|
March 27, 2019 How to decode UTF-8 text? | ||||
---|---|---|---|---|
| ||||
Hello, I have got some text with UTF-8. For example this part: > <title>ΠαÏάλληλη αναζήÏηÏη</title> How to decode it to get this result? > <title>Παράλληλη αναζήτηση</title> I have tried functions like "decode", "byUTF", "to!wchar"... but no success. Input string is correct - checked it with "https://www.browserling.com/tools/utf8-decode". |
March 27, 2019 Re: How to decode UTF-8 text? | ||||
---|---|---|---|---|
| ||||
Posted in reply to Andrey | On Wednesday, 27 March 2019 at 13:39:07 UTC, Andrey wrote: > I have got some text with UTF-8. For example this part: >> <title>ΠαÏάλληλη αναζήÏηÏη</title> This looks like a UTF-8 sequence which has been UTF-8 encoded. > How to decode it to get this result? >> <title>Παράλληλη αναζήτηση</title> Undo the second UTF-8 encoding by transcoding the UTF-8 into an 8-bit character set (latin1, windows-1252 etc.) which you have to guess. > I have tried functions like "decode", "byUTF", "to!wchar"... but no success. > > Input string is correct - checked it with "https://www.browserling.com/tools/utf8-decode". ```decode.d import std.stdio; import std.encoding; void main () { string src = "<title>Î\u00a0αÏάλληλη αναζήÏηÏη</title>"; Latin1String ls; transcode (src, ls); string targ = cast (string) ls; targ.writeln; } ``` $ ./decode <title>Παράλληλη αναζήτηση</title> |
Copyright © 1999-2021 by the D Language Foundation