bytes or chars ?

jda jda at his.com
Thu Sep 16 09:44:39 CDT 2004


>>Not true, Robert. UTF-8 characters can also be 3 or 4 bytes in 
>>length (I recall someone posting that in rare cases 5 bytes are 
>>required in certain rare cases).
>AFAIK (from the ICU User's Guide) the maximum possible len of a 
>character in UTF-8 is 4 bytes. Can you give an example when 5 bytes 
>are needed?
>
>--
>Best regards,
>Igor Gomon

Hi Igor,

Sorry, this was something I saw posted on the RB mailing list over a 
year ago, but didn't keep the reference. I think it was a recent 
addition to UTF-8 that dealt with more (unusual) languages being 
supported.

Jon


More information about the Valentina-beta mailing list