develooper Front page | perl.perl5.porters | Postings from August 2011

Re: BOMs as noncharacters

Thread Previous | Thread Next
From:
Karl Williamson
Date:
August 17, 2011 21:01
Subject:
Re: BOMs as noncharacters
Message ID:
4E4C8E66.5050706@khwilliamson.com
On 08/17/2011 03:55 PM, Tom Christiansen wrote:
>> I'm wondering if you are confusing U+FFFE, a non-character code point
>> that is invalid in open interchange, with U+FEFF, the BYTE ORDER MARK,
>> which is used in UTF-16 and UTF-32 to give the endianness of the stream.
>
> Yes, you're right.  I was.  Sorry.
>
>>   Unicode now discourages BOMs, but they are not forbidden, and there is
>> no real use for them in UTF-8, as that encoding does not have endianness.
>
> Really?  They discourage BOMs now?   I wasn't aware of that.  Could you
> please tell me where you noticed that?  Was this in Chapters 3 or 16
> from the Unicode Standard, or was it somewhere else?
>

It may be my turn to be mistaken.  I don't see anything like that in the 
current Standard; perhaps I got the impression that they were frowned 
upon by off-hand remarks in the Unicode mailing list; or perhaps I 
dreamt it all up.


Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About