develooper Front page | perl.perl5.porters | Postings from August 2011

Re: BOMs as noncharacters

Thread Previous | Thread Next
Karl Williamson
August 17, 2011 21:01
Re: BOMs as noncharacters
Message ID:
On 08/17/2011 03:55 PM, Tom Christiansen wrote:
>> I'm wondering if you are confusing U+FFFE, a non-character code point
>> that is invalid in open interchange, with U+FEFF, the BYTE ORDER MARK,
>> which is used in UTF-16 and UTF-32 to give the endianness of the stream.
> Yes, you're right.  I was.  Sorry.
>>   Unicode now discourages BOMs, but they are not forbidden, and there is
>> no real use for them in UTF-8, as that encoding does not have endianness.
> Really?  They discourage BOMs now?   I wasn't aware of that.  Could you
> please tell me where you noticed that?  Was this in Chapters 3 or 16
> from the Unicode Standard, or was it somewhere else?

It may be my turn to be mistaken.  I don't see anything like that in the 
current Standard; perhaps I got the impression that they were frowned 
upon by off-hand remarks in the Unicode mailing list; or perhaps I 
dreamt it all up.

Thread Previous | Thread Next Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About