develooper Front page | perl.perl5.porters | Postings from August 2011

Re: BOMs as noncharacters

Thread Previous | Thread Next
Tom Christiansen
August 17, 2011 14:55
Re: BOMs as noncharacters
Message ID:
> I'm wondering if you are confusing U+FFFE, a non-character code point 
> that is invalid in open interchange, with U+FEFF, the BYTE ORDER MARK, 
> which is used in UTF-16 and UTF-32 to give the endianness of the stream. 

Yes, you're right.  I was.  Sorry.

>  Unicode now discourages BOMs, but they are not forbidden, and there is 
> no real use for them in UTF-8, as that encoding does not have endianness.

Really?  They discourage BOMs now?   I wasn't aware of that.  Could you
please tell me where you noticed that?  Was this in Chapters 3 or 16 
from the Unicode Standard, or was it somewhere else?



Thread Previous | Thread Next Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About