On 08/17/2011 03:55 PM, Tom Christiansen wrote: >> I'm wondering if you are confusing U+FFFE, a non-character code point >> that is invalid in open interchange, with U+FEFF, the BYTE ORDER MARK, >> which is used in UTF-16 and UTF-32 to give the endianness of the stream. > > Yes, you're right. I was. Sorry. > >> Unicode now discourages BOMs, but they are not forbidden, and there is >> no real use for them in UTF-8, as that encoding does not have endianness. > > Really? They discourage BOMs now? I wasn't aware of that. Could you > please tell me where you noticed that? Was this in Chapters 3 or 16 > from the Unicode Standard, or was it somewhere else? > It may be my turn to be mistaken. I don't see anything like that in the current Standard; perhaps I got the impression that they were frowned upon by off-hand remarks in the Unicode mailing list; or perhaps I dreamt it all up.Thread Previous | Thread Next