On Sat, Dec 25, 2010 at 5:07 PM, Eric Brine <ikegami@adaelis.com> wrote: > > On Sat, Dec 25, 2010 at 9:17 AM, David Golden <xdaveg@gmail.com> wrote: >> >> On Karl's proposal, I agree with Zefram that it's headed in the right >> direction. Let's say that we call perl's internal encoding >> "encoding(int72)" for the sake of argument below. Then we have two >> encodings: >> >> :encoding(UTF-8) >> :encoding(int72) > > There's really three. UTF-8 for interchange, UTF-8 for intrachange and > int72. What's the difference between "interchange" and "intrachange"? > warn and let it through (yuck!) > warn and substitute in U+FFFD > warn and recode > recode (no warning) Assuming that "encoding(int72)" allows overlong sequences and that other layers can be added to modify that stream, then such a layer could do any of those. The second choice is probably the best "default" choice. -- DavidThread Previous | Thread Next