> On 24 Apr 2020, at 22:03, Joseph Brenner <doomvox@gmail.com> wrote: > > Thanks, yes I understand unicode and utf-8 reasonably well. > >> So Rakudo has to read the next codepoint to make sure that it isn't a combining codepoint. > >> It is probably faking up the reads to look right when reading ASCII, but failing to do that for wider codepoints. > > I think it'd be the other way around... the idea here would be it's > doing an extra readchar behind the scenes just in-case there's > combining chars involved-- so you're figuring there's some confusion > about the actual point in the file that's being read and the > abstraction that readchars is supplying? What does .tell say before and after the readchars?Thread Previous | Thread Next