On Sun, Feb 13, 2000 at 09:54:43PM +0200, Jarkko Hietaniemi wrote: > Food for thought: should Perl always make its utf8 data to be in the > decomposed form to be canonical? Or, the other way, should it always > try to find the composite form (to be more compact)? No and no. > A canonical form would make searching the data rather easier. I think there is a Consortium's document on "Levels of internationization support in REx engines". I think there are 3 or 4 levels, and we are on the first one now. IIRC, what you propose is similar to the level 2. I would think that such things should be treated by pessimizers for RExen: "mutate this REx to support composition/decomposition too". IlyaThread Previous | Thread Next