On approximately 10/18/2009 12:53 AM, came the following characters from the keyboard of demerphq: > 2009/10/18 karl williamson <public@khwilliamson.com>: > >> demerphq wrote: >> >>> I have basically fixed the failing new tests. >>> >>> I have not uploaded the fix because these fixes break other tests, and >>> so far I havent been able to review them all for correctness and fix >>> them or their underlying breakage. >>> >>> Also, I start to doubt the tenability of this path. Making \d mean >>> [0-9] seems to me to be clear (please speak up if anyone disagree). >>> Making \w have the strict [A-Za-z0-9_] behavior by default is looking >>> less sensible than it seemed at first. >>> >> Please elaborate, cause it still seems very sensible to me. >> > > The branch i pushed fully implements the agreed apon behaviour. > > Look at the number of failing tests, and tests I had to patch. > > That gives an idea of the impact on our community, in particular the > ones who don't use English as a native language. I think i > unrealistically underestimated how much impact this would have. Having a working Unicode semantics, and an understandable ASCII semantics, and not cluttering the code to support 4 different cases, still makes it sound worthwhile to me. But I speak English, although I do write programs that process non-English text.Thread Previous