> C<\w> matches a single I<word> character: an alphanumeric character > (that is, an alphabetic character, or a digit), or the underscore (C<_>). > What is considered a word character depends on the internal encoding > of the string. If it's in UTF-8 format, C<\w> matches those characters > that considered word characters in the Unicode database. That is, it > not only matches ASCII letters, but also Thai letters, Greek letters, etc. I think you something out. :-) --tom