perl.fwp http://www.nntp.perl.org/group/perl.fwp/ ... Copyright 1998-2008 perl.org Mon, 06 Oct 2008 22:06:24 +0000 ask@perl.org Re: Interpolated code blocks may not create lexicals by Ryan Yagatich Richard Hartmann wrote:<br/>&gt; On Sat, Sep 27, 2008 at 00:04, Michael G Schwern &lt;schwern@pobox.com&gt; wrote:<br/>&gt; <br/>&gt;&gt; David Nicol wrote:<br/>&gt;&gt; <br/>&gt;&gt;&gt; the &quot;fun with perl&quot; mailing list doesn&#39;t still exist, does it?<br/>&gt;&gt;&gt; <br/>&gt;&gt; Yes, yes it does.<br/>&gt;&gt; <br/>&gt;<br/>&gt; Lies! It does not!<br/>&gt;<br/>&gt;<br/>&gt; Richard<br/>&gt;<br/>&gt; <br/>Oh Noes!<br/><br/>Regards,<br/>Ryan Yagatich<br/><br/>--<br/> \|||/<br/> ---------0oo------( o o )------oo0----------<br/> Ryan Yagatich (_) ryagatich@gmail.com<br/> http://www.linkedin.com/in/yagatich<br/> -----------------------------------------<br/><br/><br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/09/msg4108.html Mon, 29 Sep 2008 05:01:33 +0000 Re: Interpolated code blocks may not create lexicals by Richard Hartmann On Sat, Sep 27, 2008 at 00:04, Michael G Schwern &lt;schwern@pobox.com&gt; wrote:<br/>&gt; David Nicol wrote:<br/>&gt;&gt; the &quot;fun with perl&quot; mailing list doesn&#39;t still exist, does it?<br/>&gt;<br/>&gt; Yes, yes it does.<br/><br/>Lies! It does not!<br/><br/><br/>Richard<br/> http://www.nntp.perl.org/group/perl.fwp/2008/09/msg4107.html Sun, 28 Sep 2008 16:07:09 +0000 Re: Interpolated code blocks may not create lexicals by Michael G Schwern David Nicol wrote:<br/>&gt; the &quot;fun with perl&quot; mailing list doesn&#39;t still exist, does it?<br/><br/>Yes, yes it does. It&#39;s just pining for the fjords.<br/><br/><br/>-- <br/>&quot;Clutter and overload are not an attribute of information,<br/> they are failures of design&quot;<br/> -- Edward Tufte<br/> http://www.nntp.perl.org/group/perl.fwp/2008/09/msg4106.html Fri, 26 Sep 2008 15:04:57 +0000 Re: Interpolated code blocks may not create lexicals by David Nicol On 9/25/08, Eirik Berg Hanssen &lt;Eirik-Berg.Hanssen@allverden.no&gt; wrote:<br/>&gt; print &lt;&lt;STR;<br/>&gt; I like @{<br/>&gt; my $rand = rand();<br/>&gt; [$rand &lt; 0.5 ? &#39;pie&#39; : &#39;beer&#39;]<br/>&gt; }<br/>&gt; STR<br/>&gt;<br/>&gt;<br/>&gt; You already have a block; no need for a do-block within it.<br/>&gt;<br/>&gt;<br/>&gt; (The rest, others already covered.)<br/><br/>the &quot;fun with perl&quot; mailing list doesn&#39;t still exist, does it?<br/><br/>there&#39;s also<br/><br/> my $BeerOrPie;<br/> sub BeerOrPie::TIESCALAR { bless \(my $x), &#39;BeerOrPie&#39; };<br/> sub BeerOrPie::FETCH { rand() &lt; 0.5 ? &#39;pie&#39; : &#39;beer&#39;; };<br/> tie $BeerOrPie, &#39;BeerOrPie&#39;;<br/> print &quot;I like $BeerOrPie\n&quot;;<br/><br/>-- <br/>&quot;checks and balances&quot; is not financial jargon<br/> http://www.nntp.perl.org/group/perl.fwp/2008/09/msg4105.html Thu, 25 Sep 2008 16:04:42 +0000 Re: YN golf by shmem From the keyboard of Uri Guttman [31.03.08,17:50]:<br/><br/>&gt; <br/>&gt; someone posted (and it wasn&#39;t homework) for an easy way to get all<br/>&gt; possible combos of YN (5 chars so 32 answers). he got some basic<br/>&gt; multiline answers but i think it makes for a great golf problem.<br/>&gt; <br/>&gt; here is my first pass which i am sure can be easily bested. i haven&#39;t<br/>&gt; even squeezed out the white spaces but it does work.<br/>&gt; <br/>&gt; golf away!<br/>&gt; <br/>&gt; uri<br/>&gt; <br/>&gt; perl -le &#39;print join &quot;\n&quot;, map {tr/01/NY/; $_} map unpack( &quot;b5&quot;, chr), 0 .. 31&#39;<br/><br/>from mtve:<br/><br/> perl -le &#39;y/12/NY/-5||print for 0..2x5&#39;<br/><br/>(can&#39;t do any better...)<br/><br/>0--gg-<br/><br/>-- <br/>_($_=&quot; &quot;x(1&lt;&lt;5).&quot;?\n&quot;.q&middot;/)Oo. G&deg;\ /<br/> /\_&macr;/(q /<br/>---------------------------- \__(m.====&middot;.(_(&quot;always off the crowd&quot;)).&quot;&middot;<br/>&quot;);sub _{s./.($e=&quot;&#39;Itrs `mnsgdq Gdbj O`qkdq&quot;)=~y/&quot;-y/#-z/;$e.e &amp;&amp; print}<br/> http://www.nntp.perl.org/group/perl.fwp/2008/04/msg4104.html Fri, 11 Apr 2008 06:10:30 +0000 RE: YN golf by Allen, Greg <br/>Magoo! <br/><br/>-----Original Message-----<br/>From: John W. Krahn [mailto:krahnj@telus.net] <br/>Sent: Tuesday, April 01, 2008 7:17 AM<br/>To: fwp@perl.org<br/>Subject: Re: YN golf<br/><br/>Uri Guttman wrote:<br/>&gt; someone posted (and it wasn&#39;t homework) for an easy way to get all<br/>&gt; possible combos of YN (5 chars so 32 answers). he got some basic<br/>&gt; multiline answers but i think it makes for a great golf problem.<br/>&gt; <br/>&gt; here is my first pass which i am sure can be easily bested. i haven&#39;t<br/>&gt; even squeezed out the white spaces but it does work.<br/>&gt; <br/>&gt; golf away!<br/>&gt; <br/>&gt; uri<br/>&gt; <br/>&gt; perl -le &#39;print join &quot;\n&quot;, map {tr/01/NY/; $_} map unpack( &quot;b5&quot;, chr),<br/>0 .. 31&#39;<br/><br/> perl -le&#39;y/01/NY/&amp;&amp;print,for+map+unpack(b5,chr),0..31&#39;<br/><br/><br/><br/>John<br/>-- <br/>Perl isn&#39;t a toolbox, but a small machine shop where you<br/>can special-order certain sorts of tools at low cost and<br/>in short order. -- Larry Wall<br/>- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -<br/><br/>This message is intended only for the personal and confidential use of the designated recipient(s) named above. If you are not the intended recipient of this message you are hereby notified that any review, dissemination, distribution or copying of this message is strictly prohibited. This communication is for information purposes only and should not be regarded as an offer to sell or as a solicitation of an offer to buy any financial product, an official confirmation of any transaction, or as an official statement of Lehman Brothers. Email transmission cannot be guaranteed to be secure or error-free. Therefore, we do not represent that this information is complete or accurate and it should not be relied upon as such. All information is subject to change without notice.<br/><br/><br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/04/msg4103.html Tue, 01 Apr 2008 03:37:14 +0000 Re: YN golf by John W. Krahn Uri Guttman wrote:<br/>&gt; someone posted (and it wasn&#39;t homework) for an easy way to get all<br/>&gt; possible combos of YN (5 chars so 32 answers). he got some basic<br/>&gt; multiline answers but i think it makes for a great golf problem.<br/>&gt; <br/>&gt; here is my first pass which i am sure can be easily bested. i haven&#39;t<br/>&gt; even squeezed out the white spaces but it does work.<br/>&gt; <br/>&gt; golf away!<br/>&gt; <br/>&gt; uri<br/>&gt; <br/>&gt; perl -le &#39;print join &quot;\n&quot;, map {tr/01/NY/; $_} map unpack( &quot;b5&quot;, chr), 0 .. 31&#39;<br/><br/> perl -le&#39;y/01/NY/&amp;&amp;print,for+map+unpack(b5,chr),0..31&#39;<br/><br/><br/><br/>John<br/>-- <br/>Perl isn&#39;t a toolbox, but a small machine shop where you<br/>can special-order certain sorts of tools at low cost and<br/>in short order. -- Larry Wall<br/> http://www.nntp.perl.org/group/perl.fwp/2008/04/msg4102.html Tue, 01 Apr 2008 02:19:31 +0000 Re: YN golf by Philippe Bruhat On Mon, Mar 31, 2008 at 11:37:05PM -0500, Chris Dolan wrote:<br/>&gt; On Mar 31, 2008, at 6:54 PM, Philippe Bruhat (BooK) wrote:<br/>&gt;&gt; On Mon, Mar 31, 2008 at 04:34:58PM -0700, Rick Klement wrote:<br/>&gt;&gt;&gt;<br/>&gt;&gt;&gt; perl -le &#39;print for glob&quot;{Y,N}&quot;x5&#39;<br/>&gt;&gt;&gt;<br/>&gt;&gt;<br/>&gt;&gt; Of course you have to run this in a directory that doesn&#39;t contain<br/>&gt;&gt; any file matching /^[YN]{5}$/.<br/>&gt;<br/>&gt; Not true. The {} notation doesn&#39;t care whether files of that name <br/>&gt; actual exist. I tested like so on Mac:<br/><br/>I was pretty sure I tried the same manipulation to obtain all permutations<br/>of some series of strings, and that it didn&#39;t work as expected when one of<br/>the permutations actually existed as a file in the current directory.<br/><br/>After a quick check, you are right and I was too. :-)<br/>What I tried was:<br/><br/> $ touch YYYYY<br/> $ perl -le &#39;print for glob&quot;[YN]&quot;x5&#39;<br/> YYYYY<br/><br/>I computed my permutations with square brackets (which is one character<br/>shorter, but more fragile).<br/><br/>-- <br/> Philippe Bruhat (BooK)<br/><br/> To flaunt your strength is to make it your weakness.<br/> (Moral from Groo The Wanderer #25 (Epic))<br/> http://www.nntp.perl.org/group/perl.fwp/2008/04/msg4101.html Tue, 01 Apr 2008 00:10:47 +0000 Re: YN golf by Chris Dolan On Mar 31, 2008, at 6:54 PM, Philippe Bruhat (BooK) wrote:<br/>&gt; On Mon, Mar 31, 2008 at 04:34:58PM -0700, Rick Klement wrote:<br/>&gt;&gt;<br/>&gt;&gt; perl -le &#39;print for glob&quot;{Y,N}&quot;x5&#39;<br/>&gt;&gt;<br/>&gt;<br/>&gt; Of course you have to run this in a directory that doesn&#39;t contain<br/>&gt; any file matching /^[YN]{5}$/.<br/><br/>Not true. The {} notation doesn&#39;t care whether files of that name <br/>actual exist. I tested like so on Mac:<br/><br/>% touch YYYYY<br/>% perl -le &#39;print for glob&quot;{Y,N}&quot;x5&#39;<br/>YYYYY<br/>YYYYN<br/>YYYNY<br/>YYYNN<br/>YYNYY<br/>YYNYN<br/>YYNNY<br/>YYNNN<br/>YNYYY<br/>YNYYN<br/>YNYNY<br/>YNYNN<br/>YNNYY<br/>YNNYN<br/>YNNNY<br/>YNNNN<br/>NYYYY<br/>NYYYN<br/>NYYNY<br/>NYYNN<br/>NYNYY<br/>NYNYN<br/>NYNNY<br/>NYNNN<br/>NNYYY<br/>NNYYN<br/>NNYNY<br/>NNYNN<br/>NNNYY<br/>NNNYN<br/>NNNNY<br/>NNNNN<br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4100.html Mon, 31 Mar 2008 21:37:16 +0000 Re: YN golf by Zed Lopez On Mon, Mar 31, 2008 at 05:50:24PM -0500, Uri Guttman wrote:<br/>&gt; perl -le &#39;print join &quot;\n&quot;, map {tr/01/NY/; $_} map unpack( &quot;b5&quot;, chr), 0 .. 31&#39;<br/><br/>Also, I&#39;m sure, easily bested:<br/><br/>perl -e &#39;for(0..31){$_=sprintf&quot;%05b\n&quot;,$_;y/01/YN/;print}<br/><br/>marginally shorter not-to-hoyle hybrid: <br/>seq 0 31|perl -pe &#39;$_=sprintf&quot;%05b\n&quot;,$_;y/01/YN/&#39;<br/><br/>--<br/>zed at-sign apricot dot com http://www.MemeMachineGo.com/<br/>Zed Lopez PO Box 12546 Berkeley CA 94712<br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4099.html Mon, 31 Mar 2008 17:28:20 +0000 Re: [spam?] YN golf by Josh Goldberg Knocking a few chars off yours for perl -le &#39;print<br/>join$/,map{y/01/NY/;$_}map unpack(&quot;b5&quot;,chr),0..31&#39;<br/><br/>and shrunk a few more by removing the join and using sprintf:<br/><br/>perl -le &#39;print map{y/10 /YN/;$_}map{sprintf&quot;%5b$/&quot;,$_}0..31&#39;<br/><br/>On Mon, Mar 31, 2008 at 3:50 PM, Uri Guttman &lt;uri@stemsystems.com&gt; wrote:<br/><br/>&gt;<br/>&gt; someone posted (and it wasn&#39;t homework) for an easy way to get all<br/>&gt; possible combos of YN (5 chars so 32 answers). he got some basic<br/>&gt; multiline answers but i think it makes for a great golf problem.<br/>&gt;<br/>&gt; here is my first pass which i am sure can be easily bested. i haven&#39;t<br/>&gt; even squeezed out the white spaces but it does work.<br/>&gt;<br/>&gt; golf away!<br/>&gt;<br/>&gt; uri<br/>&gt;<br/>&gt; perl -le &#39;print join &quot;\n&quot;, map {tr/01/NY/; $_} map unpack( &quot;b5&quot;, chr), 0<br/>&gt; .. 31&#39;<br/>&gt;<br/>&gt; --<br/>&gt; Uri Guttman ------ uri@stemsystems.com -------- http://www.sysarch.com--<br/>&gt; ----- Perl Code Review , Architecture, Development, Training, Support<br/>&gt; ------<br/>&gt; --------- Free Perl Training --- http://perlhunter.com/college.html---------<br/>&gt; --------- Gourmet Hot Cocoa Mix ---- http://bestfriendscocoa.com---------<br/>&gt;<br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4098.html Mon, 31 Mar 2008 17:28:19 +0000 Re: YN golf by Philippe Bruhat On Mon, Mar 31, 2008 at 04:34:58PM -0700, Rick Klement wrote:<br/>&gt; <br/>&gt; perl -le &#39;print for glob&quot;{Y,N}&quot;x5&#39;<br/>&gt; <br/><br/>Of course you have to run this in a directory that doesn&#39;t contain<br/>any file matching /^[YN]{5}$/.<br/><br/>-- <br/> Philippe Bruhat (BooK)<br/><br/> The learned man makes a mistake but once... but the truly stupid keep<br/> practicing until they get it right.<br/> (Moral from Groo The Wanderer #75 (Epic))<br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4097.html Mon, 31 Mar 2008 16:54:50 +0000 Re: YN golf by Rick Klement Uri Guttman wrote:<br/>&gt; <br/>&gt; someone posted (and it wasn&#39;t homework) for an easy way to get all<br/>&gt; possible combos of YN (5 chars so 32 answers). he got some basic<br/>&gt; multiline answers but i think it makes for a great golf problem.<br/>&gt; <br/>&gt; here is my first pass which i am sure can be easily bested. i haven&#39;t<br/>&gt; even squeezed out the white spaces but it does work.<br/>&gt; <br/>&gt; golf away!<br/>&gt; <br/>&gt; uri<br/>&gt; <br/>&gt; perl -le &#39;print join &quot;\n&quot;, map {tr/01/NY/; $_} map unpack( &quot;b5&quot;, chr), 0 .. 31&#39;<br/>&gt; <br/>&gt; --<br/>&gt; Uri Guttman ------ uri@stemsystems.com -------- http://www.sysarch.com --<br/>&gt; ----- Perl Code Review , Architecture, Development, Training, Support ------<br/>&gt; --------- Free Perl Training --- http://perlhunter.com/college.html ---------<br/>&gt; --------- Gourmet Hot Cocoa Mix ---- http://bestfriendscocoa.com ---------<br/><br/>perl -le &#39;print for glob&quot;{Y,N}&quot;x5&#39;<br/><br/>-- <br/>Rick Klement<br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4096.html Mon, 31 Mar 2008 16:35:10 +0000 YN golf by Uri Guttman <br/>someone posted (and it wasn&#39;t homework) for an easy way to get all<br/>possible combos of YN (5 chars so 32 answers). he got some basic<br/>multiline answers but i think it makes for a great golf problem.<br/><br/>here is my first pass which i am sure can be easily bested. i haven&#39;t<br/>even squeezed out the white spaces but it does work.<br/><br/>golf away!<br/><br/>uri<br/><br/>perl -le &#39;print join &quot;\n&quot;, map {tr/01/NY/; $_} map unpack( &quot;b5&quot;, chr), 0 .. 31&#39;<br/><br/>-- <br/>Uri Guttman ------ uri@stemsystems.com -------- http://www.sysarch.com --<br/>----- Perl Code Review , Architecture, Development, Training, Support ------<br/>--------- Free Perl Training --- http://perlhunter.com/college.html ---------<br/>--------- Gourmet Hot Cocoa Mix ---- http://bestfriendscocoa.com ---------<br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4095.html Mon, 31 Mar 2008 15:50:56 +0000 Re: Name the secret operator by Michael G Schwern Philippe Bruhat (BooK) wrote:<br/>&gt; Hi,<br/>&gt; <br/>&gt; while doing some research for a talk on Perl secret operators, I tried<br/>&gt; to find who first coined the term &quot;secret operator&quot;.<br/>&gt; <br/>&gt; I found a post from Greg Allen on February 2004 on this very list<br/>&gt; (http://groups.google.com/group/perl.fwp/msg/e62668a760de1652),<br/>&gt; and then a post by Abigail on comp.lang.perl.misc on January 2003<br/>&gt; (http://groups.google.com/group/comp.lang.perl.misc/msg/22cfcd81a1521ec4).<br/>&gt; <br/>&gt; Does anyone know of an earlier occurence of the term?<br/><br/>Simon Cozens uses the term &quot;double-secret operator&quot; (a reference to Animal <br/>House) in a perl.com article on June 6, 2003. But it&#39;s not really an <br/>FWP-style secret operator and it&#39;s after Abigail anyway.<br/>http://www.perl.com/pub/a/2003/06/06/regexps.html<br/><br/><br/>-- <br/>Stabbing you in the face for your own good.<br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4094.html Thu, 20 Mar 2008 12:32:20 +0000 Name the secret operator by Philippe Bruhat Hi,<br/><br/>while doing some research for a talk on Perl secret operators, I tried<br/>to find who first coined the term &quot;secret operator&quot;.<br/><br/>I found a post from Greg Allen on February 2004 on this very list<br/>(http://groups.google.com/group/perl.fwp/msg/e62668a760de1652),<br/>and then a post by Abigail on comp.lang.perl.misc on January 2003<br/>(http://groups.google.com/group/comp.lang.perl.misc/msg/22cfcd81a1521ec4).<br/><br/>Does anyone know of an earlier occurence of the term?<br/><br/><br/>Now that I have seen Abigail&#39;s post on clpm, I want to know its name.<br/>It is the longest secret operator I&#39;ve seen, and also the only one<br/>that must be on three lines (not counting the content).<br/><br/>&lt;&lt;m=~m&gt;&gt; <br/> (commented out code and pod goes here)<br/>m<br/>;<br/><br/>I have a few ideas for names, but they don&#39;t fit very well, and do not<br/>describe the &quot;m\n;&quot; part of the operator. <br/><br/>-- <br/> Philippe Bruhat (BooK)<br/><br/> Putting beauty before brains is the surest way to wind up with neither.<br/> (Moral from Groo The Wanderer #24 (Epic))<br/> http://www.nntp.perl.org/group/perl.fwp/2008/03/msg4093.html Wed, 19 Mar 2008 17:28:34 +0000 Re: regex of the month (decade?) by Craig S. Cottingham On Jan 14, 2008, at 16:42, David Landgren wrote:<br/><br/>&gt; What&#39;s next, &quot;welcome to the re&auml;l world?&quot;<br/><br/>Well, no, because the &quot;a&quot; isn&#39;t pronounced as a separate vowel. On <br/>the other hand, we may start seeing references to El Camino Re&auml;l in <br/>Silicon Valley.<br/><br/>--<br/>Craig S. Cottingham<br/>craig.cottingham@gmail.com<br/><br/><br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4092.html Mon, 14 Jan 2008 23:44:04 +0000 Re: regex of the month (decade?) by Peter Makholm David Landgren &lt;david@landgren.net&gt; writes:<br/><br/>&gt; At that point, a vote was taken, and the result was that some other<br/>&gt; accented character like &yacute; or something made it in at the expense of<br/>&gt; &#156;. What did get in were the decidedly less useful &AElig; and &aelig; ligatures.<br/><br/>Quite interesting discussion over all, but also quite off topic, so I<br/>joined until now. Which is most usefull of the french oe-ligature or<br/>the scandinavian letter &aelig; I don&#39;t know.<br/><br/>But part of the explanation should be that the scandinavian delegates<br/>pushed for &#39;&aelig;&#39; to be accepted as a full letter and not just an<br/>ligature. This succedded and therefore &#39;&aelig;&#39; got included in iso-8859-1,<br/>and I&#39;m quite sure that iso-8859-15 makes the same distinction.<br/><br/>In english &#39;&aelig;&#39; is still considered a ligature.<br/><br/><br/>But as a non-ascii using european I prefere iso-8859-1(5) to<br/>unicode. Much easier to work with, but I havn&#39;t really had the need to<br/>mix different alphabets. <br/><br/><br/>&gt; but never saw it in use. Then last week I read two articles on two<br/>&gt; different web sites that used this convention. I found it quite<br/>&gt; jarring. What&#39;s next, &quot;welcome to the re&auml;l world?&quot;<br/><br/>But only if the pronouncation &#39;re-al&#39; would become widespread.<br/><br/>//Makholm<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4091.html Mon, 14 Jan 2008 21:29:38 +0000 Re: Umlaut by David Landgren sebb wrote:<br/>&gt; On 14/01/2008, John Douglas Porter &lt;johndporter@yahoo.com&gt; wrote:<br/>&gt;&gt; From the keyboard of Yanick Champoux [12.01.08,18:50]:<br/>&gt;&gt;&gt; *dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally<br/>&gt;&gt; used in<br/>&gt;&gt;&gt; English, oftentimes replaced by a hyphen. In English, the dieresis<br/>&gt;&gt; is used on<br/>&gt;&gt;&gt; a second identical vowel to indicate a change in pronunciation of<br/>&gt;&gt; that vowel<br/>&gt;&gt;&gt; or indicate it is pronounced in a separate syllable. It is sometimes<br/>&gt;&gt; referred<br/>&gt;&gt;&gt; to as an &laquo; umlaut &raquo; when used with a single character or in a &laquo;<br/>&gt;&gt; diphthong. &raquo;<br/>&gt;&gt;&gt; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration, co&ouml;rdination.<br/>&gt; <br/>&gt; Also na&iuml;f and na&iuml;ve - non-identical vowels.<br/><br/>But isn&#39;t that because these two words are straight lifts from French, <br/>where they have this exact spelling? It&#39;s not an English thing in this case.<br/><br/>David<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4090.html Mon, 14 Jan 2008 14:49:56 +0000 Re: regex of the month (decade?) by David Landgren Chris Dolan wrote:<br/>&gt; On Jan 11, 2008, at 8:01 AM, David Landgren wrote:<br/>&gt; <br/>&gt;&gt; The benchmark may be flawed, since my appreciation of Unicode is <br/>&gt;&gt; little more than &quot;things went downhill after 7-bit ASCII&quot;.<br/>&gt; <br/>&gt; Haven&#39;t I read that you live in Paris? I figured that anyone who lives <br/>&gt; in a country whose dominant language was not fully expressible in ASCII <br/>&gt; would love Unicode.<br/><br/>I do, but then again French is fully expressible in Latin-1... except <br/>for the oe ligature (&oelig;).<br/><br/>I worked with a French programmer a few years back who spent much of his <br/>career working in government computing circles. Apparently when the <br/>national European computer organisations were thrashing out what <br/>characters should go where in the 128..255 high ASCII slots, his <br/>colleague at the time, who was representing France in one of the <br/>discussions, was out of the meeting having a coffee.<br/><br/>At that point, a vote was taken, and the result was that some other <br/>accented character like &yacute; or something made it in at the expense of &oelig;. <br/>What did get in were the decidedly less useful &AElig; and &aelig; ligatures.<br/><br/>&gt; On a major tangent, have others noticed the resurgence of the umlaut in <br/>&gt; printed English? I keep seeing things like co&ouml;peration or co&ouml;rdinates <br/>&gt; -- particularly in Technology Review, but in other publications on <br/>&gt; occasion too. Is that because it&#39;s *supposed* to be spelled that way, <br/>&gt; but ASCII and the typewriter have suppressed that spelling for my lifetime?<br/><br/>Funny you should mention that. I read about this first maybe twenty, <br/>twenty-five years ago proposed as &quot;the way things should really be&quot; but <br/>never saw it in use. Then last week I read two articles on two different <br/>web sites that used this convention. I found it quite jarring. What&#39;s <br/>next, &quot;welcome to the re&auml;l world?&quot;<br/><br/>David<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4089.html Mon, 14 Jan 2008 14:44:33 +0000 Re: Umlaut by Paul Johnson On Mon, Jan 14, 2008 at 05:16:42PM +0100, Georg Moritz wrote:<br/><br/>&gt; &gt;From the keyboard of sebb [14.01.08,12:21]:<br/><br/>&gt; &gt; The rule seems to be: second vowel of a pair=dieresis, otherwise umlaut.<br/>&gt; &gt; <br/>&gt; &gt; Any counter-examples?<br/>&gt; <br/>&gt; yup, two examples:<br/>&gt; <br/>&gt; German:<br/>&gt; &quot;ge&auml;rgert&quot; (been angry)<br/>&gt; - here the second vowel is an umlaut<br/>&gt; <br/>&gt; Quenya - the elven-tongue:<br/>&gt; &quot;&aacute;men anta s&iacute;ra ilaur&euml;a massamma&quot; (give us today the daily our-bread)<br/>&gt; - here the diaeresis is noted on the first vowel at &euml;a<br/><br/>Don&#39;t forget the burning of the B&ouml;&ouml;gg in Z&uuml;rich at S&auml;chsil&uuml;&uuml;te.<br/>http://de.wikipedia.org/wiki/Sechsel%C3%A4uten<br/><br/>-- <br/>Paul Johnson - paul@pjcj.net<br/>http://www.pjcj.net<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4088.html Mon, 14 Jan 2008 13:42:08 +0000 Re: Umlaut by Keith Ivey sebb wrote:<br/>&gt; The rule seems to be: second vowel of a pair=dieresis, otherwise umlaut.<br/><br/>I&#39;d call the symbol in &quot;Bront&euml;&quot; a dieresis, not an umlaut. Maybe: when <br/>the symbol indicates the vowel is to be pronounced further forward in <br/>the mouth, it&#39;s an umlaut; when it indicates the vowel is to be <br/>pronounced on its own in a normal way, it&#39;s a dieresis.<br/><br/>-- <br/>Keith C. Ivey &lt;keith@iveys.org&gt;<br/>Washington, DC<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4087.html Mon, 14 Jan 2008 08:16:53 +0000 Re: Umlaut by Georg Moritz From the keyboard of sebb [14.01.08,12:21]:<br/><br/>&gt; On 14/01/2008, John Douglas Porter &lt;johndporter@yahoo.com&gt; wrote:<br/>&gt; &gt;<br/>&gt; &gt; From the keyboard of Yanick Champoux [12.01.08,18:50]:<br/>&gt; &gt; &gt; *dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally<br/>&gt; &gt; used in<br/>&gt; &gt; &gt; English, oftentimes replaced by a hyphen. In English, the dieresis<br/>&gt; &gt; is used on<br/>&gt; &gt; &gt; a second identical vowel to indicate a change in pronunciation of<br/>&gt; &gt; that vowel<br/>&gt; &gt; &gt; or indicate it is pronounced in a separate syllable. It is sometimes<br/>&gt; &gt; referred<br/>&gt; &gt; &gt; to as an &laquo; umlaut &raquo; when used with a single character or in a &laquo;<br/>&gt; &gt; diphthong. &raquo;<br/>&gt; &gt; &gt; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration, co&ouml;rdination.<br/>&gt; &gt;<br/>&gt; <br/>&gt; Also na&iuml;f and na&iuml;ve - non-identical vowels.<br/>&gt; <br/>&gt; &gt; I want to clarify (only because I myself was confused at first)<br/>&gt; &gt; that an umlaut can be used IN a diphthong, but does not have<br/>&gt; &gt; any function in MAKING a diphthong. For example, the<br/>&gt; &gt; German diphthong &quot;au&quot; becomes &quot;&auml;u&quot; due to umlaut, (or &quot;vowel<br/>&gt; &gt; shifting&quot;).<br/><br/>Correct (unless I&#39;m mistaken too ;-)<br/><br/>&gt; In summary, umlaut and dieresis/diaeresis (also trema) are both<br/>&gt; diacritical marks.<br/>&gt; <br/>&gt; Same symbol, but different meaning, where the meaning of the symbol<br/>&gt; depends on the context.<br/>&gt; <br/>&gt; The rule seems to be: second vowel of a pair=dieresis, otherwise umlaut.<br/>&gt; <br/>&gt; Any counter-examples?<br/><br/>yup, two examples:<br/><br/>German:<br/>&quot;ge&auml;rgert&quot; (been angry)<br/>- here the second vowel is an umlaut<br/><br/>Quenya - the elven-tongue:<br/>&quot;&aacute;men anta s&iacute;ra ilaur&euml;a massamma&quot; (give us today the daily our-bread)<br/>- here the diaeresis is noted on the first vowel at &euml;a<br/><br/>;-)<br/><br/>0--gg-<br/><br/>&gt; &gt; Unless I am mistaken.<br/>&gt; <br/>&gt; Same here...<br/>&gt; <br/>&gt; &gt; --<br/>&gt; &gt; John Douglas Porter<br/>&gt; &gt;<br/>&gt; &gt;<br/>&gt; &gt;<br/>&gt; &gt;<br/>&gt; &gt; ____________________________________________________________________________________<br/>&gt; &gt; Never miss a thing. Make Yahoo your home page.<br/>&gt; &gt; http://www.yahoo.com/r/hs<br/>&gt; &gt;<br/>&gt; <br/><br/>-- <br/>_($_=&quot; &quot;x(1&lt;&lt;5).&quot;?\n&quot;.q&middot;/)Oo. G&deg;\ /<br/> /\_&macr;/(q /<br/>---------------------------- \__(m.====&middot;.(_(&quot;always off the crowd&quot;)).&quot;&middot;<br/>&quot;);sub _{s,/,($e=&quot;&#39;Itrs `mnsgdq Gdbj O`qkdq&quot;)=~y/&quot;-y/#-z/;$e,e &amp;&amp; print}<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4086.html Mon, 14 Jan 2008 08:16:53 +0000 Re: Umlaut by sebb On 14/01/2008, John Douglas Porter &lt;johndporter@yahoo.com&gt; wrote:<br/>&gt;<br/>&gt; From the keyboard of Yanick Champoux [12.01.08,18:50]:<br/>&gt; &gt; *dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally<br/>&gt; used in<br/>&gt; &gt; English, oftentimes replaced by a hyphen. In English, the dieresis<br/>&gt; is used on<br/>&gt; &gt; a second identical vowel to indicate a change in pronunciation of<br/>&gt; that vowel<br/>&gt; &gt; or indicate it is pronounced in a separate syllable. It is sometimes<br/>&gt; referred<br/>&gt; &gt; to as an &laquo; umlaut &raquo; when used with a single character or in a &laquo;<br/>&gt; diphthong. &raquo;<br/>&gt; &gt; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration, co&ouml;rdination.<br/>&gt;<br/><br/>Also na&iuml;f and na&iuml;ve - non-identical vowels.<br/><br/>&gt; I want to clarify (only because I myself was confused at first)<br/>&gt; that an umlaut can be used IN a diphthong, but does not have<br/>&gt; any function in MAKING a diphthong. For example, the<br/>&gt; German diphthong &quot;au&quot; becomes &quot;&auml;u&quot; due to umlaut, (or &quot;vowel<br/>&gt; shifting&quot;).<br/>&gt;<br/><br/>In summary, umlaut and dieresis/diaeresis (also trema) are both<br/>diacritical marks.<br/><br/>Same symbol, but different meaning, where the meaning of the symbol<br/>depends on the context.<br/><br/>The rule seems to be: second vowel of a pair=dieresis, otherwise umlaut.<br/><br/>Any counter-examples?<br/><br/>&gt; Unless I am mistaken.<br/><br/>Same here...<br/><br/>&gt; --<br/>&gt; John Douglas Porter<br/>&gt;<br/>&gt;<br/>&gt;<br/>&gt;<br/>&gt; ____________________________________________________________________________________<br/>&gt; Never miss a thing. Make Yahoo your home page.<br/>&gt; http://www.yahoo.com/r/hs<br/>&gt;<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4085.html Mon, 14 Jan 2008 04:22:06 +0000 Re: Umlaut by John Douglas Porter <br/>From the keyboard of Yanick Champoux [12.01.08,18:50]:<br/>&gt; *dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally<br/>used in<br/>&gt; English, oftentimes replaced by a hyphen. In English, the dieresis<br/>is used on<br/>&gt; a second identical vowel to indicate a change in pronunciation of<br/>that vowel<br/>&gt; or indicate it is pronounced in a separate syllable. It is sometimes<br/>referred<br/>&gt; to as an &laquo; umlaut &raquo; when used with a single character or in a &laquo;<br/>diphthong. &raquo;<br/>&gt; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration, co&ouml;rdination.<br/><br/>I want to clarify (only because I myself was confused at first)<br/>that an umlaut can be used IN a diphthong, but does not have<br/>any function in MAKING a diphthong. For example, the<br/>German diphthong &quot;au&quot; becomes &quot;&auml;u&quot; due to umlaut, (or &quot;vowel<br/>shifting&quot;).<br/><br/>Unless I am mistaken.<br/><br/>-- <br/>John Douglas Porter<br/><br/><br/><br/><br/> ____________________________________________________________________________________<br/>Never miss a thing. Make Yahoo your home page. <br/>http://www.yahoo.com/r/hs<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4084.html Sun, 13 Jan 2008 19:12:16 +0000 Re: regex of the month (decade?) by Yanick Champoux Michael G Schwern wrote:<br/>&gt; &gt; Yanick Champoux wrote:<br/>&gt; <br/>&gt; &gt; *dieresis* or *di&aelig;resis [..]<br/>&gt; <br/>&gt; Really they just want to be more metal. Soon it will be &iexcl;K&ouml;mpUs&euml;rV.D&oslash;&oslash;M!<br/>&gt; <br/><br/> And to thing that, all those years, I laughed at Spinal Tap&#39;s claim <br/>to be avant-garde and visionary geniuses. They were right. All that <br/>time, they were right...<br/><br/>Jo&yuml;,<br/>`/anick<br/><br/><br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4083.html Sun, 13 Jan 2008 16:44:54 +0000 Re: Umlaut by Yanick Champoux Georg Moritz wrote:<br/>&gt; From the keyboard of Yanick Champoux [12.01.08,18:50]:<br/>&gt; <br/>&gt;&gt; *dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally used in<br/>&gt;&gt; English, oftentimes replaced by a hyphen. In English, the dieresis is used on<br/>&gt;&gt; a second identical vowel to indicate a change in pronunciation of that vowel<br/>&gt;&gt; or indicate it is pronounced in a separate syllable. It is sometimes referred<br/>&gt;&gt; to as an &laquo; umlaut &raquo; when used with a single character or in a &laquo; diphthong. &raquo;<br/>&gt;&gt; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration, co&ouml;rdination.<br/>&gt;&gt; <br/>&gt;<br/>&gt; Actually the term &quot;umlaut&quot; in german denotes a &quot;shifted&quot; vowel. If you do<br/>&gt; a transition from &quot;u&quot; -&gt; &quot;e&quot; biased towards &quot;i&quot; and stopping in the middle,<br/>&gt; you have the &quot;&uuml;&quot;, which can be written as diphtong also: &quot;ue&quot;. The &quot;e&quot; in<br/>&gt; &quot;ue&quot; was often placed above the &quot;u&quot; in old german writing (where the &quot;e&quot;<br/>&gt; was written like &quot;n&quot;, but with a sharp bend instead of a curve before the<br/>&gt; last falling stroke). The four strokes necessary for that &quot;e&quot; were reduced<br/>&gt; to two, and those to dots, hence the two points above the &quot;&uuml;&quot;.<br/>&gt;<br/>&gt; So, the umlaut is a shortened form of a &quot;diphtongy&quot; denoting a shifted vowel,<br/>&gt; and *not* a diaeresis (&quot;ue&quot; is not a diphtong, but an umlaut ;-)<br/>&gt; <br/><br/> Yup, I became aware of that yesterday when I shared my new golden <br/>nugget of trivia with my wife. Being both German *and* a linguist, she <br/>promptly corrected my inaccuracies. But she left out the part about the <br/>&#39;e&#39; mutating into the two little dots that we know and love nowadays, <br/>which I think is the coolest part of the whole story, so thanks for <br/>that! :-)<br/><br/><br/>Joy,<br/>`/anick<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4082.html Sun, 13 Jan 2008 16:22:57 +0000 Re: regex of the month (decade?) by Michael G Schwern Yanick Champoux wrote:<br/>&gt; Chris Dolan wrote:<br/>&gt;&gt; On a major tangent, have others noticed the resurgence of the umlaut<br/>&gt;&gt; in printed English? I keep seeing things like co&ouml;peration or<br/>&gt;&gt; co&ouml;rdinates -- particularly in Technology Review, but in other<br/>&gt;&gt; publications on occasion too. Is that because it&#39;s *supposed* to be<br/>&gt;&gt; spelled that way, but ASCII and the typewriter have suppressed that<br/>&gt;&gt; spelling for my lifetime?<br/>&gt;&gt;<br/>&gt; <br/>&gt; A quick use of Google-fu unearthed a blog entry<br/>&gt; http://www.dwelle.org/archives/2007/01/05/whats-with-all-the-umlauts/,<br/>&gt; which in turn pointed to the page<br/>&gt; http://ourworld.compuserve.com/homepages/profirst/d.htm that says:<br/>&gt; <br/>&gt; *dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally used in<br/>&gt; English, oftentimes replaced by a hyphen. In English, the dieresis is<br/>&gt; used on a second identical vowel to indicate a change in pronunciation<br/>&gt; of that vowel or indicate it is pronounced in a separate syllable. It is<br/>&gt; sometimes referred to as an &laquo; umlaut &raquo; when used with a single character<br/>&gt; or in a &laquo; diphthong. &raquo; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration,<br/>&gt; co&ouml;rdination.<br/><br/>Because, ya know, I always get confused about how to pronounce &quot;reelecting&quot;.<br/>Thank god they cleared that right up! And with an easy to understand symbol<br/>that everyone knows about! :P<br/><br/>Really they just want to be more metal. Soon it will be &iexcl;K&ouml;mpUs&euml;rV.D&oslash;&oslash;M!<br/><br/><br/>-- <br/>There will be snacks.<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4081.html Sun, 13 Jan 2008 09:16:56 +0000 Re: regex of the month (decade?) by Eugene van der Pijll Yanick Champoux schreef:<br/>&gt; *dieresis* or *di&aelig;resis<br/>&gt; <br/>&gt; Well I, for one, never knew that such a thing existed. Neato! Too <br/>&gt; bad the name of the mark, though, which is definitively unfortunate.<br/><br/>According to the infallible Wikipedia, this diacritic is also called a<br/>trema. Only if used as a seperation mark, not as an umlaut.<br/><br/>HTH<br/><br/>Eugene<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4080.html Sun, 13 Jan 2008 05:38:33 +0000 Re: regex of the month (decade?) by Aristotle Pagaltzis * Chris Dolan &lt;chris@chrisdolan.net&gt; [2008-01-12 23:55]:<br/>&gt; I figured that anyone who lives in a country whose dominant<br/>&gt; language was not fully expressible in ASCII would love Unicode.<br/><br/>For bonus points, try writing, say, German (fully expressible<br/>with an ISO-8859 charset) and Greek (fully expressible[^1] with<br/>an ISO-8859 charset) in the same document.<br/><br/>[1]: Well, Modern Greek anyway.<br/><br/>Regards,<br/>-- <br/>Aristotle Pagaltzis // &lt;http://plasmasturm.org/&gt;<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4079.html Sat, 12 Jan 2008 23:21:45 +0000 Re: regex of the month (decade?) by Keith Ivey Chris Dolan wrote:<br/>&gt; Haven&#39;t I read that you live in Paris? I figured that anyone who lives <br/>&gt; in a country whose dominant language was not fully expressible in ASCII <br/>&gt; would love Unicode.<br/><br/>&quot;Not fully expressible&quot; seems mild to apply to writing French in ASCII <br/>(which after all has no diacritics). The phrase seems more appropriate <br/>for writing French in ISO-8859-1 (because of the lack of &quot;oe&quot; ligature).<br/><br/>-- <br/>Keith C. Ivey &lt;keith@iveys.org&gt;<br/>Washington, DC<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4078.html Sat, 12 Jan 2008 17:05:24 +0000 Re: regex of the month (decade?) by Dave Mitchell On Sun, Jan 13, 2008 at 12:23:35AM +0100, Georg Moritz wrote:<br/>&gt; Well, that&#39;s sort of quotemeta for the double o - differentiating e.g.<br/>&gt; double-o usage in cool vs. cooperation. I haven&#39;t seen that usage in<br/>&gt; english yet, but it&#39;s used in spanish to mark a vowel as literal, e.g. in <br/>&gt; &quot;Parque G&uuml;ell&quot;. <br/><br/>The only English word I think its commonly seen with is na&iuml;ve,<br/>to indicate that the ai isn&#39;t a digraph.<br/><br/><br/>-- <br/>&quot;But Sidley Park is already a picture, and a most amiable picture too.<br/>The slopes are green and gentle. The trees are companionably grouped at<br/>intervals that show them to advantage. The rill is a serpentine ribbon<br/>unwound from the lake peaceably contained by meadows on which the right<br/>amount of sheep are tastefully arranged.&quot; -- Lady Croom, &quot;Arcadia&quot;<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4077.html Sat, 12 Jan 2008 15:55:45 +0000 Umlaut (was: Re: regex of the month (decade?)) by Georg Moritz From the keyboard of Yanick Champoux [12.01.08,18:50]:<br/><br/>&gt; Chris Dolan wrote:<br/>&gt; &gt; On a major tangent, have others noticed the resurgence of the umlaut in<br/>&gt; &gt; printed English? I keep seeing things like co&ouml;peration or co&ouml;rdinates --<br/>&gt; &gt; particularly in Technology Review, but in other publications on occasion<br/>&gt; &gt; too. Is that because it&#39;s *supposed* to be spelled that way, but ASCII and<br/>&gt; &gt; the typewriter have suppressed that spelling for my lifetime?<br/>&gt; &gt; <br/>&gt; <br/>&gt; A quick use of Google-fu unearthed a blog entry<br/>&gt; http://www.dwelle.org/archives/2007/01/05/whats-with-all-the-umlauts/, which<br/>&gt; in turn pointed to the page<br/>&gt; http://ourworld.compuserve.com/homepages/profirst/d.htm that says:<br/>&gt; <br/>&gt; *dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally used in<br/>&gt; English, oftentimes replaced by a hyphen. In English, the dieresis is used on<br/>&gt; a second identical vowel to indicate a change in pronunciation of that vowel<br/>&gt; or indicate it is pronounced in a separate syllable. It is sometimes referred<br/>&gt; to as an &laquo; umlaut &raquo; when used with a single character or in a &laquo; diphthong. &raquo;<br/>&gt; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration, co&ouml;rdination.<br/><br/>Actually the term &quot;umlaut&quot; in german denotes a &quot;shifted&quot; vowel. If you do<br/>a transition from &quot;u&quot; -&gt; &quot;e&quot; biased towards &quot;i&quot; and stopping in the middle,<br/>you have the &quot;&uuml;&quot;, which can be written as diphtong also: &quot;ue&quot;. The &quot;e&quot; in<br/>&quot;ue&quot; was often placed above the &quot;u&quot; in old german writing (where the &quot;e&quot;<br/>was written like &quot;n&quot;, but with a sharp bend instead of a curve before the<br/>last falling stroke). The four strokes necessary for that &quot;e&quot; were reduced<br/>to two, and those to dots, hence the two points above the &quot;&uuml;&quot;.<br/><br/>So, the umlaut is a shortened form of a &quot;diphtongy&quot; denoting a shifted vowel,<br/>and *not* a diaeresis (&quot;ue&quot; is not a diphtong, but an umlaut ;-)<br/><br/>&Ouml;--gg-<br/><br/>-- <br/>_($_=&quot; &quot;x(1&lt;&lt;5).&quot;?\n&quot;.q&middot;/)Oo. G&deg;\ /<br/> /\_&macr;/(q /<br/>---------------------------- \__(m.====&middot;.(_(&quot;always off the crowd&quot;)).&quot;&middot;<br/>&quot;);sub _{s,/,($e=&quot;&#39;Itrs `mnsgdq Gdbj O`qkdq&quot;)=~y/&quot;-y/#-z/;$e,e &amp;&amp; print}<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4076.html Sat, 12 Jan 2008 15:50:27 +0000 Re: regex of the month (decade?) by Georg Moritz From the keyboard of Chris Dolan [12.01.08,16:51]:<br/><br/>&gt; On Jan 11, 2008, at 8:01 AM, David Landgren wrote:<br/>&gt; <br/>&gt; &gt; The benchmark may be flawed, since my appreciation of Unicode is little more<br/>&gt; &gt; than &quot;things went downhill after 7-bit ASCII&quot;.<br/>&gt; <br/>&gt; Haven&#39;t I read that you live in Paris? I figured that anyone who lives in a<br/>&gt; country whose dominant language was not fully expressible in ASCII would love<br/>&gt; Unicode.<br/>&gt; <br/>&gt; On a major tangent, have others noticed the resurgence of the umlaut in<br/>&gt; printed English? I keep seeing things like co&ouml;peration or co&ouml;rdinates --<br/>&gt; particularly in Technology Review, but in other publications on occasion too.<br/>&gt; Is that because it&#39;s *supposed* to be spelled that way, but ASCII and the<br/>&gt; typewriter have suppressed that spelling for my lifetime?<br/><br/>Well, that&#39;s sort of quotemeta for the double o - differentiating e.g.<br/>double-o usage in cool vs. cooperation. I haven&#39;t seen that usage in<br/>english yet, but it&#39;s used in spanish to mark a vowel as literal, e.g. in <br/>&quot;Parque G&uuml;ell&quot;. <br/><br/>0--gg-<br/><br/>-- <br/>_($_=&quot; &quot;x(1&lt;&lt;5).&quot;?\n&quot;.q&middot;/)Oo. G&deg;\ /<br/> /\_&macr;/(q /<br/>---------------------------- \__(m.====&middot;.(_(&quot;always off the crowd&quot;)).&quot;&middot;<br/>&quot;);sub _{s,/,($e=&quot;&#39;Itrs `mnsgdq Gdbj O`qkdq&quot;)=~y/&quot;-y/#-z/;$e,e &amp;&amp; print}<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4075.html Sat, 12 Jan 2008 15:24:21 +0000 Re: regex of the month (decade?) by Yanick Champoux Chris Dolan wrote:<br/>&gt; On a major tangent, have others noticed the resurgence of the umlaut <br/>&gt; in printed English? I keep seeing things like co&ouml;peration or <br/>&gt; co&ouml;rdinates -- particularly in Technology Review, but in other <br/>&gt; publications on occasion too. Is that because it&#39;s *supposed* to be <br/>&gt; spelled that way, but ASCII and the typewriter have suppressed that <br/>&gt; spelling for my lifetime?<br/>&gt;<br/><br/> A quick use of Google-fu unearthed a blog entry <br/>http://www.dwelle.org/archives/2007/01/05/whats-with-all-the-umlauts/, <br/>which in turn pointed to the page <br/>http://ourworld.compuserve.com/homepages/profirst/d.htm that says:<br/><br/>*dieresis* or *di&aelig;resis *A diacritical mark (* &uml; *) optionally used in <br/>English, oftentimes replaced by a hyphen. In English, the dieresis is <br/>used on a second identical vowel to indicate a change in pronunciation <br/>of that vowel or indicate it is pronounced in a separate syllable. It is <br/>sometimes referred to as an &laquo; umlaut &raquo; when used with a single character <br/>or in a &laquo; diphthong. &raquo; Examples: re&euml;lecting, re&euml;ncoding, co&ouml;peration, <br/>co&ouml;rdination.<br/><br/><br/><br/> Well I, for one, never knew that such a thing existed. Neato! Too <br/>bad the name of the mark, though, which is definitively unfortunate.<br/><br/><br/>Jo&yuml;,<br/>`/anick<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4074.html Sat, 12 Jan 2008 15:19:52 +0000 Re: regex of the month (decade?) by Chris Dolan On Jan 11, 2008, at 8:01 AM, David Landgren wrote:<br/><br/>&gt; The benchmark may be flawed, since my appreciation of Unicode is <br/>&gt; little more than &quot;things went downhill after 7-bit ASCII&quot;.<br/><br/>Haven&#39;t I read that you live in Paris? I figured that anyone who <br/>lives in a country whose dominant language was not fully expressible <br/>in ASCII would love Unicode.<br/><br/>On a major tangent, have others noticed the resurgence of the umlaut <br/>in printed English? I keep seeing things like co&ouml;peration or <br/>co&ouml;rdinates -- particularly in Technology Review, but in other <br/>publications on occasion too. Is that because it&#39;s *supposed* to be <br/>spelled that way, but ASCII and the typewriter have suppressed that <br/>spelling for my lifetime?<br/><br/>Chris<br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4073.html Sat, 12 Jan 2008 14:51:16 +0000 Re: regex of the month (decade?) by David Landgren Michael G Schwern wrote:<br/><br/>&gt; And the reverse is now true in this highly inaccurate test...<br/>&gt; <br/>&gt; $ time perl5.8.8 -wle &#39;$foo = &quot;x&quot; x 10000; $foo .= &quot;T&quot;; $foo =~ /[Tt]/ for<br/>&gt; 1..100000&#39;<br/>&gt; <br/>&gt; real 0m5.732s<br/>&gt; user 0m5.565s<br/>&gt; sys 0m0.027s<br/>&gt; <br/>&gt; $ time perl5.8.8 -wle &#39;$foo = &quot;x&quot; x 10000; $foo .= &quot;T&quot;; $foo =~ /t/i for<br/>&gt; 1..100000&#39;<br/>&gt; <br/>&gt; real 0m2.589s<br/>&gt; user 0m2.544s<br/>&gt; sys 0m0.015s<br/><br/>And if I recall my perl510delta correctly, /i should be even faster on <br/>5.10.0. No, hang on, it&#39;s when UTF-8 strings are involved.<br/><br/>% time perl5.8.8 -Mutf8 -Mcharnames=:full -wle &#39;$foo = &quot;e\N{GREEK SMALL <br/>LETTER BETA}&quot; x 5000; $foo .= &quot;T&quot;; $foo =~ /t/i for 1..1000&#39;<br/><br/>real 0m22.855s<br/>user 0m22.827s<br/>sys 0m0.016s<br/><br/>% ./perl -v<br/>This is perl, v5.10.0 DEVEL32604 built for i386-freebsd-thread-multi<br/><br/>% time ./perl -Ilib -Mutf8 -Mcharnames=:full -wle &#39;$foo = &quot;e\N{GREEK <br/>SMALL LETTER BETA}&quot; x 5000; $foo .= &quot;T&quot;; $foo =~ /t/i for 1..1000&#39;<br/><br/>real 0m22.957s<br/>user 0m22.948s<br/>sys 0m0.001s<br/><br/>Well, look on the bright side. It&#39;s no worse.<br/><br/>The benchmark may be flawed, since my appreciation of Unicode is little <br/>more than &quot;things went downhill after 7-bit ASCII&quot;.<br/><br/>David<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4072.html Fri, 11 Jan 2008 06:01:55 +0000 Re: regex of the month (decade?) by Michael G Schwern S&eacute;bastien Aperghis-Tramoni wrote:<br/>&gt; Q: Why do you use /[Tt]/ instead of /t/i? It&#39;s so ugly!<br/>&gt; <br/>&gt; A: Simple - the case-insensitive match switch is very<br/>&gt; inefficient. According to &quot;Mastering Regular Expressions&quot;<br/>&gt; from O&#39;Reilly Press, /[Tt]/ is faster and more space<br/>&gt; efficient than /t/i - by as much as double against long<br/>&gt; strings. //i essentially does a lc() on the string and<br/>&gt; keeps a temporary copy in memory.<br/>&gt; <br/>&gt; When this changes, and it is in the 5.6 development series,<br/>&gt; I will gladly use //i. Believe me, I realize [Tt] is hideously<br/>&gt; ugly.<br/><br/>Looks like it was painfully true for 5.5...<br/><br/>$ time perl5.5.5 -wle &#39;$foo = &quot;x&quot; x 10000; $foo .= &quot;T&quot;; $foo =~ /[Tt]/ for<br/>1..100000&#39;<br/><br/>real 0m4.882s<br/>user 0m4.761s<br/>sys 0m0.026s<br/><br/>$ time perl5.5.5 -wle &#39;$foo = &quot;x&quot; x 10000; $foo .= &quot;T&quot;; $foo =~ /t/i for<br/>1..100000&#39;<br/><br/>real 0m40.656s<br/>user 0m39.587s<br/>sys 0m0.149s<br/><br/><br/>And the reverse is now true in this highly inaccurate test...<br/><br/>$ time perl5.8.8 -wle &#39;$foo = &quot;x&quot; x 10000; $foo .= &quot;T&quot;; $foo =~ /[Tt]/ for<br/>1..100000&#39;<br/><br/>real 0m5.732s<br/>user 0m5.565s<br/>sys 0m0.027s<br/><br/>$ time perl5.8.8 -wle &#39;$foo = &quot;x&quot; x 10000; $foo .= &quot;T&quot;; $foo =~ /t/i for<br/>1..100000&#39;<br/><br/>real 0m2.589s<br/>user 0m2.544s<br/>sys 0m0.015s<br/><br/><br/><br/>-- <br/>&lt;Schwern&gt; What we learned was if you get confused, grab someone and swing<br/> them around a few times<br/> -- Life&#39;s lessons from square dancing<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4071.html Thu, 10 Jan 2008 19:51:06 +0000 Re: regex of the month (decade?) by Sébastien Aperghis-Tramoni Dan Collins wrote:<br/><br/>&gt; On Jan 7, 2008 5:06 PM, Uri Guttman &lt;uri@stemsystems.com&gt; wrote:<br/>&gt;<br/>&gt;&gt; ^([Pp]([Oo][Ss][Tt])?[.\s]*[Oo]([Ff][Ff][Ii][Cc][Ee])?[.\s]*[Bb][Oo]<br/>&gt;&gt; [Xx])|[Pp][Oo]([Bb]|[Xx]|[Dd][Rr][Aa][Ww][Ee][Rr]|[Ss][Tt][Oo][Ff] <br/>&gt;&gt; [Ff]<br/>&gt;&gt; [Ii][Cc][Ee]|[ ][Bb][Xx]|[Bb][Oo][Xx])|[Pp][/][Oo]|[Bb]([Xx]|[Oo] <br/>&gt;&gt; [Xx]|<br/>&gt;&gt; [Uu][Zz][Oo][Nn])|[Aa]([Pp][Aa][Rr][Tt][Aa][Dd][Oo]|[Pp][Tt][Dd][Oo])<br/>&gt;&gt;<br/>&gt;&gt; the challenge: itemize the stupidities. the case issue is only 1! i<br/>&gt;&gt; don&#39;t want to even post the &#39;spec&#39; unless asked for it. i saw this on<br/>&gt;&gt; usenet today.<br/>&gt;<br/>&gt; I don&#39;t even want to know what that&#39;s supposed to do.<br/>&gt; First, and most obviously, that should use /i.<br/><br/>You can also find code like this in HTML::Template. Sam Tregar <br/>explained the reason in the FAQ:<br/><br/> Q: Why do you use /[Tt]/ instead of /t/i? It&#39;s so ugly!<br/><br/> A: Simple - the case-insensitive match switch is very<br/> inefficient. According to &quot;Mastering Regular Expressions&quot;<br/> from O&#39;Reilly Press, /[Tt]/ is faster and more space<br/> efficient than /t/i - by as much as double against long<br/> strings. //i essentially does a lc() on the string and<br/> keeps a temporary copy in memory.<br/><br/> When this changes, and it is in the 5.6 development series,<br/> I will gladly use //i. Believe me, I realize [Tt] is hideously<br/> ugly.<br/><br/> &raquo; http://search.cpan.org/dist/HTML-Template/ <br/>Template.pm#FREQUENTLY_ASKED_QUESTIONS<br/><br/><br/>-- <br/>S&eacute;bastien Aperghis-Tramoni<br/><br/>Close the world, txEn eht nepO.<br/><br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4070.html Thu, 10 Jan 2008 18:32:26 +0000 Re: regex of the month (decade?) by David Landgren Stoll, Steven R. wrote:<br/>&gt; After solving the case sensitivity issue, separating the alternations, and<br/>&gt; solving the un-escaped /, here is what we are left with.<br/>&gt; <br/>&gt; (p(ost)?[.\s]*o(ffice)?[.\s]*box)<br/>&gt; po(b|x|drawer|stoffice|[ ]bx|box)<br/>&gt; p[\/]o<br/>&gt; b(x|ox|uzon)<br/>&gt; a(partado|ptdo)<br/><br/>If we unroll that to<br/><br/>post[.\s]*o(ffice)?[.\s]*box<br/>p[.\s]*o(ffice)?[.\s]*box<br/>pob<br/>pox<br/>podrawer<br/>postoffice<br/>po[ ]bx<br/>pobox<br/>p[\/]o<br/>bx<br/>box<br/>buzon<br/>apartado<br/>aptdo<br/><br/>reassembling it, we obtain<br/><br/>(?:p(?:o(?:st(?:[.\s]*o(ffice)?[.\s]*box|office)|(?: <br/>b)?x|b(?:ox)?|drawer)|[.\s]*o(ffice)?[.\s]*box|\/o)|ap(?:arta|t)do|b(?:uzon|o?x))<br/><br/>I&#39;m happy that they thought to check for &#39;pox&#39; as a shorthand to <br/>postoffice box. I&#39;ll remember to use that next time I need to address <br/>such a letter.<br/><br/>David<br/> http://www.nntp.perl.org/group/perl.fwp/2008/01/msg4069.html Tue, 08 Jan 2008 14:11:12 +0000