develooper Front page | perl.perl5.porters | Postings from January 2010

utf8 growth on the web.

Thread Previous | Thread Next
From:
karl williamson
Date:
January 29, 2010 08:54
Subject:
utf8 growth on the web.
Message ID:
4B63129A.4060800@khwilliamson.com
Google just published this graph of the growth of utf8-encoded web 
pages.  I thought that since perl5 is the "duct tape of the internet", 
people here might be interested.

I don't know how they distinguish between ascii and utf8 web pages.  The 
encodings that web pages say they are in are often wrong, so Google has 
developed an algorithm to (quickly) figure out the correct value, and 
the graph is based on that.

If you don't want to look at the link, the bottom line is that it shows 
a nearly exponential growth rate of utf8, at the expense of every other 
encoding, but particularly ascii and latin1/15 (which had the most to 
lose anyway), with the combination of ascii+utf8 being about 2/3, and 
utf8 alone just under half of all web pages.

http://4734020732036341599-a-1802744773732722657-s-sites.googlegroups.com/site/macchiato/main/growth_of_unicode_on_the_web-1.png?attachauth=ANoY7cpr8IquuulNCyJSN9F0TYmoWJMgwzzfLH4PPyHEPNJkK8LTFxfM43qJ0ptkDAwmabo8YRTer3PlqhcU6acPTmMFOUkCZRUUj3ur36I5DSbnx_b1RjStHvm-igbhuJzHgPQxDNbR9SD_T5N1AY1zKg4NnOU3Ugdy281Ljj_GIX0IIuhvFsAsI3eXKigPPCAb3GLuHB2nmbMQC22vqDWhJ_SQ1-tS4LQ0k2yvxmTcCQG5lYmHVXs%3D&attredirects=0

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About