develooper Front page | perl.libwww | Postings from October 2011

Re: HTML::Tagset - patch proposal for HTML 5 elements.

Thread Previous | Thread Next
From:
James Bromberger
Date:
October 24, 2011 01:41
Subject:
Re: HTML::Tagset - patch proposal for HTML 5 elements.
Message ID:
4EA5118C.4010703@rcpt.to
On 21/05/2011 8:12 AM, Andy Lester wrote:
>> My reason for raising this now is that the HTML5 spec elements are starting to appear on documents on the internet, and with these tags missing from HTML::Tagset, it impacts the effectiveness of other libraries that depend upon it.
> Haven't looked at the patch, but I think we need to talk about high-level thoughts about handling HTML5.  
>
> * Should we have separate sets of tags for HTML 4 and 5?
> * If so, should it be handled be one module, or should it get split into to?
> * Should we have strict and loose definitions of tags, going forward?  One of the things I've run into is tag attributes that are recognized by browsers, but not in the spec.  
> Thoughts?

I'm surprised that 5 months have slipped by and no one is interested in
discussing adding HTML5 entities to HTML::TagSet library, and the large
swathes of modules this will impact and/or (hopefully) improve.

Right now, trying to use HTML::TreeBuilder::XPath is futile when
processing HTML 5 documents, and there are probably other very good
libraries out there that are similarly incapacitated. I'm not saying
using HTML 5 now as a 'draft' is good practice, but browsers are
implementing it, and sites are using it. Last draft was published 25 May
2011, current editors version is dated 21 Oct 2011 (so only 3 days ago).
There's a core set of new elements that look like they are set to stay
(section, canvas, etc), so should HTML::TagSet include them now, and be
updated again for other major changes that come along?

I suggest that the major elements of HTML 5 are added to HTML::Tagset as
is.  At least we get some immediate relief to a burning issue.
HTML::TagSet can then be split at leisure into separate modules
(HTML::TagSet::HTML5, HTML::TagSet::HTML4, HTML::TagSet::XHTML, etc) and
a wrapper on top to combine them (and resolve conflicts between them)
over the next few years if required.


  James
-- 
/Mobile:/ +61 422 166 708, /Email:/ james_AT_rcpt.to


Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About