Tidy HTML

    Harry Fuecks
    Share

    John Coggeshall has posted this slides from the International PHP Conference here.

    At first glance Tidy may seem like nothing more than a nice tool for the pedantic. At second glance I start to think Tidy may be the biggest new piece of functionality in PHP for a long time; one we’re going to be thanking John for again and again.

    Take a look at this slide for example – Tidy is smart enough to be able to extract legacy HTML “styling” tags and convert them to CSS. Perhaps that should be no surprise, because the underlying C library originally began with Dave Raggett (father of the HTML

    tag, among many other things).

    There’s also other nice features like the Word 2000 mode which suggest a tool written for the real world.

    Where Tidy gets more exciting for PHP, IMO, is it enables conversion of HTML to a format ready for XML parsing. To an extent, that means it’s almost irrelevant what the HTML your PHP scripts spit out looks like – Tidy can convert it (see ob_tidyhandler) to XHTML and from there it can be transformed further with XSLT.

    Note there’s also a tutorial on Tidy @ Zend here.