Tidy HTML

Share this article

John Coggeshall has posted this slides from the International PHP Conference here.

At first glance Tidy may seem like nothing more than a nice tool for the pedantic. At second glance I start to think Tidy may be the biggest new piece of functionality in PHP for a long time; one we’re going to be thanking John for again and again.

Take a look at this slide for example – Tidy is smart enough to be able to extract legacy HTML “styling” tags and convert them to CSS. Perhaps that should be no surprise, because the underlying C library originally began with Dave Raggett (father of the HTML

tag, among many other things).

There’s also other nice features like the Word 2000 mode which suggest a tool written for the real world.

Where Tidy gets more exciting for PHP, IMO, is it enables conversion of HTML to a format ready for XML parsing. To an extent, that means it’s almost irrelevant what the HTML your PHP scripts spit out looks like – Tidy can convert it (see ob_tidyhandler) to XHTML and from there it can be transformed further with XSLT.

Note there’s also a tutorial on Tidy @ Zend here.

Harry FuecksHarry Fuecks
View Author

Harry Fuecks is the Engineering Project Lead at Tamedia and formerly the Head of Engineering at Squirro. He is a data-driven facilitator, leader, coach and specializes in line management, hiring software engineers, analytics, mobile, and marketing. Harry also enjoys writing and you can read his articles on SitePoint and Medium.

Read Next
Get the freshest news and resources for developers, designers and digital creators in your inbox each week