<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Snake Soup</title>
	<atom:link href="http://www.sitepoint.com/blogs/2006/06/30/snake-soup/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/</link>
	<description>News, opinion, and fresh thinking for web developers and designers. The official podcast of sitepoint.com.</description>
	<lastBuildDate>Mon, 23 Nov 2009 09:18:42 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Anonymous</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-649695</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Fri, 07 Mar 2008 14:50:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-649695</guid>
		<description>&lt;b&gt;</description>
		<content:encoded><![CDATA[<p>&lt;b&gt;</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Andi</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-36730</link>
		<dc:creator>Andi</dc:creator>
		<pubDate>Wed, 12 Jul 2006 12:48:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-36730</guid>
		<description>Just to let you know: ForumMatrix now has Secunia advisories. WikiMatrix will follow when it&#039;s upgraded to the new software.</description>
		<content:encoded><![CDATA[<p>Just to let you know: ForumMatrix now has Secunia advisories. WikiMatrix will follow when it&#8217;s upgraded to the new software.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: HarryF</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-35341</link>
		<dc:creator>HarryF</dc:creator>
		<pubDate>Wed, 05 Jul 2006 15:14:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-35341</guid>
		<description>&lt;blockquote&gt;
Your code snippet thing doesn’t appear to actually support Python that well either
&lt;/blockquote&gt;

Thankyou ;) Actually - improvements appreciated.</description>
		<content:encoded><![CDATA[<blockquote><p>
Your code snippet thing doesn’t appear to actually support Python that well either
</p></blockquote>
<p>Thankyou ;) Actually &#8211; improvements appreciated.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Hal9k</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-34837</link>
		<dc:creator>Hal9k</dc:creator>
		<pubDate>Mon, 03 Jul 2006 19:22:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-34837</guid>
		<description>Another Python article! *yay*

Can&#039;t wait till Python 2.5!

It&#039;s such a shame this beautiful language doesn&#039;t get the attention it deserves.</description>
		<content:encoded><![CDATA[<p>Another Python article! *yay*</p>
<p>Can&#8217;t wait till Python 2.5!</p>
<p>It&#8217;s such a shame this beautiful language doesn&#8217;t get the attention it deserves.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: binjured</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-34485</link>
		<dc:creator>binjured</dc:creator>
		<pubDate>Sat, 01 Jul 2006 20:24:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-34485</guid>
		<description>Are my eyes playing tricks on me?  Python code in a Sitepoint article???  *marks calendar*

Really though, thanks for the info, this may come in handy!  I am just jaded due to the lack of Python coverage and specific forum here at SP ;) Your code snippet thing doesn&#039;t appear to actually support Python that well either :(</description>
		<content:encoded><![CDATA[<p>Are my eyes playing tricks on me?  Python code in a Sitepoint article???  *marks calendar*</p>
<p>Really though, thanks for the info, this may come in handy!  I am just jaded due to the lack of Python coverage and specific forum here at SP ;) Your code snippet thing doesn&#8217;t appear to actually support Python that well either :(</p>]]></content:encoded>
	</item>
	<item>
		<title>By: HarryF</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-34282</link>
		<dc:creator>HarryF</dc:creator>
		<pubDate>Fri, 30 Jun 2006 23:31:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-34282</guid>
		<description>&lt;blockquote&gt;
Do you happen to know if anything like this is available for PHP? :)
&lt;/blockquote&gt;

No exactly but html_tidy + &quot;something&quot; e.g. PEAR&#039;s XML_Serializer or SimpleXML (for DOM-like data structures) could be used to a similar effect, but you&#039;d be spending more time to searching data structures.

One thing that&#039;s nice about Beautiful soup is the tag search capabilities, for example consider this from the above script;

&lt;code&gt;
imgs = pre.findAll(name=&#039;img&#039;, src=&#039;images/arrright.gif&#039;, limit=5)
if len(imgs) &gt; 0: return page + 1
&lt;/code&gt;

That allowed me to hunt for img tags with attribute src=&#039;images/arrright.gif&#039; - these only exist on MARC if there are more than 30 messages list (&quot;Next page&quot; link basically) [side note - that limit=5 is redundant - should have changed that], so I can use it to check whether I need to fetch any more pages.

Its the search API most of all that makes BeautifulSoup attactive over something html_tidy based.</description>
		<content:encoded><![CDATA[<blockquote><p>
Do you happen to know if anything like this is available for PHP? :)
</p></blockquote>
<p>No exactly but html_tidy + &#8220;something&#8221; e.g. PEAR&#8217;s XML_Serializer or SimpleXML (for DOM-like data structures) could be used to a similar effect, but you&#8217;d be spending more time to searching data structures.</p>
<p>One thing that&#8217;s nice about Beautiful soup is the tag search capabilities, for example consider this from the above script;</p>
<code>
imgs = pre.findAll(name='img', src='images/arrright.gif', limit=5)
if len(imgs) &gt; 0: return page + 1
</code>
<p>That allowed me to hunt for img tags with attribute src=&#8217;images/arrright.gif&#8217; &#8211; these only exist on MARC if there are more than 30 messages list (&#8221;Next page&#8221; link basically) [side note - that limit=5 is redundant - should have changed that], so I can use it to check whether I need to fetch any more pages.</p>
<p>Its the search API most of all that makes BeautifulSoup attactive over something html_tidy based.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Fenrir2</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-34182</link>
		<dc:creator>Fenrir2</dc:creator>
		<pubDate>Fri, 30 Jun 2006 15:41:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-34182</guid>
		<description>html_tidy</description>
		<content:encoded><![CDATA[<p>html_tidy</p>]]></content:encoded>
	</item>
	<item>
		<title>By: coffee_ninja</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-34156</link>
		<dc:creator>coffee_ninja</dc:creator>
		<pubDate>Fri, 30 Jun 2006 13:22:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-34156</guid>
		<description>Do you happen to know if anything like this is available for PHP? :)</description>
		<content:encoded><![CDATA[<p>Do you happen to know if anything like this is available for PHP? :)</p>]]></content:encoded>
	</item>
	<item>
		<title>By: HarryF</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-34108</link>
		<dc:creator>HarryF</dc:creator>
		<pubDate>Fri, 30 Jun 2006 07:01:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-34108</guid>
		<description>Seems that could be done: http://www.crummy.com/software/BeautifulSoup/documentation.html#Replacing%20one%20Element%20with%20Another</description>
		<content:encoded><![CDATA[<p>Seems that could be done: <a href="http://www.crummy.com/software/BeautifulSoup/documentation.html#Replacing%20one%20Element%20with%20Another" rel="nofollow">http://www.crummy.com/software/BeautifulSoup/documentation.html#Replacing%20one%20Element%20with%20Another</a></p>]]></content:encoded>
	</item>
	<item>
		<title>By: Matthew Magain</title>
		<link>http://www.sitepoint.com/blogs/2006/06/30/snake-soup/comment-page-1/#comment-34030</link>
		<dc:creator>Matthew Magain</dc:creator>
		<pubDate>Thu, 29 Jun 2006 23:42:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.sitepoint.com/blogs/?p=1606#comment-34030</guid>
		<description>Even more beautiful would be if it converted

&lt;code&gt;&lt;i&gt;&lt;b&gt;Aargh!&lt;/i&gt;&lt;/b&gt;&lt;/code&gt;

to

&lt;code&gt;&lt;em&gt;
  &lt;strong&gt;
    Aargh!
  &lt;/strong&gt;
&lt;/em&gt;&lt;/code&gt;</description>
		<content:encoded><![CDATA[<p>Even more beautiful would be if it converted</p>
<code>&lt;i&gt;&lt;b&gt;Aargh!&lt;/i&gt;&lt;/b&gt;</code>
<p>to</p>
<code>&lt;em&gt;
  &lt;strong&gt;
    Aargh!
  &lt;/strong&gt;
&lt;/em&gt;</code>]]></content:encoded>
	</item>
</channel>
</rss>
