Scan website for RSS feeds

tehKing · November 23, 2006, 4:08pm

I would like a function that can scan website for rss feeds.
I guess the easiest way would be to find the
<link rel=“alternate” href=“rss.xml” type=“application/rss+xml”> tag.

But i don’t know where to start, except for maybe file_get_contents(“http://website/”);

Help is really appreciated.

Mr_Money · November 23, 2006, 8:29pm

just scrape the page contents with a regexp


$siteUrl = 'http://foo.com';
$feeds = array();
$matches = array();
if( preg_match_all( '/<link (.*?)type=([\\'"])?application\\/rss\\+xml$2.*?>/is', file_get_contents($siteUrl), $matches ) ) {
    preg_match_all( '/href=([\\'"])?(.*?)$1/is', implode($matches[0]), $feeds );
}
//$feeds[2] is an array of the feeds on the page

Forbes · January 24, 2011, 9:06pm

Mr_Money:

just scrape the page contents with a regexp


$siteUrl = 'http://foo.com';
$feeds = array();
$matches = array();
if( preg_match_all( '/<link (.*?)type=([\\'"])?application\\/rss\\+xml$2.*?>/is', file_get_contents($siteUrl), $matches ) ) {
    preg_match_all( '/href=([\\'"])?(.*?)$1/is', implode($matches[0]), $feeds );
}
//$feeds[2] is an array of the feeds on the page

Hi!

I know this is an old thread, but it’s still relevant.

I’ve just tried the code and I can’t get it to work with any of the websites I’m running it against.

Might that be due to the passing of time?

If so, could you please post updated regular expressions?

Thanks!

Topic		Replies	Views
Detect RSS feeds in a web page? PHP	6	3877	January 25, 2011
Extract RSS link from page source PHP	9	1215	September 11, 2010
Php -> rss/xml PHP	1	163	July 6, 2010
Validating an RSS Feed with PHP PHP	6	5191	October 8, 2014
Php rss feed PHP	1	251	August 24, 2010

Scan website for RSS feeds

Related topics