Extracting metadata (dates) from list of pages

Hello,

We have about 500 web pages that we need to extract data from. Each page contains the following metadata, but with a different date.

<meta name=“dcterms.issued” content=“2005-12-23” />

We want to extract the date from each file so that we know when the page was issued.

Any help is appreciated.

Thanks

@tcguy;

That was more tedious than I thought:

http://www.johns-jokes.com/downloads/sp-b/tcguy/

Where do I send my bill :slight_smile:

Nice one John :slight_smile:

Thank you I am pleased you like it.

The OP posed the question and never returned :frowning:

I hope others find the code useful.

It was to me. I am a new forum user, I joined for this answer.
Please can you help me? I read that Dublin Core released its spec in the (achieved) attempt to conform with RDFa Lite.
Does anyone know whether automated systems already exist with the purpose of extracting Schema: metadata with the native attributes???
Otherwise it is better to drop the <meta@name> and move to @property.