SitePoint Sponsor

User Tag List

Results 1 to 5 of 5
  1. #1
    SitePoint Enthusiast
    Join Date
    Feb 2008
    Posts
    66
    Mentioned
    1 Post(s)
    Tagged
    0 Thread(s)

    Extracting metadata (dates) from list of pages

    Hello,

    We have about 500 web pages that we need to extract data from. Each page contains the following metadata, but with a different date.

    <meta name="dcterms.issued" content="2005-12-23" />

    We want to extract the date from each file so that we know when the page was issued.

    Any help is appreciated.

    Thanks

  2. #2
    SitePoint Mentor bronze trophy
    John_Betong's Avatar
    Join Date
    Aug 2005
    Location
    City of Angels
    Posts
    1,882
    Mentioned
    74 Post(s)
    Tagged
    6 Thread(s)
    Quote Originally Posted by tcguy View Post
    Hello,

    We have about 500 web pages that we need to extract data from. Each page contains the following metadata, but with a different date.

    <meta name="dcterms.issued" content="2005-12-23" />

    We want to extract the date from each file so that we know when the page was issued.

    Any help is appreciated.

    Thanks
    @tcguy ;

    That was more tedious than I thought:

    http://www.johns-jokes.com/downloads/sp-b/tcguy/

    Where do I send my bill
    Learn how to be ready for The New Move to Discourse

    How to make Make Money Now with a *NEW* look

    Be sure to congratulate Wolfshade on earning Member of the Month for August 2014

  3. #3
    SitePoint Guru Webinsane's Avatar
    Join Date
    Oct 2005
    Location
    Montenegro
    Posts
    898
    Mentioned
    1 Post(s)
    Tagged
    0 Thread(s)
    Nice one John
    CUBE SCRIPTS MEDIA
    REAL ESTATE SCRIPT 2.0 | Software for Real Estate Agencies

  4. #4
    SitePoint Mentor bronze trophy
    John_Betong's Avatar
    Join Date
    Aug 2005
    Location
    City of Angels
    Posts
    1,882
    Mentioned
    74 Post(s)
    Tagged
    6 Thread(s)
    Quote Originally Posted by Webinsane View Post
    Nice one John
    Thank you I am pleased you like it.

    The OP posed the question and never returned

    I hope others find the code useful.
    Learn how to be ready for The New Move to Discourse

    How to make Make Money Now with a *NEW* look

    Be sure to congratulate Wolfshade on earning Member of the Month for August 2014

  5. #5
    SitePoint Member
    Join Date
    Oct 2013
    Posts
    1
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by John_Betong View Post
    Thank you I am pleased you like it.

    The OP posed the question and never returned

    I hope others find the code useful.
    It was to me. I am a new forum user, I joined for this answer.
    Please can you help me? I read that Dublin Core released its spec in the (achieved) attempt to conform with RDFa Lite.
    Does anyone know whether automated systems already exist with the purpose of extracting Schema: metadata with the native attributes????
    Otherwise it is better to drop the <meta@name> and move to @property.


Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •