SitePoint Sponsor

User Tag List

Results 1 to 2 of 2
  1. #1
    SitePoint Member
    Join Date
    Feb 2009
    Location
    Australia / Thailand / USA
    Posts
    5
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)

    Advice on moving content.

    One of my clients currently has a site that is rather large with many pages, links to pdfs and external links.

    They are currently using a CMS called MySource Matrix. I am redeveloping the new site (no CMS) on a dedicated server. I do have access to the back end of the current CMS but this thing is impossible to extract anything meaningful from. I also have no root access or any access to the server itself. It's a complete mess.

    So, my question is... Is there any way to extract or build a hierarchy of each and every page (in essence a site map) and also extract all linked PDFs (hopefully maintaining some form of link to the parent page)?

    Is there any method or software package to perform such a task?

    I am desperate. Please, any ideas at all.

    Thanks,
    ozmo

  2. #2
    SitePoint Enthusiast
    Join Date
    Nov 2003
    Location
    Brisbane, QLD
    Posts
    89
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    I am not certain whether this will help in this instance but using wget (a linux command line program) might help. Here is a link to a site for wget.

    http://linuxreviews.org/quicktips/wget/

    Regards,
    Colin
    Colin Burns
    http://www.cmsadvantage.com
    Founder & CEO, cmsadvantage
    The premier CMS for Web & Graphic Designers


Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •