SitePoint Sponsor

User Tag List

Results 1 to 6 of 6
  1. #1
    SitePoint Enthusiast -PET-'s Avatar
    Join Date
    Apr 2006
    Location
    Timisoara/Romania
    Posts
    45
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)

    Help me to "think" better on a big website

    Hello boys and girls...but mostly the girls :P

    I have to build a website that will gather "XML" files and show them on a category. Something like Google News.

    Well, this project is pretty big... so I want to "think it" from the begining.

    How do you suggest to construct it? I don't mean like... give me code or stuff. Just links and tools to use.

    Thanks

  2. #2
    ✯✯✯ silver trophybronze trophy php_daemon's Avatar
    Join Date
    Mar 2006
    Posts
    5,284
    Mentioned
    2 Post(s)
    Tagged
    0 Thread(s)
    You expect to find girls on the php forums?

    It's not too big if you ask me, reading XML data and storing to database is straightforward.

    Are there any exact problems that you encounter? Undefined format of the XML feeds? Complex calculations to be done on huge amounts of data?
    Saul

  3. #3
    SitePoint Wizard silver trophybronze trophy Cups's Avatar
    Join Date
    Oct 2006
    Location
    France, deep rural.
    Posts
    6,869
    Mentioned
    17 Post(s)
    Tagged
    1 Thread(s)
    I think your biggest problem will be the categorisation. This will involve adding and managing meta data about each feed/data.

    Who will do that, what tools will they need, how can you make it easier for them to use these tools, can you semi-automate any of it, are you going to be using a controlled vocabulary to manage the categorisation, will the categorisation be polyhierarchical - how will you manage quality control from the outset?

    My 2 cents.

  4. #4
    SitePoint Enthusiast -PET-'s Avatar
    Join Date
    Apr 2006
    Location
    Timisoara/Romania
    Posts
    45
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Well, the data will be manualy added since each news must be checked to see in what category it fits.

  5. #5
    SitePoint Enthusiast -PET-'s Avatar
    Join Date
    Apr 2006
    Location
    Timisoara/Romania
    Posts
    45
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Also I have to figure a way to make the links something like this:

    http://www.mysite.com/category-name/...he-toilet.html

  6. #6
    SitePoint Enthusiast -PET-'s Avatar
    Join Date
    Apr 2006
    Location
    Timisoara/Romania
    Posts
    45
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Anyway, I don't know how other sites that cather news do it... but I think this must be made manualy, not the pharse just the adding of the news.

    I was thinking of a page where I have some RSS's. I Click on the rss and :

    1. my system starts to gather
    2. then he checks to see if one of the XML's already exists
    3. he displays the "news" that I don't have in the database
    4. Then I can set each news to a new category + other options...

    What do you think?
    Also, how do you sugest to pharse the XML? I found this pharser magpierss but some XML's he can't show, don't know why...


Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •