Relationship between Filenames and URLs

I am creating a website with lots of content similar to a newspaper.

For each article, I was planning on using natural English for the file name, e.g.

[I]Rising Gas Prices Stall Recover.html

Lawmakers Far Apart in Budget Standoff.html

How To Keep People From Stealing Your Ideas.html[/I]

What should be the relationship between my Articles and the Filenames and the URL?

I see a lot of websites that have URLs like this…

w w w .cnn.com/politics/lawmakers-far-apart-in-budget-standoff.html

(I believe that format is based a lot on SEO, but it is also more user-friendly.)

Debbie

I advise you to store the articles inside a database and to use mod_rewrite to rewrite your SEO friendly urls:
For example:

www.yoursite.com/123/Rising Gas Prices Stall Recover.html

should rewite to something like this:

www.yoursite.com?articleid=123

In my systems I have a “pagename” field in the database. This way I can skip off the ID, and the entity if you have more than one type (e.g. article, product etc.), in the URL. The pagename would store the unique part of the URL e.g. ‘rising-gas-prices-stall-recover’.

In your logic you would have to check for uniqueness of the pagenames across your entities, and you may consider setting up some kind of pagename forwarding.

@nihcer - www.yoursite.com?articleid=123 is not an search engine friendly URL. That is the basic url with a variable that most content management systems make if you don’t turn on URL rewrite to make the URLs search engine friendly. These days Google/Bing/etc can read most types of URL structure, but it is generally better to have the article keywords/title in the URL for both SEO and actual people.

@DoubleDee - The common relationship between your article and filenames is that the title (up to a certain length) gets converted to the filename with hyphens (-) instead of spaces (spaces should not be in filenames on the web as they can break things and confuse people and search bots). It is also common to have the content category as the preceding folder name as this is often a good keyword which is useful to have there for SEO - eg. politics, sort, music, etc.

The .html at the end is a hangover from the days of a simple page-based web that was not database driven like it is today. There was a teory that search engines prefer pages that end in .html because then it knows it is a page rather than a directory, but modern search engines are far far cleverer than that, although the .html does no harm either.

Basically - the best way to go for URL structure is something like:
http://www.example.com/category/the-title-of-your-article-which-probably-contains-keywords/ - this is how Wordpress does it and Google works well with that.
or
http://www.example.com/category/the-title-of-your-article-which-probably-contains-keywords.html - this works just as well.