I get a 404 when a file or the path has non-Latin characters in it. Examples:
http://example.com/Århus/some_image.jpg
http://example.com/beer/øl.jpg
Both of these result in a 404 even though I know the path exists. The access logs say this (for the top one above):
86.30.140.183 - - [18/Apr/2011:10:04:12 -0700] "GET /%C3%85rhus/some_image.jpg HTTP/1.1" 404 516 "http://example.com/%C3%85rhus/some_image.jpg" "Mozilla/5.0 (Windows NT 6.0; rv:2.0) Gecko/20100101 Firefox/4.0"
What I actually see in the browser’s 404 message is this:
http://example.com/Ãrhus/some_image.jpg
which is why it’s resulting in a 404 - the “Å” character is being turned into something else. I tried adding this to my .htaccess file to no avail:
AddDefaultCharset UTF-8
This is only happening on my webserver (shared hosting, so no access to httpd.conf and friends). On XAMPP on Windows I don’t have this problem.
Any ideas how to get Apache to deal with this problem?