Spamproof Your Site

Tweet

Anyone who operates their own Website knows that you need to provide a way for visitors to contact you by email. The big challenge is to provide easy email access to your visitors, without letting junk mail flood your inbox. The techniques described in this article have enabled me to dramatically reduce the amount of junk mail I receive through all my sites.

Preparing and Preempting

You need a couple things before you can really take effective action against spam. Your email software must be capable of filtering incoming email — all the major email applications (such as Eudora, Outlook, and Pegasus) support this functionality. We’ll use multiple email addresses to allow us to filter out spam and identify the source — you can’t combat spam effectively without filtering.

You’ll also need to use a Web host that provides unlimited email aliases or addresses, and/or a catch-all email address. An "alias" is an email address that forwards to some other address (for example, webmaster@domain.com forwarding to your real email address). A "catch-all" email address will forward any emails sent to unknown addresses in your domain.

For my own Websites I use the catch-all, so that every message goes to my real email address. If you have more than a one-person operation, however, multiple email accounts and aliases are pretty much a necessity.

Fighting Back

The first step in fighting back against spammers is to understand where they found your email address. You must diligently protect your email address if you ever hope to stop them. Once your email address falls into the wrong hands, it will be sold on CD-ROM (via junk mail, of course) to thousands of spammers. And once that happens, you’ve lost the fight.

Spam Source #1: Domain Name Registrations

When you register a domain name, you must provide a contact email address. If you give them your real email address, you’ve just given it to everyone, including the spammers. Instead, use a portable email address (like Hotmail) to set up your domain.

If you have multiple domains, you can also use an alias (domains@yourdomain.com) on your primary domain for all registrations. With an alias, you can use your email software to filter out and save any emails that come to that address from your registrar’s domain.

Spam Source #2: Web Forms & Email Newsletters

If you give your real email address on any Web form, or use it to subscribe to an email newsletter, you’re asking for trouble. Instead, create a unique email address for each Website or newsletter. I just use the Website’s domain name for this.

For example, if you subscribe to the SitePoint Tribune as "sitepoint.com@yourdomain.com" and let your catch-all address route it to you, you will always know where the email came from. If that address ever starts receiving junk mail, you can filter it out using your email software.

If you submit to search engines or free-for-all links pages (FFA’s), use a unique email address every time. FFAs, in particular, are famous for flooding the world with junk mail. Once you’ve given an email address to an FFA, you may as well forget about ever using it again.

Spam Source #3: Your Website

The biggest source of email addresses used by spammers is your Website. Most sites list multiple contact addresses — any time an email address appears on your Website in plain text, even if it’s hidden in a form field, you’re opening yourself up to having that email address captured.

To combat this menace, I’ve developed a set of JavaScript snippets that will meet almost every need you have to display your email address to the public, without allowing spambots to see it.

The Big Battle: Secure Your Website From Spambots

Almost every Website operator wants search engine spiders to visit. After all, search engines are the best source of free traffic on the Web. In the event that you don’t want them to visit, they are easily kept at bay with a properly formatted "robots.txt" file.

Unfortunately, there’s another group of spiders out there crawling the Web with an entirey different purpose. These are the spiders that visit site after site to collect email addresses. You may know them as spambots, email harvesters, or any number of other unpublishable names.

When it comes to controlling these rogue spiders, a robots.txt file simply won’t get the job done. In fact, most spam robots ignore robots.txt. But that doesn’t mean you have to give up and just let them have their way. Here are a few techniques that’ll stop these spiders in their tracks!

Technique #1: Use JavaScript To Mask Email Addresses

One of the weaknesses that spiders of all kinds suffer from is an inability to process scripts. Adding a small snippet of JavaScript in place of an email address effectively renders the address invisible to spiders, while leaving it accessible to your visitors with all but the most primitive Web browsers.

In the three examples below, simply substitute your username (the first half of your email address, everything before the @ symbol) and your hostname (everything after the @ symbol). To use the scripts, just insert them into your page’s HTML wherever you need them to be displayed.

Create A Spam-Proof Mailto Link

This snippet of JavaScript creates a clickable link that launches the visitor’s email application, assuming that their system is configured to work with "mailto:" hyperlinks. You can replace the link text with your own message, but see example 2 if you want to display your email address as the link text.

<script language=javascript>  
<!--  
var username = "username";  
var hostname = "yourdomain.com";  
var linktext = "Click Here To Send Me Email";  
document.write("<a href=" + "mail" + "to:" + username +  
"@" + hostname + ">" + linktext + "</a>")  
//-->  
</script>

A Spam-Proof Mailto Link That Shows Your eMail Address

Some visitors won’t be able to use a mailto link. This snippet shows your email address in the link so they can copy and paste, or type it by hand:

<script language=javascript>  
 
<!--  
var username = "username";  
var hostname = "yourdomain.com";  
var linktext = username + "@" + hostname;  
document.write("<a href=" + "mail" + "to:" + username +  
"@" + hostname + ">" + linktext + "</a>")  
//-->  
</script>

Display Your Email Address Without A Mailto Link

Here’s a snippet that displays your email address without a clickable link:

<script language=javascript>  
<!--  
var username = "username";  
var hostname = "yourdomain.com";  
var linktext = username + "@" + hostname;  
document.write(username + "@" + hostname)  
//-->  
</script>

Technique #2: Use A Contact Form

Sometimes, the sheer volume of legitimate email from real visitors can become a burden. In this case, a simple solution is to remove your email address from your site entirely, and use a contact form. There are dozens of free ASP, Perl, and PHP scripts available online that will allow your users to fill in a form, and send you an email. Most hosting providers now offer this service for free to their customers.

A contact form can enable you to deal with a higher volume of mail, by allowing you to pre-sort different types of message. This is easily accomplished by creating a drop-down menu with different options (e.g. customer service, billing, tech support, etc.) that will populate the subject line of the email message, and/or change the email address to which the form is sent.

As many spambots simply read the entire HTML source of the page in search of anything that looks like an email address, your contact form may not protect you, if you include your email address in the form’s HTML (for example, as a hidden field). You can use JavaScript, as shown in the example below, to mask the address, or if you have the skill, you can embed the email address in your form processing script, where nobody can find it.

Masking The Email Address In A Form Field

Instead of simply listing your email address in a form field, use the snippet below to replace the form field that contains your email address.

<script language=javascript>  
<!--  
var username = "username";  
var hostname = "yourdomain.com";  
var linktext = username + "@" + hostname;  
document.write("<input type=hidden name=email  
value=" +username + "@" + hostname" + ">";  
document.write(username + "@" + hostname);  
//-->  
</script>
Advanced Techniques: URL Rewriting

Both the Apache and IIS Web servers have plug-in URL-rewriting modules that can be used to provide additional protection to your site, redirecting queries from known spambots to a blank page, or to another Website. These techniques are beyond the scope of this article, and using them will slow your server down, if only a little.

I hope that this tutorial has given you a clear understanding of how to protect your Website, and your email address, from spammers and spambots.

Free book: Jump Start HTML5 Basics

Grab a free copy of one our latest ebooks! Packed with hints and tips on HTML5's most powerful new features.

No Reader comments