SpamAssassin: A practical guide to integration and configuration

SpamAssassin: A practical guide to integration and configuration

Overview of this book

As a busy administrator, you know Spam is a major distraction in todays network. The effects range from inappropriate content arriving in the mailboxes up to contact email addresses placed on a website being deluged with unsolicited mail, causing valid enquiries and sales leads to be lost and wasting employee time. The perception of the problem of spam is as big as the reality. In response to the growing problem of spam, a number of free and commercial applications and services have been developed to help network administrators and email users combat spam. Its up to you to choose and then get the most out of an antispam solution. Free to use, flexible, and effective, SpamAssassin has become the most popular open source antispam application. Its unique combination of power and flexibility make it the right choice. This book will now help you set up and optimize SpamAssassin for your network.

SpamAssassin

Credits

About the Author

About the Reviewers

Introduction

Free Chapter

Introducing Spam

Defining Spam

The Costs of Spam

Spam and the Law

Summary

Spam and Anti-Spam Techniques

Spamming Techniques

Anti-Spam Techniques

Spam Filtering Services

Anti-Spam Tools

Summary

Open Relays

Email Delivery

Open Relay Tests

MTA Configuration

Summary

Protecting Email Addresses

Websites

Usenet

Trojan Software

Mailing Lists and Archives

Registration for Websites

Employees

Business Cards and Promotional Material

How Spammers Verify Email Addresses

Summary

Detecting Spam

Valid Bulk Email Delivery

Summary

Installing SpamAssassin

Building from Source

Using CPAN

Installing by Hand

Resolving Build Failures

Packaged Distributions

Verifying the Installation

Upgrading

Uninstalling

SpamAssassin Components

Summary

Configuration Files

Rule Files

Summary

Using SpamAssassin

SpamAssassin as a Daemon

SpamAssassin and Procmail

Integrating SpamAssassin into the MTA

Testing and Troubleshooting

Rejecting Spam

Summary

Bayesian Filtering

Scoring

Training

Confirming Operation

Filter Training

Disabling Bayesian Filtering

Summary

Look and Feel

Headers

Reports

Subject Rewriting

Summary

Network Tests

RBLs

SURBLs

Vipul's Razor

Pyzor

DCC

Spamtraps

Summary

Rules

Writing Rules

Using Other Rulesets

Summary

Improving Filtering

Whitelists and Blacklists

The Auto-Whitelist

Resolving Incorrect Classifications

Character Sets and Languages

Summary

Performance

Bottlenecks

Performance Improvement Methodology

Using SQL

Summary

Housekeeping and Reporting

Separating Levels of Spam

Detecting When SpamAssassin Fails

Spam and Ham Reports

Summary

Building an Anti-Spam Gateway

Choosing a PC Platform

Choosing a Linux Distribution

Configuring Postfix

Installing Amavisd-new

Configuring Amavisd-new

Configuring Postfix to Run Amavisd-new

Configuring External Services

Firewall Configuration

Backups

Testing

Going Live

Summary

Email Clients

General Configuration Rules

Microsoft Outlook

Microsoft Outlook Express

Mozilla Thunderbird

Qualcomm Eudora

Summary

Choosing Other Spam Tools

Spam Policies

Evaluating Spam Filters

Configuring the Second Filter

Other Techniques

Summary

Glossary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Statistical Tests

Various statistical techniques can be used to identify spam. These generally involve a training phase, where a database of spam and ham emails is taught to the filter or passed through it to identify typical characteristics of spam and ham. This allows future emails to be identified based on the learning from past emails. The various statistical techniques vary in their choice of tokens and the algorithms they use to predict whether an email is spam or ham. The tokens used are normally words, but can include email headers, HTML markup within emails, and other characters such as punctuation marks.

Statistical filters rely on regular training. They use the knowledge gained in training to estimate the probability that new emails are spam. As spam changes, the filter must adapt in order to continue to detect the spam.

SpamAssassin contains a statistical filter based on Bayesian analysis. This is enabled by default and, if trained properly, aids in the correct recognition of...

SpamAssassin: A practical guide to integration and configuration

SpamAssassin: A practical guide to integration and configuration

Overview of this book

Related Content you might be interested in

Current Title:

SpamAssassin: A practical guide to integration and configuration

Statistical Tests