I’m hurting with spam. I’m looking for help.
I foudnthis facinating descriptio fo how one tool works: SpamBayes: Bayesian anti-spam classifier written in Python.
“The system then uses these clues to examine new messages.
For instance, the word “Nigeria” appears often in spam, so you could use a spam filter which identifies anything with that word in it as spam. But what if your business involves writing a guidebook on Nigerian Wildlife Conservation? Clearly a more flexible approach is necessary. Additionally spammers will adapt their content over time and will no longer use the word “Nigeria” (or the words “Lose Weight Fast”, or any number of other common lines). Ideally the software will be able to adapt as the spam changes.
So, that is what SpamBayes does. It compares the spam and the ham and calculates probabilities. ”