Bayesian Content Filtering and the Art of Statistical Language ClassificationBook - 2005
Through considerable research, creative minds have invented clever new ways to fight spam in all its nefarious forms. This landmark title describes, in depth, how statistical filtering is being used by next generation spam filters to identify and filter spam. Zdziarski explains how spam filtering works and how language classification and machine learning combine to produce remarkably accurate spam filters. Readers gain a complete understanding of the mathematical approaches used in today's spam filters, decoding, tokenization, the use of various algorithms (including Bayesian analysis and Markovian discrimination), and the benefits of using open-source solutions to end spam. Interviews with the creators of many of the best spam filters provide further insight into the anti-spam crusade.
Publisher: San Francisco : No Starch Press, c2005.
Branch Call Number: 005.713 Z39
Characteristics: xx, 287 p. : ill. ; 24 cm.