Considering behavior of sender in spam mail detection
Conference proceedings article
Authors/Editors
Strategic Research Themes
No matching items found.
Publication Details
Author list: Naksomboon S., Charnsripinyo C., Wattanapongsakorn N.
Publisher: Hindawi
Publication year: 2010
Start page: 191
End page: 195
Number of pages: 5
ISBN: 9788988678206
eISSN: 1745-4557
Languages: English-Great Britain (EN-GB)
Abstract
Recently, the number of spam mails is exponentially growing. It affects the costs of organizations and annoying the email recipient. Spammers always try to find the way to avoid filtering out from the email system. At the same time, as an email recipient or network system/administrator, we try to have an effective spam mail filtering technique to catch the spam mails. The problems of spam mail filtering are that each user has different perspective toward spam mails; so there are many types of spam mails, while the challenge is how to detect the various types and forms of spam mails. In this paper, behaviors of spammers are used to customize the filtering rule. The information from the spam messages also can be used to filter spam mails and it can give higher accuracy than the keyword-based method does. We propose a spam classification approach using Random Forest algorithm. Spam Assassin Corpus is selected as a database for classification. It consists of 6,047 email messages, where 4,150 of them are the legitimate messages and the other 1,897 messages are the spam mails.
Keywords
Data classification, Spam Assassin dataset, Spam mail detection