Performance Comparison of ADTree and Naive Bayes Algorithms for Spam Filtering

Abstract : Classification is an important data mining technique and could be used as data filtering in artificial intelligence. The broad application of classification for all kind of data leads to be used in nearly every field of our modern life. Classification helps us to put together different items according to the feature items decided as interesting and useful. In this paper, we compare two classification methods Naïve Bayes and ADTree use to detect spam e-mail. This choice is motivated by the fact that Naive Bayes algorithm is based on probability calculus while ADTree algorithm is based on decision tree. The parameter settings of the above classifiers use the maximization of true positive rate and minimization of false positive rate. The experiment results present classification accuracy and cost analysis in view of optimal classifier choice for Spam Detection. It is point out the number of attributes to obtain a tradeoff between number of them and the classificationaccuracy.
Document type :
Journal articles
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01479440
Contributor : Kévin Perrot <>
Submitted on : Tuesday, February 28, 2017 - 6:42:11 PM
Last modification on : Saturday, October 26, 2019 - 1:35:52 AM

Identifiers

  • HAL Id : hal-01479440, version 1

Citation

Thanh Nguyen, Andrei Doncescu, Pierre Siegel. Performance Comparison of ADTree and Naive Bayes Algorithms for Spam Filtering. International Journal of Mathematical and Computational Sciences, 2016, 10 (5), pp.269--274. ⟨hal-01479440⟩

Share

Metrics

Record views

292