A. M. Cohen and W. R. Et-hersh, A survey of current work in biomedical text mining, Briefings in Bioinformatics, vol.6, issue.1, pp.57-71, 2005.
DOI : 10.1093/bib/6.1.57

D. Hillard, S. Purpura, and J. Et-wilkerson, An active learning framework for classifying political text, Annual Meeting of the Midwest Political Science Association, 2007.

G. V. Cormack and T. R. Et-lynam, Online supervised spam filter evaluation, ACM Transactions on Information Systems, vol.25, issue.3, p.11, 2007.
DOI : 10.1145/1247715.1247717

B. Pang and L. Et-lee, Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, pp.1-135, 2008.

S. Purpura and D. Hillard, Automated classification of congressional legislation, Proceedings of the 2006 national conference on Digital government research , dg.o '06, pp.219-225, 2006.
DOI : 10.1145/1146598.1146660

M. Evans, W. Mcintosh, J. Lin, and C. Et-cates, Recounting the Courts? Applying Automated Content Analysis to Enhance Empirical Legal Research, Journal of Empirical Legal Studies, vol.42, issue.1, pp.1007-1039, 2007.
DOI : 10.1111/j.1740-1461.2007.00113.x

K. Durant and M. Smith, Predicting the Political Sentiment of Web Log Posts Using Supervised Machine Learning Techniques Coupled with Feature Selection, Advances in Web Mining and Web Usage Analysis: 8th International Workshop on Knowledge Discovery on the Web. Webkdd, pp.187-206, 2006.
DOI : 10.1007/978-3-540-77485-3_11

E. Wiener, J. O. Pedersen, and A. S. Et-weigend, A Neural Network Approach to Topic Spotting. Symposium on document analysis and information retrieval, pp.317-332, 1995.

H. Schütze, D. A. Hull, and J. O. Et-pedersen, A comparison of classifiers and document representations for the routing problem, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '95, pp.229-337, 1995.
DOI : 10.1145/215206.215365

Y. Yang and C. G. Et-chute, An example-based mapping method for text categorization and retrieval, ACM Transactions on Information Systems, vol.12, issue.3, pp.252-277, 1994.
DOI : 10.1145/183422.183424

D. D. Lewis and M. Et-ringuette, Comparison of two learning algorithms for text categorization, Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval (SDAIR'94, pp.81-93, 1994.

J. R. Quinlan, Induction of decision trees, Machine Learning, vol.1, issue.1, pp.81-106, 1986.
DOI : 10.1007/BF00116251

C. Apte, F. Damerau, and S. M. Et-weiss, Text mining with decision rules and decision trees, Proceedings of the Conference on Automated Learning and Discovery . Workshop 6: Learning from Text and the Web, 1998.

D. D. Lewis, An evaluation of phrasal and clustered representations on a text categorization task, Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '92, pp.37-50, 1992.
DOI : 10.1145/133160.133172

T. A. Joachims, Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization, Proceedings of ICML-97. 14th International Conference on Machine Learning, 1997.

T. A. Joachims, Text categorization with Support Vector Machines: Learning with many relevant features, proceedings of the European conference on Machine learning, pp.137-142, 1998.
DOI : 10.1007/BFb0026683

R. Schapire, Y. Singer, and A. Singhal, Boosting and Rocchio applied to text filtering, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '98, 1998.
DOI : 10.1145/290941.290996

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.104.8304

R. Iyer, D. Lewis, R. Schapire, Y. Singer, and A. Singhal, Boosting for document routing, Proceedings of the ninth international conference on Information and knowledge management , CIKM '00, 2000.
DOI : 10.1145/354756.354794

M. Lan, C. L. Tan, J. Su, and H. B. Et-low, Text Representations for Text Categorization: A Case Study in Biomedical Domain, 2007 International Joint Conference on Neural Networks, 2007.
DOI : 10.1109/IJCNN.2007.4371361

B. P. Suomela, M. A. Et-andrade, J. F. Fontaine, A. Barbosa-silva, M. Schefer et al., Ranking the whole MEDLINE database according to a large training set using text indexing Enhancing Patent Expertise through Matching with Scientific Documents 21 MedlineRanker: flexible ranking of biomedical literature, BMC Bioinformatics Nucleic Acids Res, vol.6, issue.37, pp.75141-146, 2005.

L. Yin, G. Xu, M. Torii, Z. Niu, J. M. Maisog et al., Document classification for mining host pathogen protein???protein interactions, Artificial Intelligence in Medicine, vol.49, issue.3, pp.155-160, 2010.
DOI : 10.1016/j.artmed.2010.04.003

M. Krallinger, M. Vazquez, F. Leitner, D. Salgado, and A. Valencia, Results of the BioCreative III (Interaction) Article Classification Task, Proceedings of the Third BioCreative Workshop. Bethesda. USA, pp.13-15, 2010.

G. Salton, Automatic processing of foreign language documents, Prentice-Hall. Englewood Cliffs. NJ, 1971.

H. Schmid, Probabilistic part-of-speech tagging using decision trees, Proceedings of the International Conference on New Methods in Language Processing, pp.44-49, 1994.

A. Vincarelli, Indexation de documents manuscrits, Proceedings du Colloque International Francophone sur l'Ecrit et le Document (CIFED06), pp.49-53, 2006.

G. Salton and C. Et-buckley, Term-weighting approaches in automatic text retrieval. Information Processing Management, pp.513-523, 1988.

K. Spark-jones, A STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVAL, Journal of Documentation, vol.28, issue.1, pp.11-21, 1972.
DOI : 10.1108/eb026526

F. Sebastiani, A tutorial on automated text categorisation, Proceedings of the 1st Argentinian Symposium on Artificial Intelligence (ASAI'99, pp.7-35, 1999.

Y. Yang and X. Et-liu, A reexamination of text categorization methods, Proceedings of the 22th Annual ACM SIGIR Conference, pp.42-49, 1999.

M. Mordian and A. Et-baarani, KNNBA: k-Nearest Neighbor Based Association Algorithm, 2009.

R. Agrawal and R. Et-srikant, Fast algorithms for mining association rules in large data bases, Proceeding of 20th VLDB Conference, 1994.

I. Tomek, Two modifications of CNN, IEEE Trans. Syst. Man. Cybern. SMG, vol.6, issue.11, pp.769-772, 1976.

M. Kubat and S. Et-matwin, Addressing the curse of imbalanced training sets: onesided selection, ICML, pp.179-186, 1997.