, Journal of Aritificial Intelligence Research, vol.11, pp.95-130

J. Curran, Ensemble methods for automatic thesaurus extraction, Proceedings of the conference on Empirical methods in natural language processing (EMNLP), vol.10, pp.222-229, 2002.

P. Cimano, S. Handschuh, and S. Staab, Towards the self-annotating web, Proceedings of the 13th international conference on World Wide Web, pp.462-471, 2004.

Z. S. Harris, Distributional structure. Word, vol.10, pp.146-162, 1954.

,

S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman, Indexing by latent semantic analysis, Journal of the American Society of Information Science, vol.41, pp.391-407, 1990.

E. Gabrilovich and S. Markovitch, Computing semantic relatedness using Wikipediabased explicit semantic analysis, Proceedings of the 20th International Joint Conference on Artificial Intelligence, pp.1606-1611, 2007.

M. Yazdani and A. Popescu-belis, Computing text semantic relatedness using the contents and links of a hypertext encyclopedia: extended abstract, Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, pp.3185-3189, 2013.

J. Turian, L. Ratinov, and Y. Bengio, Word representations: a simple and general method for semi-supervised learning, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp.384-394, 2010.

M. Baroni, G. Dinu, and G. Kruszewski, , 2014.

, Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol.1, pp.238-247

C. Leacock, G. A. Miller, and M. Chodorow, Using corpus statistics and WordNet relations for sense identification, Journal of Computational Linguistics, vol.24, issue.1, pp.147-165, 1998.

R. Rada, H. Mili, E. Bicknell, and M. Blettner, Development and application of a metric on semantic nets, IEEE Transactions on systems, Man and Cybernetics, vol.19, issue.1, pp.17-30, 1989.

Z. Wu and M. Palmer, Verb semantics and lexical selection, Proceedings of the 32nd Annual Meetings of the Associations for Computational Linguistics, pp.133-138, 1994.

D. C. Howe, RiTa: creativity support for computational literature, Proceedings of the seventh ACM conference on Creativity and cognition (C&C '09), pp.205-210, 2009.

D. Lin, An information-theoric definition of similarity, Proceedings of the 15th international conference on Machine Learning, pp.296-304, 1998.

P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, Proceedings of the 14th International Joint Conference on Artificial Intelligence, vol.1, pp.448-453, 1995.

S. P. Ponzetto and M. Strube, Knowledge derived from Wikipedia for computing semantic relatedness, Journal of Artificial Intelligence Research, vol.30, issue.1, pp.181-212, 2007.

D. Milne and I. H. Witten, Learning to link with Wikipedia, Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp.509-518, 2008.

M. T. Pilehvar and R. Navigli, From senses to texts: An all-in-one graph-based approach for measuring semantic similarity, Journal of Artificial Intelligence, vol.228, pp.95-128, 2015.

G. Salton and M. J. Mcgill, Introduction to modern information retrieval, 1983.

G. Salton, The SMART Retrieval System -Experiments in Automatic Document Processing, 1971.

C. J. Crouch, S. Apte, and H. Bapat, Using the extended vector model for xml retrieval, Proceedings of the First Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), Schloss Dagstuhl, pp.95-98, 2002.

E. A. Fox, Extending the Boolean and Vector Space Models of information retrieval with p-norm queries and multiple concept types, 1983.

D. Carmel, Y. Maarek, M. Mandelbrod, Y. Mass, and A. Soffer, Searching xml documents via xml fragments, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pp.151-158, 2003.

M. Fuller, E. Mackie, R. Sacks-davis, and R. Wilkinson, Structural answers for a large structured document collection, Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, Pitthsburgh, pp.204-213, 1993.

T. Schileder and H. Meus, Querying and ranking XML documents, Journal of the American Society for Information Science and Technology, vol.53, pp.489-503, 2002.

T. Joachims, A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization, Proceedings of the Fourteenth International Conference on Machine Learning, pp.143-151, 1997.

S. Jaillet, A. Laurent, and M. Teisseire, Sequential patterns for text categorization, Journal of Intelligent Data Analysis, vol.10, pp.199-214, 2006.
URL : https://hal.archives-ouvertes.fr/lirmm-00135010

P. Soucy and G. W. Mineau, A Simple k-NN Algorithm For Text Categorization, Proceedings of IEEE International Conference on Data Mining, pp.647-648, 2001.

A. Hotho, A. Maedche, and S. Staab, Ontology-based Text Document Clustering. KI, vol.16, pp.48-54, 2002.

S. B. Kotsiantis, Supervised Machine Learning: A Review of Classification Techniques, vol.31, pp.249-268, 2007.

Y. Yang and X. Liu, A re-examination of text categorization methods, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, pp.42-49, 1999.

T. Joachims, Text categorization with support vector machines: learning with many relevant features, Proceedings of ECML-98, 1998.

, European Conference on Machine Learning, pp.137-142

E. Gabrilovich and S. Markovitch, Feature Generation for Text categorization Using World Knowledge, Proceedings of IJCAI 2005: the Nineteenth International Joint Conference on Artificial Intelligence, pp.1048-1053, 2005.

A. Hotho, S. Staab, and G. Stumme, , 2003.

, Ontologies Improve Text Document Clustering, Proceedings of ICDM:3rd IEEE International Conference on Data Minin, pp.541-544

H. H. Tar, T. T. Soe, and . Nyunt, Ontology-Based Concept Weighting for Text documents, International Conference on Information Communication and Management IPCSIT, vol.16, 2011.

B. Pincemin, Similarites texte-texts expérience d'une application de diffusion ciblée et propositions. In Matemáticas y Tratamiento de Corpus, Actes du 2ème séminaire de l'Ecole interlatine de linguistique appliquée, pp.35-52, 2000.

K. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft, When is `nearest neighbor' meaningful, Proceedings of ICDT, International Conference on Database Theory, pp.217-235, 1999.

U. L. Gunasinghe, W. A. Silva, N. H. Silva, A. S. Perera, W. A. Sashika et al., Sentence similarity measuring by vector space model, Proceedings of the 14 th International Conference on Advances in ICT for Emerging Regions (ICTer), pp.185-189, 2014.

Y. Liu, C. Sun, L. Lin, Y. Zhao, and X. Wang, Computing Semantic Text Similarity Using Rich Features, Proceedings of PACLIC: 29th Pacific Asia Conference on Language, Information and Computation, pp.44-52, 2015.

J. Lewis, S. Ossowski, J. Hicks, M. Errami, and H. R. Garner, Text similarity: an alternative way to search MEDLINE, Bioinformatics, vol.22, pp.2298-2304, 2006.

E. Yamamoto, M. Kishida, Y. Takenami, Y. Takeda, and K. Umemura, Dynamic programming matching for large scale information retrieval, Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages, vol.11, pp.100-108, 2003.

W. Ma and T. Suel, Structural Sentence Similarity Estimation for Short Texts, Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, pp.232-237, 2016.

D. Dudognon, G. Hubert, and B. Ralalason, Proxigénéa : Une mesure de similarite conceptuelle, Proceedings of the Colloque Veille Strategique Scientifique et Technologique (VSST 2010), 2010.

M. Baziz, M. Boughanem, H. Prade, and G. Pasi, A Fuzzy Set Approach to Concept-based Information Retrieval, Proceedings of the 4th Conference of the European Society for Fuzzy Logic and Technology and the 11ème Eleventh Rencontres Francophones sur la Logique Floue et ses Applications (Eusflat-LFA 2005 joint Conference), pp.1287-1292, 2005.

K. M. Shenoy, K. C. Shet, and U. D. Acharya, Semantic plagiarism detection system using ontology mapping, Advanced Computing: An International Journal (ACIJ), vol.3, pp.59-62, 2012.

L. Zhang, C. Li, J. Liu, and H. Wang, Graph-Based Text Similarity Measurement by Exploiting Wikipedia as Background Knowledge, International Journal of Computer, Electrical, Automation, Control and Information Engineering, vol.5, pp.1328-1333, 2011.

W. Jin and R. K. Srihari, Graph-based Text Representation and Knowledge Discovery, 2007.

, Proceedings of the 2007 ACM symposium on Applied computing, pp.807-811

P. Wang, H. Zhang, B. Xu, C. Liu, and H. Hao, Short Text Feature Enrichment Using Link Analysis on Topic-Keyword Graph, Proceedings of Natural Language Processing and Chinese Computing, pp.79-90, 2014.

J. Leskovec and J. Shawe-taylor, Semantic text features from small world graphs, Workshop on Subspace, Latent Structure and Feature Selection techniques: Statistical and Optimization perspectives, 2005.

S. Brin, J. Davis, and H. Garcia-molina, Copy detection mechanisms for digital documents, Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, pp.398-409, 1995.

C. Basile, D. Benedetto, E. Caglioti, and M. D. Esposti, An example of mathematical authorship attribution, Journal of Mathematical Physics, vol.49, pp.125211-125212, 2008.

C. Basile, D. Benedetto, E. Caglioti, G. Cristadoro, and M. D. Esposti, A plagiarism detection procedure in three steps: selection, matches and squares. 3rd Workshop on Uncovering Plagiarism, 2009.

B. Stein and S. M. Zu-eissen, Near Similarity Search and Plagiarism Analysis, Proceeding of the 29th Annual Conference of the GfKl Springer, pp.430-437, 2005.

R. Lukashenko, V. Graudina, and J. Grundspenkis, Computer-Based Plagiarism Detection Methods and Tools: An Overview, Proceeding of the 2007 International Conference on Computer Systems and Technologies -CompSysTech'07, 2007.

K. Vani and D. Gupta, Investigating the Impact of Combined Similarity Metrics and POS tagging in Extrinsic Text Plagiarism Detection System, Proceeding of the International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp.1578-1584, 2015.

A. H. Osman, N. Salim, M. S. Binwahlan, H. Hentably, and A. M. Ali, Conceptual similarity and graph-based method for plagiarism detection, Journal of Theoretical and Applied Information Technology, vol.32, issue.2, pp.135-145, 2011.

D. Rusu, B. Fortuna, M. Grobelnik, and D. Mladeni?, Semantic Graphs Derived from Triplets with Application in Document Summarization, vol.33, pp.357-362, 2009.

S. Iltache, C. Comparot, M. Mohammed, and P. J. Charrel, Using domain ontologies for classification and semantic interpretation of documents, Proceedings of ALLDATA 2016: 2nd International Conference on Big Data, Small Data, Linked Data and Open Data, pp.76-81, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01535945

R. Bendaoud, Analyses formelle et relationnelle de concepts pour la construction d'ontologies de domaines à partir de ressources textuelles hétérogènes, vol.1, 2009.

N. Fuhr and K. Grossjohann, XIRQL: a query language for information retrieval in XML documents, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp.172-180, 2001.

E. Omodei, Y. Guo, J. P. Cointet, and T. Poibeau, Actes de la 21ème conférence Traitement Automatique des Langues Naturelles, 2014.

Y. Guo, A. Korhonen, and T. Poibeau, A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents, Proceedings of the 2011 conference on Empirical Methods in Natural Language Processing, pp.273-283, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00666472

B. Magnini and G. Cavaglià, Integrating Subject Field Codes into WordNet, Proceedings of LREC-2000, Second International Conference on Language Resources and Evaluation, pp.1413-1418, 2000.

C. Fellbaum, WordNet: An Electronic Lexical Database, 1998.

K. Toutanova, D. Klein, C. Manning, and Y. Singer, Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network, Proceedings of HLT-NAACL, pp.252-259, 2003.

M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann et al., The WEKA Data Mining Software: An Update. SIGKDD Explorations, vol.11, pp.10-18, 2009.