A. Nimaan, P. Nocera, and J. Torres-moreno, Boîtesà outils TAL pour les langues peu informatisées: le cas du Somali. In Journées d'Analyses des Données Textuelles (JADT'06), pp.697-705, 2006.

E. Airio, Word normalization and decompounding in mono-and bilingual IR, Information Retrieval, vol.9, issue.3, pp.249-271, 2006.

J. Atserias, B. Casas, E. Comelles, M. González, L. Padró et al., FreeLing 1.3: Syntactic and semantic services in an open-source NLP library, fifth International Conference on Language Resources and Evaluation (LREC'06), 2006.

R. Baeza-yates and B. Ribeiro-neto, Modern Information Retrieval. Addison Wesley, 1999.

A. Ben-hur, D. Horn, H. T. Siegelmann, and V. Vapnik, Support Vector Clustering, Journal of Machine Learning Research, vol.2, pp.125-137, 2001.

V. Berment, Méthodes pour informatiser des langues et des groupes de langues peu dotées, 2004.

D. Bernhard, Apprentissage non supervisé de familles morphologiques par classification ascendante hiérarchique, TALN'07, vol.1, pp.367-376, 2006.

F. Boudin and J. Torres-moreno, NEO-CORTEX: A Performant User-Oriented Multi-Document Summarization System, Computational Linguistics and Intelligent Text Processing (CICLing'07), vol.4394, pp.551-562, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01313214

L. Breiman, J. Friedman, R. Olshen, and C. Stone, Classification and Regression Trees, 1984.

M. Cabré-castellví, Typology of neologisms: a complex task, Alfa (São Paulo), vol.50, issue.2, pp.229-250, 2006.

M. Creutz and K. Lagus, Unsupervised Discovery of Morphemes, 6th Workshop of the ACL Special Interest Group in Computational Phonology (SIGPHON), pp.21-30, 2002.

M. Creutz and K. Lagus, Unsupervised morpheme segmentation and morphology induction from text corpora using Morfessor 1.0, Publications in Computer and Information Science, 2005.

R. and G. Díaz, Lematización en español: una aplicación para la recuperación de información, 2005.

D. P. Lyras, K. N. Sgarbas, and N. D. Fakotakis, Using the Levenshtein Edit Distance for Automatic Lemmatization: A Case Study for Modern Greek and English, 19th IEEE International Conference on Tools with Artificial Intelligence -(ICTAI'07), vol.2, pp.428-435, 2007.

F. Namer, Flemm: Un analyseur Flexionnel de Françaisà base de règles, Traitement automatique des Langues pour la recherche d'information, pp.523-547, 2000.

C. G. Figuerola, R. G. Díaz, and E. López-de-san-román, Stemming and n-grams in Spanish: An evaluation of their impact on information retrieval, Journal of Information Science, vol.26, issue.6, pp.461-467, 2000.

A. F. Gelbukh, M. Alexandrov, and S. Han, Detecting inflection patterns in natural language by minimization of morphological model, 9th Iberoamerican Congress on Pattern Recognition (CIARP'04), vol.3287, pp.432-438, 2004.

A. F. Gelbukh and G. Sidorov, Approach to construction of automatic morphological analysis systems for inflective languages with little effort, Computational Linguistics and Intelligent Text Processing (CICLing'03), vol.2, pp.215-220, 2003.

J. A. Goldsmith, Unsupervised Learning of the Morphology of a Natural Language, Computational Linguistics, vol.27, issue.2, pp.153-198, 2001.

N. Grabar and P. Zweigenbaum, Acquisition automatique de connaissances morphologiques sur le vocabulaire médical, TALN'99, pp.175-184, 1999.

C. Hammarström, Unsupervised Learning of Morphology: Survey, Model, Algorithm and Experiments. Master's thesis, 2007.

H. Hammarström, A Naive Theory of Morphology and an Algorithm for Extraction, SIGPHON 2006: ACL Special Interest Group on Computational Phonology, pp.79-88, 2006.

N. Hathout, Acquisition morphologiqueà partir d'un dictionnaire informatisé, 16ème conférence sur le Traitement Automatique des Langues Naturelles, TALN'09, p.10, 2009.

N. Hathout, Acquisition of morphological families and derivational series from a machine readable dictionary, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00382808

S. Helmut, Probabilistic part-of-speech tagging using decision trees, International Conference on New Methods in Language Processing, 1994.

J. Hertz, A. Krogh, and R. Palmer, Introduction to the Theory of Neural Computation, 1991.

V. Hollink, J. Kamps, C. Monz, and M. De-rijke, Monolingual Document Retrieval for European Languages, Information Retrieval, vol.7, issue.1-2, pp.33-52, 2004.

D. Hull and G. Grefenstette, Stemming algorithms: A case study for detailed evaluation, Journal of the American Society for Information Science, vol.47, issue.1, pp.70-84, 1996.

C. Jacquemin and E. Tzoukermann, NLP for term variant extraction: synergy between morphology, lexicon and syntax, Natural Language Information Retrieval, pp.25-74, 1999.

K. Kettunen, E. Airio, and K. Järveli, Restricted inflectional form generation in management of morphological keyword variation, Information Retrieval, vol.10, issue.4-5, pp.415-444, 2007.

T. Korenius, J. Laurikkala, K. Jarvelin, and M. Juhola, Stemming and lemmatization in the clustering of finnish text documents, CIKM'04: Thirteenth ACM Conference on Information and Knowledge Management, pp.625-633, 2004.

Y. Lepage, Solving analogies on words: an algorithm, COLING-ACL'98, pp.728-735, 1998.

J. B. Lovins, Development of a Stemming Algorithm. Mechanical translation and computational linguistics, vol.11, pp.23-31, 1968.

C. D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing, 1999.

F. Moreau, V. Claveau, and P. Sébillot, Automatic Morphological Query Expansion Using Analogy-Based Machine Learning, Advances in Information Retrieval, vol.4425, pp.222-233, 2007.

C. D. , Paice. Another stemmer. SIGIR Forum, vol.24, issue.3, pp.56-61, 1990.

C. D. Paice, Method for Evaluation of Stemming Algorithms Based on Error Counting, Journal of the American Society for Information Science, vol.47, issue.8, pp.632-649, 1996.

M. F. Porter, An algorithm for suffix stripping, Program, vol.40, issue.3, pp.211-218, 2006.

J. and R. Quinlan, C4.5: Programs for Machine Learning, 1993.

G. Souvay and J. Pierrel, LGeRM Lemmatisation des mots en Moyen Français, Traitement Automatique des Langues, vol.50, issue.2, pp.149-172, 2009.

S. Tomlinson, Lexical and Algorithmic Stemming Compared for 9 European Languages with Hummingbird SearchServer, CLEF'03, vol.3237, pp.286-300, 2003.

A. Medina-urrea, Automatic Discovery of Affixes by means of a Corpus: A Catalog of Spanish Affixes, Journal of Quantitative Linguistics, vol.7, issue.2, pp.97-114, 2000.

V. Vapnik, Principles of Risk Minimization for Learning Theory, Advances in Neural Information Processing Systems, vol.4, pp.831-838, 1991.

V. Vapnik, The Statistical Learning Theory, 1998.

J. Vilares, M. A. Alonso, and M. Vilares, Extraction of complex index terms in non-English IR: A shallow parsing based approach, Information Processing and Management, vol.44, issue.4, pp.1517-1537, 2008.

J. Vilares, D. Cabrero, and M. A. Alonso, Applying productive derivational morphology to term indexing of Spanish texts, Computational Linguistics and Intelligent Text Processing (CICLing'01), pp.336-348, 2001.