Natural Language Processing Method for Multilingual Semantic Indexing

Abstract : This paper deal with multilingual document indexing. We propose an indexing method based on natural language processing techniques. First of all, the most important term of the document are extracted using general characteristics of language and statistical methods. Thus, term extracting stages can be applied to any document whatever the document language is. Secondly, our indexing method uses multilingual ontology in order to find the most relevant concepts representing the document content. Our method can be applied to multilingual corpus containing document written in different languages; This indexing procedure is part of a multilingual document system and untitled SyDoM, that manage XML document.
Document type :
Poster communications
Complete list of metadatas
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Monday, July 17, 2017 - 2:10:37 PM
Last modification on : Friday, January 11, 2019 - 4:35:40 PM


  • HAL Id : hal-01563223, version 1


Catherine Roussey, Sylvie Calabretto, Farah Harrathi. Natural Language Processing Method for Multilingual Semantic Indexing. 12th International Conference on Applications of Natural Language to Information Systems, Jun 2007, CNAM, Paris, France, France. 2007. ⟨hal-01563223⟩



Record views