A proposal for annotation, semantic similarity and classification of textual documents

Emmanuel Nauer 1, 2 Amedeo Napoli 1
1 ORPAILLEUR - Knowledge representation, reasonning
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.
Document type :
Conference papers
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

Contributor : Emmanuel Nauer <>
Submitted on : Tuesday, October 3, 2006 - 2:49:11 PM
Last modification on : Friday, May 24, 2019 - 10:58:05 AM
Long-term archiving on : Tuesday, April 6, 2010 - 1:18:55 AM




Emmanuel Nauer, Amedeo Napoli. A proposal for annotation, semantic similarity and classification of textual documents. The 12th International Conference on Artificial Intelligence: Methodology, Systems, Applications - AIMSA 2006. AI, people and the web, 2006, Varna, Bulgaria. pp.201-212, ⟨10.1007/11861461_22⟩. ⟨hal-00102585⟩



Record views


Files downloads