Ogmios: a scalable NLP platform for annotating large web document collections - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Ogmios: a scalable NLP platform for annotating large web document collections

Résumé

Web semantic access in specific domains calls for specialised search engines with enhanced semantic querying and indexing capacities, which pertain both to information retrieval (IR) and to information extraction (IE). A rich linguistic analysis is required either to identify the relevant semantic units to index and weight them according to linguistic specific statistical distribution, or as the basis of an information extraction process. Recent developments make Natural Language Processing (NLP) techniques reliable enough to process large collections of documents and to enrich them with semantic annotations. This paper focuses on the design and the development of a text processing platform, Ogmios, which has been developed in the ALVIS project. The Ogmios platform exploits existing NLP modules and resources, which may be tuned to specific domains and produces linguistically annotated documents. We show how the three constraints of genericity, domain semantic awareness and performance can be handled all together.
Fichier principal
Vignette du fichier
2007CorpusLinguistics-FP.pdf (276.89 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00619260 , version 1 (06-09-2011)

Identifiants

  • HAL Id : hal-00619260 , version 1

Citer

Thierry Hamon, Julien Derivière, Adeline Nazarenko. Ogmios: a scalable NLP platform for annotating large web document collections. Corpus Linguistics, Jul 2007, Manchester, United Kingdom. pp.195-208. ⟨hal-00619260⟩
183 Consultations
137 Téléchargements

Partager

Gmail Facebook X LinkedIn More