Disambiguation Tools for NooJ - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2008

Disambiguation Tools for NooJ

Résumé

When NooJ performs an automatic lexical analysis of corpora, it recognizes five types of atomic linguistic units (ALUs) and represents them as annotations stored inside each text's annotation structure (TAS). Unfortunately, the massive level of ambiguities generated by each of the five corresponding parsers produces a TAS far too heavy for most corpus linguistics applications. In consequence, most users' queries produce too many incorrect results. In order to provide a working solution for NooJ's lexical parser's behavior, we have implemented a new set of tools specifically designed to deal with unwanted ambiguities in corpora and texts: automatic and semi-automatic tools as well as a manual access to edit the TAS.
Fichier principal
Vignette du fichier
Budapest_2008_Disambiguation_Tools_for_NooJ.pdf (579.03 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00498045 , version 1 (06-07-2010)

Identifiants

  • HAL Id : hal-00498045 , version 1

Citer

Max Silberztein. Disambiguation Tools for NooJ. 2008. ⟨hal-00498045⟩
122 Consultations
230 Téléchargements

Partager

Gmail Facebook X LinkedIn More