Disambiguation Tools for NooJ
Résumé
When NooJ performs an automatic lexical analysis of corpora, it recognizes five types of atomic linguistic units (ALUs) and represents them as annotations stored inside each text's annotation structure (TAS). Unfortunately, the massive level of ambiguities generated by each of the five corresponding parsers produces a TAS far too heavy for most corpus linguistics applications. In consequence, most users' queries produce too many incorrect results. In order to provide a working solution for NooJ's lexical parser's behavior, we have implemented a new set of tools specifically designed to deal with unwanted ambiguities in corpora and texts: automatic and semi-automatic tools as well as a manual access to edit the TAS.
Fichier principal
Budapest_2008_Disambiguation_Tools_for_NooJ.pdf (579.03 Ko)
Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...