Computer aided correction and extension of a syntactic wide-coverage lexicon

Abstract : The effectiveness of parsers based on manually created resources, namely a grammar and a lexicon, rely mostly on the quality of these resources. Thus, increasing the parser coverage and precision usually implies improving these two resources. Their manual improvement is a time consuming and complex task : identifying which resource is the true culprit for a given mistake is not always obvious, as well as finding the mistake and correcting it. Some techniques, like van Noord (2004) or Sagot and Villemonte de La Clergerie (2006), bring a convenient way to automatically identify forms having potentially erroneous entries in a lexicon. We have integrated and extended such techniques in a wider process which, thanks to the grammar ability to tell how these forms could be used as part of correct parses, is able to propose lexical corrections for the identified entries. We present in this paper an implementation of this process and discuss the main results we have obtained on a syntactic wide-coverage French lexicon.
Type de document :
Communication dans un congrès
Coling 2008, Aug 2008, Manchester, United Kingdom. pp 604-611, 2008, CD ROM
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00360918
Contributeur : Jacques Farré <>
Soumis le : jeudi 12 février 2009 - 16:37:28
Dernière modification le : mercredi 12 octobre 2016 - 01:23:53
Document(s) archivé(s) le : mardi 8 juin 2010 - 20:45:28

Fichier

lexFix-coling.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00360918, version 1

Collections

Citation

Lionel Nicolas, Benoît Sagot, Miguel Molinero, Jacques Farré, Eric De La Clergerie. Computer aided correction and extension of a syntactic wide-coverage lexicon. Coling 2008, Aug 2008, Manchester, United Kingdom. pp 604-611, 2008, CD ROM. 〈hal-00360918〉

Partager

Métriques

Consultations de
la notice

227

Téléchargements du document

137