Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger

Résumé

This paper evaluates the impact of external lexical resources into a CRF-based joint Multiword Segmenter and Part-of-Speech Tagger. We especially show different ways of integrating lexicon-based features in the tagging model. We display an absolute gain of 0.5\% in terms of f-measure. Moreover, we show that the integration of lexicon-based features significantly compensates the use of a small training corpus.
Fichier principal
Vignette du fichier
constant-tellier-lrec2012.pdf (62.04 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00790624 , version 1 (20-02-2013)

Identifiants

  • HAL Id : hal-00790624 , version 1

Citer

Mathieu Constant, Isabelle Tellier. Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger. 8th International Conference on Language Resources and Evaluation (LREC'12), May 2012, Turkey. pp.646-650. ⟨hal-00790624⟩
377 Consultations
219 Téléchargements

Partager

Gmail Facebook X LinkedIn More