Benchmarking of Statistical Dependency Parsers for French - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Benchmarking of Statistical Dependency Parsers for French

Résumé

We compare the performance of three statistical parsing architectures on the problem of deriving typed dependency structures for French. The architectures are based on PCFGs with latent variables, graph-based dependency parsing and transition-based dependency parsing, respectively. We also study the influence of three types of lexical information: lemmas, morphological features, and word clusters. The results show that all three systems achieve competitive performance, with a best labeled attachment score over 88%. All three parsers benefit from the use of automatically derived lemmas, while morphological features seem to be less important. Word clusters have a positive effect primarily on the latent variable parser.
Fichier principal
Vignette du fichier
frdepcompar-Coling10-final.pdf (198.04 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00514815 , version 1 (07-09-2010)

Identifiants

  • HAL Id : hal-00514815 , version 1

Citer

Marie Candito, Joakim Nivre, Pascal Denis, Enrique Henestroza Anguiano. Benchmarking of Statistical Dependency Parsers for French. 23rd International Conference on Computational Linguistics - COLING 2010, Aug 2010, Beijing, China. pp.108-116. ⟨hal-00514815⟩
167 Consultations
190 Téléchargements

Partager

Gmail Facebook X LinkedIn More