Benchmarking of Statistical Dependency Parsers for French

Marie Candito; Joakim Nivre; Pascal Denis; Enrique Henestroza Anguiano

Communication Dans Un Congrès Année : 2010

Benchmarking of Statistical Dependency Parsers for French

(1) , (2) , (1) , (1)

1
2

Marie Candito

Fonction : Auteur
PersonId : 13596
IdHAL : marie-candito
IdRef : 153698616

Analyse Linguistique Profonde à Grande Echelle ; Large-scale deep linguistic processing

Joakim Nivre

Fonction : Auteur
PersonId : 878440

Uppsala University

Pascal Denis

Fonction : Auteur
PersonId : 1744
IdHAL : pascal-denis
IdRef : 031934684

Analyse Linguistique Profonde à Grande Echelle ; Large-scale deep linguistic processing

Enrique Henestroza Anguiano

Fonction : Auteur
PersonId : 878441

Analyse Linguistique Profonde à Grande Echelle ; Large-scale deep linguistic processing

Résumé

We compare the performance of three statistical parsing architectures on the problem of deriving typed dependency structures for French. The architectures are based on PCFGs with latent variables, graph-based dependency parsing and transition-based dependency parsing, respectively. We also study the influence of three types of lexical information: lemmas, morphological features, and word clusters. The results show that all three systems achieve competitive performance, with a best labeled attachment score over 88%. All three parsers benefit from the use of automatically derived lemmas, while morphological features seem to be less important. Word clusters have a positive effect primarily on the latent variable parser.

Mots clés

statistical parsing dependency parsing semi-supervised learning

Domaines

Informatique et langage [cs.CL] Traitement du texte et du document

Fichier principal

frdepcompar-Coling10-final.pdf (198.04 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Marie Candito : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00514815

Soumis le : mardi 7 septembre 2010-15:36:54

Dernière modification le : mardi 25 octobre 2022-18:58:50

Archivage à long terme le : mercredi 8 décembre 2010-02:29:53

Dates et versions

hal-00514815 , version 1 (07-09-2010)

Identifiants

HAL Id : hal-00514815 , version 1

Citer

Marie Candito, Joakim Nivre, Pascal Denis, Enrique Henestroza Anguiano. Benchmarking of Statistical Dependency Parsers for French. 23rd International Conference on Computational Linguistics - COLING 2010, Aug 2010, Beijing, China. pp.108-116. ⟨hal-00514815⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS7 INRIA INRIA2 CAMPUS-AAR AAI ANR

167 Consultations

190 Téléchargements

Benchmarking of Statistical Dependency Parsers for French

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager