Skip to Main content Skip to Navigation
Conference papers

On statistical parsing of French with supervised and semi-supervised strategies

Abstract : This paper reports preliminary results on grammatical induction for French. We investigate how to best train a parser on the French Treebank (Abeillé and Barrier, 2004), viewing the task as a trade-off between generalizability and interpretability. We compare on French a supervised lexicalized parsing algorithm with a semi-supervised unlexicalized algorithm Petrov et al. (2006) along the lines of Crabbé and Candito (2008). We report the best results known to us on French statistical parsing with the semi-supervised learning algorithm, and the reported experiments can give insights for the task of grammatical learning for a morphologically-rich language, with a relatively limited amount of training data, annotated with a rather flat structure.
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00495290
Contributor : Marie Candito <>
Submitted on : Tuesday, September 7, 2010 - 3:42:21 PM
Last modification on : Friday, March 27, 2020 - 3:21:15 AM
Document(s) archivé(s) le : Wednesday, December 8, 2010 - 2:28:27 AM

File

wkshopEACL2009.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00495290, version 1

Citation

Marie Candito, Benoît Crabbé, Djamé Seddah. On statistical parsing of French with supervised and semi-supervised strategies. EACL 2009 workshop on Computational Linguistic Aspects of Grammatical Inference, Mar 2009, Athens, Greece. pp.49-57. ⟨hal-00495290⟩

Share

Metrics

Record views

446

Files downloads

342