Skip to Main content Skip to Navigation
Conference papers

On statistical parsing of French with supervised and semi-supervised strategies

Abstract : This paper reports preliminary results on grammatical induction for French. We investigate how to best train a parser on the French Treebank (Abeillé and Barrier, 2004), viewing the task as a trade-off between generalizability and interpretability. We compare on French a supervised lexicalized parsing algorithm with a semi-supervised unlexicalized algorithm Petrov et al. (2006) along the lines of Crabbé and Candito (2008). We report the best results known to us on French statistical parsing with the semi-supervised learning algorithm, and the reported experiments can give insights for the task of grammatical learning for a morphologically-rich language, with a relatively limited amount of training data, annotated with a rather flat structure.
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download
Contributor : Marie Candito <>
Submitted on : Tuesday, September 7, 2010 - 3:42:21 PM
Last modification on : Friday, March 27, 2020 - 3:21:15 AM
Document(s) archivé(s) le : Wednesday, December 8, 2010 - 2:28:27 AM


Files produced by the author(s)


  • HAL Id : hal-00495290, version 1


Marie Candito, Benoît Crabbé, Djamé Seddah. On statistical parsing of French with supervised and semi-supervised strategies. EACL 2009 workshop on Computational Linguistic Aspects of Grammatical Inference, Mar 2009, Athens, Greece. pp.49-57. ⟨hal-00495290⟩



Record views


Files downloads