A Framework for Understanding the Role of Morphology in Universal Dependency Parsing

Mathieu Dehouck 1, 2 Pascal Denis 2, 1
1 MAGNET - Machine Learning in Information Networks
Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille (CRIStAL) - UMR 9189
Abstract : This paper presents a simple framework for characterizing morphological complexity and how it encodes syntactic information. In particular, we propose a new measure of morphosyntactic complexity in terms of governordependent preferential attachment that explains parsing performance. Through experiments on dependency parsing with data from Universal Dependencies (UD), we show that representations derived from morphological attributes deliver important parsing performance improvements over standard word form embeddings when trained on the same datasets. We also show that the new morphosyntactic complexity measure is predictive of the gains provided by using morphological attributes over plain forms on parsing scores, making it a tool to distinguish languages using morphology as a syntactic marker from others.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01943934
Contributor : Pascal Denis <>
Submitted on : Tuesday, December 4, 2018 - 11:50:07 AM
Last modification on : Friday, March 22, 2019 - 1:37:11 AM

File

doc.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01943934, version 1

Citation

Mathieu Dehouck, Pascal Denis. A Framework for Understanding the Role of Morphology in Universal Dependency Parsing. EMNLP 2018 - Conference on Empirical Methods in Natural Language Processing, Oct 2018, Brussels, Belgium. ⟨hal-01943934⟩

Share

Metrics

Record views

67

Files downloads

66