MarsaGram: an Excursion in the Forests of Parsing Trees - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

MarsaGram: an Excursion in the Forests of Parsing Trees

Philippe Blache
Stéphane Rauzy

Résumé

The question of how to compare languages and more generally the domain of linguistic typology, relies on the study of different linguistic properties or phenomena. Classically, such a comparison is done semi-manually, for example by extracting information from databases such as the WALS. However, it remains difficult to identify precisely regular parameters, available for different languages, that can be used as a basis towards modeling. We propose in this paper, focusing on the question of syntactic typology, a method for automatically extracting such parameters from treebanks, bringing them into a typology perspective. We present the method and the tools for inferring such information and navigating through the treebanks. The approach has been applied to 10 languages of the Universal Dependencies Treebank. We approach is evaluated by showing how automatic classification correlates with language families.
Fichier principal
Vignette du fichier
lrec16-MarsaGram-final.pdf (607.93 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01462181 , version 1 (19-04-2017)

Identifiants

  • HAL Id : hal-01462181 , version 1

Citer

Philippe Blache, Stéphane Rauzy, Grégoire Montcheuil. MarsaGram: an Excursion in the Forests of Parsing Trees. Language Resources and Evaluation Conference, May 2016, Portorož, Slovenia. pp.7. ⟨hal-01462181⟩
111 Consultations
77 Téléchargements

Partager

Gmail Facebook X LinkedIn More