A syntactic component for Vietnamese language processing - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Language Modelling Année : 2015

A syntactic component for Vietnamese language processing

Résumé

This paper presents the development of a grammar and a syntactic parser for the Vietnamese language. We first discuss the construction of a lexicalized tree-adjoining grammar using an automatic extraction approach. We then present the construction and evaluation of a deep syntactic parser based on the extracted grammar. This is a complete system that produces syntactic structures for Vietnamese sentences. A dependency annotation scheme for Vietnamese and an algorithm for extracting dependency structures from derivation trees are also proposed. This is the first Vietnamese parsing system capable of producing both constituency and dependency analyses. It offers encouraging performance: accuracy of 69.33% and 73.21% for constituency and dependency analysis, respectively.
Fichier principal
Vignette du fichier
89-838-1-PB.pdf (314.33 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01255977 , version 1 (25-01-2016)

Identifiants

Citer

Phuong Le-Hong, Azim Roussanaly, Thi Minh Huen Nguyen. A syntactic component for Vietnamese language processing. Journal of Language Modelling, 2015, Journal of Language Modelling, 3 (1), pp.146-184. ⟨10.15398/jlm.v3i1.89⟩. ⟨hal-01255977⟩
171 Consultations
926 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More