Character-level Annotation for Chinese Surface-Syntactic Universal Dependencies

Abstract : This paper presents a new schema to annotate Chinese Treebanks on the character level. The original Universal Dependencies (UD) and Surface-Syntactic Universal Dependencies (SUD) projects provide token-level resources with rich morphosyntactic language details. However, without any commonly accepted word definition for Chinese, the dependency parsing always faces the dilemma of word segmentation. Therefore we present a character-level annotation schema integrated into the existing Universal Dependencies schema as an extension.
Complete list of metadatas

Cited literature [18 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02270535
Contributor : Kim Gerdes <>
Submitted on : Sunday, August 25, 2019 - 11:35:55 PM
Last modification on : Wednesday, September 4, 2019 - 2:30:03 PM

File

syntaxfest.Character-level Ann...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02270535, version 1

Citation

Chuanming Dong, Yixuan Li, Kim Gerdes. Character-level Annotation for Chinese Surface-Syntactic Universal Dependencies. Depling 2019 - International Conference on Dependency Linguistics, Aug 2019, Paris, France. ⟨hal-02270535⟩

Share

Metrics

Record views

38

Files downloads

21