Promoting multiword expressions in A* TAG parsing

Abstract : Multiword expressions (MWEs) are pervasive in natural languages and often have both idiomatic and compositional readings, which leads to high syntactic ambiguity. We show that for some MWE types idiomatic readings are usually the correct ones. We propose a heuristic for an A* parser for Tree Adjoining Grammars which benefits from this knowledge by promoting MWE-oriented analyses. This strategy leads to a substantial reduction in the parsing search space in case of true positive MWE occurrences, while avoiding parsing failures in case of false positives.
Document type :
Conference papers
Complete list of metadatas

Cited literature [26 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01378903
Contributor : Agata Savary <>
Submitted on : Wednesday, October 12, 2016 - 11:15:24 AM
Last modification on : Tuesday, July 2, 2019 - 4:02:04 PM
Long-term archiving on : Friday, January 13, 2017 - 12:11:57 PM

File

coling-promoting-MWEs.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : hal-01378903, version 1

Citation

Jakub Waszczuk, Agata Savary, Yannick Parmentier. Promoting multiword expressions in A* TAG parsing. 26th International Conference on Computational Linguistics (COLING 2016), Dec 2016, Osaka, Japan. ⟨hal-01378903⟩

Share

Metrics

Record views

328

Files downloads

128