La désambiguïsation lexicale d'une langue moins bien dotée, l'exemple de l'arabe

Abstract : Sense-annotated corpus are decisive resources for Word Sense Disambiguation (WSD). Most of the languages have none or too little to build robust systems. In this article, we present 12 sense-annotated corpra for the Arabic language automatically build from 12 corpus in English. We evaluate the quality of our WSD systems using a newly available Arabic evaluation corpus.
Liste complète des métadonnées

Cited literature [12 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01781185
Contributor : Didier Schwab <>
Submitted on : Sunday, April 29, 2018 - 5:17:28 PM
Last modification on : Monday, February 11, 2019 - 4:36:02 PM
Document(s) archivé(s) le : Tuesday, September 25, 2018 - 12:57:08 PM

File

TALN2018_Hajj-Salah-et-al.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01781185, version 1

Collections

Citation

Marwa Hadj Salah, Loïc Vial, Hervé Blanchon, Mounir Zrigui, Benjamin Lecouteux, et al.. La désambiguïsation lexicale d'une langue moins bien dotée, l'exemple de l'arabe. 25e conférence sur le Traitement Automatique des Langues Naturelles, May 2018, Rennes, France. ⟨hal-01781185⟩

Share

Metrics

Record views

124

Files downloads

146