Acquisition of Medical Terminology for Ukrainian from Parallel Corpora and Wikipedia

Abstract : The increasing availability of parallel bilingual corpora and of automatic methods and tools for their processing makes it possible to build linguistic and terminological resources for low-resourced languages. We propose to exploit various corpora available in several languages in order to build bilingual and trilingual terminologies. Typically, terminology information extracted in French and English is associated with the corresponding units in the Ukrainian corpus thanks to the multilingual transfer. According to the used approaches, precision of the term extraction varies between 0.454 and 0.966, while the quality of the interlingual relations varies between 0.309 and 0.965. The resource built contains 4,588 medical terms in Ukrainian and their 34,267 relations with French and English terms.
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01972746
Contributor : Limsi Publications <>
Submitted on : Monday, January 7, 2019 - 8:22:29 PM
Last modification on : Monday, March 18, 2019 - 4:21:21 PM

Identifiers

  • HAL Id : hal-01972746, version 1

Citation

Thierry Hamon, Natalia Grabar. Acquisition of Medical Terminology for Ukrainian from Parallel Corpora and Wikipedia. International Conference on Terminology and Artificial Intelligence, Pamela Faber and Thierry Poibeau, Jan 2015, Granada, Spain. ⟨hal-01972746⟩

Share

Metrics

Record views

99