Acquisition of Medical Terminology for Ukrainian from Parallel Corpora and Wikipedia

Abstract : The increasing availability of parallel bilingual corpora and of automatic methods and tools for their processing makes it possible to build linguistic and terminological resources for low-resourced languages. We propose to exploit various corpora available in several languages in order to build bilingual and trilingual terminologies. Typically, terminology information extracted in French and English is associated with the corresponding units in the Ukrainian corpus thanks to the multilingual transfer. According to the used approaches, precision of the term extraction varies between 0.454 and 0.966, while the quality of the interlingual relations varies between 0.309 and 0.965. The resource built contains 4,588 medical terms in Ukrainian and their 34,267 relations with French and English terms.
Type de document :
Communication dans un congrès
Pamela Faber and Thierry Poibeau. International Conference on Terminology and Artificial Intelligence, Jan 2015, Granada, Spain
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01972746
Contributeur : Limsi Publications <>
Soumis le : lundi 7 janvier 2019 - 20:22:29
Dernière modification le : mercredi 13 février 2019 - 01:26:45

Identifiants

  • HAL Id : hal-01972746, version 1

Citation

Thierry Hamon, Natalia Grabar. Acquisition of Medical Terminology for Ukrainian from Parallel Corpora and Wikipedia. Pamela Faber and Thierry Poibeau. International Conference on Terminology and Artificial Intelligence, Jan 2015, Granada, Spain. 〈hal-01972746〉

Partager

Métriques

Consultations de la notice

37