Acquisition of Medical Terminology for Ukrainian from Parallel Corpora and Wikipedia - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Acquisition of Medical Terminology for Ukrainian from Parallel Corpora and Wikipedia

Résumé

The increasing availability of parallel bilingual corpora and of automatic methods and tools for their processing makes it possible to build linguistic and terminological resources for low-resourced languages. We propose to exploit various corpora available in several languages in order to build bilingual and trilingual terminologies. Typically, terminology information extracted in French and English is associated with the corresponding units in the Ukrainian corpus thanks to the multilingual transfer. According to the used approaches, precision of the term extraction varies between 0.454 and 0.966, while the quality of the interlingual relations varies between 0.309 and 0.965. The resource built contains 4,588 medical terms in Ukrainian and their 34,267 relations with French and English terms.
Fichier non déposé

Dates et versions

hal-01972746 , version 1 (07-01-2019)

Identifiants

  • HAL Id : hal-01972746 , version 1

Citer

Thierry Hamon, Natalia Grabar. Acquisition of Medical Terminology for Ukrainian from Parallel Corpora and Wikipedia. International Conference on Terminology and Artificial Intelligence, Pamela Faber and Thierry Poibeau, Jan 2015, Granada, Spain. ⟨hal-01972746⟩
209 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More