Morphological Analysis of Biomedical Terminology with Analogy-Based Alignment

Vincent Claveau 1 Ewa Kijak 1
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In the biomedical domain, many terms are neoclassical compounds (composed of several Greek or Latin roots). The study of their morphology is important for numerous applications since it makes it possible to structure, translate, retrieve them efficiently... In this paper, we propose an original yet fruitful approach to carry out this morphological analysis by relying on Japanese, more precisely on terms written in kanjis, as a pivot language. In order to do so, we have developed a specially crafted alignment algorithm relying on analogy learning. Aligning terms with their kanji-based counterparts provides at the same time a decomposition of the term into morphs, and a kanji label for each morph. Evaluated on a dataset of French terms, our approach yields a precision greater than 70% and shows its relevance compared with existing techniques. We also illustrate the interest of this approach through two direct applications of the produced alignments: translating unknown terms and discovering relationships between morphs for terminological structuring.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00644041
Contributor : Vincent Claveau <>
Submitted on : Wednesday, November 23, 2011 - 2:46:31 PM
Last modification on : Friday, November 16, 2018 - 1:24:58 AM

Identifiers

  • HAL Id : hal-00644041, version 1

Citation

Vincent Claveau, Ewa Kijak. Morphological Analysis of Biomedical Terminology with Analogy-Based Alignment. Recent Advances in Natural Language Processing, 2011, Bulgaria. ⟨hal-00644041⟩

Share

Metrics

Record views

460