Generating and using probabilistic morphological resources for the biomedical domain

Vincent Claveau 1 Ewa Kijak 1
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds... When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources.
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download
Contributor : Vincent Claveau <>
Submitted on : Tuesday, July 22, 2014 - 12:15:49 PM
Last modification on : Friday, November 16, 2018 - 1:25:30 AM
Long-term archiving on : Tuesday, November 25, 2014 - 10:51:01 AM


Files produced by the author(s)


  • HAL Id : hal-01027778, version 1


Vincent Claveau, Ewa Kijak. Generating and using probabilistic morphological resources for the biomedical domain. 9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p. ⟨hal-01027778⟩



Record views


Files downloads