Generating and using probabilistic morphological resources for the biomedical domain

Vincent Claveau 1 Ewa Kijak 1
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds... When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources.
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01027778
Contributor : Vincent Claveau <>
Submitted on : Tuesday, July 22, 2014 - 12:15:49 PM
Last modification on : Friday, November 16, 2018 - 1:25:30 AM
Long-term archiving on : Tuesday, November 25, 2014 - 10:51:01 AM

File

Claveau_Kijak_LREC14.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01027778, version 1

Citation

Vincent Claveau, Ewa Kijak. Generating and using probabilistic morphological resources for the biomedical domain. 9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p. ⟨hal-01027778⟩

Share

Metrics

Record views

795

Files downloads

194