Unsupervised and semi-supervised morphological analysis for Information Retrieval in the biomedical domain - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Unsupervised and semi-supervised morphological analysis for Information Retrieval in the biomedical domain

Vincent Claveau

Résumé

In the biomedical field, the key to access information is the use of specialized terms. However, in most of Indo-European languages, these terms are complex morphological structures. The aim of the presented work is to identify the various meaningful components of these terms and use this analysis to improve biomedical Information Retrieval. We present an approach combining an automatic alignment using a pivot language, and an analogical learning that allows an accurate morphological analysis of terms. These morphological analysis are used to improve the indexing of medical documents. The experiments reported in this paper show the validity of this approach with a 10% improvement in MAP over a standard IR system.
Fichier principal
Vignette du fichier
Claveau_Kijak_Coling2012.pdf (696.9 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00760114 , version 1 (03-12-2012)

Identifiants

  • HAL Id : hal-00760114 , version 1

Citer

Vincent Claveau. Unsupervised and semi-supervised morphological analysis for Information Retrieval in the biomedical domain. COLING - 24th International Conference on Computational Linguistics, Dec 2012, Mumbai, India. ⟨hal-00760114⟩
210 Consultations
180 Téléchargements

Partager

Gmail Facebook X LinkedIn More