Extracting food-drug interactions from scientific literature: relation clustering to address lack of data

Tsanta Randriatsitohaina; Thierry Hamon

Communication Dans Un Congrès Année : 2019

Extracting food-drug interactions from scientific literature: relation clustering to address lack of data

(1) , (1, 2)

1
2

Tsanta Randriatsitohaina

Fonction : Auteur
PersonId : 1034329

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Thierry Hamon

Fonction : Auteur
PersonId : 11519
IdHAL : thierry-hamon
ORCID : 0000-0002-1521-4875
IdRef : 069054711

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Université Paris 13

Résumé

Food-Drug Interaction (FDI) occurs when food and drug are taken simultaneously and cause unexpected effect. This paper tackles the problem of mining scientific literature in order to extract these interactions. We consider this problem as a relation extraction task which can be solved with classification method. Since Food-Drug Interactions need a fine-grained description with many relation types, we face the data sparseness and the lack of examples per type of relation. To address this issue, we propose an effective approach for grouping relations sharing similar representation into clusters and reducing the lack of examples. Cluster labels are then used as labels of the dataset given to classifiers for the FDI type identification. Our approach, relying on the extraction of relevant features before, between, and after the entities associated by the relation, improves significantly the performance of the FDI classification. Finally, we contrast an intuitive grouping method based on the definition of the relation types and a unsupervised clustering based on the instances of each relation type.

Mots clés

Food-Drug Interaction Medical text Machine learning Clustering

Domaines

Informatique [cs] Informatique et langage [cs.CL]

Limsi Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02122766

Soumis le : mardi 7 mai 2019-16:44:25

Dernière modification le : samedi 7 octobre 2023-21:36:20

Dates et versions

hal-02122766 , version 1 (07-05-2019)

Identifiants

HAL Id : hal-02122766 , version 1

Citer

Tsanta Randriatsitohaina, Thierry Hamon. Extracting food-drug interactions from scientific literature: relation clustering to address lack of data. International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2019, La Rochelle, France. ⟨hal-02122766⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS13 CNRS LIMSI USPC UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE SORBONNE-PARIS-NORD LISN GS-ENGINEERING GS-COMPUTER-SCIENCE

89 Consultations

0 Téléchargements

Extracting food-drug interactions from scientific literature: relation clustering to address lack of data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager