Supervised learning for the detection of negation and of its scope in French and Brazilian Portuguese biomedical corpora - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Natural Language Engineering Année : 2020

Supervised learning for the detection of negation and of its scope in French and Brazilian Portuguese biomedical corpora

Résumé

Automatic detection of negated content is often a prerequisite in information extraction systems in various domains. In the biomedical domain especially, this task is important because negation plays an important role. In this work, two main contributions are proposed. First, we work with languages which have been poorly addressed up to now: Brazilian Portuguese and French. Thus, we developed new corpora for these two languages which have been manually annotated for marking up the negation cues and their scope. Second, we propose automatic methods based on supervised machine learning approaches for the automatic detection of negation marks and of their scopes. The methods show to be robust in both languages (Brazilian Portuguese and French) and in cross-domain (general and biomedical languages) contexts. The approach is also validated on English data from the state of the art: it yields very good results and outperforms other existing approaches. Besides, the application is accessible and usable online. We assume that, through these issues (new annotated corpora, application accessible online, and cross-domain robustness), the reproducibility of the results and the robustness of the NLP applications will be augmented.
Fichier principal
Vignette du fichier
supervised_learning_for_the_detection_of_negation_and_of_its_scope_in_french_and_brazilian_portuguese_biomedical_corpora.pdf (1.9 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03021033 , version 1 (24-11-2020)

Identifiants

Citer

Clément Dalloux, Vincent Claveau, Natalia Grabar, Lucas Emanuel Silva Oliveira, Claudia Maria Cabral Moro, et al.. Supervised learning for the detection of negation and of its scope in French and Brazilian Portuguese biomedical corpora. Natural Language Engineering, 2020, ⟨10.1017/S1351324920000352⟩. ⟨hal-03021033⟩
90 Consultations
86 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More