A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

Grégor Jouet; Clément Duhart; Jacopo Staiano; Francis Rousseaux; Cyril de Runz

doi:10.1109/IJCNN55064.2022.9892324

Communication Dans Un Congrès Année : 2022

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

(1, 2, 3) , (2) , (3) , (4) , (1)

1
2
3
4

Grégor Jouet

Fonction : Auteur
PersonId : 1169195

Bases de données et traitement des langues naturelles

Pôle Universitaire Léonard de Vinci

reciTAL

Clément Duhart

Fonction : Auteur
PersonId : 1169196

Pôle Universitaire Léonard de Vinci

Jacopo Staiano

Fonction : Auteur
PersonId : 1109226

reciTAL

Francis Rousseaux

Fonction : Auteur
PersonId : 178672
IdHAL : francis-rousseaux
ORCID : 0000-0002-9619-0122
IdRef : 110602218

Centre de Recherche en Sciences et Technologies de l'Information et de la Communication - EA 3804

Cyril de Runz

Fonction : Auteur
PersonId : 19574
IdHAL : cyril-de-runz
ORCID : 0000-0002-5951-6859
IdRef : 130379832

Bases de données et traitement des langues naturelles

Résumé

The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their "black-box" nature yields the downside of poor explainability: in particular, several real-world applications require-to varying extents-reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.

Mots clés

calibration ner uncertainty noise injection

Domaines

Informatique [cs]

Fichier principal

wcci2022 (15).pdf (310.3 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Cyril DE RUNZ : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03792800

Soumis le : vendredi 30 septembre 2022-14:07:35

Dernière modification le : vendredi 19 avril 2024-13:44:22

Dates et versions

hal-03792800 , version 1 (30-09-2022)

Identifiants

HAL Id : hal-03792800 , version 1
DOI : 10.1109/IJCNN55064.2022.9892324

Citer

Grégor Jouet, Clément Duhart, Jacopo Staiano, Francis Rousseaux, Cyril de Runz. A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models. International Joint Conference on Neural Networks (IJCNN), 2022, Padoue, Italy. pp.1-8, ⟨10.1109/IJCNN55064.2022.9892324⟩. ⟨hal-03792800⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TOURS CNRS URCA LIBDTLN CRESTIC LIFAT INSA-GROUPE INSA-CVL

44 Consultations

58 Téléchargements

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager