A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

Résumé

The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their "black-box" nature yields the downside of poor explainability: in particular, several real-world applications require-to varying extents-reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.
Fichier principal
Vignette du fichier
wcci2022 (15).pdf (310.3 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03792800 , version 1 (30-09-2022)

Identifiants

Citer

Grégor Jouet, Clément Duhart, Jacopo Staiano, Francis Rousseaux, Cyril de Runz. A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models. International Joint Conference on Neural Networks (IJCNN), 2022, Padoue, Italy. pp.1-8, ⟨10.1109/IJCNN55064.2022.9892324⟩. ⟨hal-03792800⟩
44 Consultations
58 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More