Unsupervised post-tuning of deep neural networks

Christophe Cerisara; Paul Caillon; Guillaume Le Berre

Communication Dans Un Congrès Année : 2021

Unsupervised post-tuning of deep neural networks

(1) , (1) , (1)

Christophe Cerisara

Fonction : Auteur
PersonId : 2353
IdHAL : christophe-cerisara
IdRef : 102700168

Natural Language Processing : representations, inference and semantics

Paul Caillon

Fonction : Auteur
PersonId : 1096067

Natural Language Processing : representations, inference and semantics

Guillaume Le Berre

Fonction : Auteur
PersonId : 747342
IdHAL : guillaume-le-berre

Natural Language Processing : representations, inference and semantics

Résumé

We propose in this work a new unsupervised training procedure that is most effective when it is applied after supervised training and fine-tuning of deep neural network classifiers. While standard regularization techniques combat overfitting by means that are unrelated to the target classification loss, such as by minimizing the L2 norm or by adding noise either in the data, model or process, the proposed unsupervised training loss reduces overfitting by optimizing the true classifier risk. The proposed approach is evaluated on several tasks of increasing difficulty and varying conditions: unsupervised training, posttuning and anomaly detection. It is also tested both on simple neural networks, such as small multi-layer perceptron, and complex Natural Language Processing models, e.g., pretrained BERT embeddings. Experimental results confirm the theory and show that the proposed approach gives the best results in posttuning conditions, i.e., when applied after supervised training and fine-tuning.

Mots clés

deep learning unsupervised training regularization natural language processing

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

ijcnn.pdf (1.09 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Cerisara : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02022062

Soumis le : jeudi 15 avril 2021-15:17:25

Dernière modification le : lundi 11 septembre 2023-17:41:18

Dates et versions

hal-02022062 , version 1 (18-02-2019)

hal-02022062 , version 2 (15-04-2021)

Identifiants

HAL Id : hal-02022062 , version 2

Citer

Christophe Cerisara, Paul Caillon, Guillaume Le Berre. Unsupervised post-tuning of deep neural networks. IJCNN, Jul 2021, Virtual Event, United States. ⟨hal-02022062v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE GENCI LORIA LORIA-NLPKD LUE-UL IMPACT-OLKI ANR

412 Consultations

171 Téléchargements

Unsupervised post-tuning of deep neural networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager