Optimal Transport for Deep Joint Transfer Learning

Ying Lu; Liming Chen; Alexandre Saidi

Pré-Publication, Document De Travail Année : 2019

Optimal Transport for Deep Joint Transfer Learning

(1) , (2) , (2)

1
2

Ying Lu

Fonction : Auteur

Faculty of Chemistry

Liming Chen

Fonction : Auteur
PersonId : 7562
IdHAL : liming-chen
IdRef : 067400175

Extraction de Caractéristiques et Identification

Alexandre Saidi

Fonction : Auteur
PersonId : 7220
IdHAL : alexandre-saidi
IdRef : 183059727

Extraction de Caractéristiques et Identification

Résumé

Training a Deep Neural Network (DNN) from scratch requires a large amount of labeled data. For a classification task where only small amount of training data is available, a common solution is to perform fine-tuning on a DNN which is pre-trained with related source data. This consecutive training process is time consuming and does not consider explicitly the relatedness between different source and target tasks. In this paper, we propose a novel method to jointly fine-tune a Deep Neural Network with source data and target data. By adding an Optimal Transport loss (OT loss) between source and target classifier predictions as a constraint on the source classifier, the proposed Joint Transfer Learning Network (JTLN) can effectively learn useful knowledge for target classification from source data. Furthermore, by using different kind of metric as cost matrix for the OT loss, JTLN can incorporate different prior knowledge about the relatedness between target categories and source categories. We carried out experiments with JTLN based on Alexnet on image classification datasets and the results verify the effectiveness of the proposed JTLN in comparison with standard consecutive fine-tuning. This Joint Transfer Learning with OT loss is general and can also be applied to other kind of Neural Networks.

Domaines

Informatique [cs] Intelligence artificielle [cs.AI]

Alexandre Saidi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02075446

Soumis le : jeudi 21 mars 2019-14:10:10

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-02075446 , version 1 (21-03-2019)

Identifiants

HAL Id : hal-02075446 , version 1
ARXIV : 1709.02995

Citer

Ying Lu, Liming Chen, Alexandre Saidi. Optimal Transport for Deep Joint Transfer Learning. 2019. ⟨hal-02075446⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS INSA-GROUPE UDL EC_LYON_STRICT

88 Consultations

1 Téléchargements

Optimal Transport for Deep Joint Transfer Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager