Deep Geometric Knowledge Distillation with Graphs

Carlos Lassance; Myriam Bontonou; Ghouthi Boukli Hacene; Vincent Gripon; Jian Tang; Antonio Ortega

doi:10.1109/ICASSP40776.2020.9053986

Communication Dans Un Congrès Année : 2020

Deep Geometric Knowledge Distillation with Graphs

(1, 2) , (1, 2) , (1, 2) , (2, 1) , (3) , (4)

1
2
3
4

Carlos Lassance

Fonction : Auteur

Lab-STICC_IMTA_CACS_IAS

Département Electronique

Myriam Bontonou

Fonction : Auteur
PersonId : 735618
IdHAL : myriam-bontonou
ORCID : 0000-0002-0010-5457

Lab-STICC_IMTA_CACS_IAS

Département Electronique

Ghouthi Boukli Hacene

Fonction : Auteur

Lab-STICC_IMTA_CACS_IAS

Département Electronique

Vincent Gripon

Fonction : Auteur
PersonId : 21307
IdHAL : vincent-gripon
ORCID : 0000-0002-4353-4542
IdRef : 16122203X

Département Electronique

Lab-STICC_IMTA_CACS_IAS

Jian Tang

Fonction : Auteur

Laboratoire de physique des interfaces et des couches minces [Palaiseau]

Antonio Ortega

Fonction : Auteur
PersonId : 989486

University of Southern California

Résumé

In most cases deep learning architectures are trained disregarding the amount of operations and energy consumption. However, some applications, like embedded systems, can be resource-constrained during inference. A popular approach to reduce the size of a deep learning architecture consists in distilling knowledge from a bigger network (teacher) to a smaller one (student). Directly training the student to mimic the teacher representation can be effective, but it requires that both share the same latent space dimensions. In this work, we focus instead on relative knowledge distillation (RKD), which considers the geometry of the respective latent spaces, allowing for dimension-agnostic transfer of knowledge. Specifically we introduce a graph-based RKD method, in which graphs are used to capture the geometry of latent spaces. Using classical computer vision benchmarks, we demonstrate the ability of the proposed method to efficiently distillate knowledge from the teacher to the student, leading to better accuracy for the same budget as compared to existing RKD alternatives.

Domaines

Apprentissage [cs.LG] Intelligence artificielle [cs.AI] Traitement du signal et de l'image [eess.SP]

Vincent Gripon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02871309

Soumis le : mercredi 17 juin 2020-11:11:48

Dernière modification le : mercredi 7 février 2024-08:57:24

Dates et versions

hal-02871309 , version 1 (17-06-2020)

Identifiants

HAL Id : hal-02871309 , version 1
DOI : 10.1109/ICASSP40776.2020.9053986

Citer

Carlos Lassance, Myriam Bontonou, Ghouthi Boukli Hacene, Vincent Gripon, Jian Tang, et al.. Deep Geometric Knowledge Distillation with Graphs. ICASSP 2020: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2020, Barcelona, Spain. pp.8484-8488, ⟨10.1109/ICASSP40776.2020.9053986⟩. ⟨hal-02871309⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST X INSTITUT-TELECOM CNRS X-PICM X-DEP-PHYS LAB-STICC_UBO LPICM ENIB LAB-STICC IMT-ATLANTIQUE IP_PARIS

166 Consultations

0 Téléchargements

Deep Geometric Knowledge Distillation with Graphs

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager