Efficient Compression Technique for NoC-based Deep Neural Network Accelerators

Jordane Lorandel; Habiba Lahdhiri; Emmanuelle Bourdel; Salvatore Monteleone; Maurizio Palesi

Communication Dans Un Congrès Année : 2020

Efficient Compression Technique for NoC-based Deep Neural Network Accelerators

(1) , (1) , (1) , (1) , (2)

1
2

Jordane Lorandel

Fonction : Auteur
PersonId : 989907

Equipes Traitement de l'Information et Systèmes

Habiba Lahdhiri

Fonction : Auteur
PersonId : 1062534

Equipes Traitement de l'Information et Systèmes

Emmanuelle Bourdel

Fonction : Auteur
PersonId : 884035

Equipes Traitement de l'Information et Systèmes

Salvatore Monteleone

Fonction : Auteur

Equipes Traitement de l'Information et Systèmes

Maurizio Palesi

Fonction : Auteur

Università degli studi di Catania = University of Catania

Résumé

Deep Neural Networks (DNNs) are very powerful neural networks, widely used in many applications. On the other hand, such networks are computation and memory intensive, which makes their implementation difficult onto hardware-constrained systems, that could use network-on-chip as interconnect infrastructure. A way to reduce the traffic generated among memory and the processing elements is to compress the information before their exchange inside the network. In particular, our work focuses on reducing the huge number of DNN parameters, i.e., weights. In this paper, we propose a flexible and low-complexity compression technique which preserves the DNN performance, allowing to reduce the memory footprint and the volume of data to be exchanged while necessitating few hardware resources. The technique is evaluated on several DNN models, achieving a compression rate close to 80% without significant loss in accuracy on AlexNet, ResNet, or LeNet-5.

Mots clés

Deep Neural Networks Compression Network-on-Chip Performance Low complexity

Domaines

Modélisation et simulation Réseaux et télécommunications [cs.NI] Traitement du signal et de l'image [eess.SP] Langage de programmation [cs.PL] Systèmes embarqués Electronique Architectures Matérielles [cs.AR]

Jordane Lorandel : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02903208

Soumis le : lundi 20 juillet 2020-17:54:40

Dernière modification le : jeudi 1 février 2024-10:34:04

Dates et versions

hal-02903208 , version 1 (20-07-2020)

Identifiants

HAL Id : hal-02903208 , version 1

Citer

Jordane Lorandel, Habiba Lahdhiri, Emmanuelle Bourdel, Salvatore Monteleone, Maurizio Palesi. Efficient Compression Technique for NoC-based Deep Neural Network Accelerators. Euromicro Conference on Digital System Design (DSD 2020), Aug 2020, Portoroz, Slovenia. ⟨hal-02903208⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-CERGY ETIS ETIS-ASTRE TDS-MACS ETIS-CELL CY-TECH-SM

86 Consultations

0 Téléchargements

Efficient Compression Technique for NoC-based Deep Neural Network Accelerators

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager