Ternary Neural Networks for Resource-Efficient AI Applications

—The computation and storage requirements for Deep Neural Networks (DNNs) are usually high. This issue limits their deployability on ubiquitous computing devices such as smart phones, wearables and autonomous drones. In this paper, we propose ternary neural networks (TNNs) in order to make deep learning more resource-efficient. We train these TNNs using a teacher-student approach based on a novel, layer-wise greedy methodology. Thanks to our two-stage training procedure, the teacher network is still able to use state-of-the-art methods such as dropout and batch normalization to increase accuracy and reduce training time. Using only ternary weights and activations, the student ternary network learns to mimic the behavior of its teacher network without using any multiplication. Unlike its {-1,1} binary counterparts, a ternary neural network inherently prunes the smaller weights by setting them to zero during training. This makes them sparser and thus more energy-efficient. We design a purpose-built hardware architecture for TNNs and implement it on FPGA and ASIC. We evaluate TNNs on several benchmark datasets and demonstrate up to 3.1× better energy efficiency with respect to the state of the art while also improving accuracy.

Domaines

Intelligence artificielle [cs.AI] Architectures Matérielles [cs.AR]

Fichier principal

paper_TNN_arxiv.pdf (364.62 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hande Alemdar : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01481478

Soumis le : jeudi 2 mars 2017-16:22:47

Dernière modification le : jeudi 4 avril 2024-21:34:45

Archivage à long terme le : mercredi 31 mai 2017-17:44:26

Dates et versions

hal-01481478 , version 1 (02-03-2017)

Identifiants

HAL Id : hal-01481478 , version 1
ARXIV : 1609.00222

Citer

Hande Alemdar, Vincent Leroy, Adrien Prost-Boucle, Frédéric Pétrot. Ternary Neural Networks for Resource-Efficient AI Applications. International Joint Conference on Neural Networks, May 2017, Anchorage, United States. ⟨hal-01481478⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS TIMA LIG LIG-TDCGE-SLIDE LIG_SIDCH

262 Consultations

1352 Téléchargements