Accelerating CNN inference on FPGAs: A Survey

Kamel Abdelouahab; Maxime Pelcat; François Berry; Jocelyn Sérot

Pré-Publication, Document De Travail Année : 2018

Accelerating CNN inference on FPGAs: A Survey

(1) , (2, 1) , (1) , (1)

1
2

Kamel Abdelouahab

Fonction : Auteur
PersonId : 15782
IdHAL : kamel-abdelouahab
ORCID : 0000-0003-0544-1457
IdRef : 234599979

Institut Pascal

Maxime Pelcat

Fonction : Auteur
PersonId : 15780
IdHAL : mpelcat
ORCID : 0000-0002-1158-0915
IdRef : 148709060

Institut d'Électronique et des Technologies du numéRique

Institut Pascal

François Berry

Fonction : Auteur
PersonId : 15755
IdHAL : francois-berry
ORCID : 0000-0002-5899-4672
IdRef : 170520552

Institut Pascal

Jocelyn Sérot

Fonction : Auteur
PersonId : 15891
IdHAL : jocelyn-serot
IdRef : 138522596

Institut Pascal

Résumé

Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of problems, ranging from speech recognition to image classification and segmentation. The large amount of processing required by CNNs calls for dedicated and tailored hardware support methods. Moreover, CNN workloads have a streaming nature, well suited to reconfigurable hardware architectures such as FPGAs. The amount and diversity of research on the subject of CNN FPGA acceleration within the last 3 years demonstrates the tremendous industrial and academic interest. This paper presents a state-of-the-art of CNN inference accelerators over FPGAs. The computational workloads, their parallelism and the involved memory accesses are analyzed. At the level of neurons, optimizations of the convolutional and fully connected layers are explained and the performances of the different methods compared. At the network level, approximate computing and datapath optimization methods are covered and state-of-the-art approaches compared. The methods and tools investigated in this survey represent the recent trends in FPGA CNN inference accelerators and will fuel the future advances on effcient hardware deep learning.

Domaines

Architectures Matérielles [cs.AR] Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

hal-accelerating-cnn.pdf (3.8 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Kamel Abdelouahab : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01695375

Soumis le : mardi 13 mars 2018-19:36:45

Dernière modification le : samedi 22 avril 2023-04:24:11

Archivage à long terme le : jeudi 14 juin 2018-17:17:38

Dates et versions

hal-01695375 , version 1 (29-01-2018)

hal-01695375 , version 2 (13-03-2018)

Identifiants

HAL Id : hal-01695375 , version 2

Citer

Kamel Abdelouahab, Maxime Pelcat, François Berry, Jocelyn Sérot. Accelerating CNN inference on FPGAs: A Survey. 2018. ⟨hal-01695375v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-NANTES UNIV-RENNES1 PRES_CLERMONT CNRS INSA-RENNES IETR INSTITUT_PASCAL CENTRALESUPELEC UR1-MATH-STIC UR1-UFR-ISTIC IETR-VAADER UNIV-RENNES INSA-GROUPE UR1-MATH-NUM NANTES-UNIVERSITE

1712 Consultations

13561 Téléchargements

Accelerating CNN inference on FPGAs: A Survey

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager