Accelerating CNN inference on FPGAs: A Survey

Abstract : Convolutional Neural Networks (CNNs) are currently adopted to solve an ever greater number of problems, ranging from speech recognition to image classification and segmentation. The large amount of processing required by CNNs calls for dedicated and tailored hardware support methods. Moreover, CNN workloads have a streaming nature, well suited to reconfigurable hardware architectures such as FPGAs. The amount and diversity of research on the subject of CNN FPGA acceleration within the last 3 years demonstrates the tremendous industrial and academic interest. This paper presents a state-of-the-art of CNN inference accelerators over FPGAs. The computational workloads, their parallelism and the involved memory accesses are analyzed. At the level of neurons, optimizations of the convolutional and fully connected layers are explained and the performances of the different methods compared. At the network level, approximate computing and datapath optimization methods are covered and state-of-the-art approaches compared. The methods and tools investigated in this survey represent the recent trends in FPGA CNN inference accelerators and will fuel the future advances on effcient hardware deep learning.
Liste complète des métadonnées

Littérature citée [103 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01695375
Contributeur : Kamel Abdelouahab <>
Soumis le : mardi 13 mars 2018 - 19:36:45
Dernière modification le : mardi 5 février 2019 - 15:58:15
Document(s) archivé(s) le : jeudi 14 juin 2018 - 17:17:38

Fichier

hal-accelerating-cnn.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01695375, version 2

Citation

Kamel Abdelouahab, Maxime Pelcat, François Berry, Jocelyn Sérot. Accelerating CNN inference on FPGAs: A Survey. 2018. 〈hal-01695375v2〉

Partager

Métriques

Consultations de la notice

1034

Téléchargements de fichiers

8853