4University of Utah (201 Presidents Cir, Salt Lake City, UT 84112 - United States)
Abstract : Tiling is a key loop transformation for optimizing tensor computations such as CNNs (Convolutional Neural Networks). Tile optimization involves an explosively large search space for multi-level tiling, including all possible permutations of the tiling loops and all possible valid tile sizes. In this paper, we develop a comprehensive methodology for finding optimized tile configurations with imperfectly nested micro-kernels ("beyond perfect") and outer tile loops optimized via analytical modeling. Experimental results on over 30 CNN benchmarks from three popular DNN pipelines demonstrate the effectiveness of the presented optimization approach by comparing with the Intel oneDNN library.
https://hal.archives-ouvertes.fr/hal-03149553
Contributor : Guillaume Iooss <>
Submitted on : Tuesday, February 23, 2021 - 10:29:42 AM Last modification on : Wednesday, February 24, 2021 - 3:27:31 AM
Nicolas Tollenaere, Auguste Olivry, Guillaume Iooss, Hugo Brunie, Albert Cohen, et al.. Efficient convolution optimisation by composing micro-kernels. 2021. ⟨hal-03149553⟩