A multi-level optimization strategy to improve the performance of the stencil computation

Gauthier Sornet 1, 2 Fabrice Dupros 1 Sylvain Jubertie 2
2 PaMDA
LIFO - Laboratoire d'Informatique Fondamentale d'Orléans
Abstract : Stencil computation represents an important numerical kernel in scientific computing. Leveraging multicore or manycore parallelism to optimize such operations represents a major challenge due both to the bandwidth demand and the low arithmetic intensity. The situation is worsened by the complexity of current architectures and the potential impact of various mechanisms (cache memory, vectorization, compilation). In this paper, we describe a multi-level optimization strategy that combines manual vectorization, space tiling and stencil composition. A major effort of this study is the comparison of our results with Pochoir stencil compiler framework. We evaluate our methodology with a set of three different compilers (Intel, Clang and GCC) on two recent generations of Intel multicore platforms. Our results show a good match with the theoretical performance models (i.e. roofline models). We also outperform Pochoir performance by a factor of x2.5 in the best cases.
Type de document :
Communication dans un congrès
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, Jun 2017, Zurich, Switzerland. 2017
Liste complète des métadonnées

https://hal-brgm.archives-ouvertes.fr/hal-01500637
Contributeur : Fabrice Dupros <>
Soumis le : lundi 3 avril 2017 - 15:13:31
Dernière modification le : mercredi 16 mai 2018 - 12:14:01

Identifiants

  • HAL Id : hal-01500637, version 1

Collections

Citation

Gauthier Sornet, Fabrice Dupros, Sylvain Jubertie. A multi-level optimization strategy to improve the performance of the stencil computation. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, Jun 2017, Zurich, Switzerland. 2017. 〈hal-01500637〉

Partager

Métriques

Consultations de la notice

51