Enabling the UCD-SPH code on the Xeon Phi

Christian Lalanne; Ashkan Rafiee; Denys Dutykh; Michael Lysaght; Frédéric Dias

Rapport Année : 2014

Enabling the UCD-SPH code on the Xeon Phi

(1) , (2, 3) , (3, 4) , (5) , (3)

1
2
3
4
5

Christian Lalanne

Fonction : Auteur
PersonId : 950912

Irish Centre for High-End Computing

Ashkan Rafiee

Fonction : Auteur
PersonId : 942177

Carnegie Wave Energy Ltd

School of Mathematical Sciences [Dublin]

Denys Dutykh

Fonction : Auteur correspondant
PersonId : 702
IdHAL : denys-dutykh
ORCID : 0000-0001-5247-2788
IdRef : 17232081X

Connectez-vous pour contacter l'auteur

School of Mathematical Sciences [Dublin]

Laboratoire de Mathématiques

Michael Lysaght

Fonction : Auteur
PersonId : 950913

Irish Centre for High-End Computing

Frédéric Dias

Fonction : Auteur
PersonId : 836633

School of Mathematical Sciences [Dublin]

Résumé

This white-paper reports on our efforts to enable an SPH-based Fortran code on the Intel Xeon Phi. As a result of the work described here , the two most computationally intensive subroutines (rates and shepard_beta) of the UCD-SPH code were refactored and parallelised with OpenMP for the first time, enabling the code to be executed on multi-core and many-core shared memory systems. This parallelisation achieved speedups of up to 4.3x for the rates subroutine and 6.0x for the shepard_beta subroutine resulting in overall speedups of up to 4.2x on a 2 processor Sandy Bridge Xeon E5 machine. The code was subsequently enabled and refactored to execute in different modes on the Intel Xeon Phi co-processor achieving speedups of up to 2.8x for the rates subroutine and up to 3.8x for the shepard_beta subroutine producing overall speedups of up to 2.7x compared to the original unoptimised code. To explore the capabilities of auto-vectorisation the shepard_beta subroutine was refactored which results in speedups of up to 6.4x for the shepard_beta subroutine relative to the original unoptimised version of the shepard_beta subroutine. The development and testing phases of the project were carried out on the PRACE EURORA machine.

Domaines

Mécanique des fluides [physics.class-ph] Mécanique des fluides [physics.class-ph] Analyse numérique [cs.NA] Calcul parallèle, distribué et partagé [cs.DC] Physique Numérique [physics.comp-ph] Analyse numérique [math.NA] Dynamique des Fluides [physics.flu-dyn]

Fichier principal

WP131.pdf (644.69 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Denys DUTYKH : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00927227

Soumis le : mercredi 22 janvier 2014-14:39:18

Dernière modification le : jeudi 4 avril 2024-20:59:55

Archivage à long terme le : mercredi 23 avril 2014-04:41:43

Dates et versions

hal-00927227 , version 1 (12-01-2014)

hal-00927227 , version 2 (22-01-2014)

Identifiants

HAL Id : hal-00927227 , version 2

Citer

Christian Lalanne, Ashkan Rafiee, Denys Dutykh, Michael Lysaght, Frédéric Dias. Enabling the UCD-SPH code on the Xeon Phi. 2014. ⟨hal-00927227v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-SAVOIE UGA CNRS LAMA TDS-MACS LARA

319 Consultations

212 Téléchargements

Enabling the UCD-SPH code on the Xeon Phi

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager