More Instruction Level Parallelism Explains the Actual Efficiency of Compensated Algorithms

Philippe Langlois; Nicolas Louvet

Pré-Publication, Document De Travail Année : 2007

More Instruction Level Parallelism Explains the Actual Efficiency of Compensated Algorithms

(1) , (1)

Philippe Langlois

Fonction : Auteur
PersonId : 3635
IdHAL : philippe-langlois
IdRef : 104061731

Laboratoire de Physique Appliquée et d'Automatique

Nicolas Louvet

Fonction : Auteur

Laboratoire de Physique Appliquée et d'Automatique

Résumé

The compensated Horner algorithm and the Horner algorithm with double-double arithmetic improve the accuracy of polynomial evaluation in IEEE-754 floating point arithmetic. Both yield a polynomial evaluation as accurate as if it was computed with the classic Horner algorithm in twice the working precision. Both algorithms also share the same low-level computation of the floating point rounding errors and cost a similar number of floating point operations. We report numerical experiments to exhibit that the compensated algorithm runs at least twice as fast as the double-double one on modern processors. We propose to explain such efficiency by identifying more instruction level parallelism in the compensated implementation. Such property also applies to other compensated algorithms for summation, dot product and triangular linear system solving. More generally this paper illustrates how this kind of performance analysis may be useful to highlight the actual efficiency of numerical algorithms.

Mots clés

Accurate polynomial evaluation Horner algorithm compensated Horner algorithm floating point arithmetic IEEE-754 standard instruction level parallelism performance evaluation. performance evaluation

Domaines

Logiciel mathématique [cs.MS]

Fichier principal

hal.pdf (223.59 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Philippe Langlois : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00165020

Soumis le : mardi 24 juillet 2007-14:57:29

Dernière modification le : vendredi 26 janvier 2024-12:55:51

Archivage à long terme le : jeudi 8 avril 2010-23:55:36

Dates et versions

hal-00165020 , version 1 (24-07-2007)

Identifiants

HAL Id : hal-00165020 , version 1

Citer

Philippe Langlois, Nicolas Louvet. More Instruction Level Parallelism Explains the Actual Efficiency of Compensated Algorithms. 2007. ⟨hal-00165020⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PERP

255 Consultations

332 Téléchargements

More Instruction Level Parallelism Explains the Actual Efficiency of Compensated Algorithms

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager