Computing floating-point logarithms with fixed-point operations

Julien Le Maire; Nicolas Brunie; Florent de Dinechin; Jean-Michel Muller

Communication Dans Un Congrès Année : 2016

Computing floating-point logarithms with fixed-point operations

(1) , (2) , (3, 4) , (5, 6)

1
2
3
4
5
6

Julien Le Maire

Fonction : Auteur

École normale supérieure de Lyon

Nicolas Brunie

Fonction : Auteur
PersonId : 765150
IdRef : 178343544

Kalray

Florent de Dinechin

Fonction : Auteur
PersonId : 5437
IdHAL : florent-de-dinechin
ORCID : 0000-0003-4927-3301
IdRef : 060154012

Software and Cognitive radio for telecommunications

CITI Centre of Innovation in Telecommunications and Integration of services

Jean-Michel Muller

Fonction : Auteur
PersonId : 4171
IdHAL : jean-michel-muller
ORCID : 0000-0003-3588-0047
IdRef : 029762871

Arithmetic and Computing

Laboratoire de l'Informatique du Parallélisme

Résumé

Elementary functions from the mathematical library input and output floating-point numbers. However it is possible to implement them purely using integer/fixed-point arithmetic. This option was not attractive between 1985 and 2005, because mainstream processor hardware supported 64-bit floating-point, but only 32-bit integers. Besides, conversions between floating-point and integer were costly. This has changed in recent years, in particular with the generalization of native 64-bit integer support. The purpose of this article is therefore to reevaluate the relevance of computing floating-point functions in fixed-point. For this, several variants of the double-precision logarithm function are implemented and evaluated. Formulating the problem as a fixed-point one is easy after the range has been (classically) reduced. Then, 64-bit integers provide slightly more accuracy than 53-bit mantissa, which helps speed up the evaluation. Finally, multi-word arithmetic, critical for accurate implementations, is much faster in fixed-point, and natively supported by recent compilers. Novel techniques of argument reduction and rounding test are introduced in this context. Thanks to all this, a purely integer implementation of the correctly rounded double-precision logarithm outperforms the previous state of the art, with the worst-case execution time reduced by a factor 5. This work also introduces variants of the logarithm that input a floating-point number and output the result in fixed-point. These are shown to be both more accurate and more efficient than the traditional floating-point functions for some applications.

Mots clés

floating-point fixed-point elementary function logarithm correct rounding

Domaines

Arithmétique des ordinateurs

Fichier principal

2015-FixFloat.pdf (269.68 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Florent de Dinechin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01227877

Soumis le : jeudi 12 novembre 2015-11:10:25

Dernière modification le : mercredi 26 juillet 2023-16:48:08

Archivage à long terme le : vendredi 28 avril 2017-08:30:28

Dates et versions

hal-01227877 , version 1 (12-11-2015)

Identifiants

HAL Id : hal-01227877 , version 1

Citer

Julien Le Maire, Nicolas Brunie, Florent de Dinechin, Jean-Michel Muller. Computing floating-point logarithms with fixed-point operations. 23rd IEEE Symposium on Computer Arithmetic, IEEE, Jul 2016, Santa Clara, United States. ⟨hal-01227877⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON CNRS INRIA UNIV-LYON1 INSA-LYON INRIA2 CITI INSA-GROUPE UDL ANR

495 Consultations

7560 Téléchargements

Computing floating-point logarithms with fixed-point operations

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager