How to square floats accurately and efficiently on the ST231 integer processor

Claude-Pierre Jeannerod 1, * Jingyan Jourdan-Lu 1, 2 Christophe Monat 2 Guillaume Revy 3
* Auteur correspondant
1 ARIC - Arithmetic and Computing
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
2 Compilation Expertise Center
ST-GRENOBLE - STMicroelectronics [Grenoble]
3 DALI - Digits, Architectures et Logiciels Informatiques
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, UPVD - Université de Perpignan Via Domitia
Abstract : We consider the problem of computing IEEE floating-point squares by means of integer arithmetic. We show how the specific properties of squaring can be exploited in order to design and implement algorithms that have much lower latency than those for general multiplication, while still guaranteeing correct rounding. Our algorithm descriptions are parameterized by the floating-point format, aim at high instruction-level parallelism (ILP) exposure, and cover all rounding modes. We show further that their C implementation for the binary32 format yields efficient codes for targets like the ST231 VLIW integer processor from STMicroelectronics, with a latency at least 1.75x smaller than that of general multiplication in the same context.
Type de document :
Pré-publication, Document de travail
2010


https://hal-ens-lyon.archives-ouvertes.fr/ensl-00532829
Contributeur : Claude-Pierre Jeannerod <>
Soumis le : vendredi 19 novembre 2010 - 07:00:22
Dernière modification le : mercredi 20 janvier 2016 - 15:07:15
Document(s) archivé(s) le : vendredi 26 octobre 2012 - 16:00:13

Fichier

sqr.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : ensl-00532829, version 1

Collections

Citation

Claude-Pierre Jeannerod, Jingyan Jourdan-Lu, Christophe Monat, Guillaume Revy. How to square floats accurately and efficiently on the ST231 integer processor. 2010. <ensl-00532829>

Exporter

Partager

Métriques

Consultations de
la notice

660

Téléchargements du document

230