On Resiliency in Krylov Solvers

Abstract : In this talk we will discuss possible numerical remedies to survive data loss in some numerical linear algebra solvers namely Krylov subspace linear solvers and some widely used eigensolvers. Assuming that a separate mechanism ensures fault detection, we propose numerical algorithms to extract relevant information from available data after a fault. After data extraction, well chosen part of missing data is regenerated through interpolation strategies to constitute meaningful inputs to numerically restart. We will also present some preliminary investigations to address soft error detection again at the application level in the conjugate gradient framework.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01162618
Contributor : Luc Giraud <>
Submitted on : Thursday, June 11, 2015 - 9:09:31 AM
Last modification on : Wednesday, September 18, 2019 - 1:14:33 AM

Identifiers

  • HAL Id : hal-01162618, version 1

Citation

Emmanuel Agullo, Luc Giraud, Pablo Salas, Emrullah Fatih Yetkin, Mawussi Zounon. On Resiliency in Krylov Solvers. PACS'15: Plateform for Advanced Scientific Computing Conference, Jun 2015, Zurich, Switzerland. ⟨hal-01162618⟩

Share

Metrics

Record views

457