A Backward/Forward recovery approach For the Preconditioned Conjugate Gradient Method
MassimilianoFasi JulienLangou YvesRobert BoraUçar Abstract Several recent papers have introduced a periodic verification mechanism to detect silent errors in iterative solvers. Chen (2013, pp. 167–176) has shown how to combine such a verification mechanism (a stability test checking the…
From Detection To Optimization: Impact Of Soft Errors On High-Performance Computing Applications
BY JON CAMERON CALHOUN DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computer Science in the Graduate College of the University of Illinois at Urbana-Champaign, 2017 Abstract As high-performance computing (HPC) continues…
Modeling Soft-Error Propagation in Programs
Abstract—As technology scales to lower feature sizes, devices become more susceptible to soft errors. Soft errors can lead to silent data corruptions (SDCs), seriously compromising the reliability of a system. Traditional hardware-only techniques to avoid SDCs are energy hungry, and…