Fault Tolerance for Iterative Methods in High-performance Computing
Author | : Dingwen Tao |
Publisher | : |
Total Pages | : 154 |
Release | : 2018 |
ISBN-10 | : 0438429516 |
ISBN-13 | : 9780438429512 |
Rating | : 4/5 (512 Downloads) |
Download or read book Fault Tolerance for Iterative Methods in High-performance Computing written by Dingwen Tao and published by . This book was released on 2018 with total page 154 pages. Available in PDF, EPUB and Kindle. Book excerpt: Iterative methods are commonly used approaches to solve large, sparse linear systems, which are fundamental operations for many modern scientific simulations. When the large-scale iterative methods are running with a large number of ranks in parallel, they are anticipated to be more susceptible to soft errors in both logic circuits and memory subsystems and fail-stop errors in the entire system, considering large component counts and lower power margins of emerging high-performance computing (HPC) platforms.