Trade-offs in transient fault recovery schemes for redundant multithreaded processors

12Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

CMOS downscaling trends, manifested in the use of smaller transistor feature sizes and lower supply voltages, make microprocessors more and more vulnerable to transient errors with each new technology generation. One architectural approach to detecting and recovering from such errors is to execute two copies of the same program and then compare the results. While comparing only the store instructions is sufficient for error detection, register values also have to be compared to support fault recovery. In this paper, we propose novel checkpoint-assisted mechanisms for efficient fault recovery that dramatically reduce the number of register values to be compared for detecting soft errors and perform comprehensive investigation of these and other existing recovery schemes from the standpoint of performance, power and design complexity. © 2006 Springer-Verlag.

Cite

CITATION STYLE

APA

Sharkey, J., Abu-Ghazeleh, N., Ponomarev, D., Ghose, K., & Aggarwal, A. (2006). Trade-offs in transient fault recovery schemes for redundant multithreaded processors. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4297 LNCS, pp. 135–147). https://doi.org/10.1007/11945918_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free