Low-cost prediction-based fault protection strategy

3Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Increasing failures from transient faults necessitates the costefficient protection mechanism that will be always activated. Thus, we propose a novel prediction-based transient fault protection strategy as a low-cost software-only technique. Instead of re-executing expensive computations for validation, an output prediction is used to cheaply determine an approximate value for a sequence of computation. When actual computation and prediction agree within a predefined acceptable range, the computation is assumed faultfree, and expensive re-computation can be skipped. With our approach, a significant reduction in dynamic instruction counts is possible. Missed faults may occur, but their occurrences can be explicitly kept to a small amount with a proper acceptable range. For evaluation, we build an automatic compilation system, called RSkip, that transforms a program into a resilient executable with the prediction-based protection scheme. Prior instruction replication work shows 2.33× execution time compared to the unreliable execution over nine compute-intensive benchmarks. With a control for the loss in protection rate, RSkip can reduce the protection overhead to 1.27× by skipping redundant computation in our target loops at a rate of 81.10%.

References Powered by Scopus

The PARSEC benchmark suite: Characterization and architectural implications

2771Citations
N/AReaders
Get full text

Rodinia: A benchmark suite for heterogeneous computing

2443Citations
N/AReaders
Get full text

SWIFT: Software implemented fault tolerance

598Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A design methodology for fault-tolerant computing using astrocyte neural networks

12Citations
N/AReaders
Get full text

Path Sensitive Signatures for Control Flow Error Detection

8Citations
N/AReaders
Get full text

Automatic Feature Isolation in Network Protocol Software Implementations

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Park, S., Li, S., Zhang, Z., & Mahlke, S. (2020). Low-cost prediction-based fault protection strategy. In CGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization (pp. 30–42). Association for Computing Machinery, Inc. https://doi.org/10.1145/3368826.3377920

Readers over time

‘20‘22‘2502468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 7

88%

Professor / Associate Prof. 1

13%

Readers' Discipline

Tooltip

Computer Science 6

75%

Engineering 2

25%

Save time finding and organizing research with Mendeley

Sign up for free
0