This paper presents Software Implemented Fault Tolerance1 (SIFT) for hypercubes which is implemented by means of a software layer. It is written in each node of the nCube parallel computer. The SIFT utilizes an error detection application software and fast reconfiguration algorithm for avoiding faulty nodes. The Balance Spanning Tree (BST) is used for embedding tree-based algorithms into the hypercube topology. Any single faulty node in the hypercube can be tolerated by the software layer. More than 90% of the multiple faults can be tolerated without backtracking. The SIFT approach has been successfully implemented for a quadtree data compression algorithm for 64×64, 128×128 compressible and uncompressible data. The experiments were run on 4 and 16 node nCubes. The time overhead (reconfiguration and recompu-tation time) incurred by the injected fault was experimentally estimated. The coverage factor, provided by the error-detection software, has been estimated by means of nSOFIT for the quadtree data compression algorithm. © Springer-Verlag Berlin Heidelberg 1999.
CITATION STYLE
Avresky, D. R., & Geoghegan, S. (1999). Software implemented fault tolerance in hypercube. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1685 LNCS, pp. 515–518). Springer Verlag. https://doi.org/10.1007/3-540-48311-x_71
Mendeley helps you to discover research relevant for your work.