Adaptive fault tolerance for spacecraft

7Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper describes the design and implementation of a software infrastructure for real-time fault tolerance for applications on long duration deep space missions. The infrastructure has advanced capabilities for Adaptive Fault Tolerance (AFT), i.e., the ability to change the recovery strategy based on the failure history, available resources, and the operating environment. The AFT technology can accommodate adaptive or fixed recovery strategies. Adaptive fault tolerance allows the recovery strategy to be changed on the basis of the mission phase, failure history, and environment. For example, during a phase where power consumption must be minimized, there would be only one processor in operation. Thus, the recovery strategy would be to restart and retry. On the other hand, if the mission phase were in a time-critical mode (e.g., orbital insertion, encounter, etc.), then, multiple processors would be running, and the recovery strategy would be to switch from a leader copy to a follower copy of the control software. In a fixed recovery strategy, there is a specified redundant resource which is committed when certain failure conditions occur. The most obvious example of a fixed recovery strategy is to switch over to the standby processor in the event of any failure of the active processor.

Cite

CITATION STYLE

APA

Hecht, M., Hecht, H., & Shokri, E. (2000). Adaptive fault tolerance for spacecraft. In IEEE Aerospace Conference Proceedings (Vol. 5, pp. 521–533). https://doi.org/10.1109/aero.2000.878526

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free