Abstract
In scientific computing, systems often manage computations that require continuous acquisition of of satellite data and the management of large databases, as well as the execution of analysis software and simulation models (e.g. Monte Carlo or molecular dynamics cell simulations) which may require several weeks of continuous run. These systems, consequently, should ensure the continuity of operation even in case of serious faults. HAVmS (High Availability Virtual machine System) is a highly available, "fault tolerant" system with zero downtime in case of fault. It is based on the use of Virtual Machines and implemented by two servers with similar characteristics. HAVmS, thanks to the developed software solutions, is unique in its kind since it automatically failbacks once faults have been fixed. The system has been designed to be used both with professional or inexpensive hardware and supports the simultaneous execution of multiple services such as: web, mail, computing and administrative services, uninterrupted computing, data base management. Finally the system is cost effective adopting exclusively open source solutions, is easily manageable and for general use.
Author supplied keywords
Cite
CITATION STYLE
Federici, M., Gaibisso, C., & Martino, B. L. (2014). HAVmS: Highly available virtual machine computer system fault tolerant with automatic failback and close to zero downtime. In Frascati Workshop 2013 - 10th International Workshop on Multifrequency Behaviour of High Energy Cosmic Sources (pp. 278–282). International Workshop on Multifrequency Behaviour of High Energy Cosmic Sources. https://doi.org/10.14311/APP.2014.01.0278
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.