Airavata Metascheduler: A Reliable, Fault Tolerant, and Resource-Aware Job Scheduling Service

Citations of this article
Mendeley users who have this article in their library.
Get full text


Software-as-a-service science gateways provide user interfaces and middleware for accessing scientific software deployed on remote high-performance computing resources and clusters. Selecting the resource to use for a particular job submission may be left to the user, who may need more information to make good choices when selecting from multiple options. To address this problem, we have designed and developed an extensible, scalable metascheduling system that can provide automated scheduling capabilities based on resource availability and other characteristics. We develop a system model based on queuing theory to guide our implementation and provide a basis for analysis. In particular, we derive an efficiency metric from these considerations. We implement the metascheduling system within the open-source Apache Airavata framework for science gateways as a supplemental service for guiding the job submission capabilities. We measure efficiency in representative scenarios, observing efficiencies of greater than 70% even in scenarios with high input rates and low job acceptance rates.




Ranawaka, I., Abeysinghe, E., Wannipurage, D., De Silva, D., Brookes, E., Marru, S., … Pierce, M. (2023). Airavata Metascheduler: A Reliable, Fault Tolerant, and Resource-Aware Job Scheduling Service. In PEARC 2023 - Computing for the common good: Practice and Experience in Advanced Research Computing (pp. 35–42). Association for Computing Machinery, Inc.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free