ScELA: Scalable and extensible launching architecture for clusters

17Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

As cluster sizes head into tens of thousands, current job launch mechanisms do not scale as they are limited by resource constraints as well as performance bottlenecks. The job launch process includes two phases - spawning of processes on processors and information exchange between processes for job initialization. Implementations of various programming models follow distinct protocols for the information exchange phase. We present the design of a scalable, extensible and high-performance job launch architecture for very large scale parallel computing. We present implementations of this architecture which achieve a speedup of more than 700% in launching a simple Hello World MPI application on 10,240 processor cores and also scale to more than 3 times the number of processor cores compared to prior solutions. © 2008 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Sridhar, J. K., Koop, M. J., Perkins, J. L., & Panda, D. K. (2008). ScELA: Scalable and extensible launching architecture for clusters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5374 LNCS, pp. 323–335). Springer Verlag. https://doi.org/10.1007/978-3-540-89894-8_30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free