Abstract
In this paper, we present a structure for monitoring a large set of computational clusters. We illustrate methods for scaling a monitor network comprised of many clusters while keeping processing requirements low. A design for presenting high-level web-based summaries of the monitor network is provided, along with a generalization to a distributed, multiple-resolution monitoring tree. Emphasis is placed on scalability, fast query response, fault tolerance, and grid compatibility. Experimental evidence is presented that demonstrates the performance of our design.
Cite
CITATION STYLE
Sacerdoti, F. D., Katz, M. J., Massie, M. L., & Culler, D. E. (2003). Wide area cluster monitoring with Ganglia. In Proceedings - IEEE International Conference on Cluster Computing, ICCC (Vol. 2003-January, pp. 289–298). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/CLUSTR.2003.1253327
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.