The increasing complexity of current and future very large computing sys- tems with a rapidly growing number of cores and nodes requires high human effort on administration and maintenance of these systems. Existing monitoring tools are neither scalable nor capable to reduce the overwhelming flow of information and provide only essential information of high value. Current management tools lack on scalability and capability to process a huge amount of information intelligently by relating several data and information from various sources together for making right decisions on error/fault handling. In order to solve these problems, we present a so- lution designed within the TIMaCS project, a hierarchical, scalable, policy based monitoring and management framework.
CITATION STYLE
Volk, E., Buchholz, J., Wesner, S., Koudela, D., Schmidt, M., Fallenbeck, N., … Jeutter, A. (2011). Towards Intelligent Management of Very Large Computing Systems. In Competence in High Performance Computing 2010 (pp. 191–204). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-24025-6_16
Mendeley helps you to discover research relevant for your work.