This paper presents an application-level non-blocking multicast scheme for dynamic DAG scheduling on large-scale distributed-memory systems. The multicast scheme takes into account both network topology and space requirement of routing tables to achieve scalability. Specifically, we prove that the scheme is deadlock-free and takes at most logN steps to complete. The routing table chooses appropriate neighbors to store based on topology IDs and has a small space of O(logN). Although built upon MPI point-to-point operations, the experimental results show that our scheme is significantly better than the simple flat-tree method and is comparable to vendor's collective MPI operations. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Song, F., Dongarra, J., & Moore, S. (2009). A scalable non-blocking multicast scheme for distributed dag scheduling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5544 LNCS, pp. 195–204). https://doi.org/10.1007/978-3-642-01970-8_20
Mendeley helps you to discover research relevant for your work.