This paper deals with parallelization and implementation aspects of PDE based image processing models for large cluster environments with distributed memory. As an example we focus on nonlinear isotropic diffusion filtering which we discretize by means of an additive operator splitting (AOS). We start by decomposing the algorithm into small modules that shall be parallelized separately. For this purpose image partitioning strategies are discussed and their impact on the communication pattern and volume is analyzed. Based on the results we develop an algorithmic implementation with excellent scaling properties on massively connected low latency networks. Test runs on a high-end Myrinet cluster yield almost linear speedup factors up to 209 for 256 processors. This results in typical denoising times of 0.5 seconds for five iterations on a 256 × 256 × 128 data cube. © Springer-Verlag Berlin Heidelberg 2002.
CITATION STYLE
Bruhn, A., Jakob, T., Fischer, M., Kohlberger, T., Weickert, J., Brüning, U., & Schnorr, C. (2002). Designing 3-D nonlinear diffusion filters for high performance cluster computing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2449 LNCS, pp. 290–297). Springer Verlag. https://doi.org/10.1007/3-540-45783-6_35
Mendeley helps you to discover research relevant for your work.