For computational purposes such as the computation of adjoint, applying the reverse mode of automatic differentiation, or debugging one may require the values computed during the evaluation of a function in reverse order. The näive approach is to store all information needed for the reversal and to read this information backwards during the reversal. This technique leads to an enormous memory requirement, which is proportional to the computing time. The paper presents an approach to reducing the memory requirement without increasing the wall clock time by using parallel computers. During the parallel computation, only a fixed and small number of memory pads called checkpoints is stored. The data needed for the reversal is recomputed piecewise by starting the evaluation procedure from the checkpoints. We explain how this technique can be used on a parallel computer with distributed memory. Different implementation strategies will be shown, and some details with respect to resource-optimality are discussed. © Springer-Verlag Berlin Heidelberg 2002.
CITATION STYLE
Lehmann, U., & Walther, A. (2002). The implementation and testing of time-minimal and resource-optimal parallel reversal schedules. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2330 LNCS, pp. 1049–1058). Springer Verlag. https://doi.org/10.1007/3-540-46080-2_110
Mendeley helps you to discover research relevant for your work.