Fast matrix multiplication algorithms are of practical use only if the leading coefficient of their arithmetic complexity is sufficiently small. Many algorithms with low asymptotic cost have large leading coefficients, and are thus impractical. Karstadt and Schwartz have recently demonstrated a technique that reduces the leading coefficient by introducing fast O(n2 logn) basis transformations, applied to the input and output matrices. We generalize their technique, by allowing larger bases for the transformations while maintaining low overhead. Thus we accelerate several matrix multiplication algorithms, beyond what is known to be possible using the previous technique. Of particular interest are a few new sub-cubic algorithms with leading coefficient 2, matching that of classical matrix multiplication. For example, we obtain an algorithm with arithmetic complexity of 2nlog3 23+o(nlog3 23) compared to 2n3 − n2 of the classical algorithm. Such new algorithms can outperform previous ones (classical included) even on relatively small matrices. We obtain lower bounds matching the coefficient of several of our algorithms, proving them to be optimal.
CITATION STYLE
Beniamini, G., & Schwartz, O. (2019). Faster matrix multiplication via sparse decomposition. In Annual ACM Symposium on Parallelism in Algorithms and Architectures (pp. 11–22). Association for Computing Machinery. https://doi.org/10.1145/3323165.3323188
Mendeley helps you to discover research relevant for your work.