Lossless Compression of Deep Neural Networks

Thiago Serra; Abhinav Kumar; Srikumar Ramalingam

Conference Proceedings

Lossless Compression of Deep Neural Networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12296 LNCS 417-430

DOI: 10.1007/978-3-030-58942-4_27

16Citations

41Readers

Get full text

Abstract

Deep neural networks have been successful in many predictive modeling tasks, such as image and language recognition, where large neural networks are often used to obtain good accuracy. Consequently, it is challenging to deploy these networks under limited computational resources, such as in mobile devices. In this work, we introduce an algorithm that removes units and layers of a neural network while not changing the output that is produced, which thus implies a lossless compression. This algorithm, which we denote as LEO (Lossless Expressiveness Optimization), relies on Mixed-Integer Linear Programming (MILP) to identify Rectified Linear Units (ReLUs) with linear behavior over the input domain. By using regularization to induce such behavior, we can benefit from training over a larger architecture than we would later use in the environment where the trained neural network is deployed.

Author supplied keywords

Cite

CITATION STYLE

APA

Serra, T., Kumar, A., & Ramalingam, S. (2020). Lossless Compression of Deep Neural Networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12296 LNCS, pp. 417–430). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-58942-4_27

Lossless Compression of Deep Neural Networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions