Towards Efficient Verification of Quantized Neural Networks

6Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Quantization replaces floating point arithmetic with integer arithmetic in deep neural network models, providing more efficient on-device inference with less power and memory. In this work, we propose a framework for formally verifying properties of quantized neural networks. Our baseline technique is based on integer linear programming which guarantees both soundness and completeness. We then show how efficiency can be improved by utilizing gradient-based heuristic search methods and also bound-propagation techniques. We evaluate our approach on perception networks quantized with PyTorch. Our results show that we can verify quantized networks with better scalability and efficiency than the previous state of the art.

Cite

CITATION STYLE

APA

Huang, P., Wu, H., Yang, Y., Daukantas, I., Wu, M., Zhang, Y., & Barrett, C. (2024). Towards Efficient Verification of Quantized Neural Networks. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, pp. 21152–21160). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v38i19.30108

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free