Abstract
Deploying neural networks (NNs) in low-resource domains is challenging because of their high computing, memory, and power requirements. For this reason, NNs are often quantized before deployment, but such an approach degrades their accuracy. Thus, we propose the counterexample-guided neural network quantization refinement (CEG4N) framework, which combines search-based quantization and equivalence checking. The former minimizes computational requirements, while the latter guarantees that the behavior of an NN does not change after quantization. We evaluate CEG4N on a diverse set of benchmarks, including large and small NNs. Our technique successfully quantizes the networks in the chosen evaluation set, while producing models with up to 163% better accuracy than state-of-the-art techniques.
Author supplied keywords
Cite
CITATION STYLE
Matos, J. B. P., De Lima Filho, E. B., Bessa, I., Manino, E., Song, X., & Cordeiro, L. C. (2024). Counterexample Guided Neural Network Quantization Refinement. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 43(4), 1121–1134. https://doi.org/10.1109/TCAD.2023.3335313
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.