NetAdapt: Platform-aware neural network adaptation for mobile applications

Tien Ju Yang; Andrew Howard; Bo Chen; Xiao Zhang; Alec Go; Mark Sandler; Vivienne Sze; Hartwig Adam

Conference ProceedingsOPEN ACCESS

NetAdapt: Platform-aware neural network adaptation for mobile applications

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11214 LNCS 289-304

DOI: 10.1007/978-3-030-01249-6_18

86Citations

501Readers

Abstract

This work proposes an algorithm, called NetAdapt, that automatically adapts a pre-trained deep neural network to a mobile platform given a resource budget. While many existing algorithms simplify networks based on the number of MACs or weights, optimizing those indirect metrics may not necessarily reduce the direct metrics, such as latency and energy consumption. To solve this problem, NetAdapt incorporates direct metrics into its adaptation algorithm. These direct metrics are evaluated using empirical measurements, so that detailed knowledge of the platform and toolchain is not required. NetAdapt automatically and progressively simplifies a pre-trained network until the resource budget is met while maximizing the accuracy. Experiment results show that NetAdapt achieves better accuracy versus latency trade-offs on both mobile CPU and mobile GPU, compared with the state-of-the-art automated network simplification algorithms. For image classification on the ImageNet dataset, NetAdapt achieves up to a 1.7 × speedup in measured inference latency with equal or higher accuracy on MobileNets (V1&V2).

Cite

CITATION STYLE

APA

Yang, T. J., Howard, A., Chen, B., Zhang, X., Go, A., Sandler, M., … Adam, H. (2018). NetAdapt: Platform-aware neural network adaptation for mobile applications. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11214 LNCS, pp. 289–304). Springer Verlag. https://doi.org/10.1007/978-3-030-01249-6_18

NetAdapt: Platform-aware neural network adaptation for mobile applications

Abstract

Cite

Register to see more suggestions