Deep Partial Updating: Towards Communication Efficient Updating for On-Device Inference

1Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Emerging edge intelligence applications require the server to retrain and update deep neural networks deployed on remote edge nodes to leverage newly collected data samples. Unfortunately, it may be impossible in practice to continuously send fully updated weights to these edge nodes due to the highly constrained communication resource. In this paper, we propose the weight-wise deep partial updating paradigm, which smartly selects a small subset of weights to update in each server-to-edge communication round, while achieving a similar performance compared to full updating. Our method is established through analytically upper-bounding the loss difference between partial updating and full updating, and only updates the weights which make the largest contributions to the upper bound. Extensive experimental results demonstrate the efficacy of our partial updating methodology which achieves a high inference accuracy while updating a rather small number of weights.

Cite

CITATION STYLE

APA

Qu, Z., Liu, C., & Thiele, L. (2022). Deep Partial Updating: Towards Communication Efficient Updating for On-Device Inference. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13671 LNCS, pp. 137–153). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-20083-0_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free