Processing systems for deep learning inference on edge devices

Mário Véstias

Book Chapter

Processing systems for deep learning inference on edge devices

Véstias M

Springer, (2020), 213-240

DOI: 10.1007/978-3-030-44907-0_9

8Citations

6Readers

Get full text

Abstract

Deep learning models are taking place at many artificial intelligence tasks. These models are achieving better results but need more computing power and memory. Therefore, training and inference of deep learning models are made at cloud centers with high-performance platforms. In many applications, it is more beneficial or required to have the inference at the edge near the source of data or action requests avoiding the need to transmit the data to a cloud service and wait for the answer. In many scenarios, transmission of data to the cloud is not reliable or even impossible, or has a high latency with uncertainty about the round-trip delay of the communication, which is not acceptable for applications sensitive to latency with real-time decisions. Other factors like security and privacy of data force the data to stay in the edge. With all these disadvantages, inference is migrating partial or totally to the edge. The problem, is that deep learning models are quite hungry in terms of computation, memory and energy which are not available in today’s edge computing devices. Therefore, artificial intelligence devices are being deployed by different companies with different markets in mind targeting edge computing. In this chapter we describe the actual state of algorithms and models for deep learning and analyze the state of the art of computing devices and platforms to deploy deep learning on edge. We describe the existing computing devices for deep learning and analyze them in terms of different metrics, like performance, power and flexibility. Different technologies are to be considered including GPU, CPU, FPGA and ASIC. We will explain the trends in computing devices for deep learning on edge, what has been researched, and what should we expect from future devices.

Cite

CITATION STYLE

APA

Véstias, M. (2020). Processing systems for deep learning inference on edge devices. In Internet of Things (pp. 213–240). Springer. https://doi.org/10.1007/978-3-030-44907-0_9

Processing systems for deep learning inference on edge devices

Abstract

Cite

Register to see more suggestions