Efficient Hardware Acceleration Techniques for Deep Learning on Edge Devices: A Comprehensive Performance Analysis

  • Burhanuddin M
N/ACitations
Citations of this article
15Readers
Mendeley users who have this article in their library.

Abstract

Implementing deep learning models on edge devices presents significant challenges due to the limited computing power, memory limitations, and processing power of these devices As deep learning models become more complex, cf ensuring proper execution on an edge platform is critical for real-time implementation Hardware -Addresses these challenges by exploring ways to accelerate. The problem lies in the resource-hungry nature of today’s deep learning models, which are typically designed for cloud environments with high computing capacity, making them unsuitable for edge environments with restricted resources. The main objective of this review is to analyze and compare various hardware acceleration strategies, such as graphics processing units (GPUs), tensor processing units (TPUs), field programmable gate arrays (FPGAs), and application-specific integrated circuits (ASICs); . . . . (pruning, quantization, and knowledge storage), memory management, and data flow optimization to improve the performance and energy efficiency of deep learning models on edge devices The results of this comprehensive performance analysis suggests that hardware accelerators can significantly improve throughput and reduce latency while maintaining acceptable levels of power consumption and accuracy. In addition to techniques such as quantization and pruning that are seen to reduce computational load and memory footprint, enabling more efficient deep learning inference on edge platforms, the study highlights trade-offs between speed, consumption power efficiency and model accuracy between for each hardware accelerator are emphasized. The findings suggest that by choosing the right hardware and applying the right optimization techniques, edge devices may be able to optimize deep learning models, meeting the requirements of real-time AI applications in resource-constrained environments handle the role there.

Cite

CITATION STYLE

APA

Burhanuddin, M. A. (2023). Efficient Hardware Acceleration Techniques for Deep Learning on Edge Devices: A Comprehensive Performance Analysis. KHWARIZMIA, 2023, 103–112. https://doi.org/10.70470/khwarizmia/2023/010

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free