Citiussynapse: A deep learning framework for embedded systems

1Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

As embedded systems, such as smartphones with limited resources, have become increas-ingly popular, active research has recently been conducted on performing on-device deep learning in such systems. Therefore, in this study, we propose a deep learning framework that is specialized for embedded systems with limited resources, the operation processing structure of which differs from that of standard PCs. The proposed framework supports an OpenCL-based accelerator engine for accelerator deep learning operations in various embedded systems. Moreover, the parallel processing performance of OpenCL is maximized through an OpenCL kernel that is optimized for embedded GPUs, and the structural characteristics of embedded systems, such as unified memory. Furthermore, an on-device optimizer for optimizing the performance in on-device environments, and model con-verters for compatibility with conventional frameworks, are provided. The results of a performance evaluation show that the proposed on-device framework outperformed conventional methods.

Cite

CITATION STYLE

APA

Hong, S., Cho, H., & Kim, J. S. (2021). Citiussynapse: A deep learning framework for embedded systems. Applied Sciences (Switzerland), 11(23). https://doi.org/10.3390/app112311570

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free