Abstract
Running machine learning inference on tiny devices, known as TinyML, is an emerging research area. This task requires generating inference code that uses memory frugally, a task that standard ML frameworks are ill-suited for. A deployment framework for TinyML must a) be parametric in the number representation to take advantage of the emerging representations like posits, b) carefully assign high-precision to a few tensors so that most tensors can be kept in low-precision while still maintaining model accuracy, and c) avoid memory fragmentation. We describe MinUn, the first TinyML framework that holistically addresses these issues to generate efficient code for ARM microcontrollers (e.g., Arduino Uno, Due and STM32H747) that outperforms the prior TinyML frameworks.
Author supplied keywords
Cite
CITATION STYLE
Jaiswal, S., Goli, R. K. K., Kumar, A., Seshadri, V., & Sharma, R. (2023). MinUn: Accurate ML Inference on Microcontrollers. In Proceedings of the ACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES) (pp. 26–39). Association for Computing Machinery. https://doi.org/10.1145/3589610.3596278
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.