A Lightweight Deep Neural Network Using a Mixer-Type Nonlinear Vector Autoregression

0Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The design of a lightweight deep learning model would be an ideal solution for overcoming resource limitations when implementing artificial intelligence in edge sites. In this study, we propose a lightweight deep neural network that uses a Mixer-type architecture based on nonlinear vector autoregression (NVAR), which we refer to as Mixer-type NVAR. We applied overlapping patch embedding to enrich the image input and Sequencer architecture for vertical and horizontal operation inside the Mixer-type NVAR. We utilized a window partition technique and general quadratic positional encoding to increase the performance of the proposed model. Our model achieved a top-1 accuracy of 82.48% for the CIFAR-10 dataset with 0.159 M parameters and 98.36% for MNIST with 0.106 M parameters. Moreover, we evaluated its throughput on a central processing unit, which was 190.1 images per second for CIFAR-10 and 106.7 images per second for the MNIST dataset. These results are competitive with the state-of-the-art convolutional neural network-based model, MLP-Mixer, and the traditional reservoir-computing-based Mixer model with the same tuning of hyperparameters.

Cite

CITATION STYLE

APA

Diana, M., Amin, R., Amagasaki, M., & Kiyama, M. (2023). A Lightweight Deep Neural Network Using a Mixer-Type Nonlinear Vector Autoregression. IEEE Access, 11, 103544–103553. https://doi.org/10.1109/ACCESS.2023.3318873

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free