ErfAct and Pserf: Non-monotonic Smooth Trainable Activation Functions

3Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

An activation function is a crucial component of a neural network that introduces non-linearity in the network. The state-of-the-art performance of a neural network depends also on the perfect choice of an activation function. We propose two novel non-monotonic smooth trainable activation functions, called ErfAct and Pserf. Experiments suggest that the proposed functions improve the network performance significantly compared to the widely used activations like ReLU, Swish, and Mish. Replacing ReLU by ErfAct and Pserf, we have 5.68% and 5.42% improvement for Top-1 accuracy on Shufflenet V2 (2.0x) network in CIFAR100 dataset, 2.11% and 1.96% improvement for Top-1 accuracy on Shufflenet V2 (2.0x) network in CIFAR10 dataset, 1.0%, and 1.0% improvement on mean average precision (mAP) on SSD300 model in Pascal VOC dataset.

Cite

CITATION STYLE

APA

Biswas, K., Kumar, S., Banerjee, S., & Pandey, A. K. (2022). ErfAct and Pserf: Non-monotonic Smooth Trainable Activation Functions. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (Vol. 36, pp. 6097–6105). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v36i6.20557

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free