ERA: Enhanced Rational Activations

Martin Trimmel; Mihai Zanfir; Richard Hartley; Cristian Sminchisescu

Conference Proceedings

ERA: Enhanced Rational Activations

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13680 LNCS 722-738

DOI: 10.1007/978-3-031-20044-1_41

N/ACitations

8Readers

Get full text

Abstract

Activation functions play a central role in deep learning since they form an essential building stone of neural networks. In the last few years, the focus has been shifting towards investigating new types of activations that outperform the classical Rectified Linear Unit (ReLU) in modern neural architectures. Most recently, rational activation functions (RAFs) have awakened interest because they were shown to perform on par with state-of-the-art activations on image classification. Despite their apparent potential, prior formulations are either not safe, not smooth, or not “true” rational functions, and they only work with careful initialisation. Aiming to mitigate these issues, we propose a novel, enhanced rational function, ERA, and investigate how to better accommodate the specific needs of these activations, to both network components and training regime. In addition to being more stable, the proposed function outperforms other standard ones across a range of lightweight network architectures on two different tasks: image classification and 3d human pose and shape reconstruction.

Author supplied keywords

Cite

CITATION STYLE

APA

Trimmel, M., Zanfir, M., Hartley, R., & Sminchisescu, C. (2022). ERA: Enhanced Rational Activations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13680 LNCS, pp. 722–738). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-20044-1_41

ERA: Enhanced Rational Activations

Abstract

Author supplied keywords

Cite

Register to see more suggestions