Operator-Learning-Inspired Modeling of Neural Ordinary Differential Equations

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

Neural ordinary differential equations (NODEs), one of the most influential works of the differential equation-based deep learning, are to continuously generalize residual networks and opened a new field. They are currently utilized for various downstream tasks, e.g., image classification, time series classification, image generation, etc. Its key part is how to model the time-derivative of the hidden state, denoted dhdt(t) . People have habitually used conventional neural network architectures, e.g., fully-connected layers followed by non-linear activations. In this paper, however, we present a neural operator-based method to define the time-derivative term. Neural operators were initially proposed to model the differential operator of partial differential equations (PDEs). Since the time-derivative of NODEs can be understood as a special type of the differential operator, our proposed method, called branched Fourier neural operator (BFNO), makes sense. In our experiments with general downstream tasks, our method significantly outperforms existing methods.

Cite

CITATION STYLE

APA

Cho, W., Cho, S., Jin, H., Jeon, J., Lee, K., Hong, S., … Park, N. (2024). Operator-Learning-Inspired Modeling of Neural Ordinary Differential Equations. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, pp. 11543–11551). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v38i10.29036

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free