How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel

Yiqiao Tan; Haizhong Liu

Journal Article

How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel

International Journal of Multimedia Information Retrieval (2024) 13(1)

DOI: 10.1007/s13735-023-00318-0

1Citations

3Readers

Get full text

Abstract

The neural tangent kernel (NTK) was created in the context of using the limit idea to study the theory of neural network. NTKs are defined from neural network models in the infinite-width limit trained by gradient descent. Such over-parameterized models achieved good test accuracy in experiments, and the success of the NTK emphasizes not only the importance of describing neural network models in the width limit of h→ ∞ , but also the further development of deep learning theory for gradient flow in the step limit of η→ 0 . And NTK can be widely used in various machine learning models. This review provides a comprehensive overview of the entire development of NTKs. Firstly, the bias–variance tradeoff in statistics, the popular over-parameterization and gradient descent in deep learning, and the widely used kernel method were introduced. Secondly, the development of research on the infinite-width limit in networks and the introduction of the concept of the NTK were introduced, and the development and latest progress of NTK theory were discussed. Finally, the researches on the migrations of NTKs to neural networks of other structures and the applications of NTKs to other fields of machine learning were presented.

Author supplied keywords

Cite

CITATION STYLE

APA

Tan, Y., & Liu, H. (2024). How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel. International Journal of Multimedia Information Retrieval, 13(1). https://doi.org/10.1007/s13735-023-00318-0

How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel

Abstract

Author supplied keywords

Cite

Register to see more suggestions