Submodular batch selection for training deep neural networks

K. J. Joseph; R. Vamshi Teja; Krishnakant Singh; Vineeth N. Balasubramanian

Conference ProceedingsOPEN ACCESS

Submodular batch selection for training deep neural networks

IJCAI International Joint Conference on Artificial Intelligence (2019) 2019-August 2677-2683

DOI: 10.24963/ijcai.2019/372

14Citations

37Readers

Abstract

Mini-batch gradient descent based methods are the de facto algorithms for training neural network architectures today. We introduce a mini-batch selection strategy based on submodular function maximization. Our novel submodular formulation captures the informativeness of each sample and diversity of the whole subset. We design an efficient, greedy algorithm which can give high-quality solutions to this NP-hard combinatorial optimization problem. Our extensive experiments on standard datasets show that the deep models trained using the proposed batch selection strategy provide better generalization than Stochastic Gradient Descent as well as a popular baseline sampling strategy across different learning rates, batch sizes, and distance metrics.

Cite

CITATION STYLE

APA

Joseph, K. J., Vamshi Teja, R., Singh, K., & Balasubramanian, V. N. (2019). Submodular batch selection for training deep neural networks. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2019-August, pp. 2677–2683). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2019/372

Submodular batch selection for training deep neural networks

Abstract

Cite

Register to see more suggestions