Efficient learning with robust gradient descent

Matthew J. Holland; Kazushi Ikeda

Journal ArticleOPEN ACCESS

Efficient learning with robust gradient descent

Machine Learning (2019) 108(8-9) 1523-1560

DOI: 10.1007/s10994-019-05802-5

11Citations

37Readers

Abstract

Minimizing the empirical risk is a popular training strategy, but for learning tasks where the data may be noisy or heavy-tailed, one may require many observations in order to generalize well. To achieve better performance under less stringent requirements, we introduce a procedure which constructs a robust approximation of the risk gradient for use in an iterative learning routine. Using high-probability bounds on the excess risk of this algorithm, we show that our update does not deviate far from the ideal gradient-based update. Empirical tests using both controlled simulations and real-world benchmark data show that in diverse settings, the proposed procedure can learn more efficiently, using less resources (iterations and observations) while generalizing better.

Author supplied keywords

Cite

CITATION STYLE

APA

Holland, M. J., & Ikeda, K. (2019). Efficient learning with robust gradient descent. Machine Learning, 108(8–9), 1523–1560. https://doi.org/10.1007/s10994-019-05802-5

Efficient learning with robust gradient descent

Abstract

Author supplied keywords

Cite

Register to see more suggestions