Online Optimization with Feedback Delay and Nonlinear Switching Cost

0Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We study a variant of online optimization in which the learner receives k-round delayed feedback about hitting cost and there is a multi-step nonlinear switching cost, i.e., costs depend on multiple previous actions in a nonlinear manner. Our main result shows that a novel Iterative Regularized Online Balanced Descent (iROBD) algorithm has a constant, dimension-free competitive ratio that is O(L2k), where L is the Lipschitz constant of the nonlinear switching cost. Additionally, we provide lower bounds that illustrate the Lipschitz condition is required and the dependencies on k and L are tight. Finally, via reductions, we show that this setting is closely related to online control problems with delay, nonlinear dynamics, and adversarial disturbances, where iROBD directly offers constant-competitive online policies. This extended abstract is an abridged version of [2].

Cite

CITATION STYLE

APA

Pan, W., Shi, G., Lin, Y., & Wierman, A. (2022). Online Optimization with Feedback Delay and Nonlinear Switching Cost. In Performance Evaluation Review (Vol. 50, pp. 81–82). Association for Computing Machinery. https://doi.org/10.1145/3489048.3522657

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free