Online learning methods for border patrol resource allocation

Richard Klíma; Christopher Kiekintveld; Viliam Lisý

Journal Article

Online learning methods for border patrol resource allocation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8840 340-349

DOI: 10.1007/978-3-319-12601-2_20

11Citations

8Readers

Get full text

Abstract

We introduce a model for border security resource allocation with repeated interactions between attackers and defenders. The defender must learn the optimal resource allocation strategy based on historical apprehension data, balancing exploration and exploitation in the policy. We experiment with several solution methods for this online learning problem including UCB, sliding-window UCB, and EXP3. We test the learning methods against several different classes of attackers including attacker with randomly varying strategies and attackers who react adversarially to the defender’s strategy. We present experimental data to identify the optimal parameter settings for these algorithms and compare the algorithms against the different types of attackers.

Author supplied keywords

Cite

CITATION STYLE

APA

Klíma, R., Kiekintveld, C., & Lisý, V. (2014). Online learning methods for border patrol resource allocation. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8840, 340–349. https://doi.org/10.1007/978-3-319-12601-2_20

Online learning methods for border patrol resource allocation

Abstract

Author supplied keywords

Cite

Register to see more suggestions