Abstract
We study the stochastic multi-armed bandits problem in the presence of adversarial corruption. We present a new algorithm for this problem whose regret is nearly optimal, substantially improving upon previous work. Our algorithm is agnostic to the level of adversarial contamination and can tolerate a significant amount of corruption with virtually no degradation in performance.
Original language | English |
---|---|
Title of host publication | Proceedings of the Thirty-Second Conference on Learning Theory |
Editors | Alina Beygelzimer, Daniel Hsu |
Publisher | PMLR |
Pages | 1562-1578 |
Number of pages | 17 |
State | Published - 2019 |
Externally published | Yes |
Event | 32nd Annual Conference on Learning Theory, COLT 2019 - Phoenix, United States Duration: 25 Jun 2019 → 28 Jun 2019 Conference number: 32 |
Publication series
Name | Proceedings of Machine Learning Research |
---|---|
Publisher | PMLR |
Volume | 99 |
ISSN (Electronic) | 2640-3498 |
Conference
Conference | 32nd Annual Conference on Learning Theory, COLT 2019 |
---|---|
Abbreviated title | COLT 2019 |
Country/Territory | United States |
City | Phoenix |
Period | 25/06/19 → 28/06/19 |