Better Algorithms for Stochastic Bandits with Adversarial Corruptions

Anupam Gupta, Tomer Koren, Kunal Talwar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We study the stochastic multi-armed bandits problem in the presence of adversarial corruption. We present a new algorithm for this problem whose regret is nearly optimal, substantially improving upon previous work. Our algorithm is agnostic to the level of adversarial contamination and can tolerate a significant amount of corruption with virtually no degradation in performance.
Original languageEnglish
Title of host publicationProceedings of the Thirty-Second Conference on Learning Theory
EditorsAlina Beygelzimer, Daniel Hsu
PublisherPMLR
Pages1562-1578
Number of pages17
StatePublished - 2019
Externally publishedYes
Event32nd Annual Conference on Learning Theory, COLT 2019 - Phoenix, United States
Duration: 25 Jun 201928 Jun 2019
Conference number: 32

Publication series

NameProceedings of Machine Learning Research
PublisherPMLR
Volume99
ISSN (Electronic)2640-3498

Conference

Conference32nd Annual Conference on Learning Theory, COLT 2019
Abbreviated titleCOLT 2019
Country/TerritoryUnited States
CityPhoenix
Period25/06/1928/06/19

Fingerprint

Dive into the research topics of 'Better Algorithms for Stochastic Bandits with Adversarial Corruptions'. Together they form a unique fingerprint.

Cite this