Better Algorithms for Stochastic Bandits with Adversarial Corruptions

Anupam Gupta, Tomer Koren, Kunal Talwar

Research output: Contribution to journalConference articlepeer-review

72 Scopus citations

Abstract

We study the stochastic multi-armed bandits problem in the presence of adversarial corruption. We present a new algorithm for this problem whose regret is nearly optimal, substantially improving upon previous work. Our algorithm is agnostic to the level of adversarial contamination and can tolerate a significant amount of corruption with virtually no degradation in performance.

Original languageEnglish
Pages (from-to)1562-1578
Number of pages17
JournalProceedings of Machine Learning Research
Volume99
StatePublished - 2019
Externally publishedYes
Event32nd Conference on Learning Theory, COLT 2019 - Phoenix, United States
Duration: 25 Jun 201928 Jun 2019

Funding

FundersFunder number
National Science FoundationCCF-1536002, CCF-1617790, CCF-1540541

    Fingerprint

    Dive into the research topics of 'Better Algorithms for Stochastic Bandits with Adversarial Corruptions'. Together they form a unique fingerprint.

    Cite this