Differentially Private Multi-Armed Bandits in the Shuffle Model

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

We give an (ε, δ)-differentially private algorithm for the multi-armed bandit (MAB) problem in the shuffle model with a distribution-dependent regret oflog∆aT +klog δ 1 log T , and a distribution-independent regretO P a∈[k]:∆a>0 ε of O √kT log T +klog δ 1 log T , where T is the number of rounds, ∆a is the ε suboptimality gap of the arm a, and k is the total number of arms. Our upper bound almost matches the regret of the best known algorithms for the centralized model, and significantly outperforms the best known algorithm in the local model.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
EditorsMarc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan
PublisherNeural information processing systems foundation
Pages24956-24967
Number of pages12
ISBN (Electronic)9781713845393
StatePublished - 2021
Event35th Conference on Neural Information Processing Systems, NeurIPS 2021 - Virtual, Online
Duration: 6 Dec 202114 Dec 2021

Publication series

NameAdvances in Neural Information Processing Systems
Volume30
ISSN (Print)1049-5258

Conference

Conference35th Conference on Neural Information Processing Systems, NeurIPS 2021
CityVirtual, Online
Period6/12/2114/12/21

Funding

FundersFunder number
Yandex Initiative for Machine Learning
European Research Council
Blavatnik Family Foundation
Israel Science Foundation993/17,1595/19,1871/19
German-Israeli Foundation for Scientific Research and Development1367/2017
Horizon 2020 Framework Programme882396
Tel Aviv University

    Fingerprint

    Dive into the research topics of 'Differentially Private Multi-Armed Bandits in the Shuffle Model'. Together they form a unique fingerprint.

    Cite this