TY - JOUR
T1 - Competing Bandits
T2 - The Perils of Exploration Under Competition
AU - Aridor, Guy
AU - Mansour, Yishay
AU - Slivkins, Aleksandrs
AU - Wu, Steven
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2025/2/7
Y1 - 2025/2/7
N2 - Most online platforms learn from interactions with users and engage in exploration: making potentially suboptimal choices to acquire new information. We study the interplay between exploration and competition: how such platforms balance the exploration for learning and competition for users.We consider a stylized duopoly in which two firms face the same multi-armed bandit problem. Users arrive one by one and choose between the two firms, so that each firm makes progress on its bandit problem only if it is chosen. We study whether competition incentivizes the adoption of better algorithms. We find that stark competition disincentivizes exploration, leading to low welfare. However, weaker competition incentivizes better exploration algorithms and increases welfare. We investigate two channels for weakening the competition: stochastic user choice models and a first-mover advantage. Our findings speak to the competition-innovation relationship and the first-mover advantage in the digital economy.
AB - Most online platforms learn from interactions with users and engage in exploration: making potentially suboptimal choices to acquire new information. We study the interplay between exploration and competition: how such platforms balance the exploration for learning and competition for users.We consider a stylized duopoly in which two firms face the same multi-armed bandit problem. Users arrive one by one and choose between the two firms, so that each firm makes progress on its bandit problem only if it is chosen. We study whether competition incentivizes the adoption of better algorithms. We find that stark competition disincentivizes exploration, leading to low welfare. However, weaker competition incentivizes better exploration algorithms and increases welfare. We investigate two channels for weakening the competition: stochastic user choice models and a first-mover advantage. Our findings speak to the competition-innovation relationship and the first-mover advantage in the digital economy.
KW - Additional Key Words and PhrasesMulti-armed bandits
KW - competition vs. innovation
KW - exploration
UR - http://www.scopus.com/inward/record.url?scp=86000187129&partnerID=8YFLogxK
U2 - 10.1145/3711831
DO - 10.1145/3711831
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:86000187129
SN - 2167-8375
VL - 13
JO - ACM Transactions on Economics and Computation
JF - ACM Transactions on Economics and Computation
IS - 1
M1 - 3
ER -