TY - GEN
T1 - Learning what's going on
T2 - 16th ACM Conference on Economics and Computation, EC 2015
AU - Blum, Avrim
AU - Mansour, Yishay
AU - Morgenstern, Jamie
PY - 2015/6/15
Y1 - 2015/6/15
N2 - We consider a setting where n buyers, with combinatorial preferences over m items, and a seller, running a priority-based allocation mechanism, repeatedly interact. Our goal, from observing limited information about the results of these interactions, is to reconstruct both the preferences of the buyers and the mechanism of the seller. More specifically, we consider an online setting where at each stage, a subset of the buyers arrive and are allocated items, according to some unknown priority that the seller has among the buyers. Our learning algorithm observes only which buyers arrive and the allocation produced (or some function of the allocation, such as just which buyers received positive utility and which did not), and its goal is to predict the outcome for future subsets of buyers. For this task, the learning algorithm needs to reconstruct both the priority among the buyers and the preferences of each buyer. We derive mistake bound algorithms for additive, unit-demand and single minded buyers. We also consider the case where buyers' utilities for a fixed bundle can change between stages due to different (observed) prices. Our algorithms are efficient both in computation time and in the maximum number of mistakes (both polynomial in the number of buyers and items).
AB - We consider a setting where n buyers, with combinatorial preferences over m items, and a seller, running a priority-based allocation mechanism, repeatedly interact. Our goal, from observing limited information about the results of these interactions, is to reconstruct both the preferences of the buyers and the mechanism of the seller. More specifically, we consider an online setting where at each stage, a subset of the buyers arrive and are allocated items, according to some unknown priority that the seller has among the buyers. Our learning algorithm observes only which buyers arrive and the allocation produced (or some function of the allocation, such as just which buyers received positive utility and which did not), and its goal is to predict the outcome for future subsets of buyers. For this task, the learning algorithm needs to reconstruct both the priority among the buyers and the preferences of each buyer. We derive mistake bound algorithms for additive, unit-demand and single minded buyers. We also consider the case where buyers' utilities for a fixed bundle can change between stages due to different (observed) prices. Our algorithms are efficient both in computation time and in the maximum number of mistakes (both polynomial in the number of buyers and items).
KW - Mechanism design
KW - Mistake-bound learning
UR - http://www.scopus.com/inward/record.url?scp=84962090720&partnerID=8YFLogxK
U2 - 10.1145/2764468.2764492
DO - 10.1145/2764468.2764492
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84962090720
T3 - EC 2015 - Proceedings of the 2015 ACM Conference on Economics and Computation
SP - 601
EP - 618
BT - EC 2015 - Proceedings of the 2015 ACM Conference on Economics and Computation
PB - Association for Computing Machinery, Inc
Y2 - 15 June 2015 through 19 June 2015
ER -