TY - JOUR
T1 - Constrained no-regret learning
AU - Du, Ye
AU - Lehrer, Ehud
N1 - Publisher Copyright:
© 2020 Elsevier B.V.
PY - 2020/5
Y1 - 2020/5
N2 - We investigate a dynamic decision making problem with constraints. The decision maker is free to take any action as long as the empirical frequency of the actions played does not violate pre-specified constraints. In a case of violation the decision maker is penalized. We introduce the constrained no-regret learning model. In this model the set of alternative strategies, with which a dynamic decision policy is compared, is the set of stationary mixed actions that satisfy all the constraints. We show that there exists a strategy that satisfies the following properties: (i) it guarantees that after an unavoidable deterministic grace period, there are absolutely no violations; (ii) for an arbitrarily small constant ϵ>0, it achieves a convergence rate of T−[Formula presented], which improves the O(T−[Formula presented]) convergence rate of Mannor et al. (2009).
AB - We investigate a dynamic decision making problem with constraints. The decision maker is free to take any action as long as the empirical frequency of the actions played does not violate pre-specified constraints. In a case of violation the decision maker is penalized. We introduce the constrained no-regret learning model. In this model the set of alternative strategies, with which a dynamic decision policy is compared, is the set of stationary mixed actions that satisfy all the constraints. We show that there exists a strategy that satisfies the following properties: (i) it guarantees that after an unavoidable deterministic grace period, there are absolutely no violations; (ii) for an arbitrarily small constant ϵ>0, it achieves a convergence rate of T−[Formula presented], which improves the O(T−[Formula presented]) convergence rate of Mannor et al. (2009).
KW - Approachability
KW - Constrained no-regret
KW - No-regret strategy
KW - On-line learning algorithm
UR - http://www.scopus.com/inward/record.url?scp=85080118282&partnerID=8YFLogxK
U2 - 10.1016/j.jmateco.2020.02.002
DO - 10.1016/j.jmateco.2020.02.002
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85080118282
SN - 0304-4068
VL - 88
SP - 16
EP - 24
JO - Journal of Mathematical Economics
JF - Journal of Mathematical Economics
ER -