TY - JOUR
T1 - Practical Nonparametric Sampling Strategies for Quantile-Based Ordinal Optimization
AU - Shin, Dongwook
AU - Broadie, Mark
AU - Zeevi, Assaf
N1 - Publisher Copyright:
© 2021 INFORMS.
PY - 2022/3
Y1 - 2022/3
N2 - Given a finite number of stochastic systems, the goal of our problemis to dynamically allocate a finite sampling budget to maximize the probability of selecting the "best"system. Systems are encoded with the probability distributions that govern sample observations, which are unknown and only assumed to belong to a broad family of distributions that need not admit any parametric representation. The best system is defined as the one with the highest quantile value. The objective of maximizing the probability of selecting this best system is not analytically tractable. In lieu of that, we use the rate function for the probability of error relying on large deviations theory. Our point of departure is an algorithmthat naively combines sequential estimation andmyopic optimization. This algorithm is shown to be asymptotically optimal; however, it exhibits poor finite-time performance and does not lead itself to implementation in settings with a large number of systems. To address this, we propose practically implementable variants that retain the asymptotic performance of the former while dramatically improving its finite-time performance.
AB - Given a finite number of stochastic systems, the goal of our problemis to dynamically allocate a finite sampling budget to maximize the probability of selecting the "best"system. Systems are encoded with the probability distributions that govern sample observations, which are unknown and only assumed to belong to a broad family of distributions that need not admit any parametric representation. The best system is defined as the one with the highest quantile value. The objective of maximizing the probability of selecting this best system is not analytically tractable. In lieu of that, we use the rate function for the probability of error relying on large deviations theory. Our point of departure is an algorithmthat naively combines sequential estimation andmyopic optimization. This algorithm is shown to be asymptotically optimal; however, it exhibits poor finite-time performance and does not lead itself to implementation in settings with a large number of systems. To address this, we propose practically implementable variants that retain the asymptotic performance of the former while dramatically improving its finite-time performance.
KW - large deviations theory
KW - ordinal optimization
KW - quantile
KW - tractable procedures
UR - http://www.scopus.com/inward/record.url?scp=85134487013&partnerID=8YFLogxK
U2 - 10.1287/ijoc.2021.1071
DO - 10.1287/ijoc.2021.1071
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85134487013
SN - 1091-9856
VL - 34
SP - 752
EP - 768
JO - INFORMS Journal on Computing
JF - INFORMS Journal on Computing
IS - 2
ER -