TY - JOUR
T1 - The principle of optimality in dynamic programming
T2 - A pedagogical note
AU - Light, Bar
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/11
Y1 - 2024/11
N2 - The principle of optimality is a fundamental aspect of dynamic programming, which states that the optimal solution to a dynamic optimization problem can be found by combining the optimal solutions to its sub-problems. While this principle is generally applicable, it is often only taught for problems with finite or countable state spaces in order to sidestep measure-theoretic complexities. Therefore, it cannot be applied to classic models such as inventory management and dynamic pricing models that have continuous state spaces, and students may not be aware of the possible challenges involved in studying dynamic programming models with general state spaces. To address this, we provide conditions and a self-contained simple proof that establish when the principle of optimality for discounted dynamic programming is valid. These conditions shed light on the difficulties that may arise in the general state space case. We provide examples from the literature that include the relatively involved case of universally measurable dynamic programming and the simple case of finite dynamic programming where our main result can be applied to show that the principle of optimality holds.
AB - The principle of optimality is a fundamental aspect of dynamic programming, which states that the optimal solution to a dynamic optimization problem can be found by combining the optimal solutions to its sub-problems. While this principle is generally applicable, it is often only taught for problems with finite or countable state spaces in order to sidestep measure-theoretic complexities. Therefore, it cannot be applied to classic models such as inventory management and dynamic pricing models that have continuous state spaces, and students may not be aware of the possible challenges involved in studying dynamic programming models with general state spaces. To address this, we provide conditions and a self-contained simple proof that establish when the principle of optimality for discounted dynamic programming is valid. These conditions shed light on the difficulties that may arise in the general state space case. We provide examples from the literature that include the relatively involved case of universally measurable dynamic programming and the simple case of finite dynamic programming where our main result can be applied to show that the principle of optimality holds.
KW - Bellman equation
KW - Dynamic optimization
KW - Dynamic programming
KW - Principle of optimality
UR - http://www.scopus.com/inward/record.url?scp=85202015901&partnerID=8YFLogxK
U2 - 10.1016/j.orl.2024.107164
DO - 10.1016/j.orl.2024.107164
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85202015901
SN - 0167-6377
VL - 57
JO - Operations Research Letters
JF - Operations Research Letters
M1 - 107164
ER -