Abstract
A general discrete decision process is formulated which includes both undiscounted and discounted semi-Markovian decision processes as special cases. A policy-iteration algorithm is presented and shown to converge to an optimal policy. Properties of the coupled functional equations are derived. Primal and dual linear programming formulations of the optimization problem are also given. An application is given to Markov ratio decision process.
Original language | English |
---|---|
Pages (from-to) | 173-186 |
Number of pages | 14 |
Journal | Mathematical Methods of Operations Research |
Volume | 21 |
Issue number | 5 |
DOIs | |
State | Published - Oct 1977 |