Abstract
We study sequential decision problems where the decision maker does not observe the states of nature, but rather receives a noisy signal, whose distribution depends on the current state and on the action that she plays. We do not assume that the decision maker considers the worst-case scenario, but rather has a response correspondence, which maps distributions over signals to subjective best responses. We extend the concept of internal regret-free strategy to this setup and provide an algorithm that generates such a strategy.
Original language | English |
---|---|
Pages (from-to) | 112-138 |
Number of pages | 27 |
Journal | Dynamic Games and Applications |
Volume | 6 |
Issue number | 1 |
DOIs | |
State | Published - 1 Mar 2016 |
Keywords
- Approachability
- Imperfect monitoring
- Internal no regret
- No regret
- response correspondence