Key points are not available for this paper at this time.
This is a study of finite state discrete time discounted Markovian decision process when the states are probabilistically observed. A model of this process is formulated, and an implicit enumeration algorithm is presented which optimizes the total expected discounted reward given the initial state. Several numerical examples are presented.
Satia et al. (Sat,) studied this question.