A survey of solution techniques for the partially observed Markov decision process

Engineering Combinatorics Economics Operations Research/Decision Theory 0211 other engineering and technologies Theory of Computation Industrial and Operations Engineering Business 02 engineering and technology Economics / Management Science Management
DOI: 10.1007/bf02204836 Publication Date: 2005-10-05T14:45:22Z
ABSTRACT
We survey several computational procedures for the partially observed Markov decision process (POMDP) that have been developed since the Monahan survey was published in 1982. The POMDP generalizes the standard, completely observed Markov decision process by permitting the possibility that state observations may be noise-corrupted and/or costly. Several computational procedures presented are convergence accelerating variants of, or approximations to, the Smallwood-Sondik algorithm. Finite-memory suboptimal design results are reported, and new research directions involving heuristic search are discussed.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (29)
CITATIONS (106)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....