Academia.eduAcademia.edu

Outline

Real-Time Decision Making for Large POMDPs

2005, Lecture Notes in Computer Science

https://doi.org/10.1007/11424918_49

Abstract

In this paper, we introduce an approach called RTBSS (Real-Time Belief Space Search) for real-time decision making in large POMDPs. The approach is based on a look-ahead search that is applied online each time the agent has to make a decision. RTBSS is particularly interesting for large real-time environments where offline solutions are not applicable because of their complexity.

References (6)

  1. Pineau, J., Gordon, G., Thrun, S.: Point-based value iteration: An anytime algo- rithm for pomdps. In: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI-03), Acapulco, Mexico (2003) 1025-1032
  2. Smith, T., Simmons, R.: Heuristic search value iteration for pomdps. In: Proceed- ings of the 20th Conference on Uncertainty in Artificial Intelligence(UAI-04), Banff, Canada (2004)
  3. Braziunas, D., Boutilier, C.: Stochastic local search for pomdp controllers. In: The Nineteenth National Conference on Artificial Intelligence (AAAI-04). (2004)
  4. Poupart, P.: Exploiting Structure to Efficiently Solve Large Scale Partially Ob- servable Markov Decision Processes. PhD thesis, University of Toronto (2005) (to appear).
  5. Spaan, M.T.J., Vlassis, N.: A point-based pomdp algorithm for robot planning. In: In Proceedings of the IEEE International Conference on Robotics and Automation, New Orleans, Louisiana (2004) 2399-2404
  6. Geffner, H., Bonet, B.: Solving large pomdps using real time dynamic programming (1998)