Van den Berg et al., 2012 - Google Patents

Efficient approximate value iteration for continuous Gaussian POMDPs

Van den Berg et al., 2012

Document ID: 14130097962501276761
Author: Van den Berg J; Patil S; Alterovitz R
Publication year: 2012
Publication venue: Proceedings of the AAAI Conference on Artificial Intelligence

External Links

Cited by

Snippet

We introduce a highly efficient method for solving continuous partially-observable Markov decision processes (POMDPs) in which beliefs can be modeled using Gaussian distributions over the state space. Our method enables fast solutions to sequential decision …

Continue reading at ojs.aaai.org (PDF) (other versions)

238000000034 method 0 abstract description 6

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models

Similar Documents

Publication	Publication Date	Title
Van Den Berg et al.	2016	Motion planning under uncertainty using differential dynamic programming in belief space
Montiel et al.	2015	Optimal path planning generation for mobile robots using parallel evolutionary artificial potential field
Van Den Berg et al.	2012	Motion planning under uncertainty using iterative local optimization in belief space
Zhuang et al.	2016	Efficient collision-free path planning for autonomous underwater vehicles in dynamic environments with a hybrid optimization algorithm
Van den Berg et al.	2012	Efficient approximate value iteration for continuous Gaussian POMDPs
Lindemann et al.	2009	Simple and efficient algorithms for computing smooth, collision-free feedback laws over given cell decompositions
Park et al.	2023	Formation reconfiguration control with collision avoidance of nonholonomic mobile robots
Andersson et al.	2015	Model-based reinforcement learning in continuous environments using real-time constrained optimization
Poonganam et al.	2020	Reactive navigation under non-parametric uncertainty through hilbert space embedding of probabilistic velocity obstacles
Han et al.	2021	Stable learning-based tracking control of underactuated balance robots
Gopalakrishnan et al.	2021	Solving chance-constrained optimization under nonparametric uncertainty through hilbert space embedding
Levihn et al.	2013	Planning with movable obstacles in continuous environments with uncertain dynamics
Morere et al.	2018	Continuous state-action-observation POMDPs for trajectory planning with Bayesian optimisation
Nayak et al.	2022	Bidirectional sampling-based motion planning without two-point boundary value solution
Pshikhopov et al.	2022	Trajectory planning algorithms in two-dimensional environment with obstacles
Dunlap et al.	2011	Motion planning for mobile robots via sampling-based model predictive optimization
Rafieisakhaei et al.	2016	Feedback motion planning under non-gaussian uncertainty and non-convex state constraints
Michaux et al.	2025	Can't Touch This: Real-Time, Safe Motion Planning and Control for Manipulators Under Uncertainty
Abdulghafoor et al.	2023	Multi-agent distributed optimal control for tracking large-scale multi-target systems in dynamic environments
Liang et al.	2025	Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models
Snyder et al.	2023	Online learning for obstacle avoidance
Huang et al.	2021	Risk conditioned neural motion planning
Bui et al.	2022	Improving the efficiency of sampling-based motion planners via runtime predictions for motion-planning problems with dynamics
Csomay-Shanklin et al.	2024	Robust agility via learned zero dynamics policies
Bhargava et al.	2025	An omnidirectional mecanum wheel automated guided vehicle control using hybrid modified A* algorithm