Van den Berg et al., 2012 - Google Patents
Efficient approximate value iteration for continuous Gaussian POMDPsVan den Berg et al., 2012
View PDF- Document ID
- 14130097962501276761
- Author
- Van den Berg J
- Patil S
- Alterovitz R
- Publication year
- Publication venue
- Proceedings of the AAAI Conference on Artificial Intelligence
External Links
Snippet
We introduce a highly efficient method for solving continuous partially-observable Markov decision processes (POMDPs) in which beliefs can be modeled using Gaussian distributions over the state space. Our method enables fast solutions to sequential decision …
- 238000000034 method 0 abstract description 6
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Van Den Berg et al. | Motion planning under uncertainty using differential dynamic programming in belief space | |
| Montiel et al. | Optimal path planning generation for mobile robots using parallel evolutionary artificial potential field | |
| Van Den Berg et al. | Motion planning under uncertainty using iterative local optimization in belief space | |
| Zhuang et al. | Efficient collision-free path planning for autonomous underwater vehicles in dynamic environments with a hybrid optimization algorithm | |
| Van den Berg et al. | Efficient approximate value iteration for continuous Gaussian POMDPs | |
| Lindemann et al. | Simple and efficient algorithms for computing smooth, collision-free feedback laws over given cell decompositions | |
| Park et al. | Formation reconfiguration control with collision avoidance of nonholonomic mobile robots | |
| Andersson et al. | Model-based reinforcement learning in continuous environments using real-time constrained optimization | |
| Poonganam et al. | Reactive navigation under non-parametric uncertainty through hilbert space embedding of probabilistic velocity obstacles | |
| Han et al. | Stable learning-based tracking control of underactuated balance robots | |
| Gopalakrishnan et al. | Solving chance-constrained optimization under nonparametric uncertainty through hilbert space embedding | |
| Levihn et al. | Planning with movable obstacles in continuous environments with uncertain dynamics | |
| Morere et al. | Continuous state-action-observation POMDPs for trajectory planning with Bayesian optimisation | |
| Nayak et al. | Bidirectional sampling-based motion planning without two-point boundary value solution | |
| Pshikhopov et al. | Trajectory planning algorithms in two-dimensional environment with obstacles | |
| Dunlap et al. | Motion planning for mobile robots via sampling-based model predictive optimization | |
| Rafieisakhaei et al. | Feedback motion planning under non-gaussian uncertainty and non-convex state constraints | |
| Michaux et al. | Can't Touch This: Real-Time, Safe Motion Planning and Control for Manipulators Under Uncertainty | |
| Abdulghafoor et al. | Multi-agent distributed optimal control for tracking large-scale multi-target systems in dynamic environments | |
| Liang et al. | Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models | |
| Snyder et al. | Online learning for obstacle avoidance | |
| Huang et al. | Risk conditioned neural motion planning | |
| Bui et al. | Improving the efficiency of sampling-based motion planners via runtime predictions for motion-planning problems with dynamics | |
| Csomay-Shanklin et al. | Robust agility via learned zero dynamics policies | |
| Bhargava et al. | An omnidirectional mecanum wheel automated guided vehicle control using hybrid modified A* algorithm |