[go: up one dir, main page]

Van den Berg et al., 2012 - Google Patents

Efficient approximate value iteration for continuous Gaussian POMDPs

Van den Berg et al., 2012

View PDF
Document ID
14130097962501276761
Author
Van den Berg J
Patil S
Alterovitz R
Publication year
Publication venue
Proceedings of the AAAI Conference on Artificial Intelligence

External Links

Snippet

We introduce a highly efficient method for solving continuous partially-observable Markov decision processes (POMDPs) in which beliefs can be modeled using Gaussian distributions over the state space. Our method enables fast solutions to sequential decision …
Continue reading at ojs.aaai.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/002Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models

Similar Documents

Publication Publication Date Title
Van Den Berg et al. Motion planning under uncertainty using differential dynamic programming in belief space
Montiel et al. Optimal path planning generation for mobile robots using parallel evolutionary artificial potential field
Van Den Berg et al. Motion planning under uncertainty using iterative local optimization in belief space
Zhuang et al. Efficient collision-free path planning for autonomous underwater vehicles in dynamic environments with a hybrid optimization algorithm
Van den Berg et al. Efficient approximate value iteration for continuous Gaussian POMDPs
Lindemann et al. Simple and efficient algorithms for computing smooth, collision-free feedback laws over given cell decompositions
Park et al. Formation reconfiguration control with collision avoidance of nonholonomic mobile robots
Andersson et al. Model-based reinforcement learning in continuous environments using real-time constrained optimization
Poonganam et al. Reactive navigation under non-parametric uncertainty through hilbert space embedding of probabilistic velocity obstacles
Han et al. Stable learning-based tracking control of underactuated balance robots
Gopalakrishnan et al. Solving chance-constrained optimization under nonparametric uncertainty through hilbert space embedding
Levihn et al. Planning with movable obstacles in continuous environments with uncertain dynamics
Morere et al. Continuous state-action-observation POMDPs for trajectory planning with Bayesian optimisation
Nayak et al. Bidirectional sampling-based motion planning without two-point boundary value solution
Pshikhopov et al. Trajectory planning algorithms in two-dimensional environment with obstacles
Dunlap et al. Motion planning for mobile robots via sampling-based model predictive optimization
Rafieisakhaei et al. Feedback motion planning under non-gaussian uncertainty and non-convex state constraints
Michaux et al. Can't Touch This: Real-Time, Safe Motion Planning and Control for Manipulators Under Uncertainty
Abdulghafoor et al. Multi-agent distributed optimal control for tracking large-scale multi-target systems in dynamic environments
Liang et al. Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models
Snyder et al. Online learning for obstacle avoidance
Huang et al. Risk conditioned neural motion planning
Bui et al. Improving the efficiency of sampling-based motion planners via runtime predictions for motion-planning problems with dynamics
Csomay-Shanklin et al. Robust agility via learned zero dynamics policies
Bhargava et al. An omnidirectional mecanum wheel automated guided vehicle control using hybrid modified A* algorithm