Lee et al., 2017 - Google Patents

Gp-ilqg: Data-driven robust optimal control for uncertain nonlinear dynamical systems

Lee et al., 2017

Document ID: 16718317521497573031
Author: Lee G; Srinivasa S; Mason M
Publication year: 2017
Publication venue: arXiv preprint arXiv:1705.05344

External Links

Cited by

Snippet

As we aim to control complex systems, use of a simulator in model-based reinforcement learning is becoming more common. However, it has been challenging to overcome the Reality Gap, which comes from nonlinear model bias and susceptibility to disturbance. To …

Continue reading at arxiv.org (PDF) (other versions)

238000005183 dynamical system 0 title description 3

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis

Similar Documents

Publication	Publication Date	Title
Lee et al.	2017	Gp-ilqg: Data-driven robust optimal control for uncertain nonlinear dynamical systems
Fan et al.	2020	Deep learning tubes for tube MPC
Lew et al.	2022	Safe active dynamics learning and control: A sequential exploration–exploitation framework
Mukadam et al.	2016	Gaussian process motion planning
Polydoros et al.	2017	Survey of model-based reinforcement learning: Applications on robotics
Fan et al.	2019	A learning framework for high precision industrial assembly
Van Den Berg et al.	2012	Motion planning under uncertainty using iterative local optimization in belief space
Boedecker et al.	2014	Approximate real-time optimal control based on sparse gaussian process models
Venkatraman et al.	2016	Improved learning of dynamics models for control
US20180032868A1 (en)	2018-02-01	Early prediction of an intention of a user's actions
Wiedemann et al.	2018	Multi-agent exploration of spatial dynamical processes under sparsity constraints
Wiedemann et al.	2017	Probabilistic modeling of gas diffusion with partial differential equations for multi-robot exploration and gas source localization
Vinogradska et al.	2018	Numerical quadrature for probabilistic policy search
Akhare et al.	2023	Diffhybrid-uq: uncertainty quantification for differentiable hybrid neural modeling
Jin et al.	2018	Inverse optimal control with incomplete observations
Snyder et al.	2023	Online learning for obstacle avoidance
Menda et al.	2020	Scalable identification of partially observed systems with certainty-equivalent EM
Teng et al.	2025	Riemannian direct trajectory optimization of rigid bodies on matrix lie groups
Liang et al.	2024	Online control-informed learning
Deng et al.	2024	Adaptive gait modeling and optimization for principally kinematic systems
Ponton et al.	2016	Risk sensitive nonlinear optimal control with measurement uncertainty
Cheng	2020	Efficient and principled robot learning: theory and algorithms
Baldauf et al.	2023	Iterative learning-based model predictive control for mobile robots in space applications
Fan et al.	2025	Efficient Estimation of Relaxed Model Parameters for Robust UAV Trajectory Optimization
Osborne et al.	2021	A review of safe online learning for nonlinear control systems