Choi et al., 2020 - Google Patents

Reinforcement learning for safety-critical control under model uncertainty, using control lyapunov functions and control barrier functions

Choi et al., 2020

View PDF

Document ID: 12439371586503751518
Author: Choi J; Castaneda F; Tomlin C; Sreenath K
Publication year: 2020
Publication venue: arXiv preprint arXiv:2004.07584

External Links

Cited by

Snippet

In this paper, the issue of model uncertainty in safety-critical control is addressed with a data- driven approach. For this purpose, we utilize the structure of an input-ouput linearization controller based on a nominal model along with a Control Barrier Function and Control …

Continue reading at arxiv.org (PDF) (other versions)

230000002787 reinforcement 0 title abstract description 14

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0205—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system
- G05B13/021—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system in which a variable is automatically adjusted to optimise the performance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G06F17/13—Differential equations

Similar Documents

Publication	Publication Date	Title
Choi et al.	2020	Reinforcement learning for safety-critical control under model uncertainty, using control lyapunov functions and control barrier functions
Liu et al.	2015	Reinforcement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints
Han et al.	2012	Precise positioning of nonsmooth dynamic systems using fuzzy wavelet echo state networks and dynamic surface sliding mode control
He et al.	2015	Neural network control of a robotic manipulator with input deadzone and output constraint
Zeinali et al.	2010	Adaptive sliding mode control with uncertainty estimator for robot manipulators
Westenbroek et al.	2020	Feedback linearization for uncertain systems via reinforcement learning
Izadbakhsh et al.	2020	Robust adaptive control of robot manipulators using Bernstein polynomials as universal approximator
Wang et al.	2021	Dynamic learning from adaptive neural control for discrete-time strict-feedback systems
Vu et al.	2021	Sliding variable-based online adaptive reinforcement learning of uncertain/disturbed nonlinear mechanical systems
Qin et al.	2021	Adaptive interval type-2 fuzzy fixed-time control for underwater walking robot with error constraints and actuator faults using prescribed performance terminal sliding-mode surfaces
Tang et al.	2018	Adaptive critic design for pure-feedback discrete-time MIMO systems preceded by unknown backlashlike hysteresis
Van Kien et al.	2019	Adaptive fuzzy sliding mode control for nonlinear uncertain SISO system optimized by differential evolution algorithm
Dai et al.	2017	Transverse function approach to practical stabilisation of underactuated surface vessels with modelling uncertainties and unknown disturbances
Wang et al.	2023	Adaptive dynamic programming-based optimal control for nonlinear state constrained systems with input delay
Marvi et al.	2022	Reinforcement learning with safety and stability guarantees during exploration for linear systems
Ben Hazem	2024	Study of Q-learning and deep Q-network learning control for a rotary inverted pendulum system
Modares et al.	2023	Safe reinforcement learning via a model-free safety certifier
Zhu et al.	2023	Robust constraint-following control for uncertain mechanical systems with generalized Udwadia-Kalaba approach
Kolaric et al.	2020	Local policy optimization for trajectory-centric reinforcement learning
Zhu et al.	2025	Cooperative game-theoretic optimization of adaptive robust constraint-following control for fuzzy mechanical systems under inequality constraints
Youssef et al.	2025	Reinforcement learning-enhanced adaptive sliding mode control for nonlinear systems
Yang et al.	2023	Optimized tracking control using reinforcement learning strategy for a class of nonlinear systems
Bałazy et al.	2024	Neural algorithm for optimization of multidimensional object controller parameters
Wan et al.	2019	Adaptive neural globally asymptotic tracking control for a class of uncertain nonlinear systems
Bachhuber et al.	2023	Neural odes for data-driven automatic self-design of finite-time output feedback control for unknown nonlinear dynamics