Rezazadeh et al., 2022 - Google Patents
Learning contraction policies from offline dataRezazadeh et al., 2022
View PDF- Document ID
- 7435163304903492865
- Author
- Rezazadeh N
- Kolarich M
- Kia S
- Mehr N
- Publication year
- Publication venue
- IEEE Robotics and Automation Letters
External Links
Snippet
This letter proposes a data-driven method for learning convergent control policies from offline data using Contraction theory. Contraction theory enables constructing a policy that makes the closed-loop system trajectories inherently convergent towards a unique …
- 238000004805 robotic 0 abstract description 21
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B11/00—Automatic controllers
- G05B11/01—Automatic controllers electric
- G05B11/32—Automatic controllers electric with inputs from more than one sensing element; with outputs to more than one correcting element
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Dai et al. | Lyapunov-stable neural-network control | |
| Choi et al. | Reinforcement learning for safety-critical control under model uncertainty, using control lyapunov functions and control barrier functions | |
| Taylor et al. | Episodic learning with control lyapunov functions for uncertain robotic systems | |
| Bartolini et al. | Adaptive second-order sliding mode control with uncertainty compensation | |
| Wang et al. | Passive separation approach to adaptive visual tracking for robotic systems | |
| Rezazadeh et al. | Learning contraction policies from offline data | |
| Peng et al. | Fuzzy adaptive output feedback control for robotic systems based on fuzzy adaptive observer | |
| Joshi et al. | Adaptive control using gaussian-process with model reference generative network | |
| Rego et al. | Learning‐based robust neuro‐control: A method to compute control Lyapunov functions | |
| Qu et al. | RL-driven MPPI: Accelerating online control laws calculation with offline policy | |
| Morales et al. | LAMDA control approaches applied to trajectory tracking for mobile robots | |
| Zhou et al. | Adaptive fuzzy control of uncertain robotic manipulator | |
| Taylor et al. | A control lyapunov perspective on episodic learning via projection to state stability | |
| Zhang et al. | Safety‐critical control for robotic systems with uncertain model via control barrier function | |
| Rigatos | Adaptive fuzzy control for differentially flat MIMO nonlinear dynamical systems | |
| Choi et al. | Constraint-guided online data selection for scalable data-driven safety filters in uncertain robotic systems | |
| Pan et al. | Self-evolving fuzzy system based inverse dynamics learning control for nonlinear systems with uncertainties | |
| Kouw | Information-seeking polynomial NARX model-predictive control through expected free energy minimization | |
| Peng et al. | Distributed Consensus‐Based Robust Adaptive Formation Control for Nonholonomic Mobile Robots with Partial Known Dynamics | |
| Kolaric et al. | Local policy optimization for trajectory-centric reinforcement learning | |
| Lopez et al. | Adaptive variants of optimal feedback policies | |
| Kamalapurkar et al. | State following (StaF) kernel functions for function approximation part II: Adaptive dynamic programming | |
| Lee et al. | A dynamic regret analysis and adaptive regularization algorithm for on-policy robot imitation learning | |
| Zhang et al. | Learning-based parameterized barrier function for safety-critical control of unknown systems | |
| Parsapour et al. | Recovery-matrix inverse optimal control for deterministic feedforward-feedback controllers |