Li et al., 2023 - Google Patents
A unified perspective on multiple shooting in differential dynamic programmingLi et al., 2023
View PDF- Document ID
- 17230557539020903603
- Author
- Li H
- Yu W
- Zhang T
- Wensing P
- Publication year
- Publication venue
- 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
External Links
Snippet
Differential Dynamic Programming (DDP) is an efficient computational tool for solving nonlinear optimal control problems. It was originally designed as a single shooting method and thus is sensitive to the initial guess supplied. This work considers the extension of DDP …
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Li et al. | A unified perspective on multiple shooting in differential dynamic programming | |
| Taylor et al. | Episodic learning with control lyapunov functions for uncertain robotic systems | |
| Tsounis et al. | Deepgait: Planning and control of quadrupedal gaits using deep reinforcement learning | |
| Farshidian et al. | An efficient optimal planning and control framework for quadrupedal locomotion | |
| Bhardwaj et al. | Differentiable gaussian process motion planning | |
| Ribeiro | Reinforcement learning agents | |
| Lioutikov et al. | Sample-based informationl-theoretic stochastic optimal control | |
| Levine et al. | Learning complex neural network policies with trajectory optimization | |
| US20210178600A1 (en) | System and Method for Robust Optimization for Trajectory-Centric ModelBased Reinforcement Learning | |
| CN106094813B (en) | Humanoid robot gait's control method based on model correlation intensified learning | |
| Hasan et al. | Model-based fault diagnosis algorithms for robotic systems | |
| El Kazdadi et al. | Equality constrained differential dynamic programming | |
| Han et al. | Robust learning-based control for uncertain nonlinear systems with validation on a soft robot | |
| Oshin et al. | Differentiable robust model predictive control | |
| Dawson et al. | A Bayesian approach to breaking things: Efficiently predicting and repairing failure modes via sampling | |
| Ribeiro | A tutorial on reinforcement learning techniques | |
| Choi et al. | Constraint-guided online data selection for scalable data-driven safety filters in uncertain robotic systems | |
| Frey et al. | Advanced-step real-time iterations with four levels–new error bounds and fast implementation in acados | |
| Kolaric et al. | Local policy optimization for trajectory-centric reinforcement learning | |
| Cao et al. | A differential dynamic programming framework for inverse reinforcement learning | |
| Valadas et al. | Learning low-dimensional strain models of soft robots by looking at the evolution of their shape with application to model-based control | |
| Desaraju et al. | Fast nonlinear model predictive control via partial enumeration | |
| Bahadorian et al. | Robust time-varying model predictive control with application to mobile robot unmanned path tracking | |
| Di Vito et al. | Learning to solve differential equation constrained optimization problems | |
| Ménager et al. | Contact-implicit inverse dynamics |