Mastalli et al., 2022 - Google Patents
A feasibility-driven approach to control-limited DDPMastalli et al., 2022
View HTML- Document ID
- 11354171532654064251
- Author
- Mastalli C
- Merkt W
- Marti-Saumell J
- Ferrolho H
- Solà J
- Mansard N
- Vijayakumar S
- Publication year
- Publication venue
- Autonomous Robots
External Links
Snippet
Differential dynamic programming (DDP) is a direct single shooting method for trajectory optimization. Its efficiency derives from the exploitation of temporal structure (inherent to optimal control problems) and explicit roll-out/integration of the system dynamics. However …
- 239000000203 mixture 0 abstract description 20
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/048—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators using a predictor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G06F17/13—Differential equations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Mastalli et al. | A feasibility-driven approach to control-limited DDP | |
| Carius et al. | Trajectory optimization with implicit hard contacts | |
| Xie et al. | Glide: Generalizable quadrupedal locomotion in diverse environments with a centroidal model | |
| Shahid et al. | Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning | |
| Carius et al. | Constrained stochastic optimal control with learned importance sampling: A path integral approach | |
| Urbain et al. | Morphological properties of mass–spring networks for optimal locomotion learning | |
| Manchester et al. | Robust direct trajectory optimization using approximate invariant funnels | |
| Rey et al. | Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies | |
| Ting | Stability analysis and design of Takagi–Sugeno fuzzy systems | |
| Bhourji et al. | Reinforcement learning DDPG–PPO agent-based control system for rotary inverted pendulum | |
| Yang et al. | Online adaptive teleoperation via motion primitives for mobile robots | |
| Agboh et al. | Pushing fast and slow: Task-adaptive planning for non-prehensile manipulation under uncertainty | |
| Manzl et al. | Reliability evaluation of reinforcement learning methods for mechanical systems with increasing complexity | |
| Lee et al. | Online gain adaptation of whole-body control for legged robots with unknown disturbances | |
| Lutter | A differentiable newton–euler algorithm for real-world robotics | |
| Ben Hazem | Study of Q-learning and deep Q-network learning control for a rotary inverted pendulum system | |
| Choi et al. | Constraint-guided online data selection for scalable data-driven safety filters in uncertain robotic systems | |
| Ben Hazem et al. | Model-free trajectory tracking control of a 5-DOF mitsubishi robotic arm using deep deterministic policy gradient algorithm | |
| Xiong et al. | Nonlinear control strategies for 3-DOF control moment gyroscope using deep reinforcement learning | |
| Arena et al. | A data-driven neural network model predictive steering controller for a bio-inspired quadruped robot | |
| Liu et al. | Dynamic delayed feedback control for stabilizing the giant swing motions of an underactuated three-link gymnastic robot | |
| Ji et al. | Robust walking and sim-to-real optimization for quadruped robots via reinforcement learning | |
| Sun et al. | A trajectory tracking method based on robust model predictive control for a bionic ankle–foot aided by a tensegrity mechanism | |
| Gazar et al. | Nonlinear stochastic trajectory optimization for centroidal momentum motion generation of legged robots | |
| Pan et al. | Time integrating articulated body dynamics using position-based collocation methods |