[go: up one dir, main page]

Mastalli et al., 2022 - Google Patents

A feasibility-driven approach to control-limited DDP

Mastalli et al., 2022

View HTML
Document ID
11354171532654064251
Author
Mastalli C
Merkt W
Marti-Saumell J
Ferrolho H
Solà J
Mansard N
Vijayakumar S
Publication year
Publication venue
Autonomous Robots

External Links

Snippet

Differential dynamic programming (DDP) is a direct single shooting method for trajectory optimization. Its efficiency derives from the exploitation of temporal structure (inherent to optimal control problems) and explicit roll-out/integration of the system dynamics. However …
Continue reading at link.springer.com (HTML) (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/048Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators using a predictor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5009Computer-aided design using simulation
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B17/00Systems involving the use of models or simulators of said systems
    • G05B17/02Systems involving the use of models or simulators of said systems electric
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • G06F17/13Differential equations
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models

Similar Documents

Publication Publication Date Title
Mastalli et al. A feasibility-driven approach to control-limited DDP
Carius et al. Trajectory optimization with implicit hard contacts
Xie et al. Glide: Generalizable quadrupedal locomotion in diverse environments with a centroidal model
Shahid et al. Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
Carius et al. Constrained stochastic optimal control with learned importance sampling: A path integral approach
Urbain et al. Morphological properties of mass–spring networks for optimal locomotion learning
Manchester et al. Robust direct trajectory optimization using approximate invariant funnels
Rey et al. Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies
Ting Stability analysis and design of Takagi–Sugeno fuzzy systems
Bhourji et al. Reinforcement learning DDPG–PPO agent-based control system for rotary inverted pendulum
Yang et al. Online adaptive teleoperation via motion primitives for mobile robots
Agboh et al. Pushing fast and slow: Task-adaptive planning for non-prehensile manipulation under uncertainty
Manzl et al. Reliability evaluation of reinforcement learning methods for mechanical systems with increasing complexity
Lee et al. Online gain adaptation of whole-body control for legged robots with unknown disturbances
Lutter A differentiable newton–euler algorithm for real-world robotics
Ben Hazem Study of Q-learning and deep Q-network learning control for a rotary inverted pendulum system
Choi et al. Constraint-guided online data selection for scalable data-driven safety filters in uncertain robotic systems
Ben Hazem et al. Model-free trajectory tracking control of a 5-DOF mitsubishi robotic arm using deep deterministic policy gradient algorithm
Xiong et al. Nonlinear control strategies for 3-DOF control moment gyroscope using deep reinforcement learning
Arena et al. A data-driven neural network model predictive steering controller for a bio-inspired quadruped robot
Liu et al. Dynamic delayed feedback control for stabilizing the giant swing motions of an underactuated three-link gymnastic robot
Ji et al. Robust walking and sim-to-real optimization for quadruped robots via reinforcement learning
Sun et al. A trajectory tracking method based on robust model predictive control for a bionic ankle–foot aided by a tensegrity mechanism
Gazar et al. Nonlinear stochastic trajectory optimization for centroidal momentum motion generation of legged robots
Pan et al. Time integrating articulated body dynamics using position-based collocation methods