[go: up one dir, main page]

Taets et al., 2025 - Google Patents

Energy-Based Exploration for Reinforcement Learning of Underactuated Mechanical Systems

Taets et al., 2025

View PDF
Document ID
12900621473167151815
Author
Taets J
Lefebvre T
Ostyn F
Crevecoeur G
Publication year
Publication venue
IEEE Access

External Links

Snippet

Reinforcement Learning (RL) for underactuated mechanical systems presents unique challenges due to the limited control inputs and complex dynamics of the system. Efficient exploration poses a significant problem for two reasons: First, these systems must start each …
Continue reading at ieeexplore.ieee.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/18Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form
    • G05B19/19Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form characterised by positioning or contouring control systems, e.g. to control position from one programmed point to another or to control movement along a programmed continuous path
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/39Robotics, robotics to robotics hand
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run

Similar Documents

Publication Publication Date Title
EP3924884B1 (en) System and method for robust optimization for trajectory-centric model-based reinforcement learning
Carron et al. Data-driven model predictive control for trajectory tracking with a robotic arm
Johannsmeier et al. A framework for robot manipulation: Skill formalism, meta learning and adaptive control
Taylor et al. Episodic learning with control lyapunov functions for uncertain robotic systems
Spong An historical perspective on the control of robotic manipulators
Marcucci et al. Approximate hybrid model predictive control for multi-contact push recovery in complex environments
Tan Event-triggered distributed H∞ constrained control of physically interconnected large-scale partially unknown strict-feedback systems
Joseph et al. Reinforcement learning with misspecified model classes
Rey et al. Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies
Kurtz et al. Contact-implicit trajectory optimization with hydroelastic contact and ilqr
Haarnoja Acquiring diverse robot skills via maximum entropy deep reinforcement learning
US11392104B2 (en) System and method for feasibly positioning servomotors with unmodeled dynamics
Faust et al. Continuous action reinforcement learning for control-affine systems with unknown dynamics
Chen et al. Beyond inverted pendulums: Task-optimal simple models of legged locomotion
Lee et al. Online gain adaptation of whole-body control for legged robots with unknown disturbances
Ganai et al. Learning stabilization control from observations by learning lyapunov-like proxy models
Hazem et al. Reinforcement learning-based intelligent trajectory tracking for a 5-DOF Mitsubishi robotic arm: Comparative evaluation of DDPG, LC-DDPG, and TD3-ADX
Sangiovanni et al. Deep reinforcement learning based self-configuring integral sliding mode control scheme for robot manipulators
Surovik et al. Learning an expert skill-space for replanning dynamic quadruped locomotion over obstacles
Taets et al. Energy-Based Exploration for Reinforcement Learning of Underactuated Mechanical Systems
US12124230B2 (en) System and method for polytopic policy optimization for robust feedback control during learning
Kandhasamy et al. Scalable decentralized multi-robot trajectory optimization in continuous-time
Wang et al. Adaptive dynamic programming-based finite-time optimal backstepping force/position control of reconfigurable robot manipulators via Pareto optimal
Al Homsi Online generation of time-optimal trajectories for industrial robots in dynamic environments
Biswas et al. Training a legged robot to walk using machine learning and trajectory control for high positional accuracy