Taets et al., 2025 - Google Patents

Energy-Based Exploration for Reinforcement Learning of Underactuated Mechanical Systems

Taets et al., 2025

Document ID: 12900621473167151815
Author: Taets J; Lefebvre T; Ostyn F; Crevecoeur G
Publication year: 2025
Publication venue: IEEE Access

External Links

Cited by

Snippet

Reinforcement Learning (RL) for underactuated mechanical systems presents unique challenges due to the limited control inputs and complex dynamics of the system. Efficient exploration poses a significant problem for two reasons: First, these systems must start each …

Continue reading at ieeexplore.ieee.org (PDF) (other versions)

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/18—Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form
- G05B19/19—Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form characterised by positioning or contouring control systems, e.g. to control position from one programmed point to another or to control movement along a programmed continuous path
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/39—Robotics, robotics to robotics hand
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run

Similar Documents

Publication	Publication Date	Title
EP3924884B1 (en)	2023-08-30	System and method for robust optimization for trajectory-centric model-based reinforcement learning
Carron et al.	2019	Data-driven model predictive control for trajectory tracking with a robotic arm
Johannsmeier et al.	2019	A framework for robot manipulation: Skill formalism, meta learning and adaptive control
Taylor et al.	2019	Episodic learning with control lyapunov functions for uncertain robotic systems
Spong	2022	An historical perspective on the control of robotic manipulators
Marcucci et al.	2017	Approximate hybrid model predictive control for multi-contact push recovery in complex environments
Tan	2019	Event-triggered distributed H∞ constrained control of physically interconnected large-scale partially unknown strict-feedback systems
Joseph et al.	2013	Reinforcement learning with misspecified model classes
Rey et al.	2018	Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies
Kurtz et al.	2022	Contact-implicit trajectory optimization with hydroelastic contact and ilqr
Haarnoja	2018	Acquiring diverse robot skills via maximum entropy deep reinforcement learning
US11392104B2 (en)	2022-07-19	System and method for feasibly positioning servomotors with unmodeled dynamics
Faust et al.	2014	Continuous action reinforcement learning for control-affine systems with unknown dynamics
Chen et al.	2024	Beyond inverted pendulums: Task-optimal simple models of legged locomotion
Lee et al.	2022	Online gain adaptation of whole-body control for legged robots with unknown disturbances
Ganai et al.	2023	Learning stabilization control from observations by learning lyapunov-like proxy models
Hazem et al.	2025	Reinforcement learning-based intelligent trajectory tracking for a 5-DOF Mitsubishi robotic arm: Comparative evaluation of DDPG, LC-DDPG, and TD3-ADX
Sangiovanni et al.	2018	Deep reinforcement learning based self-configuring integral sliding mode control scheme for robot manipulators
Surovik et al.	2021	Learning an expert skill-space for replanning dynamic quadruped locomotion over obstacles
Taets et al.	2025	Energy-Based Exploration for Reinforcement Learning of Underactuated Mechanical Systems
US12124230B2 (en)	2024-10-22	System and method for polytopic policy optimization for robust feedback control during learning
Kandhasamy et al.	2020	Scalable decentralized multi-robot trajectory optimization in continuous-time
Wang et al.	2025	Adaptive dynamic programming-based finite-time optimal backstepping force/position control of reconfigurable robot manipulators via Pareto optimal
Al Homsi	2016	Online generation of time-optimal trajectories for industrial robots in dynamic environments
Biswas et al.	2023	Training a legged robot to walk using machine learning and trajectory control for high positional accuracy