Taets et al., 2025 - Google Patents
Energy-Based Exploration for Reinforcement Learning of Underactuated Mechanical SystemsTaets et al., 2025
View PDF- Document ID
- 12900621473167151815
- Author
- Taets J
- Lefebvre T
- Ostyn F
- Crevecoeur G
- Publication year
- Publication venue
- IEEE Access
External Links
Snippet
Reinforcement Learning (RL) for underactuated mechanical systems presents unique challenges due to the limited control inputs and complex dynamics of the system. Efficient exploration poses a significant problem for two reasons: First, these systems must start each …
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/18—Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form
- G05B19/19—Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form characterised by positioning or contouring control systems, e.g. to control position from one programmed point to another or to control movement along a programmed continuous path
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/39—Robotics, robotics to robotics hand
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP3924884B1 (en) | System and method for robust optimization for trajectory-centric model-based reinforcement learning | |
| Carron et al. | Data-driven model predictive control for trajectory tracking with a robotic arm | |
| Johannsmeier et al. | A framework for robot manipulation: Skill formalism, meta learning and adaptive control | |
| Taylor et al. | Episodic learning with control lyapunov functions for uncertain robotic systems | |
| Spong | An historical perspective on the control of robotic manipulators | |
| Marcucci et al. | Approximate hybrid model predictive control for multi-contact push recovery in complex environments | |
| Tan | Event-triggered distributed H∞ constrained control of physically interconnected large-scale partially unknown strict-feedback systems | |
| Joseph et al. | Reinforcement learning with misspecified model classes | |
| Rey et al. | Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies | |
| Kurtz et al. | Contact-implicit trajectory optimization with hydroelastic contact and ilqr | |
| Haarnoja | Acquiring diverse robot skills via maximum entropy deep reinforcement learning | |
| US11392104B2 (en) | System and method for feasibly positioning servomotors with unmodeled dynamics | |
| Faust et al. | Continuous action reinforcement learning for control-affine systems with unknown dynamics | |
| Chen et al. | Beyond inverted pendulums: Task-optimal simple models of legged locomotion | |
| Lee et al. | Online gain adaptation of whole-body control for legged robots with unknown disturbances | |
| Ganai et al. | Learning stabilization control from observations by learning lyapunov-like proxy models | |
| Hazem et al. | Reinforcement learning-based intelligent trajectory tracking for a 5-DOF Mitsubishi robotic arm: Comparative evaluation of DDPG, LC-DDPG, and TD3-ADX | |
| Sangiovanni et al. | Deep reinforcement learning based self-configuring integral sliding mode control scheme for robot manipulators | |
| Surovik et al. | Learning an expert skill-space for replanning dynamic quadruped locomotion over obstacles | |
| Taets et al. | Energy-Based Exploration for Reinforcement Learning of Underactuated Mechanical Systems | |
| US12124230B2 (en) | System and method for polytopic policy optimization for robust feedback control during learning | |
| Kandhasamy et al. | Scalable decentralized multi-robot trajectory optimization in continuous-time | |
| Wang et al. | Adaptive dynamic programming-based finite-time optimal backstepping force/position control of reconfigurable robot manipulators via Pareto optimal | |
| Al Homsi | Online generation of time-optimal trajectories for industrial robots in dynamic environments | |
| Biswas et al. | Training a legged robot to walk using machine learning and trajectory control for high positional accuracy |