[go: up one dir, main page]

Moezzi, 2024 - Google Patents

Towards Sample-Efficient Reinforcement Learning Methods for Robotic Manipulation Tasks

Moezzi, 2024

Document ID
10941318954184863966
Author
Moezzi M
Publication year

External Links

Snippet

Reinforcement learning (RL) provides a framework for solving complex tasks without pre- existing knowledge of the systems involved. These algorithms hold significant promise for control and robotics, areas that have historically depended on precise system dynamics and …
Continue reading at search.proquest.com (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems

Similar Documents

Publication Publication Date Title
Lanillos et al. Active inference in robotics and artificial agents: Survey and challenges
Morales et al. A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
US11714996B2 (en) Learning motor primitives and training a machine learning system using a linear-feedback-stabilized policy
Li et al. Prodmp: A unified perspective on dynamic and probabilistic movement primitives
Shahid et al. Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
Bekey et al. Neural networks in robotics
Amarjyoti Deep reinforcement learning for robotic manipulation-the state of the art
Schaal et al. Learning control in robotics
Hanna et al. Grounded action transformation for sim-to-real reinforcement learning
Tavassoli et al. Learning skills from demonstrations: A trend from motion primitives to experience abstraction
Kargin et al. A reinforcement learning approach for continuum robot control
Levine Motor skill learning with local trajectory methods
Gawali et al. Development of improved coyote optimization with deep neural network for intelligent skill knowledge transfer for human to robot interaction
Rohrer BECCA: Reintegrating AI for Natural World Interaction.
Bonsignorio et al. An imitation learning approach for the control of a low-cost low-accuracy robotic arm for unstructured environments
Moezzi Towards Sample-Efficient Reinforcement Learning Methods for Robotic Manipulation Tasks
Learning Tactile grasp refinement using deep reinforcement learning and analytic grasp stability metrics
Zhang et al. A Review on Robot Manipulation Methods in Human-Robot Interactions
Grimes et al. Learning nonparametric policies by imitation
Navez Contributions to the Concept of Embodied Intelligence in Soft Robotics through Control and Design Co-Optimization
Chen et al. Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks Using Adaptive Potential Functions
Li Reinforcement learning-based motion planning in partially observable environments for complex tasks
Zheng Data-Driven Robotic Manipulation of Deformable Objects Using Tactile Feedback: From Model-Free to Model-Based Approaches
Weideman Robot navigation in cluttered environments with deep reinforcement learning
Gillen Improving Reinforcement Learning for Robotics with Control and Dynamical Systems Theory