Jia et al., 2021 - Google Patents

A coach-based bayesian reinforcement learning method for snake robot control

Jia et al., 2021

Document ID: 9194706796038913359
Author: Jia Y; Ma S
Publication year: 2021
Publication venue: IEEE Robotics and Automation Letters

External Links

Cited by

Snippet

Reinforcement Learning (RL) usually needs thousands of episodes, leading its applications on physical robots expensive and challenging. Little research has been reported about snake robot control using RL due to additional difficulty of high redundancy of freedom. We …

Continue reading at ieeexplore.ieee.org (other versions)

241000270295 Serpentes 0 title abstract description 41

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
- G06N3/0635—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means using analogue means
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/008—Artificial life, i.e. computers simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. robots replicating pets or humans in their appearance or behavior
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models

Similar Documents

Publication	Publication Date	Title
Jia et al.	2021	A coach-based bayesian reinforcement learning method for snake robot control
Shahid et al.	2022	Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
Sigaud et al.	2011	On-line regression algorithms for learning mechanical models of robots: a survey
Lim et al.	2020	Prediction of reward functions for deep reinforcement learning via Gaussian process regression
Wang et al.	2021	Hybrid trajectory and force learning of complex assembly tasks: A combined learning framework
Tanneberg et al.	2019	Intrinsic motivation and mental replay enable efficient online adaptation in stochastic recurrent networks
Wang et al.	2021	Sim2sim evaluation of a novel data-efficient differentiable physics engine for tensegrity robots
Tavassoli et al.	2023	Learning skills from demonstrations: A trend from motion primitives to experience abstraction
Batti et al.	2021	Autonomous smart robot for path predicting and finding in maze based on fuzzy and neuro‐fuzzy approaches
Lee et al.	2018	Safe end-to-end imitation learning for model predictive control
Lutter	2023	A differentiable newton–euler algorithm for real-world robotics
Umlauft et al.	2017	Bayesian uncertainty modeling for programming by demonstration
Poudel et al.	2022	Learning to control dc motor for micromobility in real time with reinforcement learning
Tanwani et al.	2018	Generalizing robot imitation learning with invariant hidden semi-Markov models
Schperberg et al.	2023	Real-to-sim: Predicting residual errors of robotic systems with sparse data using a learning-based unscented kalman filter
Al-Mahasneh et al.	2022	Novel general regression neural networks for improving control accuracy of nonlinear MIMO discrete-time systems
Surovik et al.	2021	Learning an expert skill-space for replanning dynamic quadruped locomotion over obstacles
Yang et al.	2022	Mpr-rl: Multi-prior regularized reinforcement learning for knowledge transfer
Verma et al.	2020	Deep reinforcement learning for single-shot diagnosis and adaptation in damaged robots
Tiboni et al.	2022	Online vs. offline adaptive domain randomization benchmark
Rzayev et al.	2022	Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks
Jia et al.	2022	Distributed Coach-Based Reinforcement Learning Controller for Snake Robot Locomotion
Schperberg et al.	2022	Real-to-sim: Deep learning with auto-tuning to predict residual errors using sparse data
Xiao et al.	2019	Learning locomotion skills via model-based proximal meta-reinforcement learning
Yu et al.	2020	Deep Q‐Network with Predictive State Models in Partially Observable Domains