Jia et al., 2021 - Google Patents
A coach-based bayesian reinforcement learning method for snake robot controlJia et al., 2021
- Document ID
- 9194706796038913359
- Author
- Jia Y
- Ma S
- Publication year
- Publication venue
- IEEE Robotics and Automation Letters
External Links
Snippet
Reinforcement Learning (RL) usually needs thousands of episodes, leading its applications on physical robots expensive and challenging. Little research has been reported about snake robot control using RL due to additional difficulty of high redundancy of freedom. We …
- 241000270295 Serpentes 0 title abstract description 41
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
- G06N3/0635—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means using analogue means
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/008—Artificial life, i.e. computers simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. robots replicating pets or humans in their appearance or behavior
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Jia et al. | A coach-based bayesian reinforcement learning method for snake robot control | |
| Shahid et al. | Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning | |
| Sigaud et al. | On-line regression algorithms for learning mechanical models of robots: a survey | |
| Lim et al. | Prediction of reward functions for deep reinforcement learning via Gaussian process regression | |
| Wang et al. | Hybrid trajectory and force learning of complex assembly tasks: A combined learning framework | |
| Tanneberg et al. | Intrinsic motivation and mental replay enable efficient online adaptation in stochastic recurrent networks | |
| Wang et al. | Sim2sim evaluation of a novel data-efficient differentiable physics engine for tensegrity robots | |
| Tavassoli et al. | Learning skills from demonstrations: A trend from motion primitives to experience abstraction | |
| Batti et al. | Autonomous smart robot for path predicting and finding in maze based on fuzzy and neuro‐fuzzy approaches | |
| Lee et al. | Safe end-to-end imitation learning for model predictive control | |
| Lutter | A differentiable newton–euler algorithm for real-world robotics | |
| Umlauft et al. | Bayesian uncertainty modeling for programming by demonstration | |
| Poudel et al. | Learning to control dc motor for micromobility in real time with reinforcement learning | |
| Tanwani et al. | Generalizing robot imitation learning with invariant hidden semi-Markov models | |
| Schperberg et al. | Real-to-sim: Predicting residual errors of robotic systems with sparse data using a learning-based unscented kalman filter | |
| Al-Mahasneh et al. | Novel general regression neural networks for improving control accuracy of nonlinear MIMO discrete-time systems | |
| Surovik et al. | Learning an expert skill-space for replanning dynamic quadruped locomotion over obstacles | |
| Yang et al. | Mpr-rl: Multi-prior regularized reinforcement learning for knowledge transfer | |
| Verma et al. | Deep reinforcement learning for single-shot diagnosis and adaptation in damaged robots | |
| Tiboni et al. | Online vs. offline adaptive domain randomization benchmark | |
| Rzayev et al. | Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | |
| Jia et al. | Distributed Coach-Based Reinforcement Learning Controller for Snake Robot Locomotion | |
| Schperberg et al. | Real-to-sim: Deep learning with auto-tuning to predict residual errors using sparse data | |
| Xiao et al. | Learning locomotion skills via model-based proximal meta-reinforcement learning | |
| Yu et al. | Deep Q‐Network with Predictive State Models in Partially Observable Domains |