Švaco et al., 2018 - Google Patents
A reinforcement learning based algorithm for robot action planningŠvaco et al., 2018
View PDF- Document ID
- 6866558472718171591
- Author
- Švaco M
- Jerbić B
- Polančec M
- Šuligoj F
- Publication year
- Publication venue
- International Conference on Robotics in Alpe-Adria Danube Region
External Links
Snippet
The learning process that arises in response to the visual perception of the environment is the starting point for numerous research in the field of applied and cognitive robotics. In this research, we propose a reinforcement learning based action planning algorithm for the …
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B15/00—Systems controlled by a computer
- G05B15/02—Systems controlled by a computer electric
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Wang et al. | A survey for machine learning-based control of continuum robots | |
| Švaco et al. | A reinforcement learning based algorithm for robot action planning | |
| Bhourji et al. | Reinforcement learning DDPG–PPO agent-based control system for rotary inverted pendulum | |
| Han et al. | Stable learning-based tracking control of underactuated balance robots | |
| Wang et al. | Intent inference in shared-control teleoperation system in consideration of user behavior | |
| Ramamurthy et al. | Leveraging domain knowledge for reinforcement learning using MMC architectures | |
| Mronga et al. | A constraint-based approach for human–robot collision avoidance | |
| Zhu et al. | Fuzzy adaptive model predictive control for image-based visual servoing of robot manipulators with kinematic constraints | |
| Afzali et al. | A modified convergence DDPG algorithm for robotic manipulation | |
| Das et al. | A modified real time A* algorithm and its performance analysis for improved path planning of mobile robot | |
| Benrabah et al. | Constrained nonlinear predictive control using neural networks and teaching–learning-based optimization | |
| Das Sharma et al. | Harmony search-based hybrid stable adaptive fuzzy tracking controllers for vision-based mobile robot navigation | |
| Deng et al. | Learning visual-based deformable object rearrangement with local graph neural networks | |
| Xia et al. | Learning sampling distribution for motion planning with local reconstruction-based self-organizing incremental neural network | |
| Waga et al. | A new method for mobile robots to learn an optimal policy from an expert using deep imitation learning | |
| Cao et al. | Shape control of elastic deformable linear objects for robotic cable assembly | |
| Geng et al. | Reinforcement extreme learning machine for mobile robot navigation | |
| Precup et al. | Nature-inspired optimization algorithms for path planning and fuzzy tracking control of mobile robots | |
| Zhang et al. | Deep q-learning with explainable and transferable domain rules | |
| Xu et al. | Discounted sampling policy gradient for robot multi-objective visual control | |
| Ding et al. | Research on manipulator motion planning for complex systems based on deep learning | |
| Li et al. | Robot navigation in crowds environment base deep reinforcement learning with POMDP | |
| Tsai et al. | Advances and challenges on intelligent learning in control systems | |
| Gorodetskiy et al. | Model-based policy optimization with neural differential equations for robotic arm control | |
| Wang et al. | Transformer-based path planning for single-arm and dual-arm robots in dynamic environments |