Lima et al., 1994 - Google Patents
Hierarchical reinforcement learning and decision making for Intelligent MachinesLima et al., 1994
View PDF- Document ID
- 17349039998069846003
- Author
- Lima P
- Saridis G
- Publication year
- Publication venue
- Proceedings of the 1994 IEEE International Conference on Robotics and Automation
External Links
Snippet
A methodology for performance improvement of intelligent machines based on hierarchical reinforcement learning is introduced. Machine decision making and learning are based on a cost function which includes reliability and a computational cost of algorithms at the three …
- 230000002787 reinforcement 0 title abstract description 10
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/45—Nc applications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/18—Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form
- G05B19/19—Numerical control [NC], i.e. automatically operating machines, in particular machine tools, e.g. in a manufacturing environment, so as to execute positioning, movement or co-ordinated operations by means of programme data in numerical form characterised by positioning or contouring control systems, e.g. to control position from one programmed point to another or to control movement along a programmed continuous path
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/39—Robotics, robotics to robotics hand
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/418—Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS], computer integrated manufacturing [CIM]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B21/00—Systems involving sampling of the variable controlled
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B15/00—Systems controlled by a computer
- G05B15/02—Systems controlled by a computer electric
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP3924884B1 (en) | System and method for robust optimization for trajectory-centric model-based reinforcement learning | |
| Ribeiro | Reinforcement learning agents | |
| Chatzilygeroudis et al. | Using parameterized black-box priors to scale up model-based policy search for robotics | |
| Kaelbling et al. | Pre-image backchaining in belief space for mobile manipulation | |
| Rego et al. | Learning‐based robust neuro‐control: A method to compute control Lyapunov functions | |
| CN114137950A (en) | Method and equipment for carrying out social perception model predictive control on robot equipment | |
| Morales et al. | LAMDA control approaches applied to trajectory tracking for mobile robots | |
| Caarls et al. | Parallel online temporal difference learning for motor control | |
| Xie et al. | A hierarchical control and learning network for redundant manipulators with unknown physical parameters | |
| Pan et al. | Self-evolving fuzzy system based inverse dynamics learning control for nonlinear systems with uncertainties | |
| Liu et al. | Safe model-based control from signal temporal logic specifications using recurrent neural networks | |
| De la Sen et al. | Basic theoretical results for expert systems. Application to the supervision of adaptation transients in planar robots | |
| US12124230B2 (en) | System and method for polytopic policy optimization for robust feedback control during learning | |
| Lima et al. | Hierarchical reinforcement learning and decision making for Intelligent Machines | |
| Sanci et al. | A novel adaptive lssvr-based inverse optimal controller with integrator for nonlinear non-affine systems | |
| Deng et al. | Adaptive gait modeling and optimization for principally kinematic systems | |
| Reuter et al. | Genetic programming-based inverse kinematics for robotic manipulators | |
| Zhou et al. | An Online Dynamic Parameter Identification Approach for Robotic Manipulator with Reformulated Physical Feasibility | |
| Wang et al. | A learning-based tune-free control framework for large scale autonomous driving system deployment | |
| CN119871469B (en) | Mechanical arm control method and device, electronic equipment and storage medium | |
| Tamizi | Towards Generalizable Motion Planning: Efficient and Safe Learning-Based Frameworks | |
| Tsymbal et al. | Predicate-Based Model of Problem-Solving for Robotic Actions Planning. Mathematics 2021, 9, 3044 | |
| Zeng et al. | A Novel Uncalibrated Visual Servoing Controller Baesd on Model-Free Adaptive Control Method with Neural Network | |
| Ndlovu et al. | Deep Reinforcement Learning Based Dynamic Controller for Trajectory Tracking of Mobile Robots | |
| Yang et al. | A new approach of adaptive reinforcement learning control |