[go: up one dir, main page]

Kalyanakrishnan et al., 2011 - Google Patents

On learning with imperfect representations

Kalyanakrishnan et al., 2011

View PDF
Document ID
926201761782441903
Author
Kalyanakrishnan S
Stone P
Publication year
Publication venue
2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)

External Links

Snippet

In this paper we present a perspective on the relationship between learning and representation in sequential decision making tasks. We undertake a brief survey of existing real-world applications, which demonstrates that the classical “tabular” representation …
Continue reading at www.academia.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5009Computer-aided design using simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems

Similar Documents

Publication Publication Date Title
Parker-Holder et al. Automated reinforcement learning (autorl): A survey and open problems
CN112668235B (en) Robot control method based on DDPG algorithm of offline model pre-training learning
Bloembergen et al. Evolutionary dynamics of multi-agent learning: A survey
Du et al. A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications
Xu et al. Learning to explore via meta-policy gradient
Boutilier Planning, learning and coordination in multiagent decision processes
CN109669452A (en) A kind of cloud robot task dispatching method and system based on parallel intensified learning
Werbos Reinforcement learning and approximate dynamic programming (RLADP)—foundations, common misconceptions, and the challenges ahead
Smith et al. Traditional heuristic versus Hopfield neural network approaches to a car sequencing problem
EP3938960A1 (en) A bilevel method and system for designing multi-agent systems and simulators
Wang et al. On the convergence of the monte carlo exploring starts algorithm for reinforcement learning
Liotet et al. Learning a belief representation for delayed reinforcement learning
CN120103855A (en) Collaborative path planning method for heterogeneous multi-UAVs based on multi-agent deep reinforcement learning
Showalter et al. Neuromodulated multiobjective evolutionary neurocontrollers without speciation
Kwiatkowski et al. Understanding reinforcement learned crowds
CN116128028A (en) An Efficient Deep Reinforcement Learning Algorithm for Combinatorial Optimization of Continuous Decision Spaces
Vasant Hybrid mesh adaptive direct search genetic algorithms and line search approaches for fuzzy optimization problems in production planning
Kalyanakrishnan et al. On learning with imperfect representations
CN108830483A (en) Multi-agent System Task planing method
Li et al. Introspective Reinforcement Learning and Learning from Demonstration.
Jasna et al. Application of game theory in path planning of multiple robots
Cavalieri Performance optimization of flexible manufacturing systems using artificial neural networks
Srinivasaiah et al. Reinforcement learning strategies using Monte-Carlo to solve the blackjack problem
Shi et al. Adaptive reinforcement q-learning algorithm for swarm-robot system using pheromone mechanism
Forbes et al. Real-time reinforcement learning in continuous domains