Aubret et al., 2019 - Google Patents

A survey on intrinsic motivation in reinforcement learning

Aubret et al., 2019

Document ID: 3754803781149163337
Author: Aubret A; Matignon L; Hassas S
Publication year: 2019
Publication venue: arXiv preprint arXiv:1908.06976

External Links

Cited by

Snippet

The reinforcement learning (RL) research area is very active, with an important number of new contributions; especially considering the emergent field of deep RL (DRL). However a number of scientific and technical challenges still need to be addressed, amongst which we …

Continue reading at arxiv.org (PDF) (other versions)

230000013016 learning 0 title abstract description 240

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines

Similar Documents

Publication	Publication Date	Title
Aubret et al.	2019	A survey on intrinsic motivation in reinforcement learning
Liu et al.	2023	Libero: Benchmarking knowledge transfer for lifelong robot learning
Naeem et al.	2020	A gentle introduction to reinforcement learning and its application in different fields
Hao et al.	2023	Exploration in deep reinforcement learning: From single-agent to multiagent domain
Ramakrishnan et al.	2021	An exploration of embodied visual exploration
Puig et al.	2020	Watch-and-help: A challenge for social perception and human-ai collaboration
Bai et al.	2023	Evolutionary reinforcement learning: A survey
Farquhar et al.	2017	Treeqn and atreec: Differentiable tree-structured models for deep reinforcement learning
Das et al.	2018	Neural modular control for embodied question answering
Fang et al.	2020	Adaptive procedural task generation for hard-exploration problems
Azar et al.	2019	World discovery models
Hu et al.	2021	Heterogeneous crowd simulation using parametric reinforcement learning
Cobo et al.	2012	Automatic task decomposition and state abstraction from demonstration
Mitsopoulos et al.	2022	Toward a psychology of deep reinforcement learning agents using a cognitive architecture
Mediratta et al.	2023	The generalization gap in offline reinforcement learning
Doncieux et al.	2020	Dream architecture: a developmental approach to open-ended learning in robotics
Núñez-Molina et al.	2024	A review of symbolic, subsymbolic and hybrid methods for sequential decision making
Cao et al.	2024	Enhancing human-AI collaboration through logic-guided reasoning
Sarkar et al.	2021	QKSA: Quantum Knowledge Seeking Agent--resource-optimized reinforcement learning using quantum process tomography
Taioli et al.	2024	Unsupervised active visual search with monte carlo planning under uncertain detections
Ma et al.	2020	Exploiting bias for cooperative planning in multi-agent tree search
Ishida	2024	Spatial reasoning and planning for deep embodied agents
Grefenstette	1992	Learning decision strategies with genetic algorithms
Oh et al.	2020	View-action representation learning for active first-person vision
Veeriah	2022	Discovery in Reinforcement Learning