Gregor et al., 2014 - Google Patents

Novelty detector for reinforcement learning based on forecasting

Gregor et al., 2014

Document ID: 17979173510913255563
Author: Gregor M; Spalek J
Publication year: 2014
Publication venue: 2014 IEEE 12th International Symposium on Applied Machine Intelligence and Informatics (SAMI)

External Links

Cited by

Snippet

The paper proposes a novelty detector based on an artificial neural network forecaster. It shows how such forecaster can be constructed and as a novelty detector. Two variations of the forecaster are presented-one is based on backpropagation, and the other on Rprop. It is …

Continue reading at ieeexplore.ieee.org (other versions)

230000013016 learning 0 title abstract description 52

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
- G06N3/0635—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means using analogue means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management

Similar Documents

Publication	Publication Date	Title
Padakandla et al.	2020	Reinforcement learning algorithm for non-stationary environments
Yang et al.	2018	Hierarchical deep reinforcement learning for continuous action control
dos Santos et al.	2014	Reactive search strategies using reinforcement learning, local search algorithms and variable neighborhood search
Rakitianskaia et al.	2012	Training feedforward neural networks with dynamic particle swarm optimisation
Kujanpää et al.	2023	Hierarchical imitation learning with vector quantized models
CN107967513B (en)	2019-02-15	Multirobot intensified learning collaboratively searching method and system
Bahle et al.	2016	Lifelong learning and collaboration of smart technical systems in open-ended environments--Opportunistic collaborative interactive learning
Hafez et al.	2015	Topological Q-learning with internally guided exploration for mobile robot navigation
Ngo et al.	2013	Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
Gregor et al.	2014	Novelty detector for reinforcement learning based on forecasting
Leventi-Peetz et al.	2021	Scope and sense of explainability for ai-systems
Othmani-Guibourg et al.	2019	LSTM Path-Maker: a new LSTM-based strategy for the multi-agent patrolling
Zhang et al.	2014	Clique-based cooperative multiagent reinforcement learning using factor graphs
Fernandez-Gauna et al.	2013	Undesired state-action prediction in multi-agent reinforcement learning for linked multi-component robotic system control
Sherstan	2020	Representation and general value functions
Pan et al.	2025	A Survey of Continual Reinforcement Learning
CN113779396B (en)	2023-09-01	Question recommending method and device, electronic equipment and storage medium
García et al.	2017	Incremental reinforcement learning for multi-objective robotic tasks
Cawalla et al.	2024	Graph Reinforcement Learning for Courses of Action Analysis
Waldock et al.	2016	Learning a robot controller using an adaptive hierarchical fuzzy rule-based system
Hwang et al.	2012	Induced states in a decision tree constructed by Q-learning
Raja	2025	Reinforcement learning in dynamic environments: challenges and future directions
Macedo et al.	2016	Genetic programming algorithms for dynamic environments
Osawa et al.	2016	An implementation of working memory using stacked half restricted Boltzmann machine: Toward to restricted Boltzmann machine-based cognitive architecture
Taheri Yeganeh et al.	2024	Active Inference Meeting Energy-Efficient Control of Parallel and Identical Machines