Zhu et al., 2019 - Google Patents

Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems

Zhu et al., 2019

Document ID: 12662984623474725972
Author: Zhu X; Yuan X; Wang Y; Sun C
Publication year: 2019
Publication venue: 2019 Chinese Control Conference (CCC)

External Links

Cited by

Snippet

In this paper, the consensus control of leader-follower multi-agent systems is investigated. To achieve the consensus of the discrete-time multi-agent systems, the data-driven iterative neighbor and target Q-learning algorithm is proposed. To implement the proposed method …

Continue reading at ieeexplore.ieee.org (other versions)

230000002787 reinforcement 0 title abstract description 8

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models

Similar Documents

Publication	Publication Date	Title
Wen et al.	2019	Optimized formation control using simplified reinforcement learning for a class of multiagent systems with unknown dynamics
Ye et al.	2018	Distributed adaptive event-triggered fault-tolerant consensus of multiagent systems with general linear dynamics
Yang et al.	2018	Distributed $ H_\infty $ state estimation over a filtering network with time-varying and switching topology and partial information exchange
Zhao et al.	2013	$ H_ {\infty} $ Consensus and Synchronization of Nonlinear Systems Based on A Novel Fuzzy Model
Yoo	2013	Distributed consensus tracking for multiple uncertain nonlinear strict-feedback systems under a directed graph
Zhan et al.	2018	Distributed model predictive consensus with self-triggered mechanism in general linear multiagent systems
Zhao et al.	2017	Adaptive neural consensus tracking for nonlinear multiagent systems using finite-time command filtered backstepping
CN113900380B (en)	2023-02-28	Robust output formation tracking control method and system for heterogeneous cluster system
Wang et al.	2020	Model-free reinforcement learning for fully cooperative consensus problem of nonlinear multiagent systems
Zhang et al.	2020	Sampled-data consensus of linear time-varying multiagent networks with time-varying topologies
Xie et al.	2021	Hybrid event-triggered approach for quasi-consensus of uncertain multi-agent systems with impulsive protocols
Liu et al.	2019	Quasi-synchronization of heterogeneous networks with a generalized Markovian topology and event-triggered communication
Liu et al.	2020	Reduced-order observer-based leader-following formation control for discrete-time linear multi-agent systems
Guo et al.	2023	Lyapunov-based output containment control of heterogeneous multi-agent systems with Markovian switching topologies and distributed delays
Wang et al.	2022	Distributed cooperative learning for discrete-time strict-feedback multi agent systems over directed graphs
Li et al.	2023	Dynamic target enclosing control scheme for multi-agent systems via a signed graph-based approach
Jiang et al.	2020	Dissipativity-based consensus tracking of singular multiagent systems with switching topologies and communication delays
Wang et al.	2019	Bipartite tracking consensus control of nonlinear high-order multi-agent systems subject to exogenous disturbances
Abdelatti et al.	2018	Cooperative deterministic learning control for a group of homogeneous nonlinear uncertain robot manipulators
Liu et al.	2024	Adaptive containment control of heterogeneous high‐order fully actuated multi‐agent systems
Xiang et al.	2023	Fixed-time bipartite output consensus of heterogeneous multi-agent systems with event-triggered observation
Wang et al.	2022	Suboptimal leader-to-coordination control for nonlinear systems with switching topologies: A learning-based method
Zhu et al.	2019	Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems
Mahmoud et al.	2020	Consensus in multi-agent systems over time-varying networks
Zamani et al.	2014	Minimum-energy distributed filtering