Zhu et al., 2019 - Google Patents
Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent SystemsZhu et al., 2019
- Document ID
- 12662984623474725972
- Author
- Zhu X
- Yuan X
- Wang Y
- Sun C
- Publication year
- Publication venue
- 2019 Chinese Control Conference (CCC)
External Links
Snippet
In this paper, the consensus control of leader-follower multi-agent systems is investigated. To achieve the consensus of the discrete-time multi-agent systems, the data-driven iterative neighbor and target Q-learning algorithm is proposed. To implement the proposed method …
- 230000002787 reinforcement 0 title abstract description 8
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Wen et al. | Optimized formation control using simplified reinforcement learning for a class of multiagent systems with unknown dynamics | |
| Ye et al. | Distributed adaptive event-triggered fault-tolerant consensus of multiagent systems with general linear dynamics | |
| Yang et al. | Distributed $ H_\infty $ state estimation over a filtering network with time-varying and switching topology and partial information exchange | |
| Zhao et al. | $ H_ {\infty} $ Consensus and Synchronization of Nonlinear Systems Based on A Novel Fuzzy Model | |
| Yoo | Distributed consensus tracking for multiple uncertain nonlinear strict-feedback systems under a directed graph | |
| Zhan et al. | Distributed model predictive consensus with self-triggered mechanism in general linear multiagent systems | |
| Zhao et al. | Adaptive neural consensus tracking for nonlinear multiagent systems using finite-time command filtered backstepping | |
| CN113900380B (en) | Robust output formation tracking control method and system for heterogeneous cluster system | |
| Wang et al. | Model-free reinforcement learning for fully cooperative consensus problem of nonlinear multiagent systems | |
| Zhang et al. | Sampled-data consensus of linear time-varying multiagent networks with time-varying topologies | |
| Xie et al. | Hybrid event-triggered approach for quasi-consensus of uncertain multi-agent systems with impulsive protocols | |
| Liu et al. | Quasi-synchronization of heterogeneous networks with a generalized Markovian topology and event-triggered communication | |
| Liu et al. | Reduced-order observer-based leader-following formation control for discrete-time linear multi-agent systems | |
| Guo et al. | Lyapunov-based output containment control of heterogeneous multi-agent systems with Markovian switching topologies and distributed delays | |
| Wang et al. | Distributed cooperative learning for discrete-time strict-feedback multi agent systems over directed graphs | |
| Li et al. | Dynamic target enclosing control scheme for multi-agent systems via a signed graph-based approach | |
| Jiang et al. | Dissipativity-based consensus tracking of singular multiagent systems with switching topologies and communication delays | |
| Wang et al. | Bipartite tracking consensus control of nonlinear high-order multi-agent systems subject to exogenous disturbances | |
| Abdelatti et al. | Cooperative deterministic learning control for a group of homogeneous nonlinear uncertain robot manipulators | |
| Liu et al. | Adaptive containment control of heterogeneous high‐order fully actuated multi‐agent systems | |
| Xiang et al. | Fixed-time bipartite output consensus of heterogeneous multi-agent systems with event-triggered observation | |
| Wang et al. | Suboptimal leader-to-coordination control for nonlinear systems with switching topologies: A learning-based method | |
| Zhu et al. | Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems | |
| Mahmoud et al. | Consensus in multi-agent systems over time-varying networks | |
| Zamani et al. | Minimum-energy distributed filtering |