[go: up one dir, main page]

Zhu et al., 2019 - Google Patents

Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems

Zhu et al., 2019

Document ID
12662984623474725972
Author
Zhu X
Yuan X
Wang Y
Sun C
Publication year
Publication venue
2019 Chinese Control Conference (CCC)

External Links

Snippet

In this paper, the consensus control of leader-follower multi-agent systems is investigated. To achieve the consensus of the discrete-time multi-agent systems, the data-driven iterative neighbor and target Q-learning algorithm is proposed. To implement the proposed method …
Continue reading at ieeexplore.ieee.org (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B17/00Systems involving the use of models or simulators of said systems
    • G05B17/02Systems involving the use of models or simulators of said systems electric
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models

Similar Documents

Publication Publication Date Title
Wen et al. Optimized formation control using simplified reinforcement learning for a class of multiagent systems with unknown dynamics
Ye et al. Distributed adaptive event-triggered fault-tolerant consensus of multiagent systems with general linear dynamics
Yang et al. Distributed $ H_\infty $ state estimation over a filtering network with time-varying and switching topology and partial information exchange
Zhao et al. $ H_ {\infty} $ Consensus and Synchronization of Nonlinear Systems Based on A Novel Fuzzy Model
Yoo Distributed consensus tracking for multiple uncertain nonlinear strict-feedback systems under a directed graph
Zhan et al. Distributed model predictive consensus with self-triggered mechanism in general linear multiagent systems
Zhao et al. Adaptive neural consensus tracking for nonlinear multiagent systems using finite-time command filtered backstepping
CN113900380B (en) Robust output formation tracking control method and system for heterogeneous cluster system
Wang et al. Model-free reinforcement learning for fully cooperative consensus problem of nonlinear multiagent systems
Zhang et al. Sampled-data consensus of linear time-varying multiagent networks with time-varying topologies
Xie et al. Hybrid event-triggered approach for quasi-consensus of uncertain multi-agent systems with impulsive protocols
Liu et al. Quasi-synchronization of heterogeneous networks with a generalized Markovian topology and event-triggered communication
Liu et al. Reduced-order observer-based leader-following formation control for discrete-time linear multi-agent systems
Guo et al. Lyapunov-based output containment control of heterogeneous multi-agent systems with Markovian switching topologies and distributed delays
Wang et al. Distributed cooperative learning for discrete-time strict-feedback multi agent systems over directed graphs
Li et al. Dynamic target enclosing control scheme for multi-agent systems via a signed graph-based approach
Jiang et al. Dissipativity-based consensus tracking of singular multiagent systems with switching topologies and communication delays
Wang et al. Bipartite tracking consensus control of nonlinear high-order multi-agent systems subject to exogenous disturbances
Abdelatti et al. Cooperative deterministic learning control for a group of homogeneous nonlinear uncertain robot manipulators
Liu et al. Adaptive containment control of heterogeneous high‐order fully actuated multi‐agent systems
Xiang et al. Fixed-time bipartite output consensus of heterogeneous multi-agent systems with event-triggered observation
Wang et al. Suboptimal leader-to-coordination control for nonlinear systems with switching topologies: A learning-based method
Zhu et al. Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems
Mahmoud et al. Consensus in multi-agent systems over time-varying networks
Zamani et al. Minimum-energy distributed filtering