[go: up one dir, main page]

CN116974185A - Multi-agent binary consistency control method, device, equipment and storage medium - Google Patents

Multi-agent binary consistency control method, device, equipment and storage medium Download PDF

Info

Publication number
CN116974185A
CN116974185A CN202310856203.XA CN202310856203A CN116974185A CN 116974185 A CN116974185 A CN 116974185A CN 202310856203 A CN202310856203 A CN 202310856203A CN 116974185 A CN116974185 A CN 116974185A
Authority
CN
China
Prior art keywords
agent
consistency
relationship
communication
binary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310856203.XA
Other languages
Chinese (zh)
Inventor
赵杰梅
刘明明
陈彬彬
谌毅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Polytechnic University
Original Assignee
Wuhan Polytechnic University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Polytechnic University filed Critical Wuhan Polytechnic University
Priority to CN202310856203.XA priority Critical patent/CN116974185A/en
Publication of CN116974185A publication Critical patent/CN116974185A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0205Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system
    • G05B13/024Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/15Correlation function computation including computation of convolution operations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Mathematical Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Computational Mathematics (AREA)
  • Software Systems (AREA)
  • Algebra (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Automation & Control Theory (AREA)
  • Medical Informatics (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Computer And Data Communications (AREA)

Abstract

The application belongs to the technical field of multi-agent system control, and discloses a multi-agent binary consistency control method, a device, equipment and a storage medium. The application obtains a symbol diagram according to the communication relation by determining the communication relation among the intelligent agents in the multi-intelligent agent system, determines the cooperative countermeasure relation among the intelligent agents according to the symbol diagram, obtains the communication state of the intelligent agents according to the cooperative countermeasure relation, determines the bipartite consistency judging condition according to the communication state of the intelligent agents, determines the bipartite consistency of the multi-intelligent agent according to the communication state of the intelligent agents and the bipartite consistency judging condition, and eliminates the time lag and external interference influence of the intelligent agents by determining the cooperative countermeasure relation among the intelligent agents so as to achieve the bipartite consistency tracking control of the multi-intelligent agent system.

Description

多智能体二分一致性的控制方法、装置、设备及存储介质Multi-agent bipartite consistency control method, device, equipment and storage medium

技术领域Technical field

本发明涉及多智能体系统控制技术领域,尤其涉及一种多智能体二分一致性的控制方法、装置、设备及存储介质。The present invention relates to the technical field of multi-agent system control, and in particular to a multi-agent bipartite consistency control method, device, equipment and storage medium.

背景技术Background technique

多智能体系统作为分布式人工智能的重要分支,在移动机器人编队控制,卫星编队系统,生物系统,社交系统等领域受到了广泛的研究和关注。一致性作为协同控制的基础,是当前多智能体系统研究的核心问题之一,在大多数一致性问题的研究中,都只注重协作交互,即使用正权重表示系统的通信关系。多智能体之间往往存在着竞争关系,通过将智能体进行分组,各组中智能体之间依据同组或异组分别采用合作或竞争的方式可以完成不同的任务,但在系统的实际运行过程中,各智能体难免受到时滞和某些未知的不确定性干扰,从而降低到系统运行的高效性与稳定性。As an important branch of distributed artificial intelligence, multi-agent systems have received extensive research and attention in fields such as mobile robot formation control, satellite formation systems, biological systems, and social systems. Consistency, as the basis of collaborative control, is one of the core issues in current research on multi-agent systems. Most studies on consistency issues only focus on collaborative interaction, that is, using positive weights to represent the communication relationship of the system. There is often a competitive relationship between multiple agents. By grouping the agents, the agents in each group can complete different tasks by cooperating or competing according to the same group or different groups. However, in the actual operation of the system, During the process, each agent will inevitably be disturbed by time delays and some unknown uncertainties, thereby reducing the efficiency and stability of the system operation.

上述内容仅用于辅助理解本发明的技术方案,并不代表承认上述内容是现有技术。The above content is only used to assist in understanding the technical solution of the present invention, and does not represent an admission that the above content is prior art.

发明内容Contents of the invention

本发明的主要目的在于提供一种多智能体二分一致性的控制方法、装置、设备及存储介质,旨在解决现有技术带有扰动和参数不确定性以及内部时滞的技术问题。The main purpose of the present invention is to provide a multi-agent bipartite consistency control method, device, equipment and storage medium, aiming to solve the technical problems of disturbance, parameter uncertainty and internal time lag in the existing technology.

为实现上述目的,本发明提供了一种多智能体二分一致性的控制方法,所述方法包括以下步骤:In order to achieve the above objectives, the present invention provides a multi-agent binary consistency control method, which method includes the following steps:

确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图;Determine the communication relationship between the various agents in the multi-agent system, and obtain the symbol graph based on the communication relationship;

根据所述符号图确定各个智能体之间的合作对抗关系;Determine the cooperative and confrontational relationship between each agent according to the symbolic diagram;

根据所述合作对抗关系得到各个智能体的通信状态;Obtain the communication status of each agent according to the cooperative confrontation relationship;

根据所述各个智能体的通信状态确定二分一致性判断条件;Determine the binary consistency judgment condition according to the communication status of each agent;

根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性。The binary consistency of the multi-agent is determined according to the communication status of each agent and the binary consistency judgment condition.

可选地,所述合作对抗关系包括合作关系与对抗关系,所述根据所述符号图确定各个智能体之间的合作对抗关系,包括:Optionally, the cooperative confrontation relationship includes a cooperative relationship and a confrontation relationship. Determining the cooperative confrontation relationship between each agent according to the symbolic diagram includes:

根据所述符号图确定各个智能体的位置,根据所述位置得到节点集;Determine the position of each agent according to the symbol graph, and obtain a node set according to the position;

根据所述各个智能体之间的通信关系,得到邻接矩阵;According to the communication relationship between the various agents, an adjacency matrix is obtained;

根据所述邻接矩阵与所述节点集得到各个智能体之间的合作对抗关系,在所述智能体之间对应的权重值为正数时,确定所述智能体之间为合作关系;The cooperative and antagonistic relationship between each agent is obtained according to the adjacency matrix and the node set. When the corresponding weight value between the agents is a positive number, it is determined that the cooperative relationship between the agents is;

在所述智能体之间对应的权重值为负数时,所述智能体之间为对抗关系。When the corresponding weight value between the agents is a negative number, there is an adversarial relationship between the agents.

可选地,所述多智能体中包括一个领导者和若干个跟随者,所述根据所述合作对抗关系得到各个智能体的通信状态,包括:Optionally, the multi-agent includes a leader and several followers, and obtaining the communication status of each agent according to the cooperative confrontation relationship includes:

根据所述合作对抗关系确定所述领导者的控制输入和所述跟随者的控制输入;Determine the control input of the leader and the control input of the follower according to the cooperative confrontation relationship;

根据所述领导者的控制输入、系统时滞、领导者当前状态和领导者的非线性时滞函数得到领导者的通信状态;Obtain the communication status of the leader according to the leader's control input, system time delay, leader's current state and leader's nonlinear time delay function;

根据所述跟随者的控制输入、系统时滞、跟随者的当前状态、跟随者智能体的不稳定因素和跟随者的非线性时滞函数得到跟随者的通信状态。The communication status of the follower is obtained according to the follower's control input, system time delay, the follower's current state, the instability factors of the follower agent and the follower's nonlinear time delay function.

可选地,所述根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性之前,还包括:Optionally, before determining the binary consistency of the multi-agent according to the communication status of each agent and the binary consistency judgment condition, the method further includes:

通过标量控制增益、反馈控制增益矩阵、符号函数、非线性函数构建控制器;Construct a controller through scalar control gain, feedback control gain matrix, symbolic function, and nonlinear function;

所述控制器能够对所述各个智能体进行通信控制,更新各个智能体的状态;The controller can perform communication control on each intelligent agent and update the status of each intelligent agent;

根据领导者与跟随者的更新后的状态,以及符号函数得到一致性误差;The consistency error is obtained based on the updated status of the leader and follower and the sign function;

根据所述一致性误差更新所述二分一致性判断条件。The binary consistency judgment condition is updated according to the consistency error.

可选地,所述根据所述各个智能体的状态与对应的判断条件确定所述多智能体的二分一致性,包括:Optionally, determining the bipartite consistency of the multi-agent based on the status of each agent and the corresponding judgment condition includes:

根据多智能体系统中的跟随者和领导者之间的状态误差构建李雅普诺夫函数;Construct a Lyapunov function based on the state error between the follower and the leader in a multi-agent system;

根据所述李雅普诺夫函数得到一致性误差之间的对应关系;The corresponding relationship between consistency errors is obtained according to the Lyapunov function;

根据所述跟随者和领导者之间的状态误差与所述一致性误差之间的对应关系、标量控制增益以及反馈控制增益得到第一变化量与第二变化量;The first change amount and the second change amount are obtained according to the corresponding relationship between the state error between the follower and the leader and the consistency error, the scalar control gain and the feedback control gain;

在所述第一变化量与第二变化量均不大于零时,所述多智能体系统实现二分一致性。When neither the first change amount nor the second change amount is greater than zero, the multi-agent system achieves bipartite consistency.

可选地,所述根据所述李雅普诺夫函数得到所述跟随者和领导者之间的状态误差与所述一致性误差之间的对应关系之后,还包括:Optionally, after obtaining the corresponding relationship between the state error between the follower and the leader and the consistency error according to the Lyapunov function, the method further includes:

根据闭环误差系统对所述李雅普诺夫函数进行求导,得到所述闭环误差系统的二分一致性条件,所述闭环误差系统包括领导者和跟随者。The Lyapunov function is differentiated according to the closed-loop error system to obtain the bipartite consistency condition of the closed-loop error system, which includes a leader and a follower.

此外,为实现上述目的,本发明还提出一种多智能体二分一致性的控制装置,所述多智能体二分一致性的控制装置包括:In addition, in order to achieve the above object, the present invention also proposes a control device for multi-agent bipartite consistency. The control device for multi-agent bipartite consistency includes:

通信遍历模块,用于确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图;The communication traversal module is used to determine the communication relationship between various agents in the multi-agent system, and obtain the symbol graph based on the communication relationship;

通信分类模块,用于根据所述符号图确定各个智能体之间的合作对抗关系;A communication classification module, used to determine the cooperative and confrontational relationship between various agents based on the symbol graph;

通信确认模块,用于根据所述合作对抗关系得到各个智能体的通信状态;A communication confirmation module, used to obtain the communication status of each agent according to the cooperative confrontation relationship;

通信判断模块,用于根据所述各个智能体的通信状态确定二分一致性判断条件;A communication judgment module, used to determine the binary consistency judgment condition according to the communication status of each agent;

结果输出模块,用于根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性。A result output module is configured to determine the binary consistency of the multi-agent according to the communication status of each agent and the binary consistency judgment condition.

此外,为实现上述目的,本发明还提出一种多智能体二分一致性的控制设备,所述多智能体二分一致性的控制设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的多智能体二分一致性的控制程序,所述多智能体二分一致性的控制程序配置为实现如上文所述的多智能体二分一致性的控制方法的步骤。In addition, to achieve the above object, the present invention also proposes a multi-agent bipartite consistency control device. The multi-agent bipartite consistency control device includes: a memory, a processor and a device that is stored in the memory and can be used in The multi-agent binary consistency control program running on the processor is configured to implement the steps of the multi-agent binary consistency control method as described above.

此外,为实现上述目的,本发明还提出一种存储介质,所述存储介质上存储有多智能体二分一致性的控制程序,所述多智能体二分一致性的控制程序被处理器执行时实现如上文所述的多智能体二分一致性的控制方法的步骤。In addition, in order to achieve the above object, the present invention also proposes a storage medium. A control program for bipartite consistency of multi-agent is stored on the storage medium. The control program of bipartite consistency of multi-agent is implemented when executed by a processor. The steps of the multi-agent bipartite consistency control method as described above.

本发明通过确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图,根据所述符号图确定各个智能体之间的合作对抗关系,根据所述合作对抗关系得到各个智能体的通信状态,根据所述各个智能体的通信状态确定二分一致性判断条件,根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性,通过对智能体之间的通信关系确定出智能体之间的合作与对抗关系,消除智能体的时滞及外部干扰影响,达到对多智能体系统的二分一致性跟踪控制。The present invention determines the communication relationship between each agent in the multi-agent system, obtains a symbol diagram based on the communication relationship, determines the cooperative confrontation relationship between the various agents based on the symbol diagram, and obtains the cooperative confrontation relationship based on the cooperative confrontation relationship. The communication status of each intelligent agent is determined according to the communication status of each intelligent agent. The binary consistency judgment condition is determined according to the communication status of each intelligent agent and the binary consistency judgment condition. The binary consistency judgment condition of the multi-agent is determined. , by determining the cooperation and confrontation relationships between agents through the communication relationship between agents, eliminating the time lag and external interference effects of agents, and achieving bipartite consistent tracking control of multi-agent systems.

附图说明Description of the drawings

图1是本发明实施例方案涉及的硬件运行环境的多智能体二分一致性的控制设备的结构示意图;Figure 1 is a schematic structural diagram of a multi-agent binary consistency control device of a hardware operating environment involved in an embodiment of the present invention;

图2为本发明多智能体二分一致性的控制方法第一实施例的流程示意图;Figure 2 is a schematic flow chart of the first embodiment of the multi-agent binary consistency control method of the present invention;

图3为本发明多智能体二分一致性的控制方法第二实施例的流程示意图;Figure 3 is a schematic flow chart of the second embodiment of the multi-agent binary consistency control method of the present invention;

图4为本发明多智能体二分一致性的控制装置第一实施例的结构框图。Figure 4 is a structural block diagram of a first embodiment of a multi-agent bipartite consistency control device of the present invention.

本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization of the purpose, functional features and advantages of the present invention will be further described with reference to the embodiments and the accompanying drawings.

具体实施方式Detailed ways

应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。It should be understood that the specific embodiments described here are only used to explain the present invention and are not intended to limit the present invention.

参照图1,图1为本发明实施例方案涉及的硬件运行环境的多智能体二分一致性的控制设备结构示意图。Referring to Figure 1, Figure 1 is a schematic structural diagram of a multi-agent binary consistency control device of a hardware operating environment involved in an embodiment of the present invention.

如图1所示,该多智能体二分一致性的控制设备可以包括:处理器1001,例如中央处理器(Central Processing Unit,CPU),通信总线1002、用户接口1003,网络接口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如无线保真(Wireless-Fidelity,Wi-Fi)接口)。存储器1005可以是高速的随机存取存储器(RandomAccess Memory,RAM)存储器,也可以是稳定的非易失性存储器(Non-Volatile Memory,NVM),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in Figure 1, the multi-agent binary consistency control device may include: a processor 1001, such as a central processing unit (Central Processing Unit, CPU), a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005. Among them, the communication bus 1002 is used to realize connection communication between these components. The user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard). The optional user interface 1003 may also include a standard wired interface and a wireless interface. The network interface 1004 may optionally include a standard wired interface or a wireless interface (such as a Wireless-Fidelity (Wi-Fi) interface). The memory 1005 may be a high-speed random access memory (Random Access Memory, RAM) memory or a stable non-volatile memory (Non-Volatile Memory, NVM), such as a disk memory. The memory 1005 may optionally be a storage device independent of the aforementioned processor 1001.

本领域技术人员可以理解,图1中示出的结构并不构成对多智能体二分一致性的控制设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the structure shown in Figure 1 does not constitute a limitation on the multi-agent bipartite consistency control device, and may include more or less components than shown in the figure, or combine certain components, or Different component arrangements.

如图1所示,作为一种存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及多智能体二分一致性的控制程序。As shown in Figure 1, the memory 1005 as a storage medium may include an operating system, a network communication module, a user interface module, and a multi-agent binary consistency control program.

在图1所示的多智能体二分一致性的控制设备中,网络接口1004主要用于与网络服务器进行数据通信;用户接口1003主要用于与用户进行数据交互;本发明多智能体二分一致性的控制设备中的处理器1001、存储器1005可以设置在多智能体二分一致性的控制设备中,所述多智能体二分一致性的控制设备通过处理器1001调用存储器1005中存储的多智能体二分一致性的控制程序,并执行本发明实施例提供的多智能体二分一致性的控制方法。In the multi-agent binary consistency control device shown in Figure 1, the network interface 1004 is mainly used for data communication with the network server; the user interface 1003 is mainly used for data interaction with the user; the multi-agent binary consistency of the present invention The processor 1001 and the memory 1005 in the control device can be set in the multi-agent binary consistency control device. The multi-agent binary consistency control device calls the multi-agent binary stored in the memory 1005 through the processor 1001. Consistency control program, and execute the multi-agent binary consistency control method provided by the embodiment of the present invention.

本发明实施例提供了一种多智能体二分一致性的控制方法,参照图2,图2为本发明一种多智能体二分一致性的控制方法第一实施例的流程示意图。Embodiments of the present invention provide a control method for binary consistency of multiple agents. Refer to Figure 2 , which is a schematic flow chart of a method for controlling binary consistency of multiple agents according to the first embodiment of the present invention.

本实施例中,所述多智能体二分一致性的控制方法包括以下步骤:In this embodiment, the multi-agent binary consistency control method includes the following steps:

步骤S10:确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图。Step S10: Determine the communication relationship between the various agents in the multi-agent system, and obtain the symbol graph based on the communication relationship.

需要说明的是,本实施例的执行主体是多智能体二分一致性的控制设备,其中,该多智能体二分一致性的控制设备具有数据处理,数据通信及程序运行等功能,所述多智能体二分一致性的控制设备可以为集成控制器,控制计算机等设备,当然还可以为其他具备相似功能的设备,本实施例对此不做限制。It should be noted that the execution subject of this embodiment is a multi-agent bipartite consistent control device, wherein the multi-agent bipartite consistent control device has functions such as data processing, data communication and program running. The multi-agent bipartite consistency control device The control device for body bipartite consistency can be an integrated controller, a control computer and other devices, and of course it can also be other devices with similar functions, which is not limited in this embodiment.

应当理解的是,在一个多智能体系统中可以包括若干个智能体,其中,多智能体之间能够相互建立通信,各个智能体之间相互协作,或是相互竞争地完成特定任务,由于在多智能体之间存在不通的通信关系,并且这两种通信关系上存在逻辑对立,因此,为了能够更好的来描述多智能体之间的通信关系,因此,可以根据多智能体之间的通信关系得到符号图G=(V,E,A),其中,V表示节点集,A为邻接矩阵,E表示两个智能体之间能够互相通信的边集。It should be understood that a multi-agent system can include several agents, in which the multiple agents can establish communication with each other, and the agents can cooperate with each other or compete with each other to complete specific tasks. There are unreasonable communication relationships between multiple agents, and there are logical oppositions between these two communication relationships. Therefore, in order to better describe the communication relationship between multiple agents, we can use the The communication relationship obtains the symbolic graph G = (V, E, A), where V represents the node set, A is the adjacency matrix, and E represents the edge set between two agents that can communicate with each other.

步骤S20:根据所述符号图确定各个智能体之间的合作对抗关系。Step S20: Determine the cooperation and confrontation relationship between the various agents according to the symbol graph.

需要说明的是,各个智能体之间存在通信关系,而通信关系根据智能体之间对特定任务的存在合作协力完成,或是多个智能体之间竞争完成特定任务。It should be noted that there is a communication relationship between various agents, and the communication relationship is based on the existence of cooperation between the agents on specific tasks, or the competition between multiple agents to complete specific tasks.

在具体实现中,将符号图G的Laplacian矩阵L定义为[lij]N×N,其中,In the specific implementation, the Laplacian matrix L of the symbol graph G is defined as [l ij ] N×N , where,

在一个含有N个智能体的多智能体系统中,用符号图G=(V,E,A)表示,其中V={v1,…,vN}是节点集,A=[aij]∈Rn×n为邻接矩阵,节点vi代表第i个智能体,边集(vi,vj)∈E表示智能体i和智能体j可以相互通信,对于符号图G的邻接矩阵A,当有(vi,vj)∈E,智能体i和智能体j为合作关系,aij>0;智能体i和智能体j为对抗关系,aij<0。In a multi-agent system containing N agents, it is represented by a symbolic graph G = (V, E, A), where V = {v 1 ,..., v N } is the node set, and A = [a ij ] ∈R n×n is the adjacency matrix, node v i represents the i-th agent, and the edge set (v i , v j )∈E indicates that agent i and agent j can communicate with each other. For the adjacency matrix A of the symbolic graph G , when there is (v i , v j )∈E, agent i and agent j are in a cooperative relationship, a ij >0; agent i and agent j are in an antagonistic relationship, a ij <0.

步骤S30:根据所述合作对抗关系得到各个智能体的通信状态。Step S30: Obtain the communication status of each intelligent agent according to the cooperation and confrontation relationship.

在具体实现中,在符号图G中,存在两个非空节点集V1和V2,每个节点集内部的智能体的交互行为是合作的,若V1中的智能体和V2中的智能体存在交互行为,则这个行为是对抗的。并且V1和V2满足V1∪V2=V,当vi,vj∈Vs(s∈1,2)时,aij≥0;当vi∈Vs,vj∈Vt,s≠t(s,t∈{1,2})时,aij≤0,则称符号图G是结构平衡的,否则称其为结构不平衡。对于结构平衡的符号图,则存在一个符号矩阵S=diag(s1,s2,…,sN)使得矩阵SAS所有的项都是非负的,当i∈Vs时si=1;当i∈Vt时si=-1。根据各个智能体对应的节点集属于V1和V2,能得到当前的智能体的通信状态。In the specific implementation, in the symbolic graph G, there are two non-empty node sets V 1 and V 2 , and the interactive behavior of the agents inside each node set is cooperative. If the agents in V 1 and the agents in V 2 If there is interactive behavior among the agents, then this behavior is confrontational. And V 1 and V 2 satisfy V 1 ∪V 2 =V, When v i , v j ∈V s (s∈1,2), a ij ≥ 0; when v i ∈V s , v j ∈V t , s≠t(s,t∈{1,2}) When a ij ≤ 0, the symbolic graph G is said to be structurally balanced, otherwise it is said to be structurally unbalanced. For the structurally balanced symbolic graph, there is a symbolic matrix S = diag (s 1 , s 2 ,..., s N ) such that all entries of the matrix SAS are non-negative. When i∈V s , s i = 1; when When i∈V t , s i =-1. According to the node set corresponding to each agent belonging to V 1 and V 2 , the current communication status of the agent can be obtained.

步骤S40:根据所述各个智能体的通信状态确定二分一致性判断条件。Step S40: Determine the binary consistency judgment condition according to the communication status of each agent.

需要说明的是,二分一致性判断条件是用于对多智能体在通信过程中,根据各个智能体之间的通信状态来判断当前智能体是否解决了带有扰动和参数不确定性以及内部时滞的问题的判断条件。It should be noted that the binary consistency judgment condition is used to determine whether the current agent has solved the problem of disturbance, parameter uncertainty and internal time based on the communication status between each agent during the communication process of multiple agents. Judgment conditions for stagnation problems.

进一步地,所述多智能体中包括一个领导者和若干个跟随者,所述根据所述合作对抗关系得到各个智能体的通信状态,包括:Further, the multi-agent includes a leader and several followers, and obtaining the communication status of each agent according to the cooperative confrontation relationship includes:

根据所述合作对抗关系确定所述领导者的控制输入和所述跟随者的控制输入;Determine the control input of the leader and the control input of the follower according to the cooperative confrontation relationship;

根据所述领导者的控制输入、系统时滞、领导者当前状态和领导者的非线性时滞函数得到领导者的通信状态;Obtain the communication status of the leader according to the leader's control input, system time delay, leader's current state and leader's nonlinear time delay function;

根据所述跟随者的控制输入、系统时滞、跟随者的当前状态、跟随者智能体的不稳定因素和跟随者的非线性时滞函数得到跟随者的通信状态。The communication status of the follower is obtained according to the follower's control input, system time delay, the follower's current state, the instability factors of the follower agent and the follower's nonlinear time delay function.

在具体实现中,在该多智能体系统中包括一个领导者和若干个跟随者,其中,领导者与跟随者构成了整个多智能体系统,而对于多智能体系统可以为:In specific implementation, the multi-agent system includes a leader and several followers, where the leader and followers constitute the entire multi-agent system, and the multi-agent system can be:

其中,领导者的个数为1,跟随者的个数为N,xi(t)∈Rn,ui(t)∈Rp,分别表示第i个智能体的状态和控制输入。f(xi(t-τ(t)))表示非线性时滞函数,τ(t)表示系统的时滞且满足0<τ(t)<τ,其中τ和μ是已知常数。/>表示为包含第i个智能体的外部干扰以及参数不确定性的匹配不确定项。A,B和F是常数矩阵。x0(t)∈Rn代表领导者智能体的状态,u0(t)∈Rp代表领导者智能体的控制输入,f(x0(t-τ(t)))表示领导者的非线性时滞函数。在对于领导者-跟随者多智能体系统(2)在设计的分布式控制器下满足:Among them, the number of leaders is 1, the number of followers is N, x i (t)∈R n , u i (t)∈R p , respectively representing the state and control input of the i-th agent. f(x i (t-τ(t))) represents the nonlinear time delay function, τ(t) represents the time delay of the system and satisfies 0<τ(t)<τ, where τ and μ are known constants. /> Represented as a matching uncertainty term including the external interference of the i-th agent and parameter uncertainty. A, B and F are constant matrices. x 0 (t)∈R n represents the state of the leader agent, u 0 (t)∈R p represents the control input of the leader agent, f(x 0 (t-τ(t))) represents the leader’s Nonlinear time delay function. For the leader-follower multi-agent system (2) under the designed distributed controller, it satisfies:

步骤S50:根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性。Step S50: Determine the binary consistency of the multi-agent according to the communication status of each agent and the binary consistency judgment condition.

在具体实现中,若多智能体中能够满足式(3)所描述的条件时,说明当前的多智能体系统实现二分一致性。对于节点集V1和V2中的智能体而言,每个节点集内部的智能体的交互行为是合作的,若V1中的智能体和V2中的智能体存在交互行为,则这个行为是对抗的。在进行二分一致性判定时首先判断出当前的智能体属于哪一个节点集,若当前智能体属于V1对应的节点集时,需要将当前节点的状态与领导者当前的状态的差做极限,判断差值的绝对值的极限是否趋于0,并重复对V1节点中的每一个节点进行相同的处理方式进行处理,相似的,若当前的节点为V2节点集时,需要将当前节点的状态与领导者当前的状态的和做极限,判断差值的绝对值的极限是否趋于0,并重复对V2节点中的每一个节点进行相同的处理方式进行处理,若所得结果均趋于0,则说明当前的多智能体系统实现了二分一致性。In specific implementation, if the conditions described in equation (3) can be satisfied in multi-agent systems, it means that the current multi-agent system achieves bipartite consistency. For the agents in node sets V 1 and V 2 , the interactive behavior of the agents within each node set is cooperative. If the agents in V 1 and the agents in V 2 have interactive behaviors, then this Behavior is confrontational. When making a binary consistency determination, first determine which node set the current agent belongs to. If the current agent belongs to the node set corresponding to V 1 , the difference between the current node's state and the leader's current state needs to be limited. Determine whether the limit of the absolute value of the difference tends to 0, and repeat the same processing method for each node in the V 1 node. Similarly, if the current node is the V 2 node set, the current node needs to be The sum of the state and the current state of the leader is used as the limit, and it is judged whether the limit of the absolute value of the difference tends to 0, and the same processing method is repeated for each node in the V 2 node. If the results are all tending to is 0, it means that the current multi-agent system has achieved bipartite consistency.

进一步地,所述合作对抗关系包括合作关系与对抗关系,所述根据所述符号图确定各个智能体之间的合作对抗关系,包括:Further, the cooperative confrontation relationship includes a cooperative relationship and a confrontation relationship. Determining the cooperative confrontation relationship between each agent according to the symbolic diagram includes:

根据所述符号图确定各个智能体的位置,根据所述位置得到节点集;Determine the position of each agent according to the symbol graph, and obtain a node set according to the position;

根据所述各个智能体之间的通信关系,得到邻接矩阵;According to the communication relationship between the various agents, an adjacency matrix is obtained;

根据所述邻接矩阵与所述节点集得到各个智能体之间的合作对抗关系,在所述智能体之间对应的权重值为正数时,确定所述智能体之间为合作关系;The cooperative and antagonistic relationship between each agent is obtained according to the adjacency matrix and the node set. When the corresponding weight value between the agents is a positive number, it is determined that the cooperative relationship between the agents is;

在所述智能体之间对应的权重值为负数时,所述智能体之间为对抗关系。When the corresponding weight value between the agents is a negative number, there is an adversarial relationship between the agents.

在具体实现中,在得到符号图时,符号图中能反映出智能体之间的通信关系,并且能够间接反映出智能体所处于的位置,根据智能体所在的位置确定出智能体所处在的节点集,在多智能体系统中包括两个非空的节点子集V1和V2,。根据节点集V1和V2能够判断出当前的符号图是否处于平衡状态,若两个存在交互行为的节点在同一个节点集中时,说明这两个节点之间是合作关系,若两个存在交互行为的节点分别在不同的节点集中时,这两个节点之间是对抗关系,若节点集中的节点都能够满足上述的关系时,说明当前的符号图时结构平衡的,对于结构平衡的符号图存在符号矩阵,能使矩阵SAS所有的项都是非负的。In the specific implementation, when the symbolic graph is obtained, the symbolic graph can reflect the communication relationship between the agents, and can indirectly reflect the location of the agents. According to the location of the agents, the location of the agents can be determined. The node set includes two non-empty node subsets V 1 and V 2 in the multi-agent system. According to the node sets V 1 and V 2 , it can be judged whether the current symbolic graph is in a balanced state. If two nodes with interactive behaviors are in the same node set, it means that there is a cooperative relationship between the two nodes. If there are When the interactive nodes are in different node sets, there is an antagonistic relationship between the two nodes. If the nodes in the node set can all satisfy the above relationship, it means that the current symbol graph is structurally balanced. For structurally balanced symbols The graph has a signed matrix that makes all entries of the matrix SAS non-negative.

本实施例通过确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图,根据所述符号图确定各个智能体之间的合作对抗关系,根据所述合作对抗关系得到各个智能体的通信状态,根据所述各个智能体的通信状态确定二分一致性判断条件,根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性,通过对智能体之间的通信关系确定出智能体之间的合作与对抗关系,消除智能体的时滞及外部干扰影响,达到对多智能体系统的二分一致性跟踪控制。This embodiment determines the communication relationship between each agent in the multi-agent system, obtains a symbol diagram based on the communication relationship, determines the cooperative confrontation relationship between the various agents based on the symbol diagram, and determines the cooperative confrontation relationship between the various agents based on the symbolic diagram. Obtain the communication status of each intelligent agent, determine the binary consistency judgment condition according to the communication status of each intelligent agent, and determine the binary consistency of the multi-agent according to the communication status of each intelligent agent and the binary consistency judgment condition. property, by determining the cooperation and confrontation relationships between agents through the communication relationship between agents, eliminating the time lag and external interference effects of agents, and achieving bipartite consistent tracking control of multi-agent systems.

参考图3,图3为本发明一种多智能体二分一致性的控制方法第二实施例的流程示意图。Referring to Figure 3, Figure 3 is a schematic flow chart of a second embodiment of a multi-agent binary consistency control method of the present invention.

基于上述第一实施例,本实施例多智能体二分一致性的控制方法在所述步骤S50之前,还包括:Based on the above-mentioned first embodiment, the multi-agent binary consistency control method in this embodiment also includes before step S50:

步骤S501:通过标量控制增益、反馈控制增益矩阵、符号函数、非线性函数构建控制器;Step S501: Construct a controller through scalar control gain, feedback control gain matrix, symbolic function, and nonlinear function;

步骤S502:所述控制器能够对所述各个智能体进行通信控制,更新各个智能体的状态;Step S502: The controller can perform communication control on each intelligent agent and update the status of each intelligent agent;

步骤S503:根据领导者与跟随者的更新后的状态,以及符号函数得到一致性误差;Step S503: Obtain the consistency error according to the updated status of the leader and follower and the sign function;

步骤S504:根据所述一致性误差更新所述二分一致性判断条件。Step S504: Update the binary consistency judgment condition according to the consistency error.

在具体实现中,基于智能体与其邻居节点间的信息交互,为了解决一阶智能体带有扰动和时滞等问题,提出第i个跟随者的分布式控制器。In the specific implementation, based on the information interaction between the agent and its neighbor nodes, in order to solve the problems of first-order agents with disturbances and time delays, a distributed controller of the i-th follower is proposed.

式(4)中,c1>0,c2>0表示标量控制增益,K表示反馈控制增益矩阵,sgn(·)表示符号函数,g(·)是一个非线性函数,定义如下:In formula (4), c 1 >0, c 2 >0 represents the scalar control gain, K represents the feedback control gain matrix, sgn(·) represents the sign function, and g(·) is a nonlinear function, defined as follows:

令第i个智能体和领导者之间的状态误差定义为:ei(t)=xi(t)-six0(t),误差信号则领导者-跟随者闭环误差系统可以描述为:Let the state error between the i-th agent and the leader be defined as: e i (t) = x i (t) - s i x 0 (t), error signal Then the leader-follower closed-loop error system can be described as:

将控制器(4)带入动力系统(6),可得:Bringing the controller (4) into the power system (6), we can get:

其中 in

可得一致性跟踪误差为:make The available consistency tracking error is:

其中G(e(t))表示为where G(e(t)) is expressed as

根据得到的一致性跟踪误差对更新所述二分一致性判断条件。The binary consistency judgment condition is updated according to the obtained consistency tracking error pair.

进一步地,所述根据所述各个智能体的状态与对应的判断条件确定所述多智能体的二分一致性,包括:Further, determining the bipartite consistency of the multi-agent according to the status of each agent and the corresponding judgment condition includes:

根据多智能体系统中的跟随者和领导者之间的状态误差构建李雅普诺夫函数;Construct a Lyapunov function based on the state error between the follower and the leader in a multi-agent system;

根据所述李雅普诺夫函数得到一致性误差之间的对应关系;The corresponding relationship between consistency errors is obtained according to the Lyapunov function;

根据所述跟随者和领导者之间的状态误差与所述一致性误差之间的对应关系、标量控制增益以及反馈控制增益得到第一变化量与第二变化量;The first change amount and the second change amount are obtained according to the corresponding relationship between the state error between the follower and the leader and the consistency error, the scalar control gain and the feedback control gain;

在所述第一变化量与第二变化量均不大于零时,所述多智能体系统实现二分一致性。When neither the first change amount nor the second change amount is greater than zero, the multi-agent system achieves bipartite consistency.

在具体实现中,针对多智能体系统,构建李雅普诺夫函数:In the specific implementation, for multi-agent systems, the Lyapunov function is constructed:

V(t)=V1(t)+V2(t) (9)V(t)=V 1 (t)+V 2 (t) (9)

其中in

注意到K=-FTR,对V1(t)求导得:Noting that K=-F T R, taking the derivative with respect to V 1 (t) gives:

对于所述多智能体系统,存在以下假设:For the multi-agent system described, the following assumptions exist:

假设1:如果无向符号图G是连通的且结构平衡的,其包含一个以领导者为根节点的有向生成树,其子图Gs包含N个跟随者。Assumption 1: If the undirected symbolic graph G is connected and structurally balanced, it contains a directed spanning tree with the leader as the root node, and its subgraph G s contains N followers.

对于领导者与跟随者之间的通信关系则通过一个对角矩阵A0描述,A0=diag(a10,…,aN0),若跟随者i可以与领导者进行通信,则ai0>0,否则ai0=0。The communication relationship between the leader and the follower is described by a diagonal matrix A 0 , A 0 =diag(a 10 ,...,a N0 ). If the follower i can communicate with the leader, then a i0 > 0, otherwise a i0 =0.

使用L代表关于图G的Laplacian矩阵,由于领导者没有邻居,L可以被表示为:Using L represents the Laplacian matrix about the graph G, since the leader has no neighbors, L can be expressed as:

其中L2∈RN×1,L1∈RN×NAmong them, L 2R N×1 and L 1R N×N .

假设2:存在一个常量β>0,使得||u0(t)||≤β。Assumption 2: There is a constant β>0 such that ||u 0 (t)||≤β.

假设3:是连续且有界的,即存在常数γ>0,使得 Assumption 3: is continuous and bounded, that is, there is a constant γ>0, such that

假设4:非线性函数f满足如下条件:Assumption 4: The nonlinear function f satisfies the following conditions:

||f(xi(t-τ(t)),t)-sif(x0(t-τ(t)),t)||≤κ||xi(t-τ(t))-six0(t-τ(t))||,其中常数κ>0。||f(x i (t-τ(t)),t)-s i f(x 0 (t-τ(t)),t)||≤κ||x i (t-τ(t) )-s i x 0 (t-τ(t))||, where the constant κ>0.

同时还包括两个引理:It also includes two lemmas:

引理1:对于一个无向连通图G,若包含一个以领导者为根节点的有向生成树,则其关于图G的拉普拉斯矩阵L为正定的。Lemma 1: For an undirected connected graph G, if it contains a directed spanning tree with the leader as the root node, then its Laplacian matrix L about the graph G is positive definite.

引理2:对于任意给定的向量X,Y∈Rn和正定矩阵P∈Rn×n,有Lemma 2: For any given vector X, Y∈R n and positive definite matrix P∈R n×n , we have

2XTY≤XTPX+YTP-1Y。2X T Y≤X T PX+Y T P -1 Y.

若假设1-4成立。当有条件c2≥ρ,K=-FTR,其中/>λ2是L1的最小非零特征值,使得不等式If assumptions 1-4 are true. When there are conditions c 2 ≥ρ,K=-F T R, where/> λ 2 is the smallest nonzero eigenvalue of L 1 such that the inequality

成立,则通过上述分布式控制器可以使领导者-跟随者多智能体系统实现二分一致性。Established, then the leader-follower multi-agent system can achieve bipartite consistency through the above distributed controller.

进一步地,,所述根据所述李雅普诺夫函数得到所述跟随者和领导者之间的状态误差与所述一致性误差之间的对应关系之后,还包括:Further, after obtaining the corresponding relationship between the state error between the follower and the leader and the consistency error according to the Lyapunov function, the method further includes:

根据闭环误差系统对所述李雅普诺夫函数进行求导,得到所述闭环误差系统的二分一致性条件,所述闭环误差系统包括领导者和跟随者。The Lyapunov function is differentiated according to the closed-loop error system to obtain the bipartite consistency condition of the closed-loop error system, which includes a leader and a follower.

在具体实现中,首先根据领导者-跟随者闭环误差系统对李雅普诺夫函数中第一项进行求导得到根据上述的假设2和假设3,得到/>Γ=γ+β,则上述(11)式中的/>能够进一步表示为:In the specific implementation, first, the first term in the Lyapunov function is derived according to the leader-follower closed-loop error system to obtain According to the above assumptions 2 and 3, we get/> Γ=γ+β, then in the above formula (11)/> It can be further expressed as:

而在(11)式中的则进一步表示为:And in equation (11) Then it is further expressed as:

而已知c2≥ρ,将上述第(13)式,(14)式代入到(11)式中,得到And it is known that c 2 ≥ρ, substituting the above equations (13) and (14) into equation (11), we get

根据引理1可知,拉普拉斯矩阵L1是正定的,并且关于G的子图Gs是无向连通的,且至少有一个智能体能与领导者进行通信,则存在一个如下的酉矩阵,According to Lemma 1, it can be seen that the Laplacian matrix L 1 is positive definite, and the subgraph G s about G is undirected and connected, and at least one agent can communicate with the leader, then there is a unitary matrix as follows ,

其中Y1∈RN×N-1通过状态转换得到/>其中同时,(λ1≤λ2…≤λN)为L1的特征值。令/>(15)式中的可以化简为:in Y 1 ∈R N×N-1 is obtained through state transition/> in At the same time, (λ 1 ≤λ 2 …≤λ N ) is the eigenvalue of L 1 . Order/> (15) in Eq. It can be simplified to:

由引理2和假设4可将式(15)中式进一步表示为:According to Lemma 2 and Assumption 4, equation (15) can be The formula is further expressed as:

将(17)、(18)式带入(15)式中,得到Putting equations (17) and (18) into equation (15), we get

求导,注意到/>可得right Find the derivative and note/> Available

根据系统时滞对所述李雅普诺夫函数第二项进行求导,此时能够将(19)式和(20)式代入到(9)式,能够得到:Derive the second term of the Lyapunov function according to the system time delay. At this time, equations (19) and (20) can be substituted into equation (9), and we can get:

根据定理1可知,Δ1≤0,Δ2≤0,则当t→∞时,系统误差趋近于0,因此,领导者-跟随者系统(2)在控制器(4)下可渐近实现二分一致性。According to Theorem 1, Δ 1 ≤ 0, Δ 2 ≤ 0, then When t→∞, the system error approaches 0. Therefore, the leader-follower system (2) can asymptotically achieve bipartite consistency under the controller (4).

本实施例在满足符号图结构平衡的条件下,不仅处理了智能体的内部时滞,还为每一个跟随者设计了相应的分布式控制器,解决了智能体带有的某些不确定项包括参数不确定性和外部干扰的影响。最终实现了一组智能体跟踪领导者达到状态值一致,另一组智能体收敛到与领导者模相等符号相反的状态值,实现了二分一致性跟踪目标。Under the condition that the symbolic graph structure is balanced, this embodiment not only handles the internal time delay of the agent, but also designs a corresponding distributed controller for each follower, solving certain uncertainties associated with the agent. Includes parameter uncertainties and the effects of external disturbances. In the end, a group of agents tracked the leader to achieve consistent state values, and the other group of agents converged to a state value with the same modulo equal sign and opposite sign as the leader, achieving the bipartite consistent tracking goal.

此外,本发明实施例还提出一种存储介质,所述存储介质上存储有多智能体二分一致性的控制程序,所述多智能体二分一致性的控制程序被处理器执行时实现如上文所述的多智能体二分一致性的控制方法的步骤。In addition, the embodiment of the present invention also proposes a storage medium. The storage medium stores a multi-agent binary consistency control program. When the multi-agent binary consistency control program is executed by a processor, the control program is implemented as described above. The steps of the multi-agent bipartite consistency control method described above.

参照图4,图4为本发明多智能体二分一致性的控制装置第一实施例的结构框图。Referring to Figure 4, Figure 4 is a structural block diagram of a first embodiment of a multi-agent binary consistency control device of the present invention.

如图4所示,本发明实施例提出的多智能体二分一致性的控制装置包括:As shown in Figure 4, the multi-agent binary consistency control device proposed by the embodiment of the present invention includes:

通信遍历模块10,用于确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图;The communication traversal module 10 is used to determine the communication relationship between various agents in the multi-agent system, and obtain the symbol graph based on the communication relationship;

通信分类模块20,用于根据所述符号图确定各个智能体之间的合作对抗关系;The communication classification module 20 is used to determine the cooperative and confrontational relationship between each agent according to the symbol graph;

通信确认模块30,用于根据所述合作对抗关系得到各个智能体的通信状态;The communication confirmation module 30 is used to obtain the communication status of each intelligent agent according to the cooperative confrontation relationship;

通信判断模块40,用于根据所述各个智能体的通信状态确定二分一致性判断条件;The communication judgment module 40 is used to determine the binary consistency judgment condition according to the communication status of each agent;

结果输出模块50,用于根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性。The result output module 50 is configured to determine the binary consistency of the multi-agent according to the communication status of each agent and the binary consistency judgment condition.

本实施例通过确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图,根据所述符号图确定各个智能体之间的合作对抗关系,根据所述合作对抗关系得到各个智能体的通信状态,根据所述各个智能体的通信状态确定二分一致性判断条件,根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性,通过对智能体之间的通信关系确定出智能体之间的合作与对抗关系,消除智能体的时滞及外部干扰影响,达到对多智能体系统的二分一致性跟踪控制。This embodiment determines the communication relationship between each agent in the multi-agent system, obtains a symbol diagram based on the communication relationship, determines the cooperative confrontation relationship between the various agents based on the symbol diagram, and determines the cooperative confrontation relationship between the various agents based on the symbolic diagram. Obtain the communication status of each intelligent agent, determine the binary consistency judgment condition according to the communication status of each intelligent agent, and determine the binary consistency of the multi-agent according to the communication status of each intelligent agent and the binary consistency judgment condition. property, by determining the cooperation and confrontation relationships between agents through the communication relationship between agents, eliminating the time lag and external interference effects of agents, and achieving bipartite consistent tracking control of multi-agent systems.

在一实施例中,所述通信分类模块20,还用于根据所述符号图确定各个智能体的位置,根据所述位置得到节点集;根据所述各个智能体之间的通信关系,得到邻接矩阵;根据所述邻接矩阵与所述节点集得到各个智能体之间的合作对抗关系,在所述智能体之间对应的权重值为正数时,确定所述智能体之间为合作关系;在所述智能体之间对应的权重值为负数时,所述智能体之间为对抗关系。In one embodiment, the communication classification module 20 is also used to determine the position of each agent according to the symbol graph, and obtain a node set according to the position; and obtain the adjacency according to the communication relationship between the agents. Matrix; obtain the cooperative and antagonistic relationship between each agent according to the adjacency matrix and the node set, and when the corresponding weight value between the agents is a positive number, it is determined that the cooperative relationship between the agents; When the corresponding weight value between the agents is a negative number, there is an adversarial relationship between the agents.

在一实施例中,所述通信确认模块30,还用于根据所述合作对抗关系确定所述领导者的控制输入和所述跟随者的控制输入;根据所述领导者的控制输入、系统时滞、领导者当前状态和领导者的非线性时滞函数得到领导者的通信状态;根据所述跟随者的控制输入、系统时滞、跟随者的当前状态、跟随者智能体的不稳定因素和跟随者的非线性时滞函数得到跟随者的通信状态。In one embodiment, the communication confirmation module 30 is further configured to determine the control input of the leader and the control input of the follower according to the cooperative confrontation relationship; according to the control input of the leader, the system time The leader's communication state is obtained by using the delay, the leader's current state and the leader's nonlinear time delay function; according to the follower's control input, system time delay, the follower's current state, the instability factors of the follower agent and The follower's nonlinear delay function obtains the follower's communication status.

在一实施例中,所述结果输出模块50,还用于通过标量控制增益、反馈控制增益矩阵、符号函数、非线性函数构建控制器;所述控制器能够对所述各个智能体进行通信控制,更新各个智能体的状态;根据领导者与跟随者的更新后的状态,以及符号函数得到一致性误差;根据所述一致性误差更新所述二分一致性判断条件。In one embodiment, the result output module 50 is also used to construct a controller through scalar control gain, feedback control gain matrix, symbolic function, and nonlinear function; the controller can perform communication control on each intelligent agent. , update the status of each agent; obtain the consistency error based on the updated status of the leader and follower, and the sign function; update the binary consistency judgment condition based on the consistency error.

在一实施例中,所述结果输出模块50,还用于根据多智能体系统中的跟随者和领导者之间的状态误差构建李雅普诺夫函数;根据所述李雅普诺夫函数得到一致性误差之间的对应关系;根据所述跟随者和领导者之间的状态误差与所述一致性误差之间的对应关系、标量控制增益以及反馈控制增益得到第一变化量与第二变化量;在所述第一变化量与第二变化量均不大于零时,所述多智能体系统实现二分一致性。In one embodiment, the result output module 50 is also used to construct a Lyapunov function based on the state error between the follower and the leader in the multi-agent system; obtain the consistency error based on the Lyapunov function The corresponding relationship between the follower and the leader; the first change amount and the second change amount are obtained according to the corresponding relationship between the state error between the follower and the leader and the consistency error, the scalar control gain and the feedback control gain; in When neither the first change amount nor the second change amount is greater than zero, the multi-agent system achieves bipartite consistency.

在一实施例中,所述结果输出模块50,还用于根据闭环误差系统对所述李雅普诺夫函数进行求导,得到所述闭环误差系统的二分一致性条件,所述闭环误差系统包括领导者和跟随者。In one embodiment, the result output module 50 is also used to derive the Lyapunov function according to a closed-loop error system to obtain the bipartite consistency condition of the closed-loop error system. The closed-loop error system includes a leader followers and followers.

应当理解的是,以上仅为举例说明,对本发明的技术方案并不构成任何限定,在具体应用中,本领域的技术人员可以根据需要进行设置,本发明对此不做限制。It should be understood that the above are only examples and do not constitute any limitation on the technical solution of the present invention. In specific applications, those skilled in the art can make settings as needed, and the present invention does not impose any limitations on this.

需要说明的是,以上所描述的工作流程仅仅是示意性的,并不对本发明的保护范围构成限定,在实际应用中,本领域的技术人员可以根据实际的需要选择其中的部分或者全部来实现本实施例方案的目的,此处不做限制。It should be noted that the workflow described above is only illustrative and does not limit the scope of the present invention. In practical applications, those skilled in the art can select some or all of them for implementation according to actual needs. The purpose of this embodiment is not limited here.

应该理解的是,虽然本申请实施例中的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,其可以以其他的顺序执行。而且,图中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,其执行顺序也不必然是依次进行,而是可以与其他步骤或者其他步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although each step in the flow chart in the embodiment of the present application is displayed in sequence as indicated by the arrows, these steps are not necessarily executed in the order indicated by the arrows. Unless explicitly stated in this article, the execution of these steps is not strictly limited in order, and they can be executed in other orders. Moreover, at least some of the steps in the figure may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but may be executed at different times, and their execution order is not necessarily sequential. may be performed in turn or alternately with other steps or sub-steps of other steps or at least part of stages.

此外,需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。Furthermore, it should be noted that, as used herein, the terms "include", "comprises" or any other variation thereof are intended to cover a non-exclusive inclusion, such that a process, method, article or system that includes a list of elements includes not only those elements, but also other elements not expressly listed or elements inherent to the process, method, article or system. Without further limitation, an element defined by the statement "comprises a..." does not exclude the presence of other identical elements in the process, method, article, or system that includes that element.

上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The above serial numbers of the embodiments of the present invention are only for description and do not represent the advantages and disadvantages of the embodiments.

通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如只读存储器(Read Only Memory,ROM)/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。Through the above description of the embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus the necessary general hardware platform. Of course, it can also be implemented by hardware, but in many cases the former is better. implementation. Based on this understanding, the technical solution of the present invention can be embodied in the form of a software product that is essentially or contributes to the existing technology. The computer software product is stored in a storage medium (such as a read-only memory). , ROM)/RAM, magnetic disk, optical disk), including several instructions to cause a terminal device (which can be a mobile phone, computer, server, or network device, etc.) to execute the method described in various embodiments of the present invention.

以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above are only preferred embodiments of the present invention, and do not limit the patent scope of the present invention. Any equivalent structure or equivalent process transformation made using the description and drawings of the present invention may be directly or indirectly used in other related technical fields. , are all similarly included in the scope of patent protection of the present invention.

Claims (9)

1.一种多智能体二分一致性的控制方法,其特征在于,所述多智能体二分一致性的控制方法包括:1. A control method for binary consistency of multi-agent, characterized in that the control method for binary consistency of multi-agent includes: 确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图;Determine the communication relationship between the various agents in the multi-agent system, and obtain the symbol graph based on the communication relationship; 根据所述符号图确定各个智能体之间的合作对抗关系;Determine the cooperative and confrontational relationship between each agent according to the symbolic graph; 根据所述合作对抗关系得到各个智能体的通信状态;Obtain the communication status of each agent according to the cooperative confrontation relationship; 根据所述各个智能体的通信状态确定二分一致性判断条件;Determine the binary consistency judgment condition according to the communication status of each agent; 根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性。The binary consistency of the multi-agent is determined according to the communication status of each agent and the binary consistency judgment condition. 2.如权利要求1所述的方法,其特征在于,所述合作对抗关系包括合作关系与对抗关系,所述根据所述符号图确定各个智能体之间的合作对抗关系,包括:2. The method according to claim 1, wherein the cooperative confrontation relationship includes a cooperative relationship and a confrontation relationship, and determining the cooperative confrontation relationship between each agent according to the symbol diagram includes: 根据所述符号图确定各个智能体的位置,根据所述位置得到节点集;Determine the position of each agent according to the symbol graph, and obtain a node set according to the position; 根据所述各个智能体之间的通信关系,得到邻接矩阵;According to the communication relationship between the various agents, an adjacency matrix is obtained; 根据所述邻接矩阵与所述节点集得到各个智能体之间的合作对抗关系,在所述智能体之间对应的权重值为正数时,确定所述智能体之间为合作关系;The cooperative and antagonistic relationship between each agent is obtained according to the adjacency matrix and the node set. When the corresponding weight value between the agents is a positive number, it is determined that the cooperative relationship between the agents is; 在所述智能体之间对应的权重值为负数时,所述智能体之间为对抗关系。When the corresponding weight value between the agents is a negative number, there is an adversarial relationship between the agents. 3.如权利要求1所述的方法,其特征在于,所述多智能体中包括一个领导者和若干个跟随者,所述根据所述合作对抗关系得到各个智能体的通信状态,包括:3. The method of claim 1, wherein the multi-agent includes a leader and several followers, and obtaining the communication status of each agent according to the cooperative confrontation relationship includes: 根据所述合作对抗关系确定所述领导者的控制输入和所述跟随者的控制输入;Determine the control input of the leader and the control input of the follower according to the cooperative confrontation relationship; 根据所述领导者的控制输入、系统时滞、领导者当前状态和领导者的非线性时滞函数得到领导者的通信状态;Obtain the communication status of the leader according to the leader's control input, system time delay, leader's current state and leader's nonlinear time delay function; 根据所述跟随者的控制输入、系统时滞、跟随者的当前状态、跟随者智能体的不稳定因素和跟随者的非线性时滞函数得到跟随者的通信状态。The communication status of the follower is obtained according to the follower's control input, system time delay, the follower's current state, the instability factors of the follower agent and the follower's nonlinear time delay function. 4.如权利要求1所述的方法,其特征在于,所述根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性之前,还包括:4. The method of claim 1, wherein before determining the binary consistency of the multi-agent according to the communication status of each agent and the binary consistency judgment condition, it further includes: 通过标量控制增益、反馈控制增益矩阵、符号函数、非线性函数构建控制器;Construct a controller through scalar control gain, feedback control gain matrix, symbolic function, and nonlinear function; 所述控制器能够对所述各个智能体进行通信控制,更新各个智能体的状态;The controller can perform communication control on each intelligent agent and update the status of each intelligent agent; 根据领导者与跟随者的更新后的状态,以及符号函数得到一致性误差;The consistency error is obtained based on the updated status of the leader and follower and the sign function; 根据所述一致性误差更新所述二分一致性判断条件。The binary consistency judgment condition is updated according to the consistency error. 5.如权利要求4所述的方法,其特征在于,所述根据所述各个智能体的状态与对应的判断条件确定所述多智能体的二分一致性,包括:5. The method of claim 4, wherein determining the bipartite consistency of the multi-agent according to the status of each agent and corresponding judgment conditions includes: 根据多智能体系统中的跟随者和领导者之间的状态误差构建李雅普诺夫函数;Construct a Lyapunov function based on the state error between the follower and the leader in a multi-agent system; 根据所述李雅普诺夫函数得到一致性误差之间的对应关系;The corresponding relationship between consistency errors is obtained according to the Lyapunov function; 根据所述跟随者和领导者之间的状态误差与所述一致性误差之间的对应关系、标量控制增益以及反馈控制增益得到第一变化量与第二变化量;The first change amount and the second change amount are obtained according to the corresponding relationship between the state error between the follower and the leader and the consistency error, the scalar control gain and the feedback control gain; 在所述第一变化量与第二变化量均不大于零时,所述多智能体系统实现二分一致性。When neither the first change amount nor the second change amount is greater than zero, the multi-agent system achieves bipartite consistency. 6.如权利要求5所述的方法,其特征在于,所述根据所述李雅普诺夫函数得到所述跟随者和领导者之间的状态误差与所述一致性误差之间的对应关系之后,还包括:6. The method of claim 5, wherein after obtaining the corresponding relationship between the state error between the follower and the leader and the consistency error according to the Lyapunov function, Also includes: 根据闭环误差系统对所述李雅普诺夫函数进行求导,得到所述闭环误差系统的二分一致性条件,所述闭环误差系统包括领导者和跟随者。The Lyapunov function is differentiated according to the closed-loop error system to obtain the bipartite consistency condition of the closed-loop error system, which includes a leader and a follower. 7.一种多智能体二分一致性的控制装置,其特征在于,所述多智能体二分一致性的控制装置包括:7. A control device for multi-agent binary consistency, characterized in that the control device for multi-agent binary consistency includes: 通信遍历模块,用于确定多智能体系统中各个智能体之间的通信关系,根据所述通信关系得到符号图;The communication traversal module is used to determine the communication relationship between various agents in the multi-agent system, and obtain the symbol graph based on the communication relationship; 通信分类模块,用于根据所述符号图确定各个智能体之间的合作对抗关系;A communication classification module, used to determine the cooperative and confrontational relationship between various agents based on the symbol graph; 通信确认模块,用于根据所述合作对抗关系得到各个智能体的通信状态;A communication confirmation module, used to obtain the communication status of each agent according to the cooperative confrontation relationship; 通信判断模块,用于根据所述各个智能体的通信状态确定二分一致性判断条件;A communication judgment module, used to determine the binary consistency judgment condition according to the communication status of each agent; 结果输出模块,用于根据所述各个智能体的通信状态与所述二分一致性判断条件确定所述多智能体的二分一致性。A result output module is configured to determine the binary consistency of the multi-agent according to the communication status of each agent and the binary consistency judgment condition. 8.一种多智能体二分一致性的控制设备,其特征在于,所述设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的多智能体二分一致性的控制程序,所述多智能体二分一致性的控制程序配置为实现如权利要求1至6中任一项所述的多智能体二分一致性的控制方法的步骤。8. A multi-agent binary consistency control device, characterized in that the device includes: a memory, a processor, and a multi-agent binary consistency stored in the memory and capable of running on the processor. The control program of the multi-agent binary consistency is configured to implement the steps of the control method of the multi-agent binary consistency according to any one of claims 1 to 6. 9.一种存储介质,其特征在于,所述存储介质上存储有多智能体二分一致性的控制程序,所述多智能体二分一致性的控制程序被处理器执行时实现如权利要求1至6任一项所述的多智能体二分一致性的控制方法的步骤。9. A storage medium, characterized in that a multi-agent binary consistency control program is stored on the storage medium, and when the multi-agent binary consistency control program is executed by a processor, the implementation of claims 1 to 1 Steps of the multi-agent bipartite consistency control method described in any one of 6.
CN202310856203.XA 2023-07-12 2023-07-12 Multi-agent binary consistency control method, device, equipment and storage medium Pending CN116974185A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310856203.XA CN116974185A (en) 2023-07-12 2023-07-12 Multi-agent binary consistency control method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310856203.XA CN116974185A (en) 2023-07-12 2023-07-12 Multi-agent binary consistency control method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN116974185A true CN116974185A (en) 2023-10-31

Family

ID=88477629

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310856203.XA Pending CN116974185A (en) 2023-07-12 2023-07-12 Multi-agent binary consistency control method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116974185A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117973431A (en) * 2024-03-25 2024-05-03 四川航天职业技术学院(四川航天高级技工学校) Optimal bipartite consensus control method, device, equipment and storage medium
CN118363306A (en) * 2024-06-14 2024-07-19 天津工业大学 Hysteresis consistency control method based on human-computer interaction multi-agent system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117973431A (en) * 2024-03-25 2024-05-03 四川航天职业技术学院(四川航天高级技工学校) Optimal bipartite consensus control method, device, equipment and storage medium
CN118363306A (en) * 2024-06-14 2024-07-19 天津工业大学 Hysteresis consistency control method based on human-computer interaction multi-agent system

Similar Documents

Publication Publication Date Title
Ozturk et al. Extracting priority rules for dynamic multi-objective flexible job shop scheduling problems using gene expression programming
Peng et al. Input–output data-based output antisynchronization control of multiagent systems using reinforcement learning approach
Liu et al. Antagonistic interaction-based bipartite consensus control for heterogeneous networked systems
Wieland et al. On topology and dynamics of consensus among linear high-order agents
Gupta* et al. A sub-optimal algorithm to synthesize control laws for a network of dynamic agents
CN113110039B (en) Finite time distributed aggregation optimization method of multi-agent system
US11099528B2 (en) Process optimization by grouping mixed integer nonlinear programming constraints
CN115268275B (en) A method and system for tracking consistency of multi-agent systems based on state observers
CN116974185A (en) Multi-agent binary consistency control method, device, equipment and storage medium
Hu et al. Distributed Nash equilibrium seeking: Continuous-time control-theoretic approaches
Agarwal et al. Regret guarantees for model-based reinforcement learning with long-term average constraints
Nojoumian et al. Bumpless stabilisation of uncertain switched positive linear systems under synchronous and asynchronous switching
CN118244638A (en) Event-triggering-based nonlinear multi-agent fixed time-varying formation control method
Gao et al. Cooperative distributed model predictive control of multiple coupled linear systems
Gadjov et al. On the exact convergence to Nash equilibrium in monotone regimes under partial-information
Almeida et al. Application of predictive control to the Hegselmann‐Krause model
Zhou et al. Leaderless and leader-following consensus of heterogeneous second-order multi-agent systems on time scales: an asynchronous impulsive approach
Guan et al. Finite‐time adaptive fuzzy consensus control for nonlinear multi‐agent systems under switching‐directed topologies
Li et al. Event-triggered bipartite consensus for multi-agent systems associated with signed graphs
CN114817744B (en) Recommendation method and device based on multiple agents
Zhang et al. Input‐constrained optimal output synchronization of heterogeneous multiagent systems via observer‐based model‐free reinforcement learning
Du et al. Transfer reinforcement learning for mixed observability Markov decision processes with time-varying interval-valued parameters and its application in pandemic control
WO2025074369A1 (en) System and method for efficient collaborative marl training using tensor networks
CN117669648A (en) A dual-channel heterogeneous graph node representation learning method based on meta-path
Fu et al. Global bipartite output consensus of discrete-time heterogeneous linear systems subject to input saturation: A model-free approach

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination