CN113927596B

CN113927596B - Width neural learning-based teleoperation limited time control method for time-varying output constraint robot

Info

Publication number: CN113927596B
Application number: CN202111167440.2A
Authority: CN
Inventors: 黄攀峰; 李陇南; 马志强; 鹿振宇; 常海涛; 陈海飞
Original assignee: Northwestern Polytechnical University
Current assignee: Northwestern Polytechnical University
Priority date: 2021-10-07
Filing date: 2021-10-07
Publication date: 2023-08-01
Anticipated expiration: 2041-10-07
Also published as: CN113927596A

Abstract

The invention discloses a time-varying output constraint robot finite time control method of width neural learning. Based on integral barrier Lyapunov function, width neural learning algorithm and finite time theory, a time-varying output constrained finite-time controller based on width neural learning is innovatively proposed. The direct integral obstacle Lyapunov function ensures that the output of the system is within the time-varying boundary; the width neural learning algorithm combines the advantages of traditional neural network and width learning, and based on the inverse dynamics observer, the width learning algorithm solves the problem of external force perception. At the same time, the model uncertainty in the design process of the control rate u _xj is eliminated; the finite time theory ensures the robot's high-precision and rapid tracking of the reference signal. In summary, the algorithm ensures a stable, safe and efficient interaction between the robot and the environment, and improves the reliability and efficiency of the teleoperation system.

Description

A time-varying output-constrained robot teleoperation based on width neural learning time control method

技术领域technical field

本发明属于机器人控制技术领域，具体涉及一种基于宽度神经学习的时变输出约束机器人遥操作有限时间控制方法。The invention belongs to the technical field of robot control, and in particular relates to a time-varying output-constrained robot teleoperation finite-time control method based on width neural learning.

背景技术Background technique

遥操作技术充分利用人的智能和机器人的操作能力。它很大程度上延伸了人类在遥远、非结构化及危险环境中的感知和行为能力，是深空、深海和深地探测中不可或缺的关键技术。同智能机器人相比，遥操作技术充分考虑了当前智能技术的不足，如突发情况中涉及的决策问题，操作中的安全性和约束性问题。该技术将人类感知和决策能力同机器人的操作能力结合，整体增强了遥操作系统在非结构化环境下面对突发情况的处理能力，是当前最为现实的人机混合智能策略。Teleoperation technology makes full use of human intelligence and robot operation ability. It largely extends human perception and behavior capabilities in remote, unstructured and dangerous environments, and is an indispensable key technology in deep space, deep sea and deep ground exploration. Compared with intelligent robots, teleoperation technology fully considers the deficiencies of current intelligent technologies, such as decision-making issues involved in emergencies, safety and constraint issues in operations. This technology combines human perception and decision-making capabilities with robot operation capabilities, and overall enhances the teleoperation system's ability to deal with emergencies in an unstructured environment. It is currently the most realistic human-machine hybrid intelligence strategy.

遥操作系统的组成复杂，操作指令的生成机制和指令的远距离传输等不可避免给系统造成不确定时延。不确定时延严重影响系统的稳定性，并使系统的性能退化；模型不确定性同样给系统的稳定性造成威胁。与此同时，受操作时间和工作空间的限制，机器人末端执行器需要在满足物理约束的同时，在预期时间内完成操作任务。如深空探测中，受在轨操作时间的限制，机器人在执行巡检任务时(如通过狭小空间)，需要在保证安全性的情况下，利用有限的时间完成在轨操作任务。The composition of the teleoperation system is complex, and the generation mechanism of operation instructions and the long-distance transmission of instructions inevitably cause uncertain delays to the system. Uncertain delay seriously affects the stability of the system and degrades the performance of the system; model uncertainty also poses a threat to the stability of the system. At the same time, limited by the operation time and work space, the robot end effector needs to complete the operation task within the expected time while satisfying the physical constraints. For example, in deep space exploration, due to the limitation of on-orbit operation time, when the robot performs inspection tasks (such as passing through a small space), it needs to use limited time to complete the on-orbit operation task under the condition of ensuring safety.

发明内容Contents of the invention

本发明解决的技术问题是：基于以上难题，本专利提出了一种时变输出受限的机器人遥操作有限时间控制方法，为遥操作系统开展实际工作提供了一种可行的方案。The technical problem solved by the present invention is: Based on the above problems, this patent proposes a time-varying output-limited robot teleoperation limited-time control method, which provides a feasible solution for the actual work of the teleoperation system.

本发明的技术方案是：一种基于宽度神经学习的时变输出约束机器人遥操作有限时间控制方法，其特征在于，包括以下步骤：The technical solution of the present invention is: a time-varying output-constrained robot teleoperation finite-time control method based on width neural learning, which is characterized in that it includes the following steps:

步骤1：对机械臂进行动力学建模；Step 1: Dynamically model the robotic arm;

步骤2：通过宽度神经学习算法和逆动力学观测器实现对操作者力和环境力的估计；Step 2: Realize the estimation of operator force and environment force by width neural learning algorithm and inverse dynamics observer;

步骤3：设计时变输出约束有限时间控制律，消解模型不确定影响，实现遥操作机器人的高精度追踪和快速性收敛。Step 3: Design a finite-time control law with time-varying output constraints, eliminate the influence of model uncertainty, and realize high-precision tracking and rapid convergence of the teleoperated robot.

本发明进一步的技术方案是：所述系统为一对n自由度机械臂，模型表达式为：A further technical solution of the present invention is: the system is a pair of n-degree-of-freedom mechanical arms, and the model expression is:

其中，j∈{m,s},分别为主从机器人标识。q_j∈R^n×1分别为关节空间的加速度，速度和位置；/>x_j∈R^n×1分别为操作空间的加速度，速度和位置；M_xj为惯性量矩阵，C_xj为离心力和哥氏力矩阵，g_xj为重力项矩阵，u_xj表示控制输入，F_mν＝F_h表示操作者施加力，F_sν＝F_e表示从端机器人和环境间的接触力。Among them, j ∈ {m, s}, respectively identify the master and slave robots. q _j ∈R ^n×1 are the acceleration, velocity and position of the joint space respectively; /> x _j ∈ R ^n×1 are the acceleration, velocity and position of the operating space; M _xj is the inertia matrix, C _xj is the centrifugal force and Coriolis force matrix, g _xj is the gravity term matrix, u _xj is the control input, F _mν =F _h represents the force exerted by the operator, and F _sν =F _e represents the contact force between the slave robot and the environment.

本发明进一步的技术方案是：所述步骤2包括以下子步骤：The further technical solution of the present invention is: said step 2 includes the following sub-steps:

步骤2.1：将模型线性化为：Step 2.1: Linearize the model as:

式中为线性回归矩阵，η为有关机械臂的参数向量，Θ∈R^n×1为二者乘积；In the formula is the linear regression matrix, η is the parameter vector of the manipulator, and Θ∈R ^n×1 is the product of the two;

步骤2.2：利用深度神经学习算法对外力评估：Step 2.2: External force evaluation using deep neural learning algorithm:

其中q_j,为神经网络的输入；where q _j , is the input of the neural network;

步骤2.3：基于逆动力学观测器，对机械臂的动力学模型(15)进一步线性化为Step 2.3: Based on the inverse dynamics observer, the dynamic model (15) of the manipulator is further linearized as

步骤2.4：设计深度神经学习算法实现对外力进行估计：Step 2.4: Design a deep neural learning algorithm to estimate the external force:

该神经网络的输入为q_j,通过宽度神经学习算法可实现对交互力的估计。The input of the neural network is q _j , Estimation of the interaction force is achieved by a wide neural learning algorithm.

本发明进一步的技术方案是：所述步骤3中，包括以下子步骤：The further technical scheme of the present invention is: in described step 3, comprise following sub-step:

步骤3.1：通过设计控制器u_xj,实现对参考轨迹有限时间追踪，同时要保证输出x_η1在约束区域内，即/> Step 3.1: By designing the controller u _xj , realize the reference trajectory Limited time tracking, while ensuring that the output x _η1 is within the constraint area, ie />

主从机器人的控制律为The control law of the master-slave robot is

式(22)中：为/>的广义逆，满足如下条件In formula (22): for /> The generalized inverse of , satisfying the following conditions

式(22)中更新律选取如下：The update law in formula (22) is selected as follows:

式(22)中误差变量e_j1,e_j2,j∈{m,s}为In formula (22), the error variables e _j1 , e _j2 , j∈{m,s} are

式中：为主从端的参考轨迹，α_j1为待设计的虚拟控制量。In the formula: α _j1 is the virtual control quantity to be designed.

式(10)则基于操作者行为可生成主端的参考轨迹，Equation (10) can generate the reference trajectory of the master end based on the operator's behavior,

式(27)其中分别代表操作者力和环境力的估计值，式中：/>分别为主端参考轨迹的加速度、速度和位置；/>分别为操作者力估计和环境力估计的比例系数；M_r,C_r,g_r分别为操作者行为的目标阻抗参数；Equation (27) where represent the estimated values of operator force and environmental force, respectively, where: /> are the acceleration, velocity and position of the host reference trajectory respectively; /> are the proportional coefficients of operator force estimation and environmental force estimation; M _r , C _r , g _r are target impedance parameters of operator behavior;

步骤3.2：从端的参考轨迹为：T_f(t)为主端到从端的网络传输时延；设计虚拟控制量α_j1：Step 3.2: The reference trajectory of the slave is: T _f (t) is the network transmission delay from the host to the slave; design the virtual control quantity α _j1 :

式中0＜ξ_j＜1，和γ_j1分别为：where 0<ξ _j <1, and γ _j1 are respectively:

式(22)中：分别为操作者力和机器人与环境间的接触力，k_j1,k_j2,χ_j,b_j为正常数，/>为正对角阵。此处神经网络的权重向量为步骤2宽度神经学习算法得到的W_j，/> In formula (22): are the operator force and the contact force between the robot and the environment, k _j1 , k _j2 , χ _j , b _j are positive constants, /> is a positive diagonal matrix. Here the weight vector of the neural network is W _j obtained by the width neural learning algorithm in step 2, />

发明效果Invention effect

本发明的技术效果在于：本发明聚焦于解决遥操作系统中存在的时延和不确定性造成系统失稳、机器人与未知环境交互的外力感知困难和机器人的操作空间和操作时间受限等问题，提出了一种基于宽度神经学习的时变输出约束机器人遥操作有限时间控制方法。其中，宽度神经学习算法将增量学习和RBF神经网络有效地结合起来，实现对操作者力和环境力的估计，同时消解了系统模型不确定性造成的负面影响；时变输出约束算法确保了机器人末端位置不会超出时变边界，提高了操作的安全性；有限时间控制方法确保了机器人轨迹的快速追踪能力。该方法勿需力传感器，系统的模型勿须精确已知，在保证系统安全操作的前提下，实现了遥操作机器人的高精度追踪和快速性控制。The technical effects of the present invention are: the present invention focuses on solving problems such as system instability caused by time delay and uncertainty in the teleoperating system, difficulty in sensing external forces when the robot interacts with an unknown environment, and limited operating space and operating time of the robot. , a finite-time control method for robot teleoperation with time-varying output constraints based on width neural learning is proposed. Among them, the width neural learning algorithm effectively combines incremental learning and RBF neural network to realize the estimation of operator force and environmental force, and at the same time eliminate the negative impact caused by the uncertainty of the system model; the time-varying output constraint algorithm ensures The end position of the robot will not exceed the time-varying boundary, which improves the safety of the operation; the limited time control method ensures the fast tracking ability of the robot trajectory. This method does not require a force sensor, and the model of the system does not need to be accurately known. On the premise of ensuring the safe operation of the system, the high-precision tracking and rapid control of the teleoperated robot are realized.

与现有技术相比，本发明具有以下优点：Compared with the prior art, the present invention has the following advantages:

(1)本发明设计一种宽度神经学习算法，实现对操作者力和环境力的估计；同时消除控制器设计中模型不确定性的影响。同传统神经网路相比，该网络可保证较高的估计准确率，同时节省大量的计算压力且不需要充足的学习条件；(1) The present invention designs a width neural learning algorithm to realize the estimation of operator force and environment force; meanwhile, it eliminates the influence of model uncertainty in controller design. Compared with the traditional neural network, the network can guarantee a higher estimation accuracy, while saving a lot of computing pressure and not requiring sufficient learning conditions;

(2)设计一种鲁棒性强的有限时间控制器，实现了机器人在操作时间受限情况下的有限时间控制。该发明具有更快的收敛速度，达到机器人与环境的高效交互的目标。(2) A robust finite-time controller is designed to realize the finite-time control of the robot when the operating time is limited. The invention has a faster convergence speed and achieves the goal of efficient interaction between the robot and the environment.

(3)本发明考虑了操作过程的安全性问题，对机器人的输出进行约束，且约束边界为时变边界，而常值边界为时变边界的特殊形式，因此该方法具有更强的实用性和实际意义。(3) The present invention considers the safety problem of the operation process, and constrains the output of the robot, and the constraint boundary is a time-varying boundary, and the constant value boundary is a special form of the time-varying boundary, so the method has stronger practicability and practical significance.

附图说明Description of drawings

图1遥操作系统控制框架图；Fig. 1 teleoperation system control frame diagram;

图2宽度神经学习算法框架；Figure 2 Width Neural Learning Algorithm Framework;

图3基于逆动力学的宽度神经学习算法；Figure 3. Width neural learning algorithm based on inverse dynamics;

图4仿真效果图(x,y,z轴为例)；(a)图为x轴和y轴上的追踪效果Figure 4 Simulation effect diagram (x, y, z axis as an example); (a) The picture shows the tracking effect on the x-axis and y-axis

(b)图为z轴和操作空间上的追踪效果(b) The picture shows the tracking effect on the z-axis and the operating space

具体实施方式Detailed ways

参见图1-图4，本方法具体步骤及给出如下：Referring to Fig. 1-Fig. 4, the concrete steps of this method and are provided as follows:

步骤一：系统的动力学建模Step 1: Dynamic modeling of the system

步骤二：设计了一种宽度神经学习算法，随后基于逆动力学观测器，实现对操作者力和环境力的估计；Step 2: Design a width neural learning algorithm, and then realize the estimation of operator force and environmental force based on the inverse dynamics observer;

步骤三：设计时变输出约束有限时间控制律，消解模型不确定影响，实现遥操作机器人的高精度追踪和快速性收敛(主端同从端控制律类似，因此采用变量j∈{m,s}统一表述)Step 3: Design a finite-time control law with time-varying output constraints, eliminate the influence of model uncertainty, and realize high-precision tracking and rapid convergence of the teleoperated robot (the control law of the master side is similar to that of the slave side, so the variable j∈{m,s }unified expression)

综合以上步骤可实现遥操作系统与环境间稳定、安全和高效交互。Combining the above steps, the stable, safe and efficient interaction between the teleoperation system and the environment can be realized.

遥操作系统较复杂，为了更清晰地展开后续的论述，因此有必要对总体框架进行概要。The teleoperation system is complex, so it is necessary to summarize the general framework in order to develop the subsequent discussion more clearly.

步骤一：该系统由一对n自由度机械臂组成，在操作空间的动力学形式如下，为了简便起见，将主从端动力学统一书写为：Step 1: The system consists of a pair of n-degree-of-freedom manipulators. The dynamics in the operating space are as follows. For simplicity, the dynamics of the master and slave ends are uniformly written as:

式中：j∈{m,s},分别为主从机器人标识。q_j∈R^n×1分别为关节空间的加速度，速度和位置。/>x_j∈R^n×1分别为操作空间的加速度，速度和位置。M_xj为惯性量矩阵，C_xj为离心力和哥氏力矩阵，g_xj为重力项矩阵，u_xj表示控制输入，F_mν＝F_h表示操作者施加力，F_sν＝F_e表示从端机器人和环境间的接触力。In the formula: j ∈ {m, s}, respectively, the identification of master and slave robots. q _j ∈R ^n×1 are the acceleration, velocity and position of the joint space respectively. /> x _j ∈R ^n×1 are the acceleration, velocity and position of the operation space, respectively. M _xj is the inertia matrix, C _xj is the centrifugal force and Coriolis force matrix, g _xj is the gravity term matrix, u _xj represents the control input, F _mν = F _h represents the force applied by the operator, F _sν = F _e represents the slave robot contact with the environment.

步骤二：基于逆动力学观测器，设计了一种宽度神经学习算法对交互力(主端操作者和主端机器人间的接触力，从端机器人和环境间的接触力)进行估计。机械臂的动力学模型(15)可被线性化为Step 2: Based on the inverse dynamics observer, a width neural learning algorithm is designed to estimate the interaction force (the contact force between the master operator and the master robot, and the contact force between the slave robot and the environment). The dynamic model (15) of the manipulator can be linearized as

式中为线性回归矩阵，η为有关机械臂的参数向量，Θ∈R^n×1为二者乘积。由于无法对机械臂进行精准建模且模型本身存在一定的偏差，传统的逆动力学观测器无法实现对外力的精准估计。因此，设计深度神经学习算法实现对外力进行估计：In the formula is the linear regression matrix, η is the parameter vector of the manipulator, and Θ∈R ^n×1 is the product of the two. Due to the inability to accurately model the manipulator and the model itself has certain deviations, the traditional inverse dynamics observer cannot accurately estimate the external force. Therefore, a deep neural learning algorithm is designed to estimate the external force:

该神经网络的输入为q_j,通过宽度神经学习算法可实现对交互力的估计。由于外部环境的变化和系统的不确定性，传统算法需要调节神经网络参数并对其重新训练，而本发明设计的宽度神经学习算法，将RBF和增量学习结合起来，通过设定阈值偏差，自适应地增加节点来得到更好的训练网络，见图2。基于逆动力学观测器，利用此宽度神经网络算法可实现交互力的估计；同时宽度神经学习算法可补偿系统模型的不确定性。基于宽度神经学习算法伪代码见表一.The input of the neural network is q _j , Estimation of the interaction force is achieved by a wide neural learning algorithm. Due to the change of the external environment and the uncertainty of the system, the traditional algorithm needs to adjust the neural network parameters and retrain it, but the width neural learning algorithm designed by the present invention combines RBF and incremental learning, and by setting the threshold deviation, Adaptively increase nodes to get a better training network, see Figure 2. Based on the inverse dynamics observer, this width neural network algorithm can be used to estimate the interaction force; at the same time, the width neural learning algorithm can compensate the uncertainty of the system model. Pseudo-code of the width-based neural learning algorithm is shown in Table 1.

表一基于宽度神经的学习算法Table 1 Learning Algorithm Based on Width Neural

表格中：X是跟q_j,有关的输入向量，W,β分别为RBF神经网络的权重向量和径向基向量，A^m和A^m+1为均为定义的模式矩阵。In the table: X is followed by q _j , The relevant input vectors, W and β are the weight vector and the radial basis vector of the RBF neural network respectively, and A ^m and A ^m+1 are defined pattern matrices.

式中：D＝(A^m)⁺H_m+1，Where: D=(A ^m ) ⁺ H _m+1 ,

式中C＝H_m+1-A^mD，因此权重可得：In the formula, C=H _m+1 -A ^m D, so the weight can be obtained:

W_ei＝(λI+(A^m)^TA^m)^-1(A^m)^TY (20)W _ei =(λI+(A ^m ) ^T A ^m ) ^-1 (A ^m ) ^T Y (20)

步骤三：为了保证操作的安全性，需要对机器人的位置输出进行限制，即机器人的末端位置在时变边界内。本专利采用直接障碍李雅普诺夫函数(IBLF)、宽度神经学习算法和有限时间理论，首次提出基于宽度神经学习的时变输出约束有限时间控制方法。Step 3: In order to ensure the safety of the operation, it is necessary to limit the position output of the robot, that is, the end position of the robot is within the time-varying boundary. This patent adopts the direct barrier Lyapunov function (IBLF), width neural learning algorithm and finite time theory, and proposes a time-varying output constraint finite time control method based on width neural learning for the first time.

控制目标：通过设计控制器u_xj,实现对参考轨迹有限时间追踪，同时要保证输出x_η1在约束区域内，即/> Control objective: by designing the controller u _xj , to realize the control of the reference trajectory Limited time tracking, while ensuring that the output x _η1 is within the constraint area, ie />

设计主从机器人的控制律为Design the control law of the master-slave robot as

式(26)则基于操作者行为可生成主端的参考轨迹，Equation (26) can generate the reference trajectory of the master end based on the operator's behavior,

式(27)其中分别代表操作者力和环境力的估计值(勿需力测量装置，在步骤一中利用宽度神经学习算法进行估计，见步骤二)，式中：/>分别为主端参考轨迹的加速度、速度和位置；/>分别为操作者力估计和环境力估计的比例系数；M_r,C_r,g_r分别为操作者行为的目标阻抗参数。Equation (27) where Represent the estimated values of operator force and environmental force respectively (no force measurement device is required, and the width neural learning algorithm is used to estimate in step 1, see step 2), where: /> are the acceleration, velocity and position of the host reference trajectory respectively; /> are the proportional coefficients of operator force estimation and environmental force estimation; M _r , C _r , and g _r are the target impedance parameters of operator behavior, respectively.

从端的参考轨迹为：T_f(t)为主端到从端的网络传输时延。式(22)，设计的虚拟控制量α_j1 The reference trajectory of the slave end is: T _f (t) is the network transmission delay from the master to the slave. Equation (22), the designed virtual control quantity α _j1

针对遥操作系统(15)，选取虚拟控制量(28)、控制量(22)和更新律(24)，确保了遥操作系统的闭环稳定性，同时实现机器人与环境间的安全，高效交互。For the teleoperation system (15), the virtual control quantity (28), control quantity (22) and update law (24) are selected to ensure the closed-loop stability of the teleoperation system and realize safe and efficient interaction between the robot and the environment.

系统的整体流程框架：基于时变输出约束的遥操作系统控制框架如图1所示。主端的位置信号x_m(t)经通讯链路传递到从端，得到从端的参考信号随后设计基于宽度神经学习的时变输出约束有限时间控制器u_xs(见步骤3)，可实现从端机器人对参考信号/>的高精度和快速性追踪。同时，利用宽度神经学习算法对从端的环境力进行估计(见步骤2)，并将虚拟的环境参数传递到主端。在主端，利用主端的运动信息和从端的虚拟环境参数，在主端实现从端环境力的重建。为了使操作者具备较好的力感知，主端的参考轨迹基于操作者的行为生成。随后在主端设计基于宽度神经学习的时变输出约束有限时间控制器u_xm(见步骤3)，实现主端机器人对主端参考信号/>的高精度和快速性追踪。(由于主端控制律u_xm和从端控制律u_xs形式一样，故统一表述)The overall process framework of the system: the control framework of the remote control system based on time-varying output constraints is shown in Figure 1. The position signal x _m (t) of the master end is transmitted to the slave end through the communication link, and the reference signal of the slave end is obtained Then design a time-varying output-constrained finite-time controller u _xs based on width neural learning (see step 3), which can realize the slave robot to the reference signal /> High precision and fast tracking. At the same time, the environmental force of the slave is estimated by using the width neural learning algorithm (see step 2), and the virtual environment parameters are passed to the master. At the master end, the reconstruction of the environment force of the slave end is realized at the master end by using the motion information of the master end and the virtual environment parameters of the slave end. In order to enable the operator to have better force perception, the reference trajectory of the master is generated based on the operator's behavior. Then, a time-varying output constraint finite-time controller u _xm based on width neural learning is designed on the master end (see step 3), to realize the master-end robot’s control of the master-end reference signal /> High precision and fast tracking. (Since the master-side control law u _xm and the slave-side control law u _xs have the same form, they are expressed uniformly)

综上所述，本发明公开了一种宽度神经学习的时变输出约束机器人有限时间控制方法。基于积分障碍李雅普诺夫函数、宽度神经学习算法和有限时间理论，创新性地提出了一种基于宽度神经学习的时变输出约束有限时间控制器。直接积分障碍李雅普诺夫函数保证了系统的输出在时变边界内；宽度神经学习算法融合了传统神经网络和宽度学习的优点，基于逆动力学观测器，利用宽度学习算法解决了外力感知难题，同时消解了控制率u_xj设计过程中的模型不确定性；有限时间理论保证了机器人对参考信号的高精度和快速性追踪。综上所述，该算法确保了机器人与环境间稳定、安全与高效交互，提升了遥操作系统的可靠性和高效性。In summary, the present invention discloses a time-varying output-constrained robot finite-time control method based on width neural learning. Based on integral barrier Lyapunov function, width neural learning algorithm and finite time theory, a time-varying output constrained finite-time controller based on width neural learning is innovatively proposed. The direct integral obstacle Lyapunov function ensures that the output of the system is within the time-varying boundary; the width neural learning algorithm combines the advantages of traditional neural network and width learning, and based on the inverse dynamics observer, the width learning algorithm solves the problem of external force perception. At the same time, the model uncertainty in the design process of the control rate u _xj is eliminated; the finite time theory ensures the robot's high-precision and rapid tracking of the reference signal. In summary, the algorithm ensures a stable, safe and efficient interaction between the robot and the environment, and improves the reliability and efficiency of the teleoperation system.

Claims

1. A teleoperation limited time control method of a time-varying output constraint robot based on width neural learning is characterized by comprising the following steps:

step 1: carrying out dynamic modeling on the mechanical arm; the mechanical arm is a pair of mechanical arms with n degrees of freedom, and the model expression is as follows:

wherein j is { m, s, is the master-slave robot identity respectively;acceleration, speed and position of joint space respectively; />Acceleration, speed and position of the operation space respectively; m is M _xj As an inertial mass matrix, C _xj For centrifugal and Golgi force matrices, g _xj As a gravity term matrix, u _xj Representing a control input;

step 2: estimating the force of an operator and the environmental force through a width neural learning algorithm and an inverse dynamics observer;

step 2.1: linearizing the model to:

in the method, in the process of the invention,is a linear regression matrix, eta is a parameter vector related to the mechanical arm, and theta epsilon R ^n×1 Is the product of the two;

step 2.2: external force assessment using a deep neural learning algorithm:

wherein the method comprises the steps ofIs an input to the neural network;

step 2.3: based on the inverse dynamics observer, the dynamics model (1) of the mechanical arm is further linearized into:

in the middle ofIs a linear regression matrix, eta is a parameter vector related to the mechanical arm, and theta epsilon R ^n×1 Is the product of the two;

step 2.4: the deep neural learning algorithm is designed to realize the estimation of external force:

the input of the neural network isThe interactive force can be estimated through a width neural learning algorithm;

step 3: and designing a time-varying output constraint finite time control law, resolving the uncertain influence of the model, and realizing high-precision tracking and rapid convergence of the teleoperation robot.

Step 3.1: by designing the controller u _xj Realize the reference trackLimited time tracking while guaranteeing output x _η1 Within the restricted area, i.e.)>

The control law of the master-slave robot is as follows:

in formula (6):the operator force and the contact force between the robot and the environment, respectively +.>Is->Is generalized inverse of (1), satisfies the following conditions

The update law in equation (6) is selected as follows:

error variable e in formula (6) _j1 ,e _j2 J epsilon { m, s } is

Wherein:alpha is the reference track of the master and slave ends _j1 The virtual control quantity to be designed;

a reference trajectory of the master may be generated based on operator behavior,

in (10) whereinRepresenting an estimate of the operator force and the ambient force, respectively, wherein: />Acceleration, speed and position of the main end reference track respectively; />Scaling factors for the operator force estimate and the ambient force estimate, respectively; mr, cr and gr are respectively target impedance parameters of the operator behavior;

step 3.2: the reference track of the slave end is:tf (t) is the network transmission delay from the master end to the slave end; designing virtual control quantity alpha _j1 ：

Wherein, the xi j is more than 0 and less than 1,and γj1 are respectively:

wherein: k (k) _j1 ,k _j2 ,χ _j ,b _j Is a normal number of times, and the number of times is equal to the normal number,is a right-angle array; the weight vector of the neural network is W obtained by the step 2 wide neural learning algorithm _j ，/>