CN113927596B - Width neural learning-based teleoperation limited time control method for time-varying output constraint robot - Google Patents
Width neural learning-based teleoperation limited time control method for time-varying output constraint robot Download PDFInfo
- Publication number
- CN113927596B CN113927596B CN202111167440.2A CN202111167440A CN113927596B CN 113927596 B CN113927596 B CN 113927596B CN 202111167440 A CN202111167440 A CN 202111167440A CN 113927596 B CN113927596 B CN 113927596B
- Authority
- CN
- China
- Prior art keywords
- time
- force
- robot
- width
- learning algorithm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1602—Programme controls characterised by the control system, structure, architecture
- B25J9/161—Hardware, e.g. neural networks, fuzzy logic, interfaces, processor
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1656—Programme controls characterised by programming, planning systems for manipulators
- B25J9/1664—Programme controls characterised by programming, planning systems for manipulators characterised by motion, path, trajectory planning
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1679—Programme controls characterised by the tasks executed
- B25J9/1689—Teleoperation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
- G06F30/27—Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2111/00—Details relating to CAD techniques
- G06F2111/04—Constraint-based CAD
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2119/00—Details relating to the type or aim of the analysis or the optimisation
- G06F2119/14—Force analysis or force optimisation, e.g. static or dynamic forces
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Computation (AREA)
- Robotics (AREA)
- Artificial Intelligence (AREA)
- Mechanical Engineering (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Automation & Control Theory (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Fuzzy Systems (AREA)
- Computer Hardware Design (AREA)
- Medical Informatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Life Sciences & Earth Sciences (AREA)
- Geometry (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Feedback Control In General (AREA)
- Manipulator (AREA)
Abstract
本发明公开了一种宽度神经学习的时变输出约束机器人有限时间控制方法。基于积分障碍李雅普诺夫函数、宽度神经学习算法和有限时间理论,创新性地提出了一种基于宽度神经学习的时变输出约束有限时间控制器。直接积分障碍李雅普诺夫函数保证了系统的输出在时变边界内;宽度神经学习算法融合了传统神经网络和宽度学习的优点,基于逆动力学观测器,利用宽度学习算法解决了外力感知难题,同时消解了控制率uxj设计过程中的模型不确定性;有限时间理论保证了机器人对参考信号的高精度和快速性追踪。综上所述,该算法确保了机器人与环境间稳定、安全与高效交互,提升了遥操作系统的可靠性和高效性。
The invention discloses a time-varying output constraint robot finite time control method of width neural learning. Based on integral barrier Lyapunov function, width neural learning algorithm and finite time theory, a time-varying output constrained finite-time controller based on width neural learning is innovatively proposed. The direct integral obstacle Lyapunov function ensures that the output of the system is within the time-varying boundary; the width neural learning algorithm combines the advantages of traditional neural network and width learning, and based on the inverse dynamics observer, the width learning algorithm solves the problem of external force perception. At the same time, the model uncertainty in the design process of the control rate u xj is eliminated; the finite time theory ensures the robot's high-precision and rapid tracking of the reference signal. In summary, the algorithm ensures a stable, safe and efficient interaction between the robot and the environment, and improves the reliability and efficiency of the teleoperation system.
Description
技术领域technical field
本发明属于机器人控制技术领域,具体涉及一种基于宽度神经学习的时变输出约束机器人遥操作有限时间控制方法。The invention belongs to the technical field of robot control, and in particular relates to a time-varying output-constrained robot teleoperation finite-time control method based on width neural learning.
背景技术Background technique
遥操作技术充分利用人的智能和机器人的操作能力。它很大程度上延伸了人类在遥远、非结构化及危险环境中的感知和行为能力,是深空、深海和深地探测中不可或缺的关键技术。同智能机器人相比,遥操作技术充分考虑了当前智能技术的不足,如突发情况中涉及的决策问题,操作中的安全性和约束性问题。该技术将人类感知和决策能力同机器人的操作能力结合,整体增强了遥操作系统在非结构化环境下面对突发情况的处理能力,是当前最为现实的人机混合智能策略。Teleoperation technology makes full use of human intelligence and robot operation ability. It largely extends human perception and behavior capabilities in remote, unstructured and dangerous environments, and is an indispensable key technology in deep space, deep sea and deep ground exploration. Compared with intelligent robots, teleoperation technology fully considers the deficiencies of current intelligent technologies, such as decision-making issues involved in emergencies, safety and constraint issues in operations. This technology combines human perception and decision-making capabilities with robot operation capabilities, and overall enhances the teleoperation system's ability to deal with emergencies in an unstructured environment. It is currently the most realistic human-machine hybrid intelligence strategy.
遥操作系统的组成复杂,操作指令的生成机制和指令的远距离传输等不可避免给系统造成不确定时延。不确定时延严重影响系统的稳定性,并使系统的性能退化;模型不确定性同样给系统的稳定性造成威胁。与此同时,受操作时间和工作空间的限制,机器人末端执行器需要在满足物理约束的同时,在预期时间内完成操作任务。如深空探测中,受在轨操作时间的限制,机器人在执行巡检任务时(如通过狭小空间),需要在保证安全性的情况下,利用有限的时间完成在轨操作任务。The composition of the teleoperation system is complex, and the generation mechanism of operation instructions and the long-distance transmission of instructions inevitably cause uncertain delays to the system. Uncertain delay seriously affects the stability of the system and degrades the performance of the system; model uncertainty also poses a threat to the stability of the system. At the same time, limited by the operation time and work space, the robot end effector needs to complete the operation task within the expected time while satisfying the physical constraints. For example, in deep space exploration, due to the limitation of on-orbit operation time, when the robot performs inspection tasks (such as passing through a small space), it needs to use limited time to complete the on-orbit operation task under the condition of ensuring safety.
发明内容Contents of the invention
本发明解决的技术问题是:基于以上难题,本专利提出了一种时变输出受限的机器人遥操作有限时间控制方法,为遥操作系统开展实际工作提供了一种可行的方案。The technical problem solved by the present invention is: Based on the above problems, this patent proposes a time-varying output-limited robot teleoperation limited-time control method, which provides a feasible solution for the actual work of the teleoperation system.
本发明的技术方案是:一种基于宽度神经学习的时变输出约束机器人遥操作有限时间控制方法,其特征在于,包括以下步骤:The technical solution of the present invention is: a time-varying output-constrained robot teleoperation finite-time control method based on width neural learning, which is characterized in that it includes the following steps:
步骤1:对机械臂进行动力学建模;Step 1: Dynamically model the robotic arm;
步骤2:通过宽度神经学习算法和逆动力学观测器实现对操作者力和环境力的估计;Step 2: Realize the estimation of operator force and environment force by width neural learning algorithm and inverse dynamics observer;
步骤3:设计时变输出约束有限时间控制律,消解模型不确定影响,实现遥操作机器人的高精度追踪和快速性收敛。Step 3: Design a finite-time control law with time-varying output constraints, eliminate the influence of model uncertainty, and realize high-precision tracking and rapid convergence of the teleoperated robot.
本发明进一步的技术方案是:所述系统为一对n自由度机械臂,模型表达式为:A further technical solution of the present invention is: the system is a pair of n-degree-of-freedom mechanical arms, and the model expression is:
其中,j∈{m,s},分别为主从机器人标识。qj∈Rn×1分别为关节空间的加速度,速度和位置;/>xj∈Rn×1分别为操作空间的加速度,速度和位置;Mxj为惯性量矩阵,Cxj为离心力和哥氏力矩阵,gxj为重力项矩阵,uxj表示控制输入,Fmν=Fh表示操作者施加力,Fsν=Fe表示从端机器人和环境间的接触力。Among them, j ∈ {m, s}, respectively identify the master and slave robots. q j ∈R n×1 are the acceleration, velocity and position of the joint space respectively; /> x j ∈ R n×1 are the acceleration, velocity and position of the operating space; M xj is the inertia matrix, C xj is the centrifugal force and Coriolis force matrix, g xj is the gravity term matrix, u xj is the control input, F mν =F h represents the force exerted by the operator, and F sν =F e represents the contact force between the slave robot and the environment.
本发明进一步的技术方案是:所述步骤2包括以下子步骤:The further technical solution of the present invention is: said step 2 includes the following sub-steps:
步骤2.1:将模型线性化为:Step 2.1: Linearize the model as:
式中为线性回归矩阵,η为有关机械臂的参数向量,Θ∈Rn×1为二者乘积;In the formula is the linear regression matrix, η is the parameter vector of the manipulator, and Θ∈R n×1 is the product of the two;
步骤2.2:利用深度神经学习算法对外力评估:Step 2.2: External force evaluation using deep neural learning algorithm:
其中qj,为神经网络的输入;where q j , is the input of the neural network;
步骤2.3:基于逆动力学观测器,对机械臂的动力学模型(15)进一步线性化为Step 2.3: Based on the inverse dynamics observer, the dynamic model (15) of the manipulator is further linearized as
式中为线性回归矩阵,η为有关机械臂的参数向量,Θ∈Rn×1为二者乘积;In the formula is the linear regression matrix, η is the parameter vector of the manipulator, and Θ∈R n×1 is the product of the two;
步骤2.4:设计深度神经学习算法实现对外力进行估计:Step 2.4: Design a deep neural learning algorithm to estimate the external force:
该神经网络的输入为qj,通过宽度神经学习算法可实现对交互力的估计。The input of the neural network is q j , Estimation of the interaction force is achieved by a wide neural learning algorithm.
本发明进一步的技术方案是:所述步骤3中,包括以下子步骤:The further technical scheme of the present invention is: in described step 3, comprise following sub-step:
步骤3.1:通过设计控制器uxj,实现对参考轨迹有限时间追踪,同时要保证输出xη1在约束区域内,即/> Step 3.1: By designing the controller u xj , realize the reference trajectory Limited time tracking, while ensuring that the output x η1 is within the constraint area, ie />
主从机器人的控制律为The control law of the master-slave robot is
式(22)中:为/>的广义逆,满足如下条件In formula (22): for /> The generalized inverse of , satisfying the following conditions
式(22)中更新律选取如下:The update law in formula (22) is selected as follows:
式(22)中误差变量ej1,ej2,j∈{m,s}为In formula (22), the error variables e j1 , e j2 , j∈{m,s} are
式中:为主从端的参考轨迹,αj1为待设计的虚拟控制量。In the formula: α j1 is the virtual control quantity to be designed.
式(10)则基于操作者行为可生成主端的参考轨迹,Equation (10) can generate the reference trajectory of the master end based on the operator's behavior,
式(27)其中分别代表操作者力和环境力的估计值,式中:/>分别为主端参考轨迹的加速度、速度和位置;/>分别为操作者力估计和环境力估计的比例系数;Mr,Cr,gr分别为操作者行为的目标阻抗参数;Equation (27) where represent the estimated values of operator force and environmental force, respectively, where: /> are the acceleration, velocity and position of the host reference trajectory respectively; /> are the proportional coefficients of operator force estimation and environmental force estimation; M r , C r , g r are target impedance parameters of operator behavior;
步骤3.2:从端的参考轨迹为:Tf(t)为主端到从端的网络传输时延;设计虚拟控制量αj1:Step 3.2: The reference trajectory of the slave is: T f (t) is the network transmission delay from the host to the slave; design the virtual control quantity α j1 :
式中0<ξj<1,和γj1分别为:where 0<ξ j <1, and γ j1 are respectively:
式(22)中:分别为操作者力和机器人与环境间的接触力,kj1,kj2,χj,bj为正常数,/>为正对角阵。此处神经网络的权重向量为步骤2宽度神经学习算法得到的Wj,/> In formula (22): are the operator force and the contact force between the robot and the environment, k j1 , k j2 , χ j , b j are positive constants, /> is a positive diagonal matrix. Here the weight vector of the neural network is W j obtained by the width neural learning algorithm in step 2, />
发明效果Invention effect
本发明的技术效果在于:本发明聚焦于解决遥操作系统中存在的时延和不确定性造成系统失稳、机器人与未知环境交互的外力感知困难和机器人的操作空间和操作时间受限等问题,提出了一种基于宽度神经学习的时变输出约束机器人遥操作有限时间控制方法。其中,宽度神经学习算法将增量学习和RBF神经网络有效地结合起来,实现对操作者力和环境力的估计,同时消解了系统模型不确定性造成的负面影响;时变输出约束算法确保了机器人末端位置不会超出时变边界,提高了操作的安全性;有限时间控制方法确保了机器人轨迹的快速追踪能力。该方法勿需力传感器,系统的模型勿须精确已知,在保证系统安全操作的前提下,实现了遥操作机器人的高精度追踪和快速性控制。The technical effects of the present invention are: the present invention focuses on solving problems such as system instability caused by time delay and uncertainty in the teleoperating system, difficulty in sensing external forces when the robot interacts with an unknown environment, and limited operating space and operating time of the robot. , a finite-time control method for robot teleoperation with time-varying output constraints based on width neural learning is proposed. Among them, the width neural learning algorithm effectively combines incremental learning and RBF neural network to realize the estimation of operator force and environmental force, and at the same time eliminate the negative impact caused by the uncertainty of the system model; the time-varying output constraint algorithm ensures The end position of the robot will not exceed the time-varying boundary, which improves the safety of the operation; the limited time control method ensures the fast tracking ability of the robot trajectory. This method does not require a force sensor, and the model of the system does not need to be accurately known. On the premise of ensuring the safe operation of the system, the high-precision tracking and rapid control of the teleoperated robot are realized.
与现有技术相比,本发明具有以下优点:Compared with the prior art, the present invention has the following advantages:
(1)本发明设计一种宽度神经学习算法,实现对操作者力和环境力的估计;同时消除控制器设计中模型不确定性的影响。同传统神经网路相比,该网络可保证较高的估计准确率,同时节省大量的计算压力且不需要充足的学习条件;(1) The present invention designs a width neural learning algorithm to realize the estimation of operator force and environment force; meanwhile, it eliminates the influence of model uncertainty in controller design. Compared with the traditional neural network, the network can guarantee a higher estimation accuracy, while saving a lot of computing pressure and not requiring sufficient learning conditions;
(2)设计一种鲁棒性强的有限时间控制器,实现了机器人在操作时间受限情况下的有限时间控制。该发明具有更快的收敛速度,达到机器人与环境的高效交互的目标。(2) A robust finite-time controller is designed to realize the finite-time control of the robot when the operating time is limited. The invention has a faster convergence speed and achieves the goal of efficient interaction between the robot and the environment.
(3)本发明考虑了操作过程的安全性问题,对机器人的输出进行约束,且约束边界为时变边界,而常值边界为时变边界的特殊形式,因此该方法具有更强的实用性和实际意义。(3) The present invention considers the safety problem of the operation process, and constrains the output of the robot, and the constraint boundary is a time-varying boundary, and the constant value boundary is a special form of the time-varying boundary, so the method has stronger practicability and practical significance.
附图说明Description of drawings
图1遥操作系统控制框架图;Fig. 1 teleoperation system control frame diagram;
图2宽度神经学习算法框架;Figure 2 Width Neural Learning Algorithm Framework;
图3基于逆动力学的宽度神经学习算法;Figure 3. Width neural learning algorithm based on inverse dynamics;
图4仿真效果图(x,y,z轴为例);(a)图为x轴和y轴上的追踪效果Figure 4 Simulation effect diagram (x, y, z axis as an example); (a) The picture shows the tracking effect on the x-axis and y-axis
(b)图为z轴和操作空间上的追踪效果(b) The picture shows the tracking effect on the z-axis and the operating space
具体实施方式Detailed ways
参见图1-图4,本方法具体步骤及给出如下:Referring to Fig. 1-Fig. 4, the concrete steps of this method and are provided as follows:
步骤一:系统的动力学建模Step 1: Dynamic modeling of the system
步骤二:设计了一种宽度神经学习算法,随后基于逆动力学观测器,实现对操作者力和环境力的估计;Step 2: Design a width neural learning algorithm, and then realize the estimation of operator force and environmental force based on the inverse dynamics observer;
步骤三:设计时变输出约束有限时间控制律,消解模型不确定影响,实现遥操作机器人的高精度追踪和快速性收敛(主端同从端控制律类似,因此采用变量j∈{m,s}统一表述)Step 3: Design a finite-time control law with time-varying output constraints, eliminate the influence of model uncertainty, and realize high-precision tracking and rapid convergence of the teleoperated robot (the control law of the master side is similar to that of the slave side, so the variable j∈{m,s }unified expression)
综合以上步骤可实现遥操作系统与环境间稳定、安全和高效交互。Combining the above steps, the stable, safe and efficient interaction between the teleoperation system and the environment can be realized.
遥操作系统较复杂,为了更清晰地展开后续的论述,因此有必要对总体框架进行概要。The teleoperation system is complex, so it is necessary to summarize the general framework in order to develop the subsequent discussion more clearly.
步骤一:该系统由一对n自由度机械臂组成,在操作空间的动力学形式如下,为了简便起见,将主从端动力学统一书写为:Step 1: The system consists of a pair of n-degree-of-freedom manipulators. The dynamics in the operating space are as follows. For simplicity, the dynamics of the master and slave ends are uniformly written as:
式中:j∈{m,s},分别为主从机器人标识。qj∈Rn×1分别为关节空间的加速度,速度和位置。/>xj∈Rn×1分别为操作空间的加速度,速度和位置。Mxj为惯性量矩阵,Cxj为离心力和哥氏力矩阵,gxj为重力项矩阵,uxj表示控制输入,Fmν=Fh表示操作者施加力,Fsν=Fe表示从端机器人和环境间的接触力。In the formula: j ∈ {m, s}, respectively, the identification of master and slave robots. q j ∈R n×1 are the acceleration, velocity and position of the joint space respectively. /> x j ∈R n×1 are the acceleration, velocity and position of the operation space, respectively. M xj is the inertia matrix, C xj is the centrifugal force and Coriolis force matrix, g xj is the gravity term matrix, u xj represents the control input, F mν = F h represents the force applied by the operator, F sν = F e represents the slave robot contact with the environment.
步骤二:基于逆动力学观测器,设计了一种宽度神经学习算法对交互力(主端操作者和主端机器人间的接触力,从端机器人和环境间的接触力)进行估计。机械臂的动力学模型(15)可被线性化为Step 2: Based on the inverse dynamics observer, a width neural learning algorithm is designed to estimate the interaction force (the contact force between the master operator and the master robot, and the contact force between the slave robot and the environment). The dynamic model (15) of the manipulator can be linearized as
式中为线性回归矩阵,η为有关机械臂的参数向量,Θ∈Rn×1为二者乘积。由于无法对机械臂进行精准建模且模型本身存在一定的偏差,传统的逆动力学观测器无法实现对外力的精准估计。因此,设计深度神经学习算法实现对外力进行估计:In the formula is the linear regression matrix, η is the parameter vector of the manipulator, and Θ∈R n×1 is the product of the two. Due to the inability to accurately model the manipulator and the model itself has certain deviations, the traditional inverse dynamics observer cannot accurately estimate the external force. Therefore, a deep neural learning algorithm is designed to estimate the external force:
该神经网络的输入为qj,通过宽度神经学习算法可实现对交互力的估计。由于外部环境的变化和系统的不确定性,传统算法需要调节神经网络参数并对其重新训练,而本发明设计的宽度神经学习算法,将RBF和增量学习结合起来,通过设定阈值偏差,自适应地增加节点来得到更好的训练网络,见图2。基于逆动力学观测器,利用此宽度神经网络算法可实现交互力的估计;同时宽度神经学习算法可补偿系统模型的不确定性。基于宽度神经学习算法伪代码见表一.The input of the neural network is q j , Estimation of the interaction force is achieved by a wide neural learning algorithm. Due to the change of the external environment and the uncertainty of the system, the traditional algorithm needs to adjust the neural network parameters and retrain it, but the width neural learning algorithm designed by the present invention combines RBF and incremental learning, and by setting the threshold deviation, Adaptively increase nodes to get a better training network, see Figure 2. Based on the inverse dynamics observer, this width neural network algorithm can be used to estimate the interaction force; at the same time, the width neural learning algorithm can compensate the uncertainty of the system model. Pseudo-code of the width-based neural learning algorithm is shown in Table 1.
表一基于宽度神经的学习算法Table 1 Learning Algorithm Based on Width Neural
表格中:X是跟qj,有关的输入向量,W,β分别为RBF神经网络的权重向量和径向基向量,Am和Am+1为均为定义的模式矩阵。In the table: X is followed by q j , The relevant input vectors, W and β are the weight vector and the radial basis vector of the RBF neural network respectively, and A m and A m+1 are defined pattern matrices.
式中:D=(Am)+Hm+1,Where: D=(A m ) + H m+1 ,
式中C=Hm+1-AmD,因此权重可得:In the formula, C=H m+1 -A m D, so the weight can be obtained:
Wei=(λI+(Am)TAm)-1(Am)TY (20)W ei =(λI+(A m ) T A m ) -1 (A m ) T Y (20)
步骤三:为了保证操作的安全性,需要对机器人的位置输出进行限制,即机器人的末端位置在时变边界内。本专利采用直接障碍李雅普诺夫函数(IBLF)、宽度神经学习算法和有限时间理论,首次提出基于宽度神经学习的时变输出约束有限时间控制方法。Step 3: In order to ensure the safety of the operation, it is necessary to limit the position output of the robot, that is, the end position of the robot is within the time-varying boundary. This patent adopts the direct barrier Lyapunov function (IBLF), width neural learning algorithm and finite time theory, and proposes a time-varying output constraint finite time control method based on width neural learning for the first time.
控制目标:通过设计控制器uxj,实现对参考轨迹有限时间追踪,同时要保证输出xη1在约束区域内,即/> Control objective: by designing the controller u xj , to realize the control of the reference trajectory Limited time tracking, while ensuring that the output x η1 is within the constraint area, ie />
设计主从机器人的控制律为Design the control law of the master-slave robot as
式(22)中:为/>的广义逆,满足如下条件In formula (22): for /> The generalized inverse of , satisfying the following conditions
式(22)中更新律选取如下:The update law in formula (22) is selected as follows:
式(22)中误差变量ej1,ej2,j∈{m,s}为In formula (22), the error variables e j1 , e j2 , j∈{m,s} are
式中:为主从端的参考轨迹,αj1为待设计的虚拟控制量。In the formula: α j1 is the virtual control quantity to be designed.
式(26)则基于操作者行为可生成主端的参考轨迹,Equation (26) can generate the reference trajectory of the master end based on the operator's behavior,
式(27)其中分别代表操作者力和环境力的估计值(勿需力测量装置,在步骤一中利用宽度神经学习算法进行估计,见步骤二),式中:/>分别为主端参考轨迹的加速度、速度和位置;/>分别为操作者力估计和环境力估计的比例系数;Mr,Cr,gr分别为操作者行为的目标阻抗参数。Equation (27) where Represent the estimated values of operator force and environmental force respectively (no force measurement device is required, and the width neural learning algorithm is used to estimate in step 1, see step 2), where: /> are the acceleration, velocity and position of the host reference trajectory respectively; /> are the proportional coefficients of operator force estimation and environmental force estimation; M r , C r , and g r are the target impedance parameters of operator behavior, respectively.
从端的参考轨迹为:Tf(t)为主端到从端的网络传输时延。式(22),设计的虚拟控制量αj1 The reference trajectory of the slave end is: T f (t) is the network transmission delay from the master to the slave. Equation (22), the designed virtual control quantity α j1
式中0<ξj<1,和γj1分别为:where 0<ξ j <1, and γ j1 are respectively:
式(22)中:分别为操作者力和机器人与环境间的接触力,kj1,kj2,χj,bj为正常数,/>为正对角阵。此处神经网络的权重向量为步骤2宽度神经学习算法得到的Wj,/> In formula (22): are the operator force and the contact force between the robot and the environment, k j1 , k j2 , χ j , b j are positive constants, /> is a positive diagonal matrix. Here the weight vector of the neural network is W j obtained by the width neural learning algorithm in step 2, />
针对遥操作系统(15),选取虚拟控制量(28)、控制量(22)和更新律(24),确保了遥操作系统的闭环稳定性,同时实现机器人与环境间的安全,高效交互。For the teleoperation system (15), the virtual control quantity (28), control quantity (22) and update law (24) are selected to ensure the closed-loop stability of the teleoperation system and realize safe and efficient interaction between the robot and the environment.
系统的整体流程框架:基于时变输出约束的遥操作系统控制框架如图1所示。主端的位置信号xm(t)经通讯链路传递到从端,得到从端的参考信号随后设计基于宽度神经学习的时变输出约束有限时间控制器uxs(见步骤3),可实现从端机器人对参考信号/>的高精度和快速性追踪。同时,利用宽度神经学习算法对从端的环境力进行估计(见步骤2),并将虚拟的环境参数传递到主端。在主端,利用主端的运动信息和从端的虚拟环境参数,在主端实现从端环境力的重建。为了使操作者具备较好的力感知,主端的参考轨迹基于操作者的行为生成。随后在主端设计基于宽度神经学习的时变输出约束有限时间控制器uxm(见步骤3),实现主端机器人对主端参考信号/>的高精度和快速性追踪。(由于主端控制律uxm和从端控制律uxs形式一样,故统一表述)The overall process framework of the system: the control framework of the remote control system based on time-varying output constraints is shown in Figure 1. The position signal x m (t) of the master end is transmitted to the slave end through the communication link, and the reference signal of the slave end is obtained Then design a time-varying output-constrained finite-time controller u xs based on width neural learning (see step 3), which can realize the slave robot to the reference signal /> High precision and fast tracking. At the same time, the environmental force of the slave is estimated by using the width neural learning algorithm (see step 2), and the virtual environment parameters are passed to the master. At the master end, the reconstruction of the environment force of the slave end is realized at the master end by using the motion information of the master end and the virtual environment parameters of the slave end. In order to enable the operator to have better force perception, the reference trajectory of the master is generated based on the operator's behavior. Then, a time-varying output constraint finite-time controller u xm based on width neural learning is designed on the master end (see step 3), to realize the master-end robot’s control of the master-end reference signal /> High precision and fast tracking. (Since the master-side control law u xm and the slave-side control law u xs have the same form, they are expressed uniformly)
综上所述,本发明公开了一种宽度神经学习的时变输出约束机器人有限时间控制方法。基于积分障碍李雅普诺夫函数、宽度神经学习算法和有限时间理论,创新性地提出了一种基于宽度神经学习的时变输出约束有限时间控制器。直接积分障碍李雅普诺夫函数保证了系统的输出在时变边界内;宽度神经学习算法融合了传统神经网络和宽度学习的优点,基于逆动力学观测器,利用宽度学习算法解决了外力感知难题,同时消解了控制率uxj设计过程中的模型不确定性;有限时间理论保证了机器人对参考信号的高精度和快速性追踪。综上所述,该算法确保了机器人与环境间稳定、安全与高效交互,提升了遥操作系统的可靠性和高效性。In summary, the present invention discloses a time-varying output-constrained robot finite-time control method based on width neural learning. Based on integral barrier Lyapunov function, width neural learning algorithm and finite time theory, a time-varying output constrained finite-time controller based on width neural learning is innovatively proposed. The direct integral obstacle Lyapunov function ensures that the output of the system is within the time-varying boundary; the width neural learning algorithm combines the advantages of traditional neural network and width learning, and based on the inverse dynamics observer, the width learning algorithm solves the problem of external force perception. At the same time, the model uncertainty in the design process of the control rate u xj is eliminated; the finite time theory ensures the robot's high-precision and rapid tracking of the reference signal. In summary, the algorithm ensures a stable, safe and efficient interaction between the robot and the environment, and improves the reliability and efficiency of the teleoperation system.
Claims (1)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111167440.2A CN113927596B (en) | 2021-10-07 | 2021-10-07 | Width neural learning-based teleoperation limited time control method for time-varying output constraint robot |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111167440.2A CN113927596B (en) | 2021-10-07 | 2021-10-07 | Width neural learning-based teleoperation limited time control method for time-varying output constraint robot |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN113927596A CN113927596A (en) | 2022-01-14 |
| CN113927596B true CN113927596B (en) | 2023-08-01 |
Family
ID=79277869
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202111167440.2A Active CN113927596B (en) | 2021-10-07 | 2021-10-07 | Width neural learning-based teleoperation limited time control method for time-varying output constraint robot |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN113927596B (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TWI808852B (en) * | 2022-08-01 | 2023-07-11 | 崑山科技大學 | Method for stable control of six-axis robotic arm by deep learning |
| CN116476067B (en) * | 2023-05-06 | 2025-06-20 | 中国科学院深圳先进技术研究院 | Robot motion control method, device and medium |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107422639A (en) * | 2017-07-13 | 2017-12-01 | 西北工业大学 | A kind of bilateral teleoperation control method based on time delay estimadon |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105904461B (en) * | 2016-05-16 | 2017-10-31 | 西北工业大学 | A kind of adaptive remote operating control method of neutral net based on RBF |
| US11948074B2 (en) * | 2018-05-14 | 2024-04-02 | Samsung Electronics Co., Ltd. | Method and apparatus with neural network parameter quantization |
| CN109240086B (en) * | 2018-10-16 | 2020-02-28 | 浙江大学 | An Adaptive Robust Control Method for a Nonlinear Bilateral Teleoperating System |
| CN110221542B (en) * | 2019-06-04 | 2021-09-17 | 西北工业大学 | Fixed time cooperative tracking control method for second-order nonlinear multi-agent system |
| US11213953B2 (en) * | 2019-07-26 | 2022-01-04 | Google Llc | Efficient robot control based on inputs from remote client devices |
| US12263590B2 (en) * | 2019-10-29 | 2025-04-01 | Sanctuary Cognitive Systems Corporation | Machine vision parsing of three-dimensional environments employing neural networks |
| US20210178600A1 (en) * | 2019-12-12 | 2021-06-17 | Mitsubishi Electric Research Laboratories, Inc. | System and Method for Robust Optimization for Trajectory-Centric ModelBased Reinforcement Learning |
| CN111241747B (en) * | 2020-01-13 | 2023-07-25 | 佛山科学技术学院 | Method, system, computer equipment and storage medium for solving inverse kinematics of mechanical arm |
| CN111482966B (en) * | 2020-05-14 | 2022-08-09 | 重庆邮电大学 | Force and position control method of robot force sense remote control system |
| CN112060088A (en) * | 2020-08-28 | 2020-12-11 | 华中科技大学 | A telemanipulation method for precise capture of non-cooperative targets under variable delay conditions |
| CN112809666B (en) * | 2020-12-17 | 2023-04-28 | 安徽工业大学 | 5-DOF mechanical arm strength position tracking algorithm based on neural network |
| CN112668190B (en) * | 2020-12-30 | 2024-03-15 | 长安大学 | A three-finger dexterous hand controller construction method, system, equipment and storage medium |
-
2021
- 2021-10-07 CN CN202111167440.2A patent/CN113927596B/en active Active
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107422639A (en) * | 2017-07-13 | 2017-12-01 | 西北工业大学 | A kind of bilateral teleoperation control method based on time delay estimadon |
Also Published As
| Publication number | Publication date |
|---|---|
| CN113927596A (en) | 2022-01-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| He et al. | Admittance-based controller design for physical human–robot interaction in the constrained task space | |
| He et al. | Reinforcement learning control of a flexible two-link manipulator: An experimental investigation | |
| Jin et al. | Novel joint-drift-free scheme at acceleration level for robotic redundancy resolution with tracking error theoretically eliminated | |
| CN112276954B (en) | Multi-joint mechanical arm impedance control method based on limited time output state limitation | |
| CN115256386B (en) | Neural Adaptive Control Method for Uncertain Manipulators Considering Tracking Error Constraints | |
| CN107662208A (en) | Flexible joint mechanical arm finite time self-adaptive backstepping control method based on neural network | |
| CN109085749A (en) | A kind of non-linear remote operating bilateral control method based on adaptive fuzzy inverting | |
| CN113927596B (en) | Width neural learning-based teleoperation limited time control method for time-varying output constraint robot | |
| CN113199477B (en) | A Reinforcement Learning-Based Trajectory Tracking Control Method for Baxter Manipulator | |
| Xia et al. | Advanced motion control of hydraulic manipulator with precise compensation of dynamic friction | |
| CN115338871B (en) | A constrained adaptive robust control method and system for a two-degree-of-freedom robotic arm | |
| Chen et al. | Adaptive stiffness visual servoing for unmanned aerial manipulators with prescribed performance | |
| Hu et al. | Impedance with Finite‐Time Control Scheme for Robot‐Environment Interaction | |
| CN114527664A (en) | Self-adaptive tracking control method of dynamic uncertainty system with asymmetric time lag | |
| Wei et al. | Asymmetric time‐varying BLF‐based model‐free hybrid force/position control for SEA‐based 2‐DOF manipulator | |
| Wu et al. | A hierarchical data-driven predictive control of image-based visual servoing systems with unknown dynamics | |
| CN116551682A (en) | Coordinated Active Disturbance Rejection Control Method for Composite Robot Arms Based on Optimum Operating Region | |
| CN114179089B (en) | Robust region tracking control method for mechanical arm | |
| Zhou et al. | AI based robot safe learning and control | |
| Hua et al. | Analysis and Design for Networked Teleoperation System | |
| CN114488791B (en) | A teleoperation event-triggered fixed-time control method based on operator intention understanding | |
| CN118636144A (en) | A control method of dual-joint serial robot arm based on composite fuzzy learning | |
| Xia et al. | A new fault tolerant strategy using adaptive time delay estimation for robot manipulators with actuator faults | |
| Han et al. | An Adaptive Fuzzy Control Model for Multi-Joint Manipulators. | |
| Qin et al. | Robotic Arm Trajectory Tracking Control Based on An RBF Neural Network Adaptive Control Algorithm |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |