CN113927596A

CN113927596A - Time-varying output constraint robot teleoperation finite time control method based on width neural learning

Info

Publication number: CN113927596A
Application number: CN202111167440.2A
Authority: CN
Inventors: 黄攀峰; 李陇南; 马志强; 鹿振宇; 常海涛; 陈海飞
Original assignee: Northwestern Polytechnical University
Current assignee: Northwestern Polytechnical University
Priority date: 2021-10-07
Filing date: 2021-10-07
Publication date: 2022-01-14
Anticipated expiration: 2041-10-07
Also published as: CN113927596B

Abstract

The invention discloses a limited-time control method for a time-varying output-constrained robot with width neural learning. Based on the integral barrier Lyapunov function, breadth neural learning algorithm and finite time theory, a time-varying output constraint finite time controller based on breadth neural learning is innovatively proposed. The direct integration obstacle Lyapunov function ensures that the output of the system is within the time-varying boundary; the breadth neural learning algorithm combines the advantages of traditional neural networks and breadth learning, based on the inverse dynamics observer, and uses the breadth learning algorithm to solve the external force perception problem, At the same time, the model uncertainty in the design process of the control rate u _xj is eliminated; the finite time theory ensures the high precision and fast tracking of the reference signal by the robot. In summary, the algorithm ensures the stable, safe and efficient interaction between the robot and the environment, and improves the reliability and efficiency of the teleoperating system.

Description

Time-varying output constraint robot teleoperation finite time control method based on width neural learning

Technical Field

The invention belongs to the technical field of robot control, and particularly relates to a time-varying output constraint robot teleoperation limited time control method based on width neural learning.

Background

Teleoperation technology makes full use of human intelligence and the operational capabilities of robots. The method greatly extends the perception and behavior ability of human beings in remote, unstructured and dangerous environments, and is an indispensable key technology in deep space, deep sea and deep ground exploration. Compared with an intelligent robot, the teleoperation technology fully considers the defects of the current intelligent technology, such as decision problems involved in emergency situations and safety and constraint problems in operation. The technology combines human perception and decision-making capability with the operation capability of the robot, integrally enhances the processing capability of a teleoperation system in the case of emergency under an unstructured environment, and is the most practical human-computer hybrid intelligent strategy at present.

The composition of the remote operation system is complex, and the generation mechanism of the operation instruction, the remote transmission of the instruction and the like cannot avoid causing uncertain time delay to the system. The uncertain time delay seriously affects the stability of the system and degrades the performance of the system; model uncertainty also poses a threat to the stability of the system. At the same time, due to the constraints of operation time and working space, the robotic end effector needs to complete an operation task within a desired time while satisfying physical constraints. If in deep space exploration, the robot is limited by the on-orbit operation time, and when the robot executes an inspection task (such as through a narrow space), the on-orbit operation task needs to be completed within a limited time under the condition of ensuring the safety.

Disclosure of Invention

The technical problem solved by the invention is as follows: based on the difficult problems, the patent provides a time-varying output-limited robot teleoperation limited time control method, and a feasible scheme is provided for a teleoperation system to carry out actual work.

The technical scheme of the invention is as follows: a time-varying output constraint robot teleoperation finite time control method based on width neural learning is characterized by comprising the following steps:

step 1: performing dynamic modeling on the mechanical arm;

step 2: estimating the force of an operator and the environmental force by a width neural learning algorithm and an inverse dynamics observer;

and step 3: and designing a time-varying output constraint finite time control law, eliminating uncertain influence of a model, and realizing high-precision tracking and rapid convergence of the teleoperation robot.

The further technical scheme of the invention is as follows: the system is a pair of n-degree-of-freedom mechanical arms, and the model expression is as follows:

wherein j belongs to { m, s }, and is respectively a master-slave robot identifier.

q_j∈R^n×1Acceleration, velocity and position of the joint space, respectively;

x_j∈R^n×1acceleration, velocity and position of the operating space, respectively; m_xjIs a matrix of inertia quantities, C_xjIs a matrix of centrifugal and Coriolis forces, g_xjIs a gravity term matrix, u_xjRepresenting a control input, F_mν＝F_hIndicating operator applied force, F_sν＝F_eRepresenting the contact force between the slave robot and the environment.

The further technical scheme of the invention is as follows: the step 2 comprises the following substeps:

step 2.1: the model was linearized as:

in the formula

Is a linear regression matrix, eta is a parameter vector related to the mechanical arm, theta belongs to R^n×1Is the product of the two;

step 2.2: evaluating external force by using a deep neural learning algorithm:

wherein q is_j,

Is an input to the neural network;

step 2.3: based on the inverse dynamics observer, the dynamics model (15) of the mechanical arm is further linearized

In the formula

step 2.4: the external force is estimated by designing a deep neural learning algorithm:

the input of the neural network is q_j,

The estimation of the interaction force can be realized through a width neural learning algorithm.

The further technical scheme of the invention is as follows: the step 3 comprises the following substeps:

step 3.1: by designing the controller u_xjRealize the reference track

Tracking for a limited time while guaranteeing an output x_η1Within a restricted area, i.e.

The control law of the master-slave robot is

In formula (22):

is composed of

Satisfies the following conditions

The update law in equation (22) is selected as follows:

error variable e in equation (22)_j1,e_j2J is as { m, s }

In the formula:

is a reference track of the master and slave terminals, alpha_j1Is a virtual control quantity to be designed.

Equation (10) then generates a reference trajectory for the master based on the operator behavior,

formula (27) wherein

Representing estimates of operator force and environmental force, respectively, wherein:

acceleration, speed and position of the main end reference track respectively;

scaling factors for the operator force estimate and the environmental force estimate, respectively; m_r,C_r,g_rTarget impedance parameters for operator behavior, respectively;

step 3.2: the reference trajectories of the slave end are:

T_f(t) network transmission delay from the master to the slave; designing a virtual control quantity alpha_j1：

In the formula, 0 < xi_j＜1，

And gamma_j1Respectively as follows:

in formula (22):

respectively operator force and contact force between robot and environment, k_j1,k_j2,χ_j,b_jIs a normal number, and is,

is opposite to the angular array. The weight vector of the neural network is W obtained by the step 2 width neural learning algorithm_j，

Effects of the invention

The invention has the technical effects that: the invention focuses on solving the problems of system instability caused by time delay and uncertainty in a teleoperation system, difficulty in external force perception of interaction between a robot and an unknown environment, limited operation space and operation time of the robot and the like, and provides a time-varying output constraint robot teleoperation limited time control method based on width neural learning. The wide neural learning algorithm effectively combines incremental learning and an RBF neural network, realizes the estimation of operator force and environmental force, and simultaneously resolves the negative influence caused by the uncertainty of a system model; the time-varying output constraint algorithm ensures that the tail end position of the robot does not exceed a time-varying boundary, and the operation safety is improved; the limited time control method ensures the rapid tracking capability of the robot track. The method does not need a force sensor, the model of the system does not need to be known accurately, and the high-precision tracking and the rapidity control of the teleoperation robot are realized on the premise of ensuring the safe operation of the system.

Compared with the prior art, the invention has the following advantages:

(1) the invention designs a width neural learning algorithm to realize the estimation of operator force and environmental force; while eliminating the effect of model uncertainty in the controller design. Compared with the traditional neural network, the network can ensure higher estimation accuracy, simultaneously saves a large amount of calculation pressure and does not need sufficient learning conditions;

(2) a limited time controller with strong robustness is designed, and limited time control of the robot under the condition that the operation time is limited is realized. The invention has faster convergence speed and achieves the aim of efficient interaction between the robot and the environment.

(3) The invention considers the safety problem of the operation process, constrains the output of the robot, and the constraint boundary is a time-varying boundary, while the constant boundary is a special form of the time-varying boundary, so the method has stronger practicability and practical significance.

Drawings

FIG. 1 is a diagram of a teleoperational system control framework;

FIG. 2 a breadth neural learning algorithm framework;

FIG. 3 is a broad neural learning algorithm based on inverse dynamics;

FIG. 4 is a simulation effect diagram (x, y, z axes are examples); (a) illustrating the tracking effect on the x-axis and y-axis

(b) Illustrating the tracking effect on the z-axis and the operation space

Detailed Description

Referring to fig. 1-4, the method comprises the following steps:

the method comprises the following steps: dynamic modeling of a system

Step two: designing a width neural learning algorithm, and then estimating the force of an operator and the environmental force based on an inverse dynamics observer;

step three: designing a time-varying output constraint finite time control law, eliminating uncertain influence of a model, realizing high-precision tracking and rapid convergence of the teleoperation robot (the master end is similar to the slave end control law, so that a variable j is in accordance with { m, s } in a unified way)

The steps are integrated, so that stable, safe and efficient interaction between the teleoperation system and the environment can be realized.

Teleoperation systems are complex and an overview of the overall framework is necessary to more clearly expand the subsequent discussion.

The method comprises the following steps: the system consists of a pair of n-degree-of-freedom mechanical arms, the dynamics form in an operation space is as follows, and for the sake of simplicity, the dynamics of a master end and a slave end are uniformly written as follows:

in the formula: j belongs to { m, s }, and is respectively a master-slave robot identifier.

q_j∈R^n×1Acceleration, velocity and position of the joint space, respectively.

x_j∈R^n×1Acceleration, velocity and position of the operating space, respectively. M_xjIs a matrix of inertia quantities, C_xjIs a matrix of centrifugal and Coriolis forces, g_xjIs a gravity term matrix, u_xjRepresenting a control input, F_mν＝F_hIndicating operator applied force, F_sν＝F_eRepresenting the contact force between the slave robot and the environment.

Step two: based on an inverse dynamics observer, a wide neural learning algorithm is designed to estimate the interaction force (the contact force between a main-end operator and a main-end robot, and the contact force between a slave-end robot and the environment). The dynamic model (15) of the mechanical arm can be linearized

In the formula

Is a linear regression matrix, eta is a parameter vector related to the mechanical arm, theta belongs to R^n×1Is the product of the two. Because the mechanical arm cannot be accurately modeled and the model has certain deviation, the traditional inverse dynamics observer cannot realize accurate estimation of the external force. Therefore, designing a deep neural learning algorithm enables estimation of external forces:

the input of the neural network is q_j,

The estimation of the interaction force can be realized through a width neural learning algorithm. Due to the change of the external environment and the uncertainty of the system, the traditional algorithm needs to adjust and retrain the neural network parameters, but the wide neural learning algorithm designed by the invention combines RBF and incremental learning, and obtains a better training network by setting threshold deviation and increasing nodes in a self-adaptive manner, as shown in figure 2. Based on an inverse dynamics observer, the estimation of the interaction force can be realized by utilizing the wide neural network algorithm; while the broad neural learning algorithm can compensate for uncertainty in the system model. The pseudo code is shown in table one based on the width neural learning algorithm.

TABLE-learning algorithm based on breadth nerve

In the table: x is heel q_j,

Related input vector, W, beta are weight vector and radial basis of RBF neural network respectivelyVector, A^mAnd A^m+1Are all defined mode matrices.

In the formula: d ═ a^m)⁺H_m+1，

Wherein C is H_m+1-A^mD, so the weight can be:

W_ei＝(λI+(A^m)^TA^m)^-1(A^m)^TY (20)

step three: in order to ensure the safety of the operation, the position output of the robot needs to be limited, namely the end position of the robot is within the time-varying boundary. The method adopts direct barrier Lyapunov function (IBLF), a breadth neural learning algorithm and a finite time theory, and firstly provides a time-varying output constraint finite time control method based on breadth neural learning.

A control target: by designing the controller u_xjRealize the reference track

The control law of master-slave robot is designed as

In formula (22):

is composed of

Satisfies the following conditions

The update law in equation (22) is selected as follows:

error variable e in equation (22)_j1,e_j2J is as { m, s }

In the formula:

Equation (26) then generates a reference trajectory for the master based on the operator behavior,

formula (27) wherein

Representing the estimated values of the operator force and the environmental force, respectively (without force measuring means, estimated in step one using a neuro-learning algorithm, see step two), in which:

acceleration, speed and position of the main end reference track respectively;

scaling factors for the operator force estimate and the environmental force estimate, respectively; m_r,C_r,g_rRespectively, target impedance parameters for operator behavior.

The reference trajectories of the slave end are:

T_fand (t) is the network transmission delay from the master end to the slave end. Equation (22), designed virtual control quantity α_j1

In the formula, 0 < xi_j＜1，

And gamma_j1Respectively as follows:

in formula (22):

is opposite to the angular array. Here the weight vector of the neural network is step 2 width neurologyW obtained by learning algorithm_j，

Aiming at the teleoperation system (15), virtual control quantity (28), control quantity (22) and updating law (24) are selected, so that the closed-loop stability of the teleoperation system is ensured, and meanwhile, the safe and efficient interaction between the robot and the environment is realized.

Overall process framework of the system: a teleoperational system control framework based on time-varying output constraints is shown in fig. 1. Position signal x of the master_m(t) transmitting to the slave end through the communication link to obtain the reference signal of the slave end

Then designing a time-varying output constraint finite time controller u based on the width neural learning_xs(see step 3), the reference signal can be realized by the slave robot

High precision and fast tracking. Meanwhile, the environment force of the slave end is estimated by using a width neural learning algorithm (see step 2), and virtual environment parameters are transmitted to the master end. And at the master end, reconstructing the environment force of the slave end at the master end by utilizing the motion information of the master end and the virtual environment parameters of the slave end. In order to provide better force perception for the operator, the reference trajectory of the main terminal is generated based on the behavior of the operator. Then designing a time-varying output constraint finite time controller u based on the width neural learning at the main end_xm(see step 3), realize the master robot to the master reference signal

High precision and fast tracking. (due to master control law u_xmAnd slave control law u_xsThe same form, so unify the expression)

In conclusion, the invention discloses a finite time control method of a time-varying output constraint robot for width neural learning. Lyapunov function based on integral obstacle and breadth neural learningAn algorithm and a finite time theory creatively provide a time-varying output constraint finite time controller based on the width neural learning. The direct integral barrier Lyapunov function ensures that the output of the system is in a time-varying boundary; the width neural learning algorithm integrates the advantages of the traditional neural network and the width learning, based on the inverse dynamics observer, solves the external force perception problem by using the width learning algorithm, and simultaneously resolves the control rate u_xjModel uncertainty in the design process; the finite time theory ensures high precision and rapidity tracking of the reference signal by the robot. In conclusion, the algorithm ensures stable, safe and efficient interaction between the robot and the environment, and improves the reliability and the efficiency of the teleoperation system.

Claims

1. a time-varying output constraint robot teleoperation limited-time control method based on breadth neural learning, is characterized in that, comprises the following steps:

Step 1: Dynamic modeling of the robotic arm;

Step 2: Realize the estimation of operator force and environmental force through the breadth neural learning algorithm and inverse dynamics observer;

Step 3: Design a time-varying output constraint finite-time control law to eliminate the uncertainty of the model and achieve high-precision tracking and rapid convergence of the teleoperated robot.

2. a kind of time-varying output constraint robot teleoperation limited-time control method based on width neural learning as claimed in claim 1, is characterized in that, described system is a pair of n-degree-of-freedom mechanical arms, and the model expression is:

Among them, j∈{m,s}, respectively, the master and slave robot identification.

are the acceleration, velocity and position of the joint space, respectively;

are the acceleration, velocity and position of the operating space, respectively; M _xj is the inertia matrix, C _xj is the centrifugal force and Coriolis force matrix, g _xj is the gravity term matrix, u _xj represents the control input, F _mν = F _h represents the operator exerted Force, F _sν = _Fe represents the contact force between the slave robot and the environment.

3. a kind of time-varying output constraint robot teleoperation limited time control method based on breadth neural learning as claimed in claim 1 or 2, is characterized in that, described step 2 comprises the following sub-steps:

Step 2.1: Linearize the model as:

in the formula

is the linear regression matrix, η is the parameter vector about the manipulator, Θ∈R ^n×1 is the product of the two;

Step 2.2: Use deep neural learning algorithms to evaluate external forces:

in

is the input of the neural network;

Step 2.3: Based on the inverse dynamics observer, the dynamic model (1) of the manipulator is further linearized as

in the formula

Step 2.4: Design a deep neural learning algorithm to estimate the external force:

The input to this neural network is

The estimation of the interaction force can be achieved by a breadth neural learning algorithm.

4. a kind of time-varying output constraint robot teleoperation limited time control method based on breadth neural learning as claimed in claim 1, is characterized in that, in described step 3, comprises the following sub-steps:

Step 3.1: By designing the controller u _xj , realize the reference trajectory

Finite time tracking, while ensuring that the output x _η1 is within the constraint region, that is

The control law of the master-slave robot is

In formula (8):

for

The generalized inverse of , which satisfies the following conditions

The update law in formula (8) is selected as follows:

In formula (8), the error variables e _j1 , e _j2 , j∈{m,s} are

where:

It is the reference trajectory of the master and slave, and α _j1 is the virtual control quantity to be designed.

Equation (12) can generate the reference trajectory of the master based on the operator's behavior,

Formula (13) where

represent the estimated values of operator force and environmental force, respectively, where:

are the acceleration, velocity and position of the reference trajectory of the main terminal, respectively;

are the proportional coefficients of operator force estimation and environmental force estimation, respectively _; Mr , _Cr , and _gr are the target impedance parameters of operator behavior;

Step 3.2: The reference trajectory of the slave is:

T _f (t) network transmission delay from master to slave; design virtual control quantity α _j1 :

where 0 < ξ _j < 1,

and γ _j1 are:

In formula (8):

j∈{m,s} are the operator force and the contact force between the robot and the environment, respectively, k _j1 , k _j2 , χ _j , b _j are positive numbers,

is a positive diagonal matrix. Here, the weight vector of the neural network is W _j obtained by the width neural learning algorithm in step 2,