[go: up one dir, main page]

CN116604546A - Industrial mechanical arm control method, device, equipment and storage medium - Google Patents

Industrial mechanical arm control method, device, equipment and storage medium Download PDF

Info

Publication number
CN116604546A
CN116604546A CN202310049146.4A CN202310049146A CN116604546A CN 116604546 A CN116604546 A CN 116604546A CN 202310049146 A CN202310049146 A CN 202310049146A CN 116604546 A CN116604546 A CN 116604546A
Authority
CN
China
Prior art keywords
industrial
mechanical arm
optimal
system model
equation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310049146.4A
Other languages
Chinese (zh)
Inventor
鲁仁全
郭子杰
李鸿一
程志键
任鸿儒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Peng Cheng Laboratory
Original Assignee
Guangdong University of Technology
Peng Cheng Laboratory
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology, Peng Cheng Laboratory filed Critical Guangdong University of Technology
Priority to CN202310049146.4A priority Critical patent/CN116604546A/en
Publication of CN116604546A publication Critical patent/CN116604546A/en
Pending legal-status Critical Current

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1602Programme controls characterised by the control system, structure, architecture
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1602Programme controls characterised by the control system, structure, architecture
    • B25J9/1605Simulation of manipulator lay-out, design, modelling of manipulator
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1656Programme controls characterised by programming, planning systems for manipulators
    • B25J9/1664Programme controls characterised by programming, planning systems for manipulators characterised by motion, path, trajectory planning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Robotics (AREA)
  • Mechanical Engineering (AREA)
  • Automation & Control Theory (AREA)
  • Feedback Control In General (AREA)

Abstract

The invention discloses a control method, a device, equipment and a storage medium for an industrial mechanical arm, which relate to the technical field of intelligent control, wherein the method comprises the following steps: acquiring physical characteristic data and preset performance indexes of an industrial mechanical arm; constructing a constrained system model of the industrial mechanical arm according to the physical characteristic data and the preset performance index; performing equivalent system conversion on the constrained system model to obtain an unconstrained system model; constructing a Hamiltonian-Jacobian-Belman equation according to the unconstrained system model and an optimal control strategy of the industrial mechanical arm; and solving a Hamiltonian-Jacobian-Bellman equation by a self-adaptive dynamic programming method to obtain an optimal control law so as to control the industrial mechanical arm according to the optimal control law. The invention solves the problem that the existing industrial mechanical arm control method can not realize optimal control while meeting the constraint of the preset performance index, and realizes high-precision control of the industrial mechanical arm.

Description

工业机械臂控制方法、装置、设备及存储介质Industrial mechanical arm control method, device, equipment and storage medium

技术领域technical field

本发明涉及智能控制技术领域,尤其涉及一种工业机械臂控制方法、装置、设备及存储介质。The present invention relates to the technical field of intelligent control, in particular to an industrial mechanical arm control method, device, equipment and storage medium.

背景技术Background technique

随着技术的不断发展及生产效率与操作精度的不断提高,工业机械臂已被广泛应用于对轨迹跟踪精度有较高要求的机械加工、装配及焊接等行业。为了提高工业机械臂的控制精度,通常需要对性能指标提出要求,因此,在控制器设计过程中结合预设性能指标进行控制具有重要意义。最优控制是一类考虑系统控制性能和节能效应的控制策略,工业机械臂的运动过程属于高度耦合的非线性系统,这给传统的最优控制方法带来了巨大的挑战。With the continuous development of technology and the continuous improvement of production efficiency and operation accuracy, industrial robotic arms have been widely used in machining, assembly, welding and other industries that have high requirements for trajectory tracking accuracy. In order to improve the control accuracy of industrial manipulators, it is usually necessary to put forward requirements for performance indicators. Therefore, it is of great significance to combine preset performance indicators in the process of controller design for control. Optimal control is a control strategy that considers system control performance and energy-saving effects. The motion process of industrial manipulators is a highly coupled nonlinear system, which brings great challenges to traditional optimal control methods.

因此,设计一种满足预设性能指标约束且能实现最优控制的工业机械臂控制器是亟待解决的问题。Therefore, it is an urgent problem to design an industrial manipulator controller that satisfies the preset performance index constraints and can achieve optimal control.

发明内容Contents of the invention

本发明的主要目的在于:提供一种工业机械臂控制方法、装置、设备及存储介质,旨在解决现有工业机械臂控制方法无法在满足预设性能指标约束的同时实现最优控制的技术问题。The main purpose of the present invention is to provide an industrial manipulator control method, device, equipment and storage medium, aiming to solve the technical problem that the existing industrial manipulator control method cannot achieve optimal control while satisfying the preset performance index constraints .

为实现上述目的,本发明采用如下技术方案:To achieve the above object, the present invention adopts the following technical solutions:

第一方面,本发明提供了一种工业机械臂控制方法,所述方法包括:In a first aspect, the present invention provides a method for controlling an industrial robot arm, the method comprising:

获取工业机械臂的物理特性数据和预设性能指标;Obtain physical characteristic data and preset performance indicators of industrial robotic arms;

根据所述物理特性数据和所述预设性能指标,构建所述工业机械臂的受约束系统模型;Constructing a constrained system model of the industrial manipulator according to the physical characteristic data and the preset performance index;

对所述受约束系统模型进行等价系统转换,得到无约束系统模型;performing an equivalent system conversion on the constrained system model to obtain an unconstrained system model;

根据所述无约束系统模型和所述工业机械臂的最优控制策略,构建哈密顿-雅克比-贝尔曼方程;Constructing the Hamilton-Jacobi-Bellman equation according to the unconstrained system model and the optimal control strategy of the industrial manipulator;

通过自适应动态规划方法求解所述哈密顿-雅克比-贝尔曼方程,得到最优控制律,以根据所述最优控制律对所述工业机械臂进行控制。The Hamilton-Jacobi-Bellman equation is solved by an adaptive dynamic programming method to obtain an optimal control law, so as to control the industrial mechanical arm according to the optimal control law.

可选地,上述工业机械臂控制方法中,所述根据所述物理特性数据和所述预设性能指标,构建所述工业机械臂的受约束系统模型的步骤包括:Optionally, in the above method for controlling the industrial manipulator, the step of constructing the constrained system model of the industrial manipulator according to the physical characteristic data and the preset performance index includes:

根据所述物理特性数据,构建所述工业机械臂的状态空间方程;Constructing a state space equation of the industrial manipulator according to the physical characteristic data;

根据所述预设性能指标,定义预设性能函数;defining a preset performance function according to the preset performance index;

根据所述状态空间方程和所述预设性能函数,得到所述受约束系统模型。The constrained system model is obtained according to the state space equation and the preset performance function.

可选地,上述工业机械臂控制方法中,所述根据所述物理特性数据,构建所述工业机械臂的状态空间方程的步骤包括:Optionally, in the above method for controlling the industrial manipulator, the step of constructing the state space equation of the industrial manipulator according to the physical characteristic data includes:

对所述物理特性数据进行建模,得到所述工业机械臂的动力学模型;Modeling the physical characteristic data to obtain a dynamic model of the industrial mechanical arm;

对所述动力学模型进行转换,得到所述工业机械臂的状态空间方程。The dynamic model is converted to obtain the state space equation of the industrial mechanical arm.

可选地,上述工业机械臂控制方法中,所述根据所述无约束系统模型和所述工业机械臂的最优控制策略,构建哈密顿-雅克比-贝尔曼方程的步骤包括:Optionally, in the above method for controlling the industrial manipulator, the step of constructing the Hamilton-Jacobi-Bellman equation according to the unconstrained system model and the optimal control strategy of the industrial manipulator includes:

根据所述无约束系统模型和所述工业机械臂的最优控制策略,定义代价函数;Define a cost function according to the unconstrained system model and the optimal control strategy of the industrial manipulator;

根据所述代价函数,构建哈密顿-雅克比-贝尔曼方程。According to the cost function, the Hamilton-Jacobi-Bellman equation is constructed.

可选地,上述工业机械臂控制方法中,所述根据所述无约束系统模型和所述工业机械臂的最优控制策略,定义代价函数的步骤包括:Optionally, in the above method for controlling an industrial manipulator, the step of defining a cost function according to the unconstrained system model and the optimal control strategy of the industrial manipulator includes:

根据所述无约束系统模型,定义位置跟踪误差和混合误差;According to the unconstrained system model, define position tracking error and mixing error;

根据所述位置跟踪误差和所述混合误差,得到误差向量;Obtain an error vector according to the position tracking error and the mixed error;

根据所述误差向量和所述工业机械臂的最优控制策略,定义代价函数。A cost function is defined according to the error vector and the optimal control strategy of the industrial manipulator.

可选地,上述工业机械臂控制方法中,所述根据所述代价函数,构建哈密顿-雅克比-贝尔曼方程的步骤包括:Optionally, in the above method for controlling an industrial manipulator, the step of constructing the Hamilton-Jacobi-Bellman equation according to the cost function includes:

根据所述代价函数,定义哈密顿函数和最优代价函数;According to the cost function, define a Hamiltonian function and an optimal cost function;

利用贝尔曼最优原则求解所述最优代价函数,得到所述最优代价函数的最优解;Solving the optimal cost function by using Bellman's optimal principle to obtain the optimal solution of the optimal cost function;

将所述最优解代入所述哈密顿函数,得到所述哈密顿-雅克比-贝尔曼方程。The optimal solution is substituted into the Hamiltonian function to obtain the Hamilton-Jacobi-Bellman equation.

可选地,上述工业机械臂控制方法中,所述通过自适应动态规划方法求解所述哈密顿-雅克比-贝尔曼方程,得到最优控制律的步骤包括:Optionally, in the above method for controlling an industrial manipulator, the step of solving the Hamilton-Jacobi-Bellman equation by an adaptive dynamic programming method to obtain an optimal control law includes:

采用基于神经网络架构的自适应动态规划方法求解所述哈密顿-雅克比-贝尔曼方程,得到最优控制律。An adaptive dynamic programming method based on a neural network architecture is used to solve the Hamilton-Jacobi-Bellman equation to obtain an optimal control law.

第二方面,本发明提供了一种工业机械臂控制装置,所述装置包括:In a second aspect, the present invention provides an industrial robotic arm control device, the device comprising:

数据获取模块,用于获取工业机械臂的物理特性数据和预设性能指标;The data acquisition module is used to acquire the physical characteristic data and preset performance indicators of the industrial mechanical arm;

模型构建模块,用于根据所述物理特性数据和所述预设性能指标,构建所述工业机械臂的受约束系统模型;A model construction module, configured to construct a constrained system model of the industrial manipulator according to the physical characteristic data and the preset performance index;

系统转换模块,用于对所述受约束系统模型进行等价系统转换,得到无约束系统模型;A system conversion module, configured to perform equivalent system conversion on the constrained system model to obtain an unconstrained system model;

方程构建模块,用于根据所述无约束系统模型和所述工业机械臂的最优控制策略,构建哈密顿-雅克比-贝尔曼方程;An equation construction module, configured to construct the Hamilton-Jacobi-Bellman equation according to the unconstrained system model and the optimal control strategy of the industrial manipulator;

最优控制模块,用于通过自适应动态规划方法求解所述哈密顿-雅克比-贝尔曼方程,得到最优控制律,以根据所述最优控制律对所述工业机械臂进行控制。The optimal control module is used to solve the Hamilton-Jacobi-Bellman equation through an adaptive dynamic programming method to obtain an optimal control law, so as to control the industrial mechanical arm according to the optimal control law.

第三方面,本发明提供了一种工业机械臂控制设备,所述设备包括处理器和存储器,所述存储器中存储有工业机械臂控制程序,所述工业机械臂控制程序被所述处理器执行时,实现如上述的工业机械臂控制方法。In a third aspect, the present invention provides an industrial manipulator control device, the device includes a processor and a memory, an industrial manipulator control program is stored in the memory, and the industrial manipulator control program is executed by the processor , realize the above-mentioned industrial manipulator control method.

第四方面,本发明提供了一种计算机可读存储介质,所述存储介质上存储有计算机程序,所述计算机程序被一个或多个处理器执行时,实现如上述的工业机械臂控制方法。In a fourth aspect, the present invention provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by one or more processors, the above-mentioned method for controlling an industrial robot arm is realized.

本发明提供的上述一个或多个技术方案,可以具有如下优点或至少实现了如下技术效果:The above one or more technical solutions provided by the present invention may have the following advantages or at least achieve the following technical effects:

本发明提出的一种工业机械臂控制方法、装置、设备及存储介质,通过根据工业机械臂的物理特性数据和预设性能指标构建工业机械臂的受约束系统模型,对受约束系统模型进行等价系统转换,得到无约束系统模型,再根据无约束系统模型和工业机械臂的最优控制策略构建哈密顿-雅克比-贝尔曼方程,通过自适应动态规划方法求解哈密顿-雅克比-贝尔曼方程,得到最优控制律,从而根据最优控制律对工业机械臂进行控制,在保证预设性能指标的基础上进行最优控制,实现了对工业机械臂的高精度控制;本发明的方法可以使得工业机械臂的系统输出有效跟踪参考信号,并使得跟踪误差等满足预设要求,提高了控制精度,还具有节能效应。An industrial manipulator control method, device, equipment, and storage medium proposed by the present invention construct a constrained system model of an industrial manipulator based on the physical characteristic data and preset performance indicators of the industrial manipulator, and perform an equalization process on the constrained system model. Then, the unconstrained system model is obtained, and the Hamilton-Jacobi-Bellman equation is constructed according to the unconstrained system model and the optimal control strategy of the industrial manipulator, and the Hamilton-Jacobi-Bellman equation is solved by the adaptive dynamic programming method. Mann equation to obtain the optimal control law, thereby controlling the industrial mechanical arm according to the optimal control law, and performing optimal control on the basis of ensuring the preset performance index, and realizing high-precision control of the industrial mechanical arm; The method can enable the system of the industrial manipulator to output an effective tracking reference signal, and make the tracking error and the like meet preset requirements, improve the control accuracy, and also have an energy-saving effect.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. Those skilled in the art can also obtain other drawings according to the provided drawings without creative work.

图1为本发明工业机械臂控制方法的流程示意图;Fig. 1 is the schematic flow sheet of industrial manipulator control method of the present invention;

图2为本发明涉及的工业机械臂控制设备的硬件结构示意图;Fig. 2 is a schematic diagram of the hardware structure of the industrial manipulator control device involved in the present invention;

图3为本发明实施例一中单连杆机械臂的物理特性示意图;3 is a schematic diagram of the physical characteristics of the single-link robotic arm in Embodiment 1 of the present invention;

图4为本发明实施例一中评价网络权重的收敛曲线图;FIG. 4 is a graph of convergence curves for evaluating network weights in Embodiment 1 of the present invention;

图5为本发明实施例一中参考信号yd与系统状态x1的曲线图;5 is a graph of reference signal yd and system state x1 in Embodiment 1 of the present invention;

图6为本发明实施例一中参考信号与系统状态x2的曲线图;Figure 6 is a reference signal in Embodiment 1 of the present invention plot against system state x2 ;

图7为本发明实施例一中跟踪误差与预设性能界线的曲线图;FIG. 7 is a graph of tracking error and preset performance boundary in Embodiment 1 of the present invention;

图8为本发明工业机械臂控制装置的功能模块示意图。Fig. 8 is a schematic diagram of the functional modules of the industrial robot control device of the present invention.

本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization of the purpose of the present invention, functional characteristics and advantages will be further described in conjunction with the embodiments and with reference to the accompanying drawings.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例只是本发明的一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动的前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only the present invention. Some, but not all, embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

需要说明,在本发明中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。另外,在本发明中,使用用于表示元件的诸如“模块”、“部件”或“单元”的后缀仅为了有利于本发明的说明,其本身没有特定的意义。因此,“模块”、“部件”或“单元”可以混合地使用。对于本领域的普通技术人员而言,可以根据具体情况理解上述术语在本发明中的具体含义。另外,各个实施例的技术方案可以相互结合,但是,是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时,应当认为这种技术方案的结合不存在,也不在本发明要求的保护范围之内。It should be noted that in the present invention, the term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion, so that a process, method, article or system comprising a series of elements includes not only those elements, but also Other elements not expressly listed, or inherent to the process, method, article, or system are also included. Without further limitations, an element defined by the phrase "comprising..." does not preclude the presence of additional identical elements in the process, method, article or system comprising the element. In addition, in the present invention, use of suffixes such as 'module', 'part' or 'unit' for denoting elements is only for facilitating description of the present invention and has no specific meaning by itself. Therefore, 'module', 'part' or 'unit' may be used in combination. Those of ordinary skill in the art can understand the specific meanings of the above terms in the present invention according to specific situations. In addition, the technical solutions of various embodiments can be combined with each other, but it is based on the realization of those skilled in the art. When the combination of technical solutions is contradictory or cannot be realized, it should be considered that the combination of technical solutions does not exist. Also not within the scope of protection required by the present invention.

对现有技术的分析发现,快速响应、高精度的位置跟踪控制一直以来都是工业机械臂的研究热点,目前,大多数对工业机械臂控制方法都停留在保证跟踪误差渐进收敛的阶段,存在反应速度慢,超调量过高等问题,为进一步提高工业机械臂的控制精度,通常需要对收敛速率、最大超调量、稳态误差等性能指标提出要求。因此,在控制器设计过程中结合预设性能指标进行控制具有重要意义。The analysis of the existing technology found that fast response and high-precision position tracking control has always been a research hotspot of industrial manipulators. At present, most of the control methods for industrial manipulators are still in the stage of ensuring the gradual convergence of tracking errors. In order to further improve the control accuracy of industrial manipulators due to problems such as slow response speed and high overshoot, it is usually necessary to put forward requirements on performance indicators such as convergence rate, maximum overshoot, and steady-state error. Therefore, it is of great significance to control with preset performance indicators in the process of controller design.

此外,在关注工业机械臂高精度控制的同时,工业界也对其能耗提出了更高的要求。降低工业机械臂的控制成本,减少对能源的消耗,对当前工业发展来说尤为重要。最优控制是一类考虑系统控制性能和节能效应的控制策略。研究表明,工业机械臂的运动过程属于高度耦合的非线性系统,这给传统的最优控制方法带来了巨大的挑战。In addition, while paying attention to the high-precision control of industrial manipulators, the industry has also put forward higher requirements for its energy consumption. It is particularly important for the current industrial development to reduce the control cost of industrial robotic arms and reduce energy consumption. Optimal control is a kind of control strategy that considers system control performance and energy-saving effect. Studies have shown that the motion process of industrial manipulators is a highly coupled nonlinear system, which brings great challenges to traditional optimal control methods.

针对强非线性系统的最优控制问题的解决方案有自适应动态规划方法。基于自适应动态规划设计技术,所得到的最优控制器在保证被控系统稳定的同时,还能使系统性能达到最优。然而,目前来说,对于工业机械臂的高精度跟踪控制问题,设计满足预设性能指标约束的最优控制器仍然是个亟待解决的问题。There are adaptive dynamic programming methods for solutions to optimal control problems of strongly nonlinear systems. Based on the adaptive dynamic programming design technology, the obtained optimal controller not only ensures the stability of the controlled system, but also optimizes the performance of the system. However, at present, for the high-precision tracking control problem of industrial manipulators, it is still an urgent problem to design an optimal controller that satisfies the constraints of preset performance indicators.

鉴于现有技术中工业机械臂控制方法无法在满足预设性能指标约束的同时实现最优控制的技术问题,本发明提供了一种工业机械臂控制方法,总体思路如下:In view of the technical problem that the industrial manipulator control method in the prior art cannot achieve optimal control while satisfying the preset performance index constraints, the present invention provides an industrial manipulator control method, the general idea is as follows:

获取工业机械臂的物理特性数据和预设性能指标;根据物理特性数据和预设性能指标,构建工业机械臂的受约束系统模型;对受约束系统模型进行等价系统转换,得到无约束系统模型;根据无约束系统模型和工业机械臂的最优控制策略,构建哈密顿-雅克比-贝尔曼方程;通过自适应动态规划方法求解哈密顿-雅克比-贝尔曼方程,得到最优控制律,以根据最优控制律对工业机械臂进行控制。Obtain the physical characteristic data and preset performance indicators of the industrial manipulator; construct the constrained system model of the industrial manipulator according to the physical characteristic data and preset performance indicators; perform equivalent system conversion on the constrained system model to obtain the unconstrained system model ; According to the unconstrained system model and the optimal control strategy of the industrial manipulator, the Hamilton-Jacobi-Bellman equation is constructed; the optimal control law is obtained by solving the Hamilton-Jacobi-Bellman equation through the adaptive dynamic programming method, In order to control the industrial manipulator according to the optimal control law.

通过上述技术方案,在保证预设性能指标的基础上进行最优控制,实现了对工业机械臂的高精度控制;本发明的方法可以使得工业机械臂的系统输出有效跟踪参考信号,并使得跟踪误差等满足预设要求,提高了控制精度,还具有节能效应。Through the above technical scheme, optimal control is carried out on the basis of ensuring the preset performance indicators, and high-precision control of the industrial manipulator is realized; the method of the present invention can make the system of the industrial manipulator output an effective tracking reference signal, and make the tracking The error and the like meet the preset requirements, which improves the control accuracy and also has energy-saving effects.

下面结合附图,通过具体的实施例和实施方式对本发明提供的工业机械臂控制方法、装置、设备及存储介质进行详细说明。The industrial manipulator control method, device, equipment and storage medium provided by the present invention will be described in detail below through specific embodiments and implementations with reference to the accompanying drawings.

实施例一Embodiment one

参照图1的流程示意图,提出本发明工业机械臂控制方法的第一实施例,该工业机械臂控制方法应用于工业机械臂控制设备。Referring to the schematic flowchart of FIG. 1 , a first embodiment of the method for controlling an industrial robot arm of the present invention is proposed, and the method for controlling an industrial robot arm is applied to an industrial robot arm control device.

工业机械臂控制设备是指能够实现数据传输的终端设备或控制设备,可以是手机、电脑、嵌入式工控机等终端设备,也可以是位于系统内的控制器、处理器等控制设备。Industrial robotic arm control equipment refers to terminal equipment or control equipment that can realize data transmission, which can be terminal equipment such as mobile phones, computers, embedded industrial computers, etc., or control equipment such as controllers and processors located in the system.

如图2所示,为工业机械臂控制设备的硬件结构示意图。工业机械臂控制设备可以包括:处理器1001,例如CPU(Central Processing Unit,中央处理器),通信总线1002,用户接口1003,网络接口1004,存储器1005。As shown in Fig. 2, it is a schematic diagram of the hardware structure of the control equipment of the industrial manipulator. The industrial robot arm control device may include: a processor 1001 , such as a CPU (Central Processing Unit, central processing unit), a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 .

具体的,通信总线1002用于实现这些组件之间的连接通信;用户接口1003用于连接客户端,与客户端进行数据通信,用户接口1003可以包括输出单元,如显示屏、输入单元,如键盘;网络接口1004用于连接后台服务器,与后台服务器进行数据通信,网络接口1004可以包括输入/输出接口,比如标准的有线接口、无线接口,如Wi-Fi接口;存储器1005用于存储各种类型的数据,这些数据例如可以包括该工业机械臂控制设备中任何应用程序或方法的指令,以及应用程序相关的数据,存储器1005可以是高速RAM存储器,也可以是稳定的存储器,例如磁盘存储器;可选的,存储器1005还可以是独立于处理器1001的存储装置,继续参照图2,存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及工业机械臂控制程序;Specifically, the communication bus 1002 is used to realize connection and communication between these components; the user interface 1003 is used to connect the client and perform data communication with the client, and the user interface 1003 may include an output unit, such as a display screen, and an input unit, such as a keyboard The network interface 1004 is used to connect the background server, and carries out data communication with the background server, and the network interface 1004 can include input/output interfaces, such as standard wired interfaces, wireless interfaces, such as Wi-Fi interfaces; memory 1005 is used to store various types of These data, for example, can include instructions of any application program or method in the industrial manipulator control device, as well as application-related data, and the memory 1005 can be a high-speed RAM memory, or a stable memory, such as a disk memory; Optionally, the memory 1005 can also be a storage device independent of the processor 1001. With continued reference to FIG. 2, the memory 1005 can include an operating system, a network communication module, a user interface module, and an industrial manipulator control program;

处理器1001用于调用存储器1005中存储的工业机械臂控制程序,并执行以下操作:The processor 1001 is used to call the industrial manipulator control program stored in the memory 1005, and perform the following operations:

获取工业机械臂的物理特性数据和预设性能指标;Obtain physical characteristic data and preset performance indicators of industrial robotic arms;

根据物理特性数据和预设性能指标,构建工业机械臂的受约束系统模型;Construct a constrained system model of an industrial manipulator based on physical characteristic data and preset performance indicators;

对受约束系统模型进行等价系统转换,得到无约束系统模型;Perform equivalent system transformation on the constrained system model to obtain the unconstrained system model;

根据无约束系统模型和工业机械臂的最优控制策略,构建哈密顿-雅克比-贝尔曼方程;According to the unconstrained system model and the optimal control strategy of the industrial manipulator, the Hamilton-Jacobi-Bellman equation is constructed;

通过自适应动态规划方法求解哈密顿-雅克比-贝尔曼方程,得到最优控制律,以根据最优控制律对工业机械臂进行控制。The Hamilton-Jacobi-Bellman equation is solved by the adaptive dynamic programming method, and the optimal control law is obtained, so as to control the industrial manipulator according to the optimal control law.

基于上述的工业机械臂控制设备,下面结合图1所示的流程示意图,对本实施例的工业机械臂控制方法进行详细描述。该方法可以包括以下步骤:Based on the above-mentioned industrial manipulator control device, the method for controlling an industrial manipulator in this embodiment will be described in detail below in conjunction with the schematic flowchart shown in FIG. 1 . The method may include the steps of:

步骤S100:获取工业机械臂的物理特性数据和预设性能指标。Step S100: Obtain physical characteristic data and preset performance indicators of the industrial robotic arm.

具体的,工业机械臂可以是单连杆机械臂,例如一类非线性单连杆机械臂。物理特性数据是指该工业机械臂的硬件结构所对应的机械特性,例如工业机械臂的有效载荷的质量、机械臂的长度、转动惯量、摩擦力系数、转动角度和控制输入等参数特性。预设性能指标是指工业机械臂实现预设性能控制的参数指标,例如工业机械臂的跟踪误差要求,跟踪误差为工业机械臂的输出信号与需要跟踪的输入信号的差值,跟踪误差指标可以是一个范围值。Specifically, the industrial robot arm may be a single-link robot arm, such as a type of nonlinear single-link robot arm. The physical characteristic data refers to the mechanical characteristics corresponding to the hardware structure of the industrial robotic arm, such as the mass of the payload of the industrial robotic arm, the length of the robotic arm, the moment of inertia, the coefficient of friction, the angle of rotation, and control input. The preset performance index refers to the parameter index for the industrial manipulator to realize the preset performance control, such as the tracking error requirement of the industrial manipulator. The tracking error is the difference between the output signal of the industrial manipulator and the input signal that needs to be tracked. The tracking error index can be is a range value.

步骤S200:根据所述物理特性数据和所述预设性能指标,构建所述工业机械臂的受约束系统模型。Step S200: Construct a constrained system model of the industrial manipulator according to the physical characteristic data and the preset performance index.

具体的,控制设备获取到工业机械臂的物理特性数据和预设性能指标后,可以构建该工业机械臂的受约束系统模型。该模型可以是动力学模型,也可以是状态方程的模型。Specifically, after the control device obtains the physical characteristic data and preset performance indicators of the industrial manipulator, it can construct a constrained system model of the industrial manipulator. The model can be a kinetic model or an equation of state model.

进一步地,步骤S200可以包括:Further, step S200 may include:

步骤S210:根据所述物理特性数据,构建所述工业机械臂的状态空间方程。Step S210: Construct a state space equation of the industrial robot arm according to the physical characteristic data.

具体的,根据工业机械臂的物理特性数据构建状态空间方程时,可以先建立一个动力学模型再转换成状态空间方程的形式,也可以通过设定好的程序或方法,例如一些现有的建模软件,直接将物理特性数据输入后获得对应的状态空间方程。Specifically, when constructing the state space equation according to the physical characteristic data of the industrial manipulator, a dynamic model can be established first and then converted into the form of the state space equation. The modeling software directly inputs the physical characteristic data to obtain the corresponding state space equation.

更进一步地,步骤S210可以包括:Further, step S210 may include:

步骤S211:对所述物理特性数据进行建模,得到所述工业机械臂的动力学模型。Step S211: Modeling the physical characteristic data to obtain a dynamic model of the industrial mechanical arm.

本实施例中,以一类单连杆机械臂为例,如图1所示为单连杆机械臂的物理特性示意图,获取单连杆机械臂的物理特性,包括单连杆机械臂的有效载荷的质量M,重力加速度g,单连杆机械臂的长度H,转动角度θ(t),转动惯量G,控制设备对单连杆机械臂的控制输入u(t)。对单连杆机械臂的物理特性进行建模,得到的动力学模型为:In this embodiment, a class of single-link manipulator is taken as an example. Figure 1 is a schematic diagram of the physical characteristics of the single-link manipulator. The mass M of the load, the acceleration of gravity g, the length H of the single-link manipulator, the rotation angle θ(t), the moment of inertia G, and the control input u(t) of the control device to the single-link manipulator. The physical characteristics of the single-link manipulator are modeled, and the dynamic model obtained is:

其中,M表示单连杆机械臂的有效载荷的质量,g表示重力加速度,H表示单连杆机械臂的长度,G表示转动惯量,D表示摩擦力系数,θ(t)表示转动角度,u(t)表示控制输入,t表示时间。Among them, M is the mass of the payload of the single-link manipulator, g is the acceleration of gravity, H is the length of the single-link manipulator, G is the moment of inertia, D is the coefficient of friction, θ(t) is the rotation angle, u (t) represents a control input, and t represents time.

步骤S212:对所述动力学模型进行转换,得到所述工业机械臂的状态空间方程。Step S212: Convert the dynamic model to obtain the state space equation of the industrial manipulator.

本实施例中,根据单连杆机械臂的物理特性,可以将建模得到的动力学模型的方程转化为状态空间方程。令x1=θ(t),得到单连杆机械臂的状态空间方程为:In this embodiment, according to the physical characteristics of the single-link manipulator, the equation of the dynamic model obtained by modeling can be transformed into a state space equation. Let x 1 =θ(t), The state space equation of the single link manipulator is obtained as:

y=x1 y=x 1

其中,y表示工业机械臂的输出信号。Among them, y represents the output signal of the industrial manipulator.

假设该单连杆机械臂的需要跟踪的输入信号为yd,其一阶导数二阶导数/>均存在,那么可以进而得到单连杆机械臂的状态空间方程的紧凑结构:Assuming that the input signal to be tracked of the single-link manipulator is y d , its first derivative second derivative /> Both exist, then the compact structure of the state space equation of the single-link manipulator can be obtained:

y=x1 y=x 1

其中, in,

步骤S220:根据所述预设性能指标,定义预设性能函数;Step S220: Define a preset performance function according to the preset performance index;

预设性能控制是一种可以预先确定收敛速度与控制精度等动态性能指标的实用性技术,可以使得工业机械臂的跟踪误差保持在两个指定性能函数组成的有限范围内,从而保证了工业机械臂的高动态性能。Preset performance control is a practical technology that can predetermine dynamic performance indicators such as convergence speed and control accuracy. It can keep the tracking error of the industrial manipulator within a limited range composed of two specified performance functions, thus ensuring the stability of the industrial machinery. High dynamic performance of the arm.

本实施例中,单连杆机械臂的跟踪误差为输出信号y与需要跟踪的输入信号yd的差值,假设该单连杆机械臂的跟踪误差要求为位于b与B之间,则预设性能指标为b<y-yd<B,则根据单连杆机械臂的预设性能指标确定的预设性能函数为:In this embodiment, the tracking error of the single-link manipulator is the difference between the output signal y and the input signal y d that needs to be tracked. Assuming that the tracking error of the single-link manipulator is required to be between b and B, the predicted Assuming that the performance index is b<yy d <B, the preset performance function determined according to the preset performance index of the single-link manipulator is:

其中,δ1,l1,γ1,δ2,l2和γ2均为正的设计参数,可根据实际需要设定。Among them, δ 1 , l 1 , γ 1 , δ 2 , l 2 and γ 2 are all positive design parameters, which can be set according to actual needs.

步骤S230:根据所述状态空间方程和所述预设性能函数,得到所述受约束系统模型。Step S230: Obtain the constrained system model according to the state space equation and the preset performance function.

本实施例中,根据预设性能函数可以得到单连杆机械臂的输出约束为b+yd<y<B+yd,再结合前述状态空间方程,可得到受约束系统模型。In this embodiment, according to the preset performance function, the output constraint of the single-link manipulator can be obtained as b+y d <y<B+y d , combined with the aforementioned state space equation, the constrained system model can be obtained.

步骤S300:对所述受约束系统模型进行等价系统转换,得到无约束系统模型。Step S300: performing an equivalent system transformation on the constrained system model to obtain an unconstrained system model.

本实施例中,为了实现高精度控制的目标,基于预设性能函数的系统转换技术被引入。对于单连杆机械臂的输出信号,定义一个非线性映射函数为:In this embodiment, in order to achieve the goal of high-precision control, a system conversion technology based on a preset performance function is introduced. For the output signal of a single-link manipulator, a nonlinear mapping function is defined as:

ρ1=T(x1,B,b,yd)ρ 1 =T(x 1 ,B,b,y d )

其中,a tanh(·)表示反正切函数;in, a tanh( ) means the arc tangent function;

对该非线性映射函数求导,可以得到:Deriving the nonlinear mapping function, we can get:

继续得到:make continue to get:

其中,in,

利用系统变换技术来处理预设性能约束问题,通过定义基于预设性能指标的非线性映射函数,可以把受约束的系统转化成无约束的等价系统。针对转化后的无约束系统设计的控制策略可以使得工业机械臂的跟踪误差保持在两个预设性能函数组成的有限范围内,例如上述的b与B范围内,从而有效提高了控制精度。The system transformation technology is used to deal with the problem of preset performance constraints, and the constrained system can be transformed into an unconstrained equivalent system by defining a nonlinear mapping function based on the preset performance index. The control strategy designed for the converted unconstrained system can keep the tracking error of the industrial manipulator within a limited range composed of two preset performance functions, such as the above b and B ranges, thereby effectively improving the control accuracy.

步骤S400:根据所述无约束系统模型和所述工业机械臂的最优控制策略,构建哈密顿-雅克比-贝尔曼方程。Step S400: Construct the Hamilton-Jacobi-Bellman equation according to the unconstrained system model and the optimal control strategy of the industrial manipulator.

进一步地,步骤S400可以包括:Further, step S400 may include:

步骤S410:根据所述无约束系统模型和所述工业机械臂的最优控制策略,定义代价函数。Step S410: Define a cost function according to the unconstrained system model and the optimal control strategy of the industrial manipulator.

具体的,为均衡控制精度和控制输入所消耗的能量,从而实现最优控制的目标,定义一个代价函数。Specifically, in order to balance the control accuracy and the energy consumed by the control input, so as to achieve the goal of optimal control, a cost function is defined.

更进一步地,步骤S410可以包括:Further, step S410 may include:

步骤S411:根据所述无约束系统模型,定义位置跟踪误差和混合误差。Step S411: According to the unconstrained system model, define position tracking error and mixing error.

本实施例中,针对转化后无约束的系统,定义位置跟踪误差z为:In this embodiment, for the converted unconstrained system, the position tracking error z is defined as:

z=ρ1-sd z=ρ 1 -s d

其中, in,

并定义混合误差θ为:And define the mixing error θ as:

其中,是一个正常数,可根据实际需要设定,/>表示转动速率跟踪误差,/>为sd的一阶导数。in, is a normal number and can be set according to actual needs, /> Indicates the rotation rate tracking error, /> is the first derivative of s d .

步骤S412:根据所述位置跟踪误差和所述混合误差,得到误差向量。Step S412: Obtain an error vector according to the position tracking error and the mixed error.

本实施例中,基于前述定义的位置跟踪误差z和混合误差θ,定义一个新的误差向量X为:In this embodiment, based on the previously defined position tracking error z and mixed error θ, a new error vector X is defined as:

从而可以得到误差向量X的动力学方程为:Thus, the dynamic equation of the error vector X can be obtained as:

其中,in,

步骤S413:根据所述误差向量和所述工业机械臂的最优控制策略,定义代价函数。Step S413: Define a cost function according to the error vector and the optimal control strategy of the industrial manipulator.

本实施例中,根据误差向量和工业机械臂的最优控制策略,定义的代价函数为:In this embodiment, according to the error vector and the optimal control strategy of the industrial manipulator, the defined cost function is:

其中,是正定矩阵,R是正常数,可根据实际需要设定,R>0。in, is a positive definite matrix, R is a positive constant, can be set according to actual needs, R>0.

步骤S420:根据所述代价函数,构建哈密顿-雅克比-贝尔曼方程。Step S420: Construct the Hamilton-Jacobi-Bellman equation according to the cost function.

具体的,通过基于最优控制策略定义的代价函数来推导哈密顿-雅克比-贝尔曼方程,可以将最优控制问题转化成求解哈密顿-雅克比-贝尔曼方程。Specifically, by deriving the Hamilton-Jacobi-Bellman equation based on the cost function defined by the optimal control strategy, the optimal control problem can be transformed into solving the Hamilton-Jacobi-Bellman equation.

更进一步地,步骤S420可以包括:Further, step S420 may include:

步骤S421:根据所述代价函数,定义哈密顿函数和最优代价函数。Step S421: According to the cost function, define a Hamiltonian function and an optimal cost function.

本实施例中,根据步骤S413定义的代价函数定义哈密顿函数为:In this embodiment, according to the cost function defined in step S413, the Hamiltonian function is defined as:

其中, in,

最优控制时,代价函数要实现最小,以通过最小的控制输入得到期望的控制精度。本实施例中,定义最优代价函数为:In optimal control, the cost function should be minimized to obtain the desired control accuracy with the minimum control input. In this embodiment, the optimal cost function is defined as:

其中,Ω表示针对单连杆机械臂的允许的控制策略的集合,V*(X)满足V*(0)=0。Wherein, Ω represents a set of allowable control strategies for a single-link manipulator, and V * (X) satisfies V * (0)=0.

步骤S422:利用贝尔曼最优原则求解所述最优代价函数,得到所述最优代价函数的最优解。Step S422: Solving the optimal cost function by using the Bellman optimality principle to obtain an optimal solution of the optimal cost function.

根据贝尔曼最优原则,可以得到According to the Bellman optimality principle, we can get

其中, in,

可以得到最优代价函数的最优解为:Depend on The optimal solution of the optimal cost function can be obtained as:

步骤S423:将所述最优解代入所述哈密顿函数,得到所述哈密顿-雅克比-贝尔曼方程。Step S423: Substituting the optimal solution into the Hamiltonian function to obtain the Hamilton-Jacobi-Bellman equation.

本实施例中,将最优解u*(t)代入步骤S421定义的哈密顿函数,可以得到哈密顿-雅克比-贝尔曼方程为:In the present embodiment, substituting the optimal solution u * (t) into the Hamiltonian function defined in step S421, the Hamilton-Jacobi-Bellman equation can be obtained as:

步骤S500:通过自适应动态规划方法求解所述哈密顿-雅克比-贝尔曼方程,得到最优控制律,以根据所述最优控制律对所述工业机械臂进行控制。Step S500: solving the Hamilton-Jacobi-Bellman equation by an adaptive dynamic programming method to obtain an optimal control law, so as to control the industrial manipulator according to the optimal control law.

具体的,由于哈密顿-雅克比-贝尔曼方程的强非线性性质,很难直接求解得到最优代价函数和最优控制律。因此,可以采用单网络的自适应动态规划方法近似求解哈密顿-雅克比-贝尔曼方程。Specifically, due to the strongly nonlinear nature of the Hamilton-Jacobi-Bellman equation, it is difficult to directly solve the optimal cost function and optimal control law. Therefore, the Hamilton-Jacobi-Bellman equation can be approximated by using the adaptive dynamic programming method of a single network.

进一步地,步骤S500可以包括:Further, step S500 may include:

步骤S510:采用基于神经网络架构的自适应动态规划方法求解所述哈密顿-雅克比-贝尔曼方程,得到最优控制律。Step S510: Solving the Hamilton-Jacobi-Bellman equation by using an adaptive dynamic programming method based on a neural network architecture to obtain an optimal control law.

本实施例中,基于神经网络架构建立单网络的评价网络,最优代价函数V*(X)可以被评价网络近似为:In this embodiment, a single-network evaluation network is established based on the neural network architecture, and the optimal cost function V * (X) can be approximated by the evaluation network as:

其中,为理想的权重向量,/>为基函数向量,/>为近似误差,m表示神经网络节点数量,且/> in, is the ideal weight vector, /> is the basis function vector, /> is the approximate error, m represents the number of neural network nodes, and />

表示理想权重的估计值,最优代价函数可以估计为:make Represents the estimated value of the ideal weight, the optimal cost function can be estimated as:

则可以得到近似的最优控制律为:Then the approximate optimal control law can be obtained as:

从而控制设备可以根据该最优控制律来对应控制该单连杆机械臂,保证预设性能指标的基础上进行最优控制,并实现对该单连杆机械臂的高精度控制。Therefore, the control device can control the single-link manipulator correspondingly according to the optimal control law, perform optimal control on the basis of ensuring preset performance indicators, and realize high-precision control of the single-link manipulator.

通过单网络自适应动态规划方法来近似获得最优控制律,实现对工业机械臂的控制,并采用单评价网络来近似最优代价函数,相比传统的执行-评价双网络结构,有助于减少计算量和内存需求。Approximately obtain the optimal control law through a single network adaptive dynamic programming method, realize the control of industrial manipulators, and use a single evaluation network to approximate the optimal cost function, compared with the traditional execution-evaluation dual network structure, it is helpful Reduce computation and memory requirements.

可选地,设计的评价网络的权重更新律可以为:Optionally, the weight update law of the designed evaluation network can be:

其中,β为正的设计参数,可根据实际需要设定, Among them, β is a positive design parameter, which can be set according to actual needs,

为了验证本实施例提供的工业机械臂控制方法的有效性,进行如下仿真实验:In order to verify the effectiveness of the industrial manipulator control method provided in this embodiment, the following simulation experiments are carried out:

在仿真实验中,控制目标设定为使单连杆机械臂的输出信号y以最优的方式跟踪上参考信号yd=0.2sin(t)。根据单连杆机械臂的实际系统,物理特性数据分别取值为M=1kg,g=9.8m/s2,l=1m,D=2N·m·s/rad。预设性能函数为b=-0.1-e-t和B=0.1+e-t,单连杆机械臂的状态初始值为x1(0)=0.6,x2(0)=-1.5。评价网络的函数为σc(X)=[z22,zθ]T,其权重初始值为wc(0)=[100,250,50]T。另外,设定其他参数为Q=[800,0;0,800],R=1,β=2。In the simulation experiment, the control target is set to make the output signal y of the single-link manipulator track the upper reference signal y d =0.2 sin(t) optimally. According to the actual system of the single-link manipulator, the physical characteristic data are M=1kg, g=9.8m/s 2 , l=1m, D=2N·m·s/rad. The preset performance functions are b=-0.1-e -t and B=0.1+e -t , and the initial state values of the single-link manipulator are x 1 (0)=0.6, x 2 (0)=-1.5. The function of the evaluation network is σ c (X)=[z 22 ,zθ] T , and its weight initial value is w c (0)=[100,250,50] T . In addition, set other parameters as Q=[800,0;0,800], R=1, β=2.

选取李亚普诺夫函数求时间导数的方式来进行结果分析,基于仿真实验中实际情况所设计的参数,此处不再举例,分析后可以得到/>根据李亚普诺夫稳定性定理可以知道,跟踪误差z和混合误差θ以及评价网络的权重估计误差都是一致且最终有界的,即说明单连杆机械臂的输出信号y可以跟踪上参考信号yd,评价网络的权重可以收敛接近于理想值,如图4所示为评价网络权重的收敛曲线图,图中,横轴表示时间,单位为秒(s),纵轴表示评价网络权重的值。由该图可以看出,评价网络可以准确近似代价函数,从而所得到的控制输入u(t)则可以看成是最优的。Choose the Lyapunov function The time derivative method is used to analyze the results. Based on the parameters designed in the actual situation in the simulation experiment, no examples are given here. After the analysis, you can get /> According to the Lyapunov stability theorem, it can be known that the tracking error z and mixing error θ and the weight estimation error of the evaluation network are consistent and ultimately bounded, which means that the output signal y of the single-link manipulator can track the upper reference signal y d , and the weight of the evaluation network can converge close to the ideal value, as shown in Figure 4. The convergence of the evaluation network weight In the graph, in the graph, the horizontal axis represents time in seconds (s), and the vertical axis represents the value of the evaluation network weight. It can be seen from the figure that the evaluation network can accurately approximate the cost function, so the obtained control input u(t) can be regarded as optimal.

如图5所示为本实施例中参考信号yd与系统状态x1的曲线图,图中,横轴表示时间,单位为秒(s),纵轴表示各曲线对应的值;如图6所示为本实施例中参考信号与系统状态x2的曲线图,图中,横轴表示时间,单位为秒(s),纵轴表示各曲线对应的值;如图7所示为本实施例中跟踪误差与预设性能界线的曲线图,图中,横轴表示时间,单位为秒(s),纵轴表示各曲线对应的值,其中,预设性能界线为预设性能指标的取值所对应的曲线。由图5-图7可以看出,输出信号y下所对应的单连杆机械臂的系统状态与参考信号yd是保持一致的,该控制方法的跟踪该效果好,且跟踪误差满足预设性能指标的要求,可以实现单连杆机械臂的高精度控制。As shown in Fig. 5, it is the curve diagram of reference signal y d and system state x 1 in the present embodiment, among the figure, horizontal axis represents time, and unit is second (s), and vertical axis represents the value corresponding to each curve; Fig. 6 Shown is the reference signal in this example and the graph of the system state x 2 , in the figure, the horizontal axis represents time, and the unit is second (s), and the vertical axis represents the value corresponding to each curve; As shown in Figure 7, it is the tracking error and the preset performance boundary in the present embodiment In the graph, the horizontal axis represents time in seconds (s), and the vertical axis represents the value corresponding to each curve, wherein the preset performance boundary is the curve corresponding to the value of the preset performance index. From Figures 5 to 7, it can be seen that the system state of the single-link manipulator corresponding to the output signal y is consistent with the reference signal y d , the tracking effect of this control method is good, and the tracking error meets the preset The requirements of the performance index can realize the high-precision control of the single-link manipulator.

本实施例提供的工业机械臂控制方法,通过根据工业机械臂的物理特性数据和预设性能指标构建工业机械臂的受约束系统模型,对受约束系统模型进行等价系统转换,得到无约束系统模型,再根据无约束系统模型和工业机械臂的最优控制策略构建哈密顿-雅克比-贝尔曼方程,通过自适应动态规划方法求解哈密顿-雅克比-贝尔曼方程,得到最优控制律,从而根据最优控制律对工业机械臂进行控制,在保证预设性能指标的基础上进行最优控制,实现了对工业机械臂的高精度控制;本发明的方法可以使得工业机械臂的系统输出有效跟踪参考信号,并使得跟踪误差等满足预设要求,提高了控制精度,还具有节能效应。The control method of the industrial manipulator provided in this embodiment constructs the constrained system model of the industrial manipulator according to the physical characteristic data and preset performance indicators of the industrial manipulator, performs equivalent system conversion on the constrained system model, and obtains an unconstrained system Then construct the Hamilton-Jacobi-Bellman equation according to the unconstrained system model and the optimal control strategy of the industrial manipulator, and solve the Hamilton-Jacobi-Bellman equation through the adaptive dynamic programming method to obtain the optimal control law , so that the industrial manipulator is controlled according to the optimal control law, the optimal control is carried out on the basis of ensuring the preset performance index, and the high-precision control of the industrial manipulator is realized; the method of the present invention can make the system of the industrial manipulator The effective tracking reference signal is output, and the tracking error and the like meet the preset requirements, which improves the control precision and also has an energy-saving effect.

实施例二Embodiment two

基于同一发明构思,参照图8,提出本发明工业机械臂控制装置的第一实施例,该装置可以为虚拟装置,应用于工业机械臂控制设备。Based on the same inventive concept, referring to FIG. 8 , a first embodiment of the control device for industrial manipulators of the present invention is proposed. The device can be a virtual device, which is applied to control equipment for industrial manipulators.

下面结合图8所示的功能模块示意图,对本实施例提供的工业机械臂控制装置进行详细描述,装置可以包括:The industrial manipulator control device provided in this embodiment will be described in detail below in conjunction with the functional module schematic diagram shown in FIG. 8 . The device may include:

数据获取模块,用于获取工业机械臂的物理特性数据和预设性能指标;The data acquisition module is used to acquire the physical characteristic data and preset performance indicators of the industrial mechanical arm;

模型构建模块,用于根据所述物理特性数据和所述预设性能指标,构建所述工业机械臂的受约束系统模型;A model construction module, configured to construct a constrained system model of the industrial manipulator according to the physical characteristic data and the preset performance index;

系统转换模块,用于对所述受约束系统模型进行等价系统转换,得到无约束系统模型;A system conversion module, configured to perform equivalent system conversion on the constrained system model to obtain an unconstrained system model;

方程构建模块,用于根据所述无约束系统模型和所述工业机械臂的最优控制策略,构建哈密顿-雅克比-贝尔曼方程;An equation construction module, configured to construct the Hamilton-Jacobi-Bellman equation according to the unconstrained system model and the optimal control strategy of the industrial manipulator;

最优控制模块,用于通过自适应动态规划方法求解所述哈密顿-雅克比-贝尔曼方程,得到最优控制律,以根据所述最优控制律对所述工业机械臂进行控制。The optimal control module is used to solve the Hamilton-Jacobi-Bellman equation through an adaptive dynamic programming method to obtain an optimal control law, so as to control the industrial mechanical arm according to the optimal control law.

需要说明,本实施例提供的工业机械臂控制装置中各个模块可实现的功能和对应达到的技术效果可以参照本发明工业机械臂控制方法各个实施例中具体实施方式的描述,为了说明书的简洁,此处不再赘述。It should be noted that the functions and corresponding technical effects achieved by each module in the industrial manipulator control device provided in this embodiment can refer to the description of the specific implementation in each embodiment of the industrial manipulator control method of the present invention. For the sake of brevity of the description, I won't repeat them here.

实施例三Embodiment three

基于同一发明构思,参照图2的硬件结构示意图,本实施例提供了一种工业机械臂控制设备,该设备可以包括处理器和存储器,存储器中存储有工业机械臂控制程序,该工业机械臂控制程序被处理器执行时,实现本发明工业机械臂控制方法各个实施例的全部或部分步骤。Based on the same inventive concept, referring to the schematic diagram of the hardware structure in Fig. 2, this embodiment provides an industrial manipulator control device, which may include a processor and a memory, and an industrial manipulator control program is stored in the memory, and the industrial manipulator control When the program is executed by the processor, all or part of the steps of the various embodiments of the industrial robot arm control method of the present invention are realized.

具体的,工业机械臂控制设备是指能够实现数据传输的终端设备或控制设备,可以是手机、电脑、嵌入式工控机等终端设备,也可以是位于系统内的控制器、处理器等控制设备。Specifically, industrial robotic arm control equipment refers to terminal equipment or control equipment capable of data transmission, which can be terminal equipment such as mobile phones, computers, embedded industrial computers, etc., or control equipment such as controllers and processors located in the system .

本领域技术人员可以理解,图2中示出的硬件结构并不构成对本发明工业机械臂控制设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the hardware structure shown in FIG. 2 does not constitute a limitation to the industrial robot arm control device of the present invention, and may include more or less components than those shown in the illustration, or combine certain components, or have different Part placement.

可以理解,工业机械臂控制设备还可以包括通信总线,用户接口和网络接口。其中,通信总线用于实现这些组件之间的连接通信;用户接口用于连接客户端,与客户端进行数据通信,用户接口可以包括输出单元,如显示屏、输入单元,如键盘;网络接口用于连接后台服务器,与后台服务器进行数据通信,网络接口可以包括输入/输出接口,比如标准的有线接口、无线接口。It can be understood that the industrial manipulator control device may also include a communication bus, a user interface and a network interface. Among them, the communication bus is used to realize the connection and communication between these components; the user interface is used to connect the client and perform data communication with the client. The user interface can include an output unit, such as a display screen, and an input unit, such as a keyboard; For connecting to the background server and performing data communication with the background server, the network interface may include an input/output interface, such as a standard wired interface and a wireless interface.

存储器用于存储各种类型的数据,这些数据例如可以包括该工业机械臂控制设备中任何应用程序或方法的指令,以及应用程序相关的数据。存储器可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,例如静态随机存取存储器(Static RandomAccess Memory,简称SRAM),随机存取存储器(Random Access Memory,简称RAM),电可擦除可编程只读存储器(Electrically Erasable Programmable Read-Only Memory,简称EEPROM),可擦除可编程只读存储器(Erasable Programmable Read-Only Memory,简称EPROM),可编程只读存储器(Programmable Read-Only Memory,简称PROM),只读存储器(Read-Only Memory,简称ROM),磁存储器,快闪存储器,磁盘或光盘,可选的,存储器还可以是独立于处理器的存储装置。The memory is used to store various types of data, which may include, for example, instructions of any application program or method in the industrial robot arm control device, as well as data related to the application program. The memory may be implemented by any type of volatile or non-volatile storage device or a combination thereof, such as Static Random Access Memory (SRAM for short), Random Access Memory (RAM for short), Electrically Erasable Programmable Read-Only Memory (EEPROM), Erasable Programmable Read-Only Memory (EPROM), Programmable Read-Only Memory (EPROM) -Only Memory, referred to as PROM), read-only memory (Read-Only Memory, referred to as ROM), magnetic storage, flash memory, magnetic disk or optical disk, optionally, the memory can also be a storage device independent of the processor.

处理器用于调用存储器中存储的工业机械臂控制程序,并执行如上述的工业机械臂控制方法,处理器可以是专用集成电路(Application Specific Integrated Circuit,简称ASIC)、数字信号处理器(Digital Signal Processor,简称DSP)、数字信号处理设备(Digital Signal Processing Device,简称DSPD)、可编程逻辑器件(Programmable LogicDevice,简称PLD)、现场可编程门阵列(Field Programmable Gate Array,简称FPGA)、控制器、微控制器、微处理器或其他电子元件,用于执行如上述工业机械臂控制方法各个实施例的全部或部分步骤。The processor is used to call the industrial manipulator control program stored in the memory, and execute the above-mentioned industrial manipulator control method. The processor can be an application specific integrated circuit (ASIC for short), a digital signal processor (Digital Signal Processor) , DSP for short), digital signal processing device (Digital Signal Processing Device, DSPD for short), programmable logic device (Programmable Logic Device, PLD for short), field programmable gate array (Field Programmable Gate Array, FPGA for short), controller, microcontroller A controller, a microprocessor or other electronic components are used to execute all or part of the steps in the various embodiments of the above-mentioned industrial robot control method.

实施例四Embodiment four

基于同一发明构思,本实施例提供了一种计算机可读存储介质,如闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、可编程只读存储器(PROM)、可擦除可编程只读存储器(EPROM)、电可擦除可编程只读存储器(EEPROM)、磁性存储器、磁盘、光盘、服务器等等,该存储介质上存储有计算机程序,该计算机程序可被一个或多个处理器执行,该计算机程序被处理器执行时可以实现本发明工业机械臂控制方法各个实施例的全部或部分步骤。Based on the same inventive concept, this embodiment provides a computer-readable storage medium, such as flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), Read Only Memory (ROM), Programmable Read Only Memory (PROM), Erasable Programmable Read Only Memory (EPROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Magnetic Memory, Disk, CD, server, etc., the storage medium stores a computer program, the computer program can be executed by one or more processors, when the computer program is executed by the processor, all of the various embodiments of the industrial manipulator control method of the present invention can be realized or partial steps.

需要说明,上述本发明实施例序号仅为了描述,不代表实施例的优劣。以上实施例仅为本发明的可选实施例,并非因此限制本发明的专利范围,凡是在本发明的发明构思下,利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均包括在本发明的专利保护范围内。It should be noted that the serial numbers of the above embodiments of the present invention are for description only, and do not represent the advantages and disadvantages of the embodiments. The above embodiments are only optional embodiments of the present invention, and are not therefore limiting the patent scope of the present invention. Under the inventive concept of the present invention, the equivalent structure or equivalent process transformation made by utilizing the description of the present invention and the contents of the accompanying drawings, Or directly or indirectly used in other related technical fields, all are included in the patent protection scope of the present invention.

Claims (10)

1. An industrial robot control method, comprising:
acquiring physical characteristic data and preset performance indexes of an industrial mechanical arm;
constructing a constrained system model of the industrial mechanical arm according to the physical characteristic data and the preset performance index;
performing equivalent system conversion on the constrained system model to obtain an unconstrained system model;
constructing a Hamiltonian-Jacobian-Belman equation according to the unconstrained system model and an optimal control strategy of the industrial mechanical arm;
and solving the Hamiltonian-Jacobian-Bellman equation by a self-adaptive dynamic programming method to obtain an optimal control law, so as to control the industrial mechanical arm according to the optimal control law.
2. The industrial robot control method of claim 1, wherein the step of constructing a constrained system model of the industrial robot based on the physical characteristic data and the preset performance index comprises:
constructing a state space equation of the industrial mechanical arm according to the physical characteristic data;
defining a preset performance function according to the preset performance index;
and obtaining the constrained system model according to the state space equation and the preset performance function.
3. The industrial robot control method according to claim 2, wherein the step of constructing a state space equation of the industrial robot based on the physical characteristic data comprises:
modeling the physical characteristic data to obtain a dynamic model of the industrial mechanical arm;
and converting the dynamic model to obtain a state space equation of the industrial mechanical arm.
4. The industrial robot control method of claim 1, wherein the constructing a hamilton-jacobian-bellman equation according to the unconstrained system model and the optimal control strategy of the industrial robot comprises:
defining a cost function according to the unconstrained system model and an optimal control strategy of the industrial mechanical arm;
and constructing a Hamiltonian-Jacobian-Belman equation according to the cost function.
5. The method of claim 4, wherein defining a cost function based on the unconstrained system model and an optimal control strategy for the industrial robot comprises:
defining a position tracking error and a mixing error according to the unconstrained system model;
obtaining an error vector according to the position tracking error and the mixed error;
and defining a cost function according to the error vector and an optimal control strategy of the industrial mechanical arm.
6. The industrial robot control method of claim 4, wherein constructing a hamilton-jacobian-bellman equation from the cost function comprises:
according to the cost function, a Hamiltonian function and an optimal cost function are defined;
solving the optimal cost function by utilizing a Belman optimal principle to obtain an optimal solution of the optimal cost function;
substituting the optimal solution into the Hamiltonian to obtain the Hamiltonian-Jacobian-Belman equation.
7. The industrial robot control method according to claim 1, wherein the step of solving the hamilton-jacobian-bellman equation by an adaptive dynamic programming method to obtain an optimal control law comprises:
and solving the Hamiltonian-Jacobian-Bellman equation by adopting a self-adaptive dynamic programming method based on a neural network architecture to obtain an optimal control law.
8. An industrial robot control device, the device comprising:
the data acquisition module is used for acquiring physical characteristic data and preset performance indexes of the industrial mechanical arm;
the model construction module is used for constructing a constrained system model of the industrial mechanical arm according to the physical characteristic data and the preset performance index;
the system conversion module is used for carrying out equivalent system conversion on the constrained system model to obtain an unconstrained system model;
the equation construction module is used for constructing a Hamiltonian-Jacobian-Bellman equation according to the unconstrained system model and an optimal control strategy of the industrial mechanical arm;
and the optimal control module is used for solving the Hamiltonian-Jacobian-Bellman equation through a self-adaptive dynamic programming method to obtain an optimal control law so as to control the industrial mechanical arm according to the optimal control law.
9. An industrial robot control device, characterized in that the device comprises a processor and a memory, on which an industrial robot control program is stored, which when executed by the processor, implements the industrial robot control method according to any one of claims 1 to 7.
10. A computer-readable storage medium, characterized in that the storage medium has stored thereon a computer program which, when executed by one or more processors, implements the industrial robot control method according to any one of claims 1 to 7.
CN202310049146.4A 2023-02-01 2023-02-01 Industrial mechanical arm control method, device, equipment and storage medium Pending CN116604546A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310049146.4A CN116604546A (en) 2023-02-01 2023-02-01 Industrial mechanical arm control method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310049146.4A CN116604546A (en) 2023-02-01 2023-02-01 Industrial mechanical arm control method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN116604546A true CN116604546A (en) 2023-08-18

Family

ID=87680608

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310049146.4A Pending CN116604546A (en) 2023-02-01 2023-02-01 Industrial mechanical arm control method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116604546A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117086865A (en) * 2023-08-02 2023-11-21 广东工业大学 Input-limited robot arm tracking control method, system, equipment and medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130119916A1 (en) * 2011-11-10 2013-05-16 Yebin Wang Energy Efficient Motion Control System
CN113103237A (en) * 2021-04-25 2021-07-13 长春工业大学 A reconfigurable manipulator control method and system for unknown environmental constraints

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130119916A1 (en) * 2011-11-10 2013-05-16 Yebin Wang Energy Efficient Motion Control System
CN113103237A (en) * 2021-04-25 2021-07-13 长春工业大学 A reconfigurable manipulator control method and system for unknown environmental constraints

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZIJIE GUO等: "Distributed Optimal Consensus Learning Control for Nonlinear MASs with Output Constraints", 2021 6TH INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING, 22 November 2021 (2021-11-22), pages 108 - 112 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117086865A (en) * 2023-08-02 2023-11-21 广东工业大学 Input-limited robot arm tracking control method, system, equipment and medium
CN117086865B (en) * 2023-08-02 2025-10-31 广东工业大学 Mechanical arm tracking control method, system, equipment and medium based on input limitation

Similar Documents

Publication Publication Date Title
Nakajima et al. Exploiting the dynamics of soft materials for machine learning
KR102578263B1 (en) Multi-agent system consistency distributed control method for agriculture based on sampling data
Nakajima et al. Information processing via physical soft body
CN107984472B (en) A Design Method of Variable-parameter Neural Solver for Motion Planning of Redundant Manipulators
CN112338912B (en) Finite time stability control method and system for flexible single-chain mechanical arm
CN116466735B (en) A spacecraft formation attitude orientation collaborative control method and related equipment
CN113805483B (en) Robot control method and device based on model prediction and computer equipment
CN117086865B (en) Mechanical arm tracking control method, system, equipment and medium based on input limitation
CN112558470A (en) Optimal consistency control method and device for actuator saturated multi-agent system
Xiong et al. A stable control for second-order nonholonomic planar underactuated mechanical system: energy attenuation approach
CN116604546A (en) Industrial mechanical arm control method, device, equipment and storage medium
WO2020093253A1 (en) Robot motion control method, control system and storage device
CN118636153B (en) Robust self-adaptive shear thickening fluid admittance control method under characteristic space
CN109996247A (en) Networked resource concocting method, device, equipment and storage medium
CN116872219A (en) Robot control method, electronic device and storage medium based on U-K equation
CN119188729A (en) Robotic arm control method and stability evaluation method based on double evaluation network
CN117086886B (en) Robot dynamic error prediction method and system based on mechanism data hybrid driving
CN116038709B (en) Distributed optimization control method, device and medium for nonlinear multi-manipulator system
CN113927585A (en) Robot balance control method and device, readable storage medium and robot
CN115958591A (en) Robot end load quality identification method, computer device and storage medium
Guizhi et al. Determination of stability correction parameters for dynamic equations of constrained multibody systems
CN116372937A (en) Robotic arm control method, device, computer-readable storage medium, and robotic arm
CN115598979B (en) A method and device for identifying model parameters of a hydraulic system and hydraulic engineering machinery
CN119270861A (en) A design method, device, equipment and medium for feedback controller of inspection robot
CN116276965A (en) Fixed time track tracking control method for mechanical arm based on nonsingular terminal sliding mode

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination