[go: up one dir, main page]

CN118011826A - UAV control strategy safety verification and correction method and system - Google Patents

UAV control strategy safety verification and correction method and system Download PDF

Info

Publication number
CN118011826A
CN118011826A CN202410200544.6A CN202410200544A CN118011826A CN 118011826 A CN118011826 A CN 118011826A CN 202410200544 A CN202410200544 A CN 202410200544A CN 118011826 A CN118011826 A CN 118011826A
Authority
CN
China
Prior art keywords
uav
safety
drone
control strategy
state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202410200544.6A
Other languages
Chinese (zh)
Inventor
陈时雨
李衍杰
楼云江
林可
赵凯东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Institute Of Technology shenzhen Shenzhen Institute Of Science And Technology Innovation Harbin Institute Of Technology
Original Assignee
Harbin Institute Of Technology shenzhen Shenzhen Institute Of Science And Technology Innovation Harbin Institute Of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Institute Of Technology shenzhen Shenzhen Institute Of Science And Technology Innovation Harbin Institute Of Technology filed Critical Harbin Institute Of Technology shenzhen Shenzhen Institute Of Science And Technology Innovation Harbin Institute Of Technology
Priority to CN202410200544.6A priority Critical patent/CN118011826A/en
Publication of CN118011826A publication Critical patent/CN118011826A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)

Abstract

本发明涉及无人机控制策略安全性验证与修正方法及系统,其中方法包括:获取当前无人机系统状态信息、无人机系统的原有无人机飞行控制策略输出的未验证的潜在不安全动作、和无人机自适应调节的安全约束边界;基于优化目标范数,建立并求解带有无人机动力学及安全性约束的有限时域优化问题,得到用于无人机系统飞行控制的安全控制动作;基于无人机自适应调节的安全约束边界,建立并更新无人机状态需要满足的安全性约束;在有限时域范围内迭代计算,得到无人机在未来有限时域范围内的安全状态序列及对应的安全控制动作序列。本发明对潜在的不安全无人机飞行控制策略进行验证与修正,达到对无人机系统既安全又机动的飞行控制目的。

The present invention relates to a method and system for verifying and correcting the safety of a drone control strategy, wherein the method comprises: obtaining the current drone system status information, the unverified potential unsafe actions output by the original drone flight control strategy of the drone system, and the safety constraint boundary of the drone adaptive adjustment; based on the optimization target norm, establishing and solving a finite time domain optimization problem with drone dynamics and safety constraints, and obtaining safety control actions for the drone system flight control; based on the safety constraint boundary of the drone adaptive adjustment, establishing and updating the safety constraints that the drone state needs to meet; iteratively calculating within a finite time domain, and obtaining the safety state sequence of the drone within the future finite time domain and the corresponding safety control action sequence. The present invention verifies and corrects the potential unsafe drone flight control strategy, and achieves the purpose of safe and maneuverable flight control of the drone system.

Description

无人机控制策略安全性验证与修正方法及系统UAV control strategy safety verification and correction method and system

技术领域Technical Field

本发明属于无人机技术领域,特别涉及无人机控制策略安全性验证与修正方法及系统。The present invention belongs to the technical field of unmanned aerial vehicles, and in particular relates to a method and system for verifying and correcting the safety of unmanned aerial vehicle control strategies.

背景技术Background technique

无人机基于其灵活敏捷、垂直起降等优势,广泛应用于搜索救援、电路巡检、快递运输、航拍测绘、农林植保等工、农业领域。尽管如此,作为对自身动力学变化及外部环境扰动等危险因素极为敏感的空中无人系统,随着应用场景多样化及环境复杂度的提升,对无人机飞行控制策略的安全性提出了更高的要求和挑战。然而,由于安全性要求加剧了无人机飞行控制策略的保守程度,其在安全性与机动性之间的权衡也是影响无人机高效执行复杂飞行任务的关键因素。为此,一种更加安全且机动的无人机飞行控制策略成为当前无人机系统的迫切需求。目前,针对无人机飞行控制策略安全性的研究主要分为以下几种方法:Due to their advantages such as flexibility, agility, and vertical take-off and landing, drones are widely used in industrial and agricultural fields such as search and rescue, circuit inspection, express delivery, aerial photography and mapping, and agricultural and forestry plant protection. Nevertheless, as an aerial unmanned system that is extremely sensitive to dangerous factors such as its own dynamic changes and external environmental disturbances, with the diversification of application scenarios and the increase in environmental complexity, higher requirements and challenges are placed on the safety of drone flight control strategies. However, since safety requirements have increased the conservatism of drone flight control strategies, the trade-off between safety and maneuverability is also a key factor affecting the efficient execution of complex flight missions by drones. For this reason, a safer and more maneuverable drone flight control strategy has become an urgent need for current drone systems. At present, research on the safety of drone flight control strategies is mainly divided into the following methods:

(1)基于模型的最优控制方法(1) Model-based optimal control method

此类方法通常将无人机系统建模为确定的刚体,在此基础上,通过将无人机控制策略抽象为满足动力学约束及各类安全性约束的最优控制问题,从而得到具有安全性保证的无人机飞行控制策略,达到无人机安全飞行控制的目的。通过求解一个带有约束的有限时域模型预测控制问题,得到了能够同时满足所有约束的无人机控制策略,解决了无人机在飞行过程中遇到的安全性挑战。然而,该方法需要依赖严格且复杂的数学形式来构造无人机控制策略的优化目标及约束,由此导致了其复杂的非线性结构且往往难以通过近似求解的方式得到其可行解。此外,无人机控制策略对于复杂约束的满足会不断加剧其输出控制动作的保守性,进而使得其无法适用于无人机系统对飞行机动性的需求。This type of method usually models the UAV system as a fixed rigid body. On this basis, by abstracting the UAV control strategy into an optimal control problem that satisfies dynamic constraints and various safety constraints, a UAV flight control strategy with safety guarantees is obtained to achieve the purpose of safe flight control of the UAV. By solving a finite time domain model predictive control problem with constraints, a UAV control strategy that can simultaneously satisfy all constraints is obtained, solving the safety challenges encountered by the UAV during flight. However, this method needs to rely on strict and complex mathematical forms to construct the optimization objectives and constraints of the UAV control strategy, which leads to its complex nonlinear structure and is often difficult to obtain its feasible solution through approximate solutions. In addition, the satisfaction of the complex constraints of the UAV control strategy will continuously increase the conservatism of its output control actions, making it unsuitable for the UAV system's demand for flight maneuverability.

(2)基于无模型的安全强化学习方法(2) Model-free secure reinforcement learning method

强化学习方法作为一种新型的智能自学习序列决策方法,由于其无需依赖复杂的系统模型,仅在与环境不断交互试错的过程中,基于其强大的策略搜索与优化能力,在无人机飞行控制领域逐渐得到了应用。然而,大多数无模型的强化学习方法由于无法显示地表达系统需要满足的安全性约束而使得学习到的无人机控制策略不具备安全性保证。尽管利用带约束的马尔科夫决策过程可以构造出一类安全强化学习方法[7]用于训练满足安全性要求的无人机控制策略,但此类方法由于安全性约束的限制,在一定程度上会影响策略探索与优化的效率,且训练出的无人机安全控制策略均与所处的应用场景和特定的飞行任务进行了深度耦合,不具备通用性和对多样化应用场景的泛化能力。此外,在无人机控制策略的训练过程中,由于其与环境交互存在不断的试错与探索,因此,无法避免其违反安全性约束,进而无法保证其在训练过程中的安全性。As a new type of intelligent self-learning sequential decision-making method, reinforcement learning method has gradually been applied in the field of UAV flight control because it does not rely on complex system models and only interacts with the environment through trial and error. Based on its powerful strategy search and optimization capabilities, it has been gradually applied in the field of UAV flight control. However, most model-free reinforcement learning methods cannot explicitly express the safety constraints that the system needs to meet, so the learned UAV control strategies do not have safety guarantees. Although a class of safety reinforcement learning methods can be constructed using constrained Markov decision processes [7] to train UAV control strategies that meet safety requirements, such methods will affect the efficiency of strategy exploration and optimization to a certain extent due to the limitations of safety constraints. In addition, the trained UAV safety control strategies are deeply coupled with the application scenarios and specific flight missions, and lack universality and generalization capabilities for diverse application scenarios. In addition, during the training process of the UAV control strategy, due to the continuous trial and error and exploration of its interaction with the environment, it is impossible to avoid violating safety constraints, and thus its safety during the training process cannot be guaranteed.

(3)基于安全性验证的方法(3) Methods based on security verification

安全性验证是另一类通过判断无人机控制策略是否违反安全性约束来对其安全性进行验证的方法,与此同时,基于判断结果对无人机控制策略进行安全性修正,从而实现无人机系统的安全飞行控制效果。通过学习的方法训练了一种针对不安全控制策略进行替换的恢复策略作为其违反安全性约束情况下的备用策略,在一定程度上提升了控制策略的安全性。但是,由于该恢复策略是一种基于数据驱动的学习类方法,因此,其本身仍然存在安全风险,无法提供严格的安全性保证。为了避免学习类方法存在的安全风险,通过构造障碍函数的方式,将控制策略限制在具有严格数学形式表征的安全集内。尽管如此,由于该方法存在障碍函数构造难度大且计算复杂度高等问题,无法满足无人机控制对实时性的要求。也有研究提出了一种安全性滤波器用于预测和修正系统原有控制策略来达到安全控制的目的,但是由于其未考虑该滤波器对复杂多样化应用场景的自适应性,因而无法适用于无人机系统在多变场景下执行复杂的飞行任务。Safety verification is another method that verifies the safety of drone control strategies by judging whether they violate safety constraints. At the same time, the safety of drone control strategies is corrected based on the judgment results, thereby achieving safe flight control of drone systems. A recovery strategy that replaces unsafe control strategies is trained through learning methods as a backup strategy in the event of violations of safety constraints, which improves the safety of the control strategy to a certain extent. However, since the recovery strategy is a data-driven learning method, it still has safety risks and cannot provide strict safety guarantees. In order to avoid the safety risks of learning methods, the control strategy is restricted to a safe set with a strict mathematical form by constructing a barrier function. However, due to the difficulty of barrier function construction and high computational complexity, this method cannot meet the real-time requirements of drone control. Some studies have also proposed a safety filter to predict and correct the original control strategy of the system to achieve the purpose of safety control, but since it does not consider the adaptability of the filter to complex and diverse application scenarios, it cannot be applied to drone systems to perform complex flight missions in changing scenarios.

发明内容Summary of the invention

本发明提供无人机控制策略安全性验证与修正方法及系统,旨在至少解决现有技术中存在的技术问题之一。The present invention provides a method and system for verifying and correcting the safety of an unmanned aerial vehicle control strategy, aiming to solve at least one of the technical problems existing in the prior art.

本发明的技术方案涉及无人机控制策略安全性验证与修正方法及系统,所述的无人机控制策略安全性验证与修正方法应用于无人机系统上,所述无人机系统包含原有的无人机飞行控制策略,所述的方法包括以下步骤:The technical solution of the present invention relates to a method and system for verifying and correcting the safety of a drone control strategy. The method for verifying and correcting the safety of a drone control strategy is applied to a drone system, wherein the drone system includes an existing drone flight control strategy. The method comprises the following steps:

S100、获取当前无人机系统状态信息、无人机系统的原有无人机飞行控制策略输出的未验证的潜在不安全动作、和无人机自适应调节的安全约束边界;S100, obtaining the current UAV system status information, the unverified potential unsafe actions output by the original UAV flight control strategy of the UAV system, and the safety constraint boundary of the UAV adaptive adjustment;

S200、基于优化目标范数,建立并求解带有无人机动力学及安全性约束的有限时域优化问题,得到用于无人机系统飞行控制的安全控制动作;S200, based on the optimization target norm, establish and solve a finite time domain optimization problem with UAV dynamics and safety constraints, and obtain a safety control action for UAV system flight control;

S300、基于无人机自适应调节的安全约束边界,建立并更新无人机状态需要满足的安全性约束;S300, establishing and updating the safety constraints that the drone state needs to satisfy based on the safety constraint boundary of the drone adaptive adjustment;

S400、在有限时域范围内迭代计算步骤S200和步骤S300,得到无人机在未来有限时域范围内的安全状态序列及对应的安全控制动作序列。S400, iteratively calculating step S200 and step S300 within a limited time domain to obtain a safety state sequence and a corresponding safety control action sequence of the UAV within a future limited time domain.

进一步,所述步骤S200包括:Further, the step S200 includes:

S210、基于在最大化保留原有无人机飞行控制策略性能的同时,保证其输出控制动作的安全性,定义无人机系统的优化目标范数距离;S210, defining an optimization target norm distance of the UAV system based on maximizing the performance of the original UAV flight control strategy while ensuring the safety of its output control action;

S220、基于接收的无人机状态信息和预测得到的安全控制动作,进行无人机动力学转移,建立在有限时域[0,M-1]范围内安全控制动作的无人机动力学约束。S220. Based on the received UAV state information and the predicted safety control action, the UAV dynamics transfer is performed to establish the UAV dynamics constraints of the safety control action within the limited time domain [0, M-1].

进一步,所述步骤S210中,所述优化目标范数距离为:Further, in step S210, the optimization target norm distance is:

其中,为原有无人机飞行控制策略在t时刻输出的未验证的潜在不安全控制动作,/>为经过安全性验证与修正方法优化后得到的t时刻无人机安全控制动作。in, is the unverified potentially unsafe control action output by the original UAV flight control strategy at time t, /> It is the safety control action of the UAV at time t obtained after safety verification and optimization of the correction method.

进一步,原有无人机飞行控制策略输出的未验证的潜在不安全控制动作为:Furthermore, the original UAV flight control strategy outputs unverified potentially unsafe control actions for:

其中,Ft为无人机中所有机载电机生成的合推力,ωt为无人机在机体坐标系下的三轴角速度。Among them, Ft is the combined thrust generated by all the onboard motors in the UAV, and ωt is the three-axis angular velocity of the UAV in the body coordinate system.

进一步,所述步骤S220中,对有限时域范围内所有的整数m,所述无人机动力学约束为:Furthermore, in step S220, for all integers m within a limited time domain, the UAV dynamics constraint is:

其中,且m为整数,st+m为无人机在t+m时刻的状态,/>为无人机在t+m时刻的安全控制动作,st+m+1为由无人机动力学模型f(·)经过计算得到的下一时刻t+m+1的状态。in, And m is an integer, s t+m is the state of the drone at time t+m, /> is the safety control action of the UAV at time t+m, and s t+m+1 is the state at the next time t+m+1 calculated by the UAV dynamics model f(·).

进一步,所述步骤S220中,在t时刻无人机状态st包括:Further, in step S220, the drone state s t at time t includes:

st=[pt,qt,vt],s t =[p t ,q t ,v t ],

其中,pt为t时刻无人机的位置信息,qt为t时刻无人机的姿态四元数信息,vt为t时刻无人机的线速度信息。Among them, p t is the position information of the UAV at time t, q t is the attitude quaternion information of the UAV at time t, and v t is the linear velocity information of the UAV at time t.

进一步,所述步骤S300中,无人机状态的安全性约束包括无人机在有限时域范围内的状态均需处于安全状态集合S内,Furthermore, in step S300, the safety constraint of the drone state includes that the drone state within a limited time domain must be within the safety state set S.

所述安全状态集合S为:The security state set S is:

其中,st为无人机在t时刻的状态,B为状态安全性约束在每个状态分量上的权重矩阵,为在t时刻无人机状态需要满足的安全状态边界值,Among them, s t is the state of the UAV at time t, B is the weight matrix of the state safety constraint on each state component, is the safety state boundary value that the drone state needs to meet at time t,

其中,为t时刻无人机的位置信息的最小值,/>为t时刻无人机的姿态四元数信息的最小值,/>为t时刻无人机的线速度信息的最小值,/>为t时刻无人机的位置信息的最大值,/>为t时刻无人机的姿态四元数信息的最大值,/>为t时刻无人机的线速度信息的最大值。in, is the minimum value of the drone's position information at time t, /> is the minimum value of the UAV's attitude quaternion information at time t,/> is the minimum value of the UAV's linear velocity information at time t, /> is the maximum value of the drone's position information at time t, /> is the maximum value of the UAV's attitude quaternion information at time t,/> is the maximum value of the UAV’s linear velocity information at time t.

进一步,所述步骤S300中,无人机控制动作的安全性约束包括无人机在有限时域范围内的控制动作均需处于安全动作集合U内,Furthermore, in step S300, the safety constraints of the drone control actions include that the drone control actions within a limited time domain must be within the safety action set U.

所述安全动作集合U内为:The security action set U includes:

其中,U表示无人机控制动作需要满足的安全性约束集合,Among them, U represents the set of safety constraints that the drone control action needs to meet,

其中,A表示控制动作安全性约束在每个控制分量上的权重矩阵,为在t时刻无人机控制动作需要满足的安全边界值,Where A represents the weight matrix of the control action safety constraint on each control component, is the safety boundary value that the drone control action needs to meet at time t,

其中,为无人机中所有机载电机生成的合推力的最小值,/>为无人机中所有机载电机生成的合推力的最大值,/>为无人机在机体坐标系下的三轴角速度的最小值,/>为无人机在机体坐标系下的三轴角速度的最大值。in, is the minimum value of the combined thrust generated by all onboard motors in the drone,/> is the maximum value of the combined thrust generated by all the onboard motors in the drone,/> is the minimum value of the three-axis angular velocity of the drone in the body coordinate system, /> It is the maximum value of the three-axis angular velocity of the UAV in the body coordinate system.

进一步,在有限时域的终止时刻M,无人机状态st+M需要满足状态终止集Send,即,Furthermore, at the end time M of the finite time domain, the drone state s t+M needs to satisfy the state termination set S end , that is,

st+M∈Sends t+M ∈S end ,

其中,Send为状态终止集,状态终止集Send为状态安全性约束集合S的一个子集,即,Among them, S end is the state termination set, and the state termination set S end is a subset of the state safety constraint set S, that is,

其中,S为状态安全约束集合。Among them, S is the set of state safety constraints.

进一步,本发明还提出无人机控制策略安全性验证与修正系统,用于实施无人机控制策略安全性验证与修正方法,所述无人机控制策略安全性验证与修正系统包括:Furthermore, the present invention also proposes a UAV control strategy safety verification and correction system for implementing a UAV control strategy safety verification and correction method, wherein the UAV control strategy safety verification and correction system comprises:

无人机系统,所述无人机系统包含原有的无人机飞行控制策略,原有的无人机飞行控制策略基于无人机系统的无人机状态信息生成未验证的潜在不安全控制动作,控制无人机系统的飞行;An unmanned aerial vehicle system, the unmanned aerial vehicle system including an existing unmanned aerial vehicle flight control strategy, the existing unmanned aerial vehicle flight control strategy generating unverified potentially unsafe control actions based on unmanned aerial vehicle state information of the unmanned aerial vehicle system to control the flight of the unmanned aerial vehicle system;

安全性验证与修正装置,所述安全性验证与修正装置从无人机系统的原有的无人机飞行控制策略中获取潜在不安全控制动作,并生成安全控制动作后传送到无人机系统,所述安全性验证与修正装置与所述无人机系统连接。A safety verification and correction device, wherein the safety verification and correction device obtains potential unsafe control actions from the original UAV flight control strategy of the UAV system, generates safe control actions and transmits them to the UAV system, and the safety verification and correction device is connected to the UAV system.

与现有的技术相比,本发明具有以下的特点:Compared with the prior art, the present invention has the following characteristics:

本发明提出一种统一且通用的即插即用型无人机飞行控制策略安全性验证与修正方法,基于优化理论的思想,通过解耦的方式对潜在的不安全无人机飞行控制策略进行验证与修正,旨在最大化保留原有无人机飞行控制策略机动性能,保证其安全性,达到对无人机系统既安全又机动的飞行控制目的。此外,针对所提出的无人机飞行控制策略安全性验证与修正方法,本发明通过设计与之对应的无人机系统,用于部署和验证其可行性与有效性,并最终实现其在多样化真实场景下的应用。The present invention proposes a unified and universal plug-and-play UAV flight control strategy safety verification and correction method. Based on the idea of optimization theory, the potentially unsafe UAV flight control strategy is verified and corrected by decoupling, aiming to maximize the retention of the original UAV flight control strategy maneuverability, ensure its safety, and achieve the purpose of safe and maneuverable flight control of the UAV system. In addition, for the proposed UAV flight control strategy safety verification and correction method, the present invention designs a corresponding UAV system to deploy and verify its feasibility and effectiveness, and ultimately realizes its application in a variety of real scenarios.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为无人机控制策略安全性验证与修正方法的流程图。Figure 1 is a flow chart of the UAV control strategy safety verification and correction method.

图2为无人机控制策略安全性验证与修正方法的建立并求解带有无人机动力学及安全性约束的有限时域优化问题的流程图。Figure 2 is a flow chart of the establishment of the UAV control strategy safety verification and correction method and the solution of the finite time domain optimization problem with UAV dynamics and safety constraints.

图3为无人机飞行控制策略安全性验证与修正方法及系统框架的示意图。FIG3 is a schematic diagram of the safety verification and correction method and system framework of the UAV flight control strategy.

图4为无人机控制策略安全性验证与修正方法的无人机系统架构的示意图。FIG4 is a schematic diagram of the UAV system architecture of the UAV control strategy safety verification and correction method.

图5为无人机控制策略安全性验证与修正方法的原有无人机飞行控制策略构成的示意图。FIG5 is a schematic diagram of the original UAV flight control strategy structure of the UAV control strategy safety verification and correction method.

图6为无人机控制策略安全性验证与修正方法的安全性验证与修正方法框架的示意图。FIG6 is a schematic diagram of a safety verification and correction method framework for a UAV control strategy safety verification and correction method.

图7为无人机控制策略安全性验证与修正方法的安全性验证与修正优化问题求解流程。FIG7 is a flowchart of solving the safety verification and correction optimization problem of the UAV control strategy safety verification and correction method.

图8为无人机控制策略安全性验证与修正方法的仿真过程中无人机控制动作数据实时显示界面的示意图。FIG8 is a schematic diagram of a real-time display interface of UAV control action data during the simulation process of the UAV control strategy safety verification and correction method.

图9为无人机控制策略安全性验证与修正系统在高度受限(最大高度3m)的大厅环境下无人机起飞测试的仿真结果示意图。FIG9 is a schematic diagram of the simulation results of the drone takeoff test of the drone control strategy safety verification and correction system in a hall environment with limited height (maximum height 3m).

图10为无人机控制策略安全性验证与修正系统在宽度受限(宽度范围[-Figure 10 shows the safety verification and correction system of the UAV control strategy under the condition of limited width (width range [-

0.5,0.5]m)的走廊环境下无人机穿越测试仿真结果示意图。Schematic diagram of the simulation results of the drone crossing test in a corridor environment with a diameter of 0.5, 0.5]m.

图11a至图11b为无人机控制策略安全性验证与修正系统在空间受限(长[-1,1]m、宽[-1,1]m、高[0,2]m)的房间环境下无人机跟踪轨迹测试仿真结果示意图。Figures 11a to 11b are schematic diagrams of the simulation results of the drone tracking trajectory test of the drone control strategy safety verification and correction system in a room environment with limited space (length [-1,1]m, width [-1,1]m, height [0,2]m).

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solution and advantages of the embodiments of the present invention clearer, the technical solution in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.

以下将结合实施例和附图对本发明的构思、具体结构及产生的技术效果进行清楚、完整的描述,以充分地理解本发明的目的、方案和效果。The concept, specific structure and technical effects of the present invention will be clearly and completely described below in combination with the embodiments and drawings to fully understand the purpose, scheme and effect of the present invention.

需要说明的是,如无特殊说明,当某一特征被称为“固定”、“连接”在另一个特征,它可以直接固定、连接在另一个特征上,也可以间接地固定、连接在另一个特征上。本文所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。此外,除非另有定义,本文所使用的所有的技术和科学术语与本技术领域的技术人员通常理解的含义相同。本文说明书中所使用的术语只是为了描述具体的实施例,而不是为了限制本发明。本文所使用的术语“和/或”包括一个或多个相关的所列项目的任意的组合。It should be noted that, unless otherwise specified, when a feature is referred to as being "fixed" or "connected" to another feature, it may be directly fixed or connected to another feature, or it may be indirectly fixed or connected to another feature. The singular forms "a", "said" and "the" used herein are also intended to include the plural forms, unless the context clearly indicates otherwise. In addition, unless otherwise defined, all technical and scientific terms used herein have the same meaning as those generally understood by those skilled in the art. The terms used in this specification are intended only to describe specific embodiments and are not intended to limit the invention. The term "and/or" used herein includes any combination of one or more related listed items.

应当理解,尽管在本公开可能采用术语第一、第二、第三等来描述各种元件,但这些元件不应限于这些术语。这些术语仅用来将同一类型的元件彼此区分开。例如,在不脱离本公开范围的情况下,第一元件也可以被称为第二元件,类似地,第二元件也可以被称为第一元件。本文所提供的任何以及所有实例或示例性语言(“例如”、“如”等)的使用仅意图更好地说明本发明的实施例,并且除非另外要求,否则不会对本发明的范围施加限制。此外,本文所采用的行业术语“位姿”是指某个元件相对于空间坐标系的位置和姿态。It should be understood that although the terms first, second, third, etc. may be used to describe various elements in the present disclosure, these elements should not be limited to these terms. These terms are only used to distinguish elements of the same type from each other. For example, without departing from the scope of the present disclosure, the first element may also be referred to as the second element, and similarly, the second element may also be referred to as the first element. The use of any and all examples or exemplary language ("for example", "such as", etc.) provided herein is intended only to better illustrate embodiments of the present invention, and unless otherwise required, will not impose limitations on the scope of the present invention. In addition, the industry term "posture" used herein refers to the position and posture of a certain element relative to a spatial coordinate system.

参照图1至图11b,本发明实施例提供了无人机控制策略安全性验证与修正方法及系统,所述的无人机控制策略安全性验证与修正方法应用于无人机系统上,所述无人机系统包含原有的无人机飞行控制策略,参照图1,所述的方法包括以下步骤:1 to 11 b, an embodiment of the present invention provides a method and system for verifying and correcting the safety of a drone control strategy. The method is applied to a drone system, wherein the drone system includes an existing drone flight control strategy. Referring to FIG. 1 , the method includes the following steps:

S100、获取当前无人机系统状态信息、无人机系统的原有无人机飞行控制策略输出的未验证的潜在不安全动作、和无人机自适应调节的安全约束边界;S100, obtaining the current UAV system status information, the unverified potential unsafe actions output by the original UAV flight control strategy of the UAV system, and the safety constraint boundary of the UAV adaptive adjustment;

S200、基于优化目标范数,建立并求解带有无人机动力学及安全性约束的有限时域优化问题,得到用于无人机系统飞行控制的安全控制动作;S200, based on the optimization target norm, establish and solve a finite time domain optimization problem with UAV dynamics and safety constraints, and obtain a safety control action for UAV system flight control;

S300、基于无人机自适应调节的安全约束边界,建立并更新无人机状态需要满足的安全性约束;S300, establishing and updating the safety constraints that the drone state needs to satisfy based on the safety constraint boundary of the drone adaptive adjustment;

S400、在有限时域范围内迭代计算步骤S200和步骤S300,得到无人机在未来有限时域范围内的安全状态序列及对应的安全控制动作序列。S400, iteratively calculating step S200 and step S300 within a limited time domain to obtain a safety state sequence and a corresponding safety control action sequence of the UAV within a future limited time domain.

与现有的技术相比,本发明具有以下的特点:Compared with the prior art, the present invention has the following characteristics:

本发明提出一种统一且通用的即插即用型无人机飞行控制策略安全性验证与修正方法,基于优化理论的思想,通过解耦的方式对潜在的不安全无人机飞行控制策略进行验证与修正,旨在最大化保留原有无人机飞行控制策略机动性能,保证其安全性,达到对无人机系统既安全又机动的飞行控制目的。此外,针对所提出的无人机飞行控制策略安全性验证与修正方法,本发明通过设计与之对应的无人机系统,用于部署和验证其可行性与有效性,并最终实现其在多样化真实场景下的应用。The present invention proposes a unified and universal plug-and-play UAV flight control strategy safety verification and correction method. Based on the idea of optimization theory, the potentially unsafe UAV flight control strategy is verified and corrected by decoupling, aiming to maximize the retention of the original UAV flight control strategy maneuverability, ensure its safety, and achieve the purpose of safe and maneuverable flight control of the UAV system. In addition, for the proposed UAV flight control strategy safety verification and correction method, the present invention designs a corresponding UAV system to deploy and verify its feasibility and effectiveness, and ultimately realizes its application in a variety of real scenarios.

针对现有无人机安全飞行控制方法存在的求解难度高、保守性强、耦合严重、探索效率低、通用性差、无法自适应于多样化复杂场景等缺陷,本发明的主要目的是提出一种统一且通用的即插即用型无人机飞行控制策略安全性验证与修正方法,基于优化理论的思想,并在此基础上通过解耦的方式对潜在的不安全无人机飞行控制策略进行验证与修正,旨在最大化保留原有无人机飞行控制策略机动性能的同时,保证其安全性,从而达到对无人机系统既安全又机动的飞行控制目的。此外,针对所提出的无人机飞行控制策略安全性验证与修正方法,本发明通过设计与之对应的无人机系统,用于部署和验证其可行性与有效性,并最终实现其在多样化真实场景下的应用。In view of the defects of existing UAV safe flight control methods, such as high difficulty in solving, strong conservatism, serious coupling, low exploration efficiency, poor versatility, and inability to adapt to diverse and complex scenarios, the main purpose of the present invention is to propose a unified and universal plug-and-play UAV flight control strategy safety verification and correction method, based on the idea of optimization theory, and on this basis, verify and correct the potential unsafe UAV flight control strategy by decoupling, aiming to maximize the retention of the original UAV flight control strategy maneuverability while ensuring its safety, so as to achieve the purpose of safe and maneuverable flight control of the UAV system. In addition, for the proposed UAV flight control strategy safety verification and correction method, the present invention designs a corresponding UAV system to deploy and verify its feasibility and effectiveness, and finally realizes its application in diverse real scenarios.

本发明技术方案带来了如下有益效果:The technical solution of the present invention brings the following beneficial effects:

(1)与基于模型的最优控制方法相比,本发明无需构造复杂的非线性优化目标来达到无人机的飞行控制目的,从而大幅度降低了优化问题的求解难度。与此同时,本发明主要强调的是在最大化保留原有无人机飞行控制策略性能的基础上,对潜在的不安全无人机飞行控制策略进行安全性验证与修正,因此,有效避免了所提出的安全性验证与修正方法对原有无人机飞行控制策略性能的影响,进而降低了无人机安全飞行控制策略的整体保守性。(1) Compared with the model-based optimal control method, the present invention does not need to construct complex nonlinear optimization objectives to achieve the flight control purpose of the UAV, thereby greatly reducing the difficulty of solving the optimization problem. At the same time, the present invention mainly emphasizes the safety verification and correction of the potential unsafe UAV flight control strategy on the basis of maximizing the performance of the original UAV flight control strategy. Therefore, the proposed safety verification and correction method is effectively avoided. The impact on the performance of the original UAV flight control strategy, thereby reducing the overall conservatism of the UAV safe flight control strategy.

(2)与基于无模型的安全强化学习方法相比,本发明通过解耦的方式将无人机飞行控制策略需要满足的安全性约束从安全强化学习控制策略的训练中独立出来,同时避免了安全性约束对策略搜索效率的负面影响以及策略与特定场景与任务的严重耦合,从而使得所提出的无人机控制策略安全性验证与修正方法具备了统一且通用的即插即用特性,能够适用于对所有潜在的不安全无人机飞行控制策略进行安全性验证,并最终为无人机系统提供安全飞行保证。(2) Compared with the model-free safety reinforcement learning method, the present invention separates the safety constraints that the UAV flight control strategy needs to meet from the training of the safety reinforcement learning control strategy by decoupling it, while avoiding the negative impact of the safety constraints on the strategy search efficiency and the serious coupling of the strategy with specific scenarios and tasks. As a result, the proposed UAV control strategy safety verification and correction method has a unified and universal plug-and-play feature, which can be applied to the safety verification of all potentially unsafe UAV flight control strategies, and ultimately provide safe flight guarantees for the UAV system.

(3)与现有基于安全性验证的方法相比,本发明基于设计的与所提出安全性验证与修正方法对应的无人机系统,并利用无人机系统中的环境特征感知模块实现了对安全性约束边界的自适应调节,从而使得所提出的无人机控制策略安全性验证与修正方法具备了自适应于多样化复杂环境的能力,极大地扩展了其应用场景。(3) Compared with the existing safety verification-based methods, the present invention is based on a designed UAV system corresponding to the proposed safety verification and correction method, and utilizes the environmental feature perception module in the UAV system to realize adaptive adjustment of the safety constraint boundary, thereby enabling the proposed UAV control strategy safety verification and correction method to have the ability to adapt to diverse and complex environments, greatly expanding its application scenarios.

具体地,在步骤S100后,在得到原有无人机飞行控制策略输出的未验证的潜在不安全控制动作之后,为了对其进行安全性验证与修正,本发明提出了如图4所示的安全性验证与修正方法,通过优化理论的思想将该问题转化为带有无人机动力学及安全性约束的有限时域优化问题进行求解,最终输出具有安全性保证的无人机控制动作。该有限时域优化问题主要由优化目标、无人机动力学约束、以及安全性约束三部分构成。Specifically, after step S100, after obtaining the unverified potential unsafe control action output by the original UAV flight control strategy, in order to verify and correct its safety, the present invention proposes a safety verification and correction method as shown in Figure 4, which transforms the problem into a finite time domain optimization problem with UAV dynamics and safety constraints through the idea of optimization theory, and finally outputs the UAV control action with safety guarantee. The finite time domain optimization problem mainly consists of three parts: optimization objective, UAV dynamics constraint, and safety constraint.

进一步,参照图1至图2,所述步骤S200包括:Further, referring to FIG. 1 and FIG. 2 , the step S200 includes:

S210、基于在最大化保留原有无人机飞行控制策略性能的同时,保证其输出控制动作的安全性,定义无人机系统的优化目标范数距离;S210, defining an optimization target norm distance of the UAV system based on maximizing the performance of the original UAV flight control strategy while ensuring the safety of its output control action;

S220、基于接收的无人机状态信息和预测得到的安全控制动作,进行无人机动力学转移,建立在有限时域[0,M-1]范围内安全控制动作的无人机动力学约束。S220. Based on the received UAV state information and the predicted safety control action, the UAV dynamics transfer is performed to establish the UAV dynamics constraints of the safety control action within the limited time domain [0, M-1].

进一步,参照图1至图2,所述步骤S210中,所述优化目标范数距离为:Further, referring to FIG. 1 and FIG. 2 , in step S210, the optimization target norm distance is:

其中,为原有无人机飞行控制策略在t时刻输出的未验证的潜在不安全控制动作,/>为经过安全性验证与修正方法优化后得到的t时刻无人机安全控制动作。in, is the unverified potentially unsafe control action output by the original UAV flight control strategy at time t, /> It is the safety control action of the UAV at time t obtained after safety verification and optimization of the correction method.

具体地,优化目标,根据本发明所提出的安全性验证与修正方法的优化目的,即在最大化保留原有无人机飞行控制策略性能的同时,保证其输出控制动作的安全性,其仅与未验证的潜在不安全控制动作有关,无需利用复杂的数学形式进行构造,只需最小化该不安全控制动作/>与预测安全控制动作/>之间的范数距离,即可达到上述优化目的。Specifically, the optimization goal is to optimize the safety verification and correction method proposed by the present invention, that is, to maximize the performance of the original UAV flight control strategy while ensuring the safety of its output control action, which is only related to the unverified potential unsafe control action. There is no need to use complex mathematical forms to construct it, just minimize the unsafe control action/> and predictive safety control actions/> The above optimization purpose can be achieved by simply calculating the norm distance between them.

进一步,参照图1至图2,原有无人机飞行控制策略输出的未验证的潜在不安全控制动作为:Further, referring to FIGS. 1 and 2 , the unverified potentially unsafe control actions output by the original UAV flight control strategy for:

其中,Ft为无人机中所有机载电机生成的合推力,ωt为无人机在机体坐标系下的三轴角速度。表示原有无人机飞行控制策略在t时刻输出的不安全控制动作,其由无人机四个机载电机生成的合推力Ft和机体坐标系下的三轴角速度ωt组成。Among them, Ft is the combined thrust generated by all the onboard motors in the drone, and ωt is the three-axis angular velocity of the drone in the body coordinate system. represents the unsafe control action output by the original drone flight control strategy at time t, which is composed of the combined thrust Ft generated by the four onboard motors of the drone and the three-axis angular velocity ωt in the body coordinate system.

进一步,参照图1至图2,所述步骤S220中,对有限时域范围内所有的整数m,所述无人机动力学约束为:Further, referring to FIG. 1 and FIG. 2 , in step S220, for all integers m within a limited time domain, the UAV dynamics constraint is:

其中,且m为整数,/>表示有限时域范围内所有的整数值0,1,…,m,M-1;st+m为无人机在t+m时刻的状态,/>为无人机在t+m时刻的安全控制动作,st+m+1为由无人机动力学模型f(·)经过计算得到的下一时刻t+m+1的状态。/>表示经过安全性验证与修正方法优化后得到的t时刻无人机安全控制动作,同理,/>表示在t+m时刻无人机的安全控制动作。in, and m is an integer, /> represents all integer values 0, 1, ..., m, M-1 within a limited time domain; s t+m is the state of the drone at time t+m, /> is the safety control action of the UAV at time t+m, and s t+m+1 is the state at the next time t+m+1 calculated by the UAV dynamics model f(·). /> represents the safety control action of the drone at time t obtained after safety verification and optimization of the correction method. Similarly, /> Indicates the safety control action of the drone at time t+m.

无人机动力学约束:无人机动力学约束是该优化问题根据实时预测得到的安全控制动作在有限时域[0,M-1]范围内进行无人机状态转移与更新的重要依据,也是保证无人机在未来时刻满足安全性约束的必要条件,因此,基于接收的无人机状态信息和预测得到的安全控制动作进行无人机动力学转移是该优化问题需要满足的约束之一。UAV dynamic constraints: UAV dynamic constraints are an important basis for the optimization problem to transfer and update the UAV state within a limited time domain [0, M-1] according to the safety control actions obtained by real-time prediction. It is also a necessary condition to ensure that the UAV meets the safety constraints at all times in the future. Therefore, UAV dynamic transfer based on the received UAV state information and the predicted safety control actions is one of the constraints that need to be met in this optimization problem.

进一步,参照图1至图2,所述步骤S220中,在t时刻无人机状态st包括:Further, referring to FIG. 1 and FIG. 2 , in step S220, the drone state s t at time t includes:

st=[pt,qt,vt],s t =[p t ,q t ,v t ],

其中,pt为t时刻无人机的位置信息,qt为t时刻无人机的姿态四元数信息,vt为t时刻无人机的线速度信息。Among them, p t is the position information of the UAV at time t, q t is the attitude quaternion information of the UAV at time t, and v t is the linear velocity information of the UAV at time t.

同理,在t+m以及t+m+1时刻的无人机状态分别表示为st+m,st+m+1Similarly, the drone states at time t+m and t+m+1 are represented as s t+m , s t+m+1 respectively.

进一步,参照图1至图2,所述步骤S300中,无人机状态的安全性约束包括无人机在有限时域范围内的状态均需处于安全状态集合S内,Further, referring to FIG. 1 and FIG. 2 , in step S300 , the safety constraint of the drone state includes that the drone state within a limited time domain must be within a safe state set S,

所述安全状态集合S为:The security state set S is:

其中,st为无人机在t时刻的状态,B为状态安全性约束在每个状态分量上的权重矩阵,为在t时刻无人机状态需要满足的安全状态边界值,Among them, s t is the state of the UAV at time t, B is the weight matrix of the state safety constraint on each state component, is the safety state boundary value that the drone state needs to meet at time t,

其中,为t时刻无人机的位置信息的最小值,/>为t时刻无人机的姿态四元数信息的最小值,/>为t时刻无人机的线速度信息的最小值,/>为t时刻无人机的位置信息的最大值,/>为t时刻无人机的姿态四元数信息的最大值,/>为t时刻无人机的线速度信息的最大值。in, is the minimum value of the drone's position information at time t, /> is the minimum value of the UAV's attitude quaternion information at time t,/> is the minimum value of the UAV's linear velocity information at time t, /> is the maximum value of the drone's position information at time t, /> is the maximum value of the UAV's attitude quaternion information at time t,/> is the maximum value of the UAV’s linear velocity information at time t.

进一步,参照图1至图2,所述步骤S300中,无人机控制动作的安全性约束包括无人机在有限时域范围内的控制动作均需处于安全动作集合U内,Further, referring to FIG. 1 and FIG. 2 , in step S300 , the safety constraints of the drone control actions include that the drone control actions within a limited time domain must be within the safety action set U,

所述安全动作集合U内为:The security action set U includes:

其中,U表示无人机控制动作需要满足的安全性约束集合,Among them, U represents the set of safety constraints that the drone control action needs to meet,

其中,A表示控制动作安全性约束在每个控制分量上的权重矩阵,为在t时刻无人机控制动作需要满足的安全边界值,Where A represents the weight matrix of the control action safety constraint on each control component, is the safety boundary value that the drone control action needs to meet at time t,

其中,为无人机中所有机载电机生成的合推力的最小值,/>为无人机中所有机载电机生成的合推力的最大值,/>为无人机在机体坐标系下的三轴角速度的最小值,/>为无人机在机体坐标系下的三轴角速度的最大值。in, is the minimum value of the combined thrust generated by all onboard motors in the drone,/> is the maximum value of the combined thrust generated by all the onboard motors in the drone,/> is the minimum value of the three-axis angular velocity of the drone in the body coordinate system, /> It is the maximum value of the three-axis angular velocity of the UAV in the body coordinate system.

进一步,参照图1至图2,在有限时域的终止时刻M,无人机状态st+M需要满足状态终止集Send,即,Further, referring to FIG. 1 and FIG. 2 , at the end time M of the finite time domain, the drone state s t+M needs to satisfy the state end set S end , that is,

st+M∈Sends t+M ∈S end ,

其中,Send为状态终止集,状态终止集Send为状态安全性约束集合S的一个子集,即,Among them, S end is the state termination set, and the state termination set S end is a subset of the state safety constraint set S, that is,

其中,S为状态安全约束集合。Among them, S is the set of state safety constraints.

具体地,安全性约束是保证无人机状态及控制动作始终处于安全范围内必须要满足的条件。无人机在有限时域范围内的状态和控制动作均需处于安全状态集合S和安全动作集合内U,即每个状态分量pt,qt,vt和控制动作分量Ft,ωt均需满足安全约束边界条件,具体表示为 其中,Send为集合S的一个子集,即表示无人机在有限时域范围内的状态终止集。为了保证该优化问题求解的递归可行性,定义状态终止集Send为不变集,即存在一个控制策略/>对于/>均有π(s)∈U和f(s,π(s))∈Send成立,从而使得当该优化问题在初始时刻t0有可行解时,在未来时刻t>t0同样存在可行解。另外,为了实现该安全性验证与修正方法对多样化场景的自适应能力,本发明利用从无人机系统中环境特征感知模块获取的状态安全约束边界/>中的实时位置信息/>对上述安全性约束中的状态安全集合S进行实时调节,使得其满足 从而使得其与无人机当前所处环境所需满足的位置安全边界条件保持一致。结合上述优化问题的构成部分,通过在有限时域内进行实时高效的求解,最终得到满足安全性约束的无人机控制动作,达到无人机安全飞行控制的目的。Specifically, safety constraints are conditions that must be met to ensure that the drone state and control actions are always within a safe range. The drone state and control actions within a limited time domain must be within the safe state set S and safe action set U, that is, each state component p t , q t , v t and control action component F t , ω t must satisfy the safety constraint boundary conditions, which can be specifically expressed as Among them, S end is a subset of the set S, that is, Represents the state termination set of the UAV within a limited time domain. In order to ensure the recursive feasibility of solving the optimization problem, the state termination set S end is defined as an invariant set, that is, there exists a control strategy/> For/> π(s)∈U and f(s,π(s))∈S end hold, so that when the optimization problem has a feasible solution at the initial time t 0 , it also has a feasible solution at the future time t>t 0. In addition, in order to realize the adaptive ability of the safety verification and correction method to diversified scenarios, the present invention uses the state safety constraint boundary obtained from the environmental feature perception module in the drone system/> Real-time location information in/> The state safety set S in the above safety constraints is adjusted in real time so that it satisfies This makes it consistent with the position safety boundary conditions that the drone is currently in. Combining the components of the above optimization problem, by performing real-time and efficient solutions in a limited time domain, the drone control action that meets the safety constraints is finally obtained, achieving the purpose of safe flight control of the drone.

进一步,参照图3至图6,本发明还提出无人机控制策略安全性验证与修正系统,用于实施无人机控制策略安全性验证与修正方法,所述无人机控制策略安全性验证与修正系统包括:Further, referring to FIGS. 3 to 6 , the present invention also proposes a UAV control strategy safety verification and correction system for implementing a UAV control strategy safety verification and correction method, wherein the UAV control strategy safety verification and correction system comprises:

无人机系统,所述无人机系统包含原有的无人机飞行控制策略,原有的无人机飞行控制策略基于无人机系统的无人机状态信息生成未验证的潜在不安全控制动作,控制无人机系统的飞行;An unmanned aerial vehicle system, the unmanned aerial vehicle system including an existing unmanned aerial vehicle flight control strategy, the existing unmanned aerial vehicle flight control strategy generating unverified potentially unsafe control actions based on unmanned aerial vehicle state information of the unmanned aerial vehicle system to control the flight of the unmanned aerial vehicle system;

安全性验证与修正装置,所述安全性验证与修正装置从无人机系统的原有的无人机飞行控制策略中获取潜在不安全控制动作,并生成安全控制动作后传送到无人机系统,所述安全性验证与修正装置与所述无人机系统连接。A safety verification and correction device, wherein the safety verification and correction device obtains potential unsafe control actions from the original UAV flight control strategy of the UAV system, generates safe control actions and transmits them to the UAV system, and the safety verification and correction device is connected to the UAV system.

参照图3,在一些具体的实施例中,本发明最简实现方案主要包括三个部分,分别为无人机系统、原有无人机飞行控制策略、以及安全性验证与修正方法。无人机系统作为被控对象主要用于接收控制策略经过安全性验证与修正后产生的安全控制动作,通过将该控制动作作用于系统底层执行机构上产生无人机状态信息的变化与更新。与此同时,根据无人机系统周围所处环境的变化,通过实时环境特征感知模块自适应调节安全约束边界,从而使得安全性验证与修正方法具备多样化场景的自适应能力。原有无人机飞行控制策略则基于当前的无人机状态信息生成未验证的潜在不安全控制动作,用于实现对无人机系统的飞行控制。为了使得作用于无人机系统的控制动作具有安全性保证,则需要利用安全性验证与修正方法对潜在的不安全控制动作进行检测,并基于检测结果对其进行安全性修正。在安全性验证与修正方法中,利用同时输入的不安全控制动作、当前无人机系统状态信息、以及自适应调节的安全约束边界对优化问题进行配置,并通过求解带有无人机系统动力学及安全性约束的有限时域优化问题,以得到用于无人机系统飞行控制的安全控制动作。基于上述最简实现方案中三个部分的相互反馈与结合,通过无人机系统状态信息与控制动作之间的不断传输与交互,如此循环,最终保证无人机系统始终处于安全飞行状态。Referring to FIG. 3 , in some specific embodiments, the simplest implementation scheme of the present invention mainly includes three parts, namely, the unmanned aerial vehicle system, the original unmanned aerial vehicle flight control strategy, and the safety verification and correction method. As the controlled object, the unmanned aerial vehicle system is mainly used to receive the safety control action generated by the control strategy after safety verification and correction, and the change and update of the unmanned aerial vehicle state information is generated by applying the control action to the bottom actuator of the system. At the same time, according to the changes in the environment around the unmanned aerial vehicle system, the safety constraint boundary is adaptively adjusted through the real-time environmental feature perception module, so that the safety verification and correction method has the adaptive ability of diversified scenarios. The original unmanned aerial vehicle flight control strategy generates unverified potential unsafe control actions based on the current unmanned aerial vehicle state information, which is used to realize the flight control of the unmanned aerial vehicle system. In order to ensure the safety of the control action acting on the unmanned aerial vehicle system, it is necessary to use the safety verification and correction method to detect the potential unsafe control action, and to correct it for safety based on the detection result. In the safety verification and correction method, the optimization problem is configured using the unsafe control action input at the same time, the current unmanned aerial vehicle system state information, and the adaptively adjusted safety constraint boundary, and the finite time domain optimization problem with the unmanned aerial vehicle system dynamics and safety constraints is solved to obtain the safety control action for the flight control of the unmanned aerial vehicle system. Based on the mutual feedback and combination of the three parts in the above-mentioned simplest implementation scheme, through the continuous transmission and interaction between the drone system status information and control actions, such a cycle is finally implemented, which ultimately ensures that the drone system is always in a safe flight state.

参照图4,在一些具体的实施例中,无人机系统作为被控对象主体,主要用于接收无人机飞行控制策略产生的控制动作,并将其自身的实时状态信息作为反馈量输入到无人机飞行控制策略,形成闭环的无人机控制系统,达到对无人机系统的控制目的。参照图4,无人机系统主要由两个部分构成,分别为机载计算机和底层飞行控制器。机载计算机利用无人机系统搭载的传感器,通过环境特征感知模块和定位模块分别获取无人机所处环境的特征信息与无人机系统自身的状态信息。在获取无人机所处环境的特征信息之后,便可基于该信息对无人机飞行控制策略所要满足的安全约束边界进行自适应调节,从而使得对应的安全性验证与修正方法具备多样化场景的自适应能力,进而保证无人机系统在复杂多变的环境中能够实时获取到相应的安全控制动作,用于实现其安全飞行控制。在获取无人机系统自身的状态信息之后,通过与底层飞行控制器中惯性测量单元测量的无人机姿态信息进行融合,得到更加准确的无人机系统状态估计信息,作为无人机飞行控制策略和与之对应的安全性验证与修正方法的输入,成为获取安全控制动作的必要条件。底层飞行控制器在得到安全控制动作之后,基于惯性测量单元获取的当前无人机姿态信息,通过对无人机系统进行底层角速度控制,最终得到用于驱动执行机构的控制量,实现对无人机系统整体的控制。Referring to FIG4, in some specific embodiments, the UAV system, as the subject of the controlled object, is mainly used to receive the control action generated by the UAV flight control strategy, and inputs its own real-time state information as feedback to the UAV flight control strategy, forming a closed-loop UAV control system to achieve the purpose of controlling the UAV system. Referring to FIG4, the UAV system is mainly composed of two parts, namely, an onboard computer and an underlying flight controller. The onboard computer uses the sensors carried by the UAV system to obtain the characteristic information of the environment in which the UAV is located and the state information of the UAV system itself through the environmental feature perception module and the positioning module. After obtaining the characteristic information of the environment in which the UAV is located, the safety constraint boundary to be satisfied by the UAV flight control strategy can be adaptively adjusted based on the information, so that the corresponding safety verification and correction method has the adaptive ability of diversified scenarios, thereby ensuring that the UAV system can obtain the corresponding safety control action in real time in a complex and changeable environment, which is used to realize its safe flight control. After obtaining the state information of the UAV system itself, it is fused with the UAV attitude information measured by the inertial measurement unit in the underlying flight controller to obtain more accurate UAV system state estimation information, which is used as the input of the UAV flight control strategy and the corresponding safety verification and correction method, and becomes a necessary condition for obtaining safety control actions. After obtaining the safety control action, the underlying flight controller performs underlying angular velocity control on the UAV system based on the current UAV attitude information obtained by the inertial measurement unit, and finally obtains the control quantity used to drive the actuator, thereby achieving overall control of the UAV system.

参照图5,在一些具体的实施例中,原有无人机飞行控制策略是指未经安全性验证并存在潜在安全隐患的无人机飞行控制策略。根据本发明提出的无人机飞行控制策略安全性验证与修正方法具备的统一性与通用性特性,原有无人机飞行控制策略不特指某一种无人机飞行控制策略,其中有N种可能的控制策略,本发明列举了较为普遍的两种,分别为传统控制策略和强化学习控制策略。原有无人机飞行控制策略基于在无人机系统中获取的无人机状态信息,经过不同控制策略算法的求解,均可得到用于无人机系统飞行控制的控制动作。但由于原有无人机飞行控制策略存在的潜在安全隐患,其输出的无人机控制动作同样存在一定的安全风险,因此在作用到实际无人机系统之前,需要通过安全性验证与修正方法的检测和调整,以确保其输出安全的无人机控制动作。Referring to FIG. 5 , in some specific embodiments, the original UAV flight control strategy refers to a UAV flight control strategy that has not been safety verified and has potential safety hazards. According to the unified and universal characteristics of the UAV flight control strategy safety verification and correction method proposed in the present invention, the original UAV flight control strategy does not specifically refer to a certain UAV flight control strategy, among which there are N possible control strategies. The present invention lists two more common ones, namely traditional control strategy and reinforcement learning control strategy. The original UAV flight control strategy is based on the UAV state information obtained in the UAV system. After solving different control strategy algorithms, the control action for the flight control of the UAV system can be obtained. However, due to the potential safety hazards of the original UAV flight control strategy, the output UAV control action also has certain safety risks. Therefore, before acting on the actual UAV system, it needs to be tested and adjusted by the safety verification and correction method to ensure that it outputs safe UAV control actions.

参照图7,在一些具体的实施例中,针对上述优化问题的求解,在时刻t,将原有无人机飞行控制策略输出的不安全控制动作无人机状态信息st、以及状态安全约束边界信息/>同时输入到优化问题中,通过在有限时域范围[0,M-1]内循环迭代M次,便可预测出无人机在未来M个时间步内的安全状态序列st+1,st+2,…,st+M及对应的安全控制动作序列/>取其中t时刻对应的安全控制动作/>作为对原有无人机飞行控制动作/>的修正结果,便可保证无人机始终处于安全飞行状态,从而达到对无人机的安全飞行控制目的。7 , in some specific embodiments, in order to solve the above optimization problem, at time t, the unsafe control action output by the original UAV flight control strategy is UAV state information s t and state safety constraint boundary information/> At the same time, it is input into the optimization problem. By looping and iterating M times in the limited time domain range [0, M-1], the safety state sequence of the UAV in the next M time steps s t+1 , s t+2 ,…, s t+M and the corresponding safety control action sequence can be predicted/> Take the safety control action corresponding to time t/> As an alternative to the original UAV flight control action/> The correction result can ensure that the UAV is always in a safe flight state, thereby achieving the purpose of safe flight control of the UAV.

针对具体的实施过程,本发明将不安全的强化学习(Reinforcement Learning,RL)控制策略作为无人机的原有飞行控制策略,在此基础上,通过提出的具备环境自适应能力的安全性验证与修正方法(Adaptive Safety Predictive Corrector,ASPC)对其进行安全性验证,分别在高度受限的大厅环境、宽度受限的走廊环境、以及空间受限的房间环境下实施了无人机安全飞行控制仿真实验。其中,用于安全性验证与修正方法优化问题求解所需的具体参数配置参照表1:For the specific implementation process, the present invention uses the unsafe reinforcement learning (RL) control strategy as the original flight control strategy of the UAV. On this basis, the proposed safety verification and correction method (Adaptive Safety Predictive Corrector, ASPC) with environmental adaptive capability is used to verify its safety. The UAV safe flight control simulation experiment is implemented in a hall environment with limited height, a corridor environment with limited width, and a room environment with limited space. The specific parameter configuration required for solving the optimization problem of the safety verification and correction method is shown in Table 1:

表1安全性验证与修正方法优化问题参数配置Table 1 Parameter configuration of optimization problem for security verification and correction method

参照图8,对于仿真结果,在无人机仿真实验过程中,原有无人机飞行控制策略输出的不安全控制动作(RL)与经过安全性验证与修正方法(ASPC)优化出的安全控制动作(Safe)数据可实时显示。Referring to FIG8 , for the simulation results, during the UAV simulation experiment, the unsafe control action (RL) output by the original UAV flight control strategy and the safe control action (Safe) data optimized by the safety verification and correction method (ASPC) can be displayed in real time.

从仿真过程中的数据显示可知,本发明提出的安全性验证与修正方法具备实时高效的求解能力,单次求解时间均在2ms以内,满足无人机飞行控制对实时性的要求。此外,通过在无人机飞行过程中录制其飞行轨迹数据及对应时刻的原有控制动作与修正后的安全控制动作序列,并将得到的数据绘制成仿真结果。From the data displayed in the simulation process, it can be seen that the safety verification and correction method proposed in the present invention has real-time and efficient solution capabilities, and the single solution time is within 2ms, which meets the real-time requirements of UAV flight control. In addition, by recording the flight trajectory data of the UAV and the original control actions and the corrected safety control action sequence at the corresponding time during flight, the obtained data is plotted into simulation results.

参照图9至图11b,其中,图9表示无人机在高度受限的大厅环境下的起飞测试结果,无人机的高度和控制动作始终处于安全范围之内;图10表示无人机在宽度受限的走廊环境下的穿越测试结果,无人机的位置始终处于安全边界之内;图11a和图11b分别表示无人机在空间受限的房间环境下的轨迹跟踪测试结果,无人机在轨迹跟踪过程中的飞行轨迹和控制动作序列表明其始终处于安全飞行状态。由上述图中所示的测试结果可知,无人机在不同空间受限环境下执行不同飞行任务时,均能够始终保持在安全状态空间中,从而实现了无人机的安全飞行控制效果,验证了本发明所提出的安全性验证与修正方法的有效性及对多样化环境的自适应能力。通过对比图9和图11b中无人机控制动作在安全性验证前后的区别可知,每当无人机靠近安全性约束边界时,本发明所提出的安全性验证与修正方法均会对原有无人机飞行控制策略输出的不安全控制动作进行实时修正,以保证无人机接收到安全的控制动作,进而达到对无人机安全飞行控制的目的。此外,当无人机未靠近安全性约束边界时,安全性验证与修正方法则并未对原有无人机飞行控制策略进行非必要的修正,从而进一步验证了其最大化保留原有无人机飞行控制策略性能的优化目标原则。Referring to Figures 9 to 11b, Figure 9 shows the take-off test results of the drone in a hall environment with limited height, and the height and control action of the drone are always within the safe range; Figure 10 shows the crossing test results of the drone in a corridor environment with limited width, and the position of the drone is always within the safe boundary; Figures 11a and 11b respectively show the trajectory tracking test results of the drone in a room environment with limited space, and the flight trajectory and control action sequence of the drone during the trajectory tracking process show that it is always in a safe flight state. From the test results shown in the above figures, it can be seen that when the drone performs different flight missions in different space-constrained environments, it can always remain in the safe state space, thereby achieving the safe flight control effect of the drone, verifying the effectiveness of the safety verification and correction method proposed in the present invention and its adaptability to diverse environments. By comparing the difference between the drone control action before and after the safety verification in Figures 9 and 11b, it can be seen that whenever the drone approaches the safety constraint boundary, the safety verification and correction method proposed in the present invention will correct the unsafe control action output by the original drone flight control strategy in real time to ensure that the drone receives a safe control action, thereby achieving the purpose of safe flight control of the drone. In addition, when the UAV was not close to the safety constraint boundary, the safety verification and correction method did not make unnecessary corrections to the original UAV flight control strategy, thereby further verifying its optimization goal principle of maximizing the retention of the original UAV flight control strategy performance.

目前,无人机安全飞行控制策略大多基于最优控制的思想,通过构造复杂的优化目标函数及安全性约束,达到对无人机的安全飞行控制目的。然而,在此过程中,需要依赖严格且复杂的数学形式来构造无人机控制策略的优化目标及约束,由此导致了其复杂的非线性结构且往往难以通过近似求解的方式得到其可行解。此外,无人机控制策略对于复杂约束的满足会不断加剧其输出控制动作的保守性,进而使得其无法适用于无人机系统对飞行机动性的需求。尽管基于无模型的安全强化学习方法能够通过强大的策略搜索与优化能力训练出满足安全性约束的无人机飞行控制策略,但是此类方法由于安全性约束的限制,在一定程度上会影响策略探索与优化的效率,且训练出的无人机安全控制策略均与所处的应用场景和特定的飞行任务进行了深度耦合,不具备通用性和对多样化应用场景的泛化能力。此外,在无人机控制策略的训练过程中,由于其与环境交互存在不断的试错与探索,因此,无法避免其违反安全性约束,进而无法保证其在训练过程中的安全性。基于安全性验证的方法作为另一类可选的用于提升无人机飞行控制安全性的方法,虽然能够通过安全性验证机制对不安全控制策略进行预测与修正,但是同样存在构造难度大、计算复杂度高、环境自适应能力差等问题而无法在真实场景下的无人机系统中进行实时部署与应用。At present, most of the safe flight control strategies of UAVs are based on the idea of optimal control, and the purpose of safe flight control of UAVs is achieved by constructing complex optimization objective functions and safety constraints. However, in this process, it is necessary to rely on strict and complex mathematical forms to construct the optimization objectives and constraints of the UAV control strategy, which leads to its complex nonlinear structure and is often difficult to obtain its feasible solution through approximate solution. In addition, the satisfaction of the complex constraints of the UAV control strategy will continuously increase the conservatism of its output control actions, making it unsuitable for the flight maneuverability requirements of the UAV system. Although the model-free safety reinforcement learning method can train UAV flight control strategies that meet safety constraints through powerful strategy search and optimization capabilities, such methods will affect the efficiency of strategy exploration and optimization to a certain extent due to the limitations of safety constraints, and the trained UAV safety control strategies are deeply coupled with the application scenarios and specific flight tasks, and do not have universality and generalization capabilities for diversified application scenarios. In addition, during the training process of the UAV control strategy, due to the continuous trial and error and exploration of its interaction with the environment, it is impossible to avoid violating safety constraints, and thus its safety during the training process cannot be guaranteed. Safety verification-based methods are another optional method for improving the safety of UAV flight control. Although they can predict and correct unsafe control strategies through safety verification mechanisms, they also have problems such as high construction difficulty, high computational complexity, and poor environmental adaptability, making them impossible to be deployed and applied in real time in UAV systems in real scenarios.

本发明针对无人机飞行控制策略在设计、训练、测试、及部署过程中存在的安全风险,公开了一种安全性验证与修正方法及系统,其主要的技术关键点为:Aiming at the safety risks existing in the design, training, testing and deployment of UAV flight control strategies, the present invention discloses a safety verification and correction method and system, the main technical key points of which are:

(1)针对所有潜在的不安全无人机飞行控制策略,提出了一种统一且通用的即插即用型安全性验证与修正方法,通过解耦的方式对无人机飞行控制策略进行验证与修正,达到了无人机安全飞行控制的目的。(1) For all potentially unsafe UAV flight control strategies, a unified and universal plug-and-play safety verification and correction method is proposed. The UAV flight control strategy is verified and corrected in a decoupled manner, thus achieving the purpose of safe UAV flight control.

(2)该安全性验证与修正方法基于优化理论,通过设计特定的优化目标及安全性约束,在最大化保留原有无人机飞行控制策略性能的基础上,通过实时高效的计算求解得到了安全飞行控制策略,保证了无人机飞行控制的安全性。(2) This safety verification and correction method is based on optimization theory. By designing specific optimization objectives and safety constraints, it obtains a safe flight control strategy through real-time and efficient calculation while maximizing the performance of the original UAV flight control strategy, thereby ensuring the safety of UAV flight control.

(3)该安全性验证与修正方法能够基于当前环境特征信息自适应地调整其安全性约束边界,使得其具备对多样化环境的适应能力,适用于多场景下的无人机安全飞行控制。(3) This safety verification and correction method can adaptively adjust its safety constraint boundaries based on the current environmental feature information, so that it has the ability to adapt to diverse environments and is suitable for safe flight control of drones in multiple scenarios.

(4)另外,本发明基于提出的无人机控制策略安全性验证与修正方法,设计了对应于该方法的无人机系统,用于部署和验证所提出方法的可行性与有效性。(4) In addition, based on the proposed UAV control strategy safety verification and correction method, the present invention designs a UAV system corresponding to the method to deploy and verify the feasibility and effectiveness of the proposed method.

进一步,本发明还提出一种计算机可读存储介质,其上储存有程序指令,所述程序指令被处理器执行时实施所述的方法。应当认识到,本发明实施例中的方法步骤可以由计算机硬件、硬件和软件的组合、或者通过存储在非暂时性计算机可读存储器中的计算机指令来实现或实施。所述方法可以使用标准编程技术。每个程序可以以高级过程或面向对象的编程语言来实现以与计算机系统通信。然而,若需要,该程序可以以汇编或机器语言实现。在任何情况下,该语言可以是编译或解释的语言。此外,为此目的该程序能够在编程的专用集成电路上运行。Further, the present invention also proposes a computer-readable storage medium, on which program instructions are stored, and the described method is implemented when the program instructions are executed by a processor. It should be appreciated that the method steps in the embodiments of the present invention can be implemented or implemented by computer hardware, a combination of hardware and software, or by computer instructions stored in a non-temporary computer-readable memory. The method can use standard programming techniques. Each program can be implemented in a high-level process or object-oriented programming language to communicate with a computer system. However, if necessary, the program can be implemented in assembly or machine language. In any case, the language can be a compiled or interpreted language. In addition, the program can be run on a programmed application-specific integrated circuit for this purpose.

此外,可按任何合适的顺序来执行本文描述的过程的操作,除非本文另外指示或以其他方式明显地与上下文矛盾。本文描述的过程(或变型和/或其组合)可在配置有可执行指令的一个或多个计算机系统的控制下执行,并且可作为共同地在一个或多个处理器上执行的代码(例如,可执行指令、一个或多个计算机程序或一个或多个应用)、由硬件或其组合来实现。所述计算机程序包括可由一个或多个处理器执行的多个指令。Furthermore, the operations of the processes described herein may be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The processes described herein (or variations and/or combinations thereof) may be performed under the control of one or more computer systems configured with executable instructions, and may be implemented as code (e.g., executable instructions, one or more computer programs, or one or more applications) that is executed collectively on one or more processors, by hardware, or a combination thereof. The computer program includes a plurality of instructions that may be executed by one or more processors.

进一步,所述方法可以在可操作地连接至合适的任何类型的计算平台中实现,包括但不限于个人电脑、迷你计算机、主框架、工作站、网络或分布式计算环境、单独的或集成的计算机平台、或者与带电粒子工具或其它成像装置通信等等。本发明的各方面可以以存储在非暂时性存储介质或设备上的机器可读代码来实现,无论是可移动的还是集成至计算平台,如硬盘、光学读取和/或写入存储介质、RAM、ROM等,使得其可由可编程计算机读取,当存储介质或设备由计算机读取时可用于配置和操作计算机以执行在此所描述的过程。此外,机器可读代码,或其部分可以通过有线或无线网络传输。当此类媒体包括结合微处理器或其他数据处理器实现上文所述步骤的指令或程序时,本文所述的发明包括这些和其他不同类型的非暂时性计算机可读存储介质。当根据本发明所述的方法和技术编程时,本发明还可以包括计算机本身。Further, the method can be implemented in any type of computing platform that is operably connected to a suitable computer, including but not limited to a personal computer, a minicomputer, a mainframe, a workstation, a network or distributed computing environment, a separate or integrated computer platform, or in communication with a charged particle tool or other imaging device, etc. Various aspects of the present invention can be implemented in machine-readable code stored on a non-transitory storage medium or device, whether removable or integrated into a computing platform, such as a hard disk, an optical read and/or write storage medium, a RAM, a ROM, etc., so that it can be read by a programmable computer, and when the storage medium or device is read by the computer, it can be used to configure and operate the computer to perform the process described herein. In addition, the machine-readable code, or portions thereof, can be transmitted via a wired or wireless network. When such media includes instructions or programs that implement the steps described above in conjunction with a microprocessor or other data processor, the invention described herein includes these and other different types of non-transitory computer-readable storage media. When programmed according to the methods and techniques of the present invention, the present invention can also include the computer itself.

计算机程序能够应用于输入数据以执行本文所述的功能,从而转换输入数据以生成存储至非易失性存储器的输出数据。输出信息还可以应用于一个或多个输出设备如显示器。在本发明优选的实施例中,转换的数据表示物理和有形的对象,包括显示器上产生的物理和有形对象的特定视觉描绘。The computer program can be applied to input data to perform the functions described herein, thereby converting the input data to generate output data stored in a non-volatile memory. The output information can also be applied to one or more output devices such as a display. In a preferred embodiment of the present invention, the converted data represents physical and tangible objects, including specific visual depictions of physical and tangible objects produced on the display.

以上所述,只是本发明的较佳实施例而已,本发明并不局限于上述实施方式,只要其以相同的手段达到本发明的技术效果,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明保护的范围之内。在本发明的保护范围内其技术方案和/或实施方式可以有各种不同的修改和变化。The above is only a preferred embodiment of the present invention. The present invention is not limited to the above implementation. As long as the technical effect of the present invention is achieved by the same means, any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included in the scope of protection of the present invention. Within the scope of protection of the present invention, its technical scheme and/or implementation method may have various modifications and changes.

Claims (10)

1.无人机控制策略安全性验证与修正方法,所述的无人机控制策略安全性验证与修正方法应用于无人机系统上,所述无人机系统包含原有的无人机飞行控制策略,其特征在于,所述的方法包括以下步骤:1. A method for verifying and correcting the safety of an unmanned aerial vehicle control strategy, wherein the method is applied to an unmanned aerial vehicle system, wherein the unmanned aerial vehicle system includes an existing unmanned aerial vehicle flight control strategy, and wherein the method comprises the following steps: S100、获取当前无人机系统状态信息、无人机系统的原有无人机飞行控制策略输出的未验证的潜在不安全动作、和无人机自适应调节的安全约束边界;S100, obtaining the current UAV system status information, the unverified potential unsafe actions output by the original UAV flight control strategy of the UAV system, and the safety constraint boundary of the UAV adaptive adjustment; S200、基于优化目标范数,建立并求解带有无人机动力学及安全性约束的有限时域优化问题,得到用于无人机系统飞行控制的安全控制动作;S200, based on the optimization target norm, establish and solve a finite time domain optimization problem with UAV dynamics and safety constraints, and obtain a safety control action for UAV system flight control; S300、基于无人机自适应调节的安全约束边界,建立并更新无人机状态需要满足的安全性约束;S300, based on the safety constraint boundary of the UAV adaptive adjustment, establish and update the safety constraints that the UAV state needs to meet; S400、在有限时域范围内迭代计算步骤S200和步骤S300,得到无人机在未来有限时域范围内的安全状态序列及对应的安全控制动作序列。S400, iteratively calculating step S200 and step S300 within a limited time domain to obtain a safety state sequence and a corresponding safety control action sequence of the UAV within a future limited time domain. 2.根据权利要求1所述的无人机控制策略安全性验证与修正方法,其特征在于,所述步骤S200包括:2. The method for verifying and correcting the safety of a drone control strategy according to claim 1, wherein step S200 comprises: S210、基于在最大化保留原有无人机飞行控制策略性能的同时,保证其输出控制动作的安全性,定义无人机系统的优化目标范数距离;S210, defining an optimization target norm distance of the UAV system based on maximizing the performance of the original UAV flight control strategy while ensuring the safety of its output control action; S220、基于接收的无人机状态信息和预测得到的安全控制动作,进行无人机动力学转移,建立在有限时域[0,M-1]范围内安全控制动作的无人机动力学约束。S220. Based on the received UAV state information and the predicted safety control action, the UAV dynamics transfer is performed to establish the UAV dynamics constraints of the safety control action within the limited time domain [0, M-1]. 3.根据权利要求2所述的无人机控制策略安全性验证与修正方法,其特征在于,所述步骤S210中,所述优化目标范数距离为:3. The method for verifying and correcting the safety of the drone control strategy according to claim 2, characterized in that in step S210, the optimization target norm distance is: 其中,为原有无人机飞行控制策略在t时刻输出的未验证的潜在不安全控制动作,/>为经过安全性验证与修正方法优化后得到的t时刻无人机安全控制动作。in, is the unverified potentially unsafe control action output by the original UAV flight control strategy at time t, /> It is the safety control action of the UAV at time t obtained after safety verification and optimization of the correction method. 4.根据权利要求3所述的无人机控制策略安全性验证与修正方法,其特征在于,原有无人机飞行控制策略输出的未验证的潜在不安全控制动作为:4. The method for verifying and correcting the safety of the UAV control strategy according to claim 3 is characterized in that the unverified potentially unsafe control actions output by the original UAV flight control strategy for: 其中,Ft为无人机中所有机载电机生成的合推力,ωt为无人机在机体坐标系下的三轴角速度。Among them, Ft is the combined thrust generated by all the onboard motors in the UAV, and ωt is the three-axis angular velocity of the UAV in the body coordinate system. 5.根据权利要求2所述的无人机控制策略安全性验证与修正方法,其特征在于,所述步骤S220中,对有限时域范围内所有的整数m,所述无人机动力学约束为:5. The method for safety verification and correction of drone control strategy according to claim 2, characterized in that in step S220, for all integers m within a limited time domain, the drone dynamics constraint is: 其中,且m为整数,st+m为无人机在t+m时刻的状态,/>为无人机在t+m时刻的安全控制动作,st+m+1为由无人机动力学模型f(·)经过计算得到的下一时刻t+m+1的状态。in, And m is an integer, s t+m is the state of the drone at time t+m, /> is the safety control action of the UAV at time t+m, and s t+m+1 is the state at the next time t+m+1 calculated by the UAV dynamics model f(·). 6.根据权利要求5所述的无人机控制策略安全性验证与修正方法,其特征在于,所述步骤S220中,在t时刻无人机状态st包括:6. The method for verifying and correcting the safety of the drone control strategy according to claim 5, characterized in that, in the step S220, the drone state s t at time t includes: st=[pt,qt,vt],s t =[p t ,q t ,v t ], 其中,pt为t时刻无人机的位置信息,qt为t时刻无人机的姿态四元数信息,vt为t时刻无人机的线速度信息。Among them, p t is the position information of the UAV at time t, q t is the attitude quaternion information of the UAV at time t, and v t is the linear velocity information of the UAV at time t. 7.根据权利要求1所述的无人机控制策略安全性验证与修正方法,其特征在于,所述步骤S300中,无人机状态的安全性约束包括无人机在有限时域范围内的状态均需处于安全状态集合S内,7. The method for safety verification and correction of drone control strategy according to claim 1 is characterized in that in step S300, the safety constraint of the drone state includes that the state of the drone within a limited time domain must be within the safe state set S, 所述安全状态集合S为:The security state set S is: 其中,st为无人机在t时刻的状态,B为状态安全性约束在每个状态分量上的权重矩阵,为在t时刻无人机状态需要满足的安全状态边界值,Among them, s t is the state of the UAV at time t, B is the weight matrix of the state safety constraint on each state component, is the safety state boundary value that the drone state needs to meet at time t, 其中,为t时刻无人机的位置信息的最小值,/>为t时刻无人机的姿态四元数信息的最小值,/>为t时刻无人机的线速度信息的最小值,/>为t时刻无人机的位置信息的最大值,/>为t时刻无人机的姿态四元数信息的最大值,/>为t时刻无人机的线速度信息的最大值。in, is the minimum value of the drone's position information at time t, /> is the minimum value of the UAV's attitude quaternion information at time t,/> is the minimum value of the UAV's linear velocity information at time t, /> is the maximum value of the drone's position information at time t, /> is the maximum value of the UAV's attitude quaternion information at time t,/> is the maximum value of the UAV’s linear velocity information at time t. 8.根据权利要求1所述的无人机控制策略安全性验证与修正方法,其特征在于,所述步骤S300中,无人机控制动作的安全性约束包括无人机在有限时域范围内的控制动作均需处于安全动作集合U内,8. The method for safety verification and correction of drone control strategy according to claim 1 is characterized in that in step S300, the safety constraints of the drone control action include that the drone control actions within a limited time domain must be within the safety action set U, 所述安全动作集合U内为:The security action set U includes: 其中,U表示无人机控制动作需要满足的安全性约束集合,Among them, U represents the set of safety constraints that the drone control action needs to meet, 其中,A表示控制动作安全性约束在每个控制分量上的权重矩阵,为在t时刻无人机控制动作需要满足的安全边界值,Where A represents the weight matrix of the control action safety constraint on each control component, is the safety boundary value that the drone control action needs to meet at time t, 其中,为无人机中所有机载电机生成的合推力的最小值,/>为无人机中所有机载电机生成的合推力的最大值,/>为无人机在机体坐标系下的三轴角速度的最小值,为无人机在机体坐标系下的三轴角速度的最大值。in, is the minimum value of the combined thrust generated by all onboard motors in the drone,/> is the maximum value of the combined thrust generated by all the onboard motors in the drone,/> is the minimum value of the three-axis angular velocity of the drone in the body coordinate system, It is the maximum value of the three-axis angular velocity of the UAV in the body coordinate system. 9.根据权利要求8所述的无人机控制策略安全性验证与修正方法,其特征在于,9. The method for verifying and correcting the safety of the UAV control strategy according to claim 8 is characterized in that: 在有限时域的终止时刻M,无人机状态st+M需要满足状态终止集Send,即,At the end time M of the finite time domain, the drone state s t+M needs to satisfy the state termination set S end , that is, st+M∈Sends t+M ∈S end , 其中,Send为状态终止集,状态终止集Send为状态安全性约束集合S的一个子集,即,Among them, S end is the state termination set, and the state termination set S end is a subset of the state safety constraint set S, that is, 其中,S为状态安全约束集合。Among them, S is the set of state safety constraints. 10.无人机控制策略安全性验证与修正系统,用于实施如权利要求1至权利要求9中任一项所述的方法,所述无人机控制策略安全性验证与修正系统包括:10. A UAV control strategy safety verification and correction system, used to implement the method according to any one of claims 1 to 9, the UAV control strategy safety verification and correction system comprising: 无人机系统,所述无人机系统包含原有的无人机飞行控制策略,原有的无人机飞行控制策略基于无人机系统的无人机状态信息生成未验证的潜在不安全控制动作,控制无人机系统的飞行;An unmanned aerial vehicle system, the unmanned aerial vehicle system including an existing unmanned aerial vehicle flight control strategy, the existing unmanned aerial vehicle flight control strategy generating unverified potentially unsafe control actions based on unmanned aerial vehicle state information of the unmanned aerial vehicle system to control the flight of the unmanned aerial vehicle system; 安全性验证与修正装置,所述安全性验证与修正装置从无人机系统的原有的无人机飞行控制策略中获取潜在不安全控制动作,并生成安全控制动作后传送到无人机系统,所述安全性验证与修正装置与所述无人机系统连接。A safety verification and correction device, wherein the safety verification and correction device obtains potential unsafe control actions from the original UAV flight control strategy of the UAV system, generates safe control actions and transmits them to the UAV system, and the safety verification and correction device is connected to the UAV system.
CN202410200544.6A 2024-02-23 2024-02-23 UAV control strategy safety verification and correction method and system Pending CN118011826A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410200544.6A CN118011826A (en) 2024-02-23 2024-02-23 UAV control strategy safety verification and correction method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410200544.6A CN118011826A (en) 2024-02-23 2024-02-23 UAV control strategy safety verification and correction method and system

Publications (1)

Publication Number Publication Date
CN118011826A true CN118011826A (en) 2024-05-10

Family

ID=90944107

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410200544.6A Pending CN118011826A (en) 2024-02-23 2024-02-23 UAV control strategy safety verification and correction method and system

Country Status (1)

Country Link
CN (1) CN118011826A (en)

Similar Documents

Publication Publication Date Title
Greeff et al. Flatness-based model predictive control for quadrotor trajectory tracking
Nguyen et al. Formation control scheme with reinforcement learning strategy for a group of multiple surface vehicles
Kaminer et al. Path following for small unmanned aerial vehicles using L1 adaptive augmentation of commercial autopilots
Xiao et al. Incremental updating multirobot formation using nonlinear model predictive control method with general projection neural network
CN109947126B (en) Control method, device, device and readable medium for quadrotor unmanned aerial vehicle
CN114355976B (en) Method for controlling unmanned aerial vehicle to complete track tracking under wind disturbance based on learning
CN112947086B (en) Self-adaptive compensation method for actuator faults in formation control of heterogeneous multi-agent system consisting of unmanned aerial vehicle and unmanned vehicle
Bui et al. Lyapunov-based nonlinear model predictive control for attitude trajectory tracking of unmanned aerial vehicles
Zhang et al. Model-based safe reinforcement learning with time-varying constraints: Applications to intelligent vehicles
El Houm et al. Optimal new sliding mode controller combined with modified supertwisting algorithm for a perturbed quadrotor UAV
Liang et al. Finite-time fault-tolerant consensus of UAVs: a switching event-triggered fuzzy control scheme
CN110275432A (en) Unmanned Aerial Vehicle Suspension Load Control System Based on Reinforcement Learning
CN117452975A (en) Security performance cooperative formation control design method for four-rotor unmanned aerial vehicle cluster
Zhang et al. Predefined-time cooperative formation control of heterogeneous unmanned surface vehicle-unmanned aerial vehicle systems with uncertain dynamic estimation
CN116324635A (en) Apparatus and method for controlling a system having uncertainty in its dynamics
Chen et al. Learning agile quadrotor flight in restricted environments with safety guarantees
CN118011826A (en) UAV control strategy safety verification and correction method and system
Bi et al. Neural Network-based Adaptive Finite-time Control for 2-DOF Helicopter Systems with Prescribed Performance and Input Saturation
Tian et al. Lyapunov-based model predictive control for unmanned aerial vehicles with integrated non-singular terminal sliding mode surface
Lee et al. Design of Robust H∞ Guaranteed Cost Controller of Quadrotor UAV for Set-Point Tracking
Zhao et al. Efficient deep learning of robust, adaptive policies using tube mpc-guided data augmentation
CN117873156A (en) A method for online trajectory optimization of aircraft based on deep learning
Jin et al. Modelling, control and development of GUI-based simulator for a quadcopter using nonlinear PID control
Dong et al. Tracking control of vertical taking‐off and landing vehicles with parametric uncertainty
Dai et al. Reinforcement learning control for multi-axis rotor configuration UAV

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination