WO2025092999A1

WO2025092999A1 - Model performance supervision method and apparatus, and device

Info

Publication number: WO2025092999A1
Application number: PCT/CN2024/129460
Authority: WO
Inventors: 贾承璐; 邬华明; 王园园; 吴昊
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2023-11-03
Filing date: 2024-11-01
Publication date: 2025-05-08
Anticipated expiration: 2026-05-03
Also published as: CN119938459A

Abstract

The present application belongs to the field of communications. Disclosed are a model performance supervision method and apparatus, and a device, which can solve the problem of performance supervision for AI models. The model performance supervision method in the embodiments of the present application comprises: a first device acquiring first information, wherein the first information is used for representing the uncertainty or probability distribution of an output of a first AI model, and an input of the first AI model is a first feature; and the first device determining third information on the basis of the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used for determining the validity of the first AI model.

Description

Model performance supervision method, device and equipment

相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS

本申请要求于2023年11月03日提交中国专利局、申请号为202311467210.7、发明名称为“模型性能监督方法、装置及设备”的中国专利申请的优先权，其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed with the China Patent Office on November 3, 2023, with application number 202311467210.7 and invention name “Model performance supervision method, device and equipment”, the entire contents of which are incorporated by reference in this application.

Technical Field

本申请涉及通信领域，并且更具体地，涉及一种模型性能监督方法、装置及设备。The present application relates to the field of communications, and more specifically, to a model performance supervision method, apparatus and device.

Background Art

在新无线(New Radio，NR)系统中，引入了人工智能(Artificial Intelligence，AI)模型来提升系统性能。例如，引入AI模型进行定位、波束管理、信道状态信息(Channel State Information，CSI)预测、移动性管理、CSI压缩等。然而，当无线传播环境发生变化时，AI模型的性能可能难以保证，如何监督AI模型的性能，是一个需要解决问题。In the New Radio (NR) system, artificial intelligence (AI) models are introduced to improve system performance. For example, AI models are introduced for positioning, beam management, channel state information (CSI) prediction, mobility management, CSI compression, etc. However, when the wireless propagation environment changes, the performance of the AI model may be difficult to guarantee. How to supervise the performance of the AI model is a problem that needs to be solved.

发明内容Summary of the invention

本申请实施例提供一种模型性能监督方法、装置及设备，能够解决AI模型的性能监督问题。The embodiments of the present application provide a model performance supervision method, device and equipment, which can solve the performance supervision problem of AI models.

第一方面，提供了一种模型性能监督方法，包括：In the first aspect, a model performance supervision method is provided, comprising:

第一设备获取第一信息，其中，该第一信息用于表征第一AI模型的输出的不确定性或概率分布，该第一AI模型的输入为第一特征；The first device acquires first information, wherein the first information is used to characterize uncertainty or probability distribution of an output of a first AI model, and an input of the first AI model is a first feature;

该第一设备根据该第一信息和第二信息确定第三信息，其中，该第二信息为该第一特征对应的标签，该第三信息用于确定该第一AI模型的有效性。The first device determines third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

第二方面，提供了一种模型性能监督方法，包括：Secondly, a model performance supervision method is provided, including:

第二设备从第一设备接收第三信息，其中，该第三信息基于第一信息和第二信息确定，该第一信息用于表征第一AI模型的输出的不确定性或概率分布，该第一AI模型的输入为第一特征，该第二信息为该第一特征对应的标签；The second device receives third information from the first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first AI model, the input of the first AI model is the first feature, and the second information is a label corresponding to the first feature;

该第二设备根据该第三信息确定该第一AI模型的有效性。The second device determines the validity of the first AI model based on the third information.

第三方面，提供了一种模型性能监督方法，包括：Thirdly, a model performance supervision method is provided, including:

第一设备获取第四信息，其中，该第四信息用于表征第二AI模型的输出的不确定性或概率分布，该第二AI模型的输入为第三特征；The first device obtains fourth information, where the fourth information is used to characterize uncertainty or probability distribution of an output of a second AI model, and an input of the second AI model is a third feature;

该第一设备根据该第四信息确定该第二AI模型的有效性；或，该第一设备向第二设备发送该第四信息。The first device determines the validity of the second AI model based on the fourth information; or, the first device sends the fourth information to the second device.

第四方面，提供了一种模型性能监督方法，包括：Fourthly, a model performance supervision method is provided, including:

第二设备从第一设备接收第四信息，其中，该第四信息用于表征第二AI模型的输出的不确定性或概率分布，该第二AI模型的输入为第三特征；The second device receives fourth information from the first device, wherein the fourth information is used to characterize uncertainty or probability distribution of an output of a second AI model, and an input of the second AI model is a third feature;

该第二设备根据该第四信息确定该第二AI模型的有效性。The second device determines the validity of the second AI model based on the fourth information.

第五方面，提供了一种模型性能监督装置，包括：In a fifth aspect, a model performance monitoring device is provided, comprising:

获取单元，用于获取第一信息，其中，所述第一信息用于表征第一AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征；an acquisition unit, configured to acquire first information, wherein the first information is used to characterize uncertainty or probability distribution of an output of a first AI model, and an input of the first AI model is a first feature;

处理单元，用于根据所述第一信息和第二信息确定第三信息，其中，所述第二信息为所述第一特征对应的标签，所述第三信息用于确定所述第一AI模型的有效性。 A processing unit, used to determine third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

第六方面，提供了一种模型性能监督装置，包括：In a sixth aspect, a model performance monitoring device is provided, comprising:

收发单元，用于从第一设备接收第三信息，其中，所述第三信息基于第一信息和第二信息确定，所述第一信息用于表征第一AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征，所述第二信息为所述第一特征对应的标签；a transceiver unit, configured to receive third information from a first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first AI model, the input of the first AI model is a first feature, and the second information is a label corresponding to the first feature;

处理单元，用于根据所述第三信息确定所述第一AI模型的有效性。A processing unit is used to determine the validity of the first AI model based on the third information.

第七方面，提供了一种模型性能监督装置，包括：In a seventh aspect, a model performance monitoring device is provided, comprising:

获取单元，用于获取第四信息，其中，所述第四信息用于表征第二AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；an acquisition unit, configured to acquire fourth information, wherein the fourth information is used to characterize uncertainty or probability distribution of an output of a second AI model, and an input of the second AI model is a third feature;

处理单元，用于根据所述第四信息确定所述第二AI模型的有效性；或，收发单元，用于向第二设备发送所述第四信息。A processing unit, used to determine the validity of the second AI model according to the fourth information; or a transceiver unit, used to send the fourth information to the second device.

第八方面，提供了一种模型性能监督装置，包括：In an eighth aspect, a model performance monitoring device is provided, comprising:

收发单元，用于从第一设备接收第四信息，其中，所述第四信息用于表征第二AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；a transceiver unit, configured to receive fourth information from the first device, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second AI model, and the input of the second AI model is the third feature;

处理单元，用于根据所述第四信息确定所述第二AI模型的有效性。A processing unit, configured to determine the validity of the second AI model based on the fourth information.

第九方面，提供了一种模型性能监督设备，所述模型性能监督设备包括处理器和存储器，所述存储器存储可在所述处理器上运行的程序或指令，所述程序或指令被所述处理器执行时实现如第一方面所述的方法的步骤。In the ninth aspect, a model performance monitoring device is provided, which includes a processor and a memory, wherein the memory stores programs or instructions that can be run on the processor, and when the programs or instructions are executed by the processor, the steps of the method described in the first aspect are implemented.

第十方面，提供了一种模型性能监督设备，包括处理器及通信接口；其中，所述通信接口或所述处理器用于获取第一信息，其中，所述第一信息用于表征第一AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征；所述处理器用于根据所述第一信息和第二信息确定第三信息，其中，所述第二信息为所述第一特征对应的标签，所述第三信息用于确定所述第一AI模型的有效性。In the tenth aspect, a model performance supervision device is provided, comprising a processor and a communication interface; wherein the communication interface or the processor is used to obtain first information, wherein the first information is used to characterize the uncertainty or probability distribution of the output of a first AI model, and the input of the first AI model is a first feature; the processor is used to determine third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

第十一方面，提供了一种模型性能监督设备，所述模型性能监督设备包括处理器和存储器，所述存储器存储可在所述处理器上运行的程序或指令，所述程序或指令被所述处理器执行时实现如第二方面所述的方法的步骤。In the eleventh aspect, a model performance monitoring device is provided, which includes a processor and a memory, wherein the memory stores programs or instructions that can be run on the processor, and when the programs or instructions are executed by the processor, the steps of the method described in the second aspect are implemented.

第十二方面，提供了一种模型性能监督设备，包括处理器及通信接口；其中，所述通信接口用于从第一设备接收第三信息，其中，所述第三信息基于第一信息和第二信息确定，所述第一信息用于表征第一AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征，所述第二信息为所述第一特征对应的标签；所述处理器用于根据所述第三信息确定所述第一AI模型的有效性。In the twelfth aspect, a model performance supervision device is provided, comprising a processor and a communication interface; wherein the communication interface is used to receive third information from a first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first AI model, the input of the first AI model is a first feature, and the second information is a label corresponding to the first feature; the processor is used to determine the validity of the first AI model based on the third information.

第十三方面，提供了一种模型性能监督设备，所述模型性能监督设备包括处理器和存储器，所述存储器存储可在所述处理器上运行的程序或指令，所述程序或指令被所述处理器执行时实现如第三方面所述的方法的步骤。In the thirteenth aspect, a model performance monitoring device is provided, which includes a processor and a memory, wherein the memory stores programs or instructions that can be executed on the processor, and when the program or instructions are executed by the processor, the steps of the method described in the third aspect are implemented.

第十四方面，提供了一种模型性能监督设备，包括处理器及通信接口；其中，所述通信接口或所述处理器用于获取第四信息，其中，所述第四信息用于表征第二AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；所述处理器用于根据所述第四信息确定所述第二AI模型的有效性；或，所述通信接口用于向第二设备发送所述第四信息。In the fourteenth aspect, a model performance supervision device is provided, comprising a processor and a communication interface; wherein the communication interface or the processor is used to obtain fourth information, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of a second AI model, and the input of the second AI model is a third feature; the processor is used to determine the validity of the second AI model based on the fourth information; or, the communication interface is used to send the fourth information to a second device.

第十五方面，提供了一种模型性能监督设备，所述模型性能监督设备包括处理器和存储器，所述存储器存储可在所述处理器上运行的程序或指令，所述程序或指令被所述处理器执行时实现如第四方面所述的方法的步骤。In the fifteenth aspect, a model performance monitoring device is provided, which includes a processor and a memory, wherein the memory stores programs or instructions that can be run on the processor, and when the programs or instructions are executed by the processor, the steps of the method described in the fourth aspect are implemented.

第十六方面，提供了一种模型性能监督设备，包括处理器及通信接口；其中，所述通信接口用于从第一设备接收第四信息，其中，所述第四信息用于表征第二AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；所述处理器用于根据所述第四信息确定所述第二AI模型的有效性。In the sixteenth aspect, a model performance supervision device is provided, comprising a processor and a communication interface; wherein the communication interface is used to receive fourth information from a first device, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of a second AI model, and the input of the second AI model is a third feature; the processor is used to determine the validity of the second AI model based on the fourth information.

第十七方面，提供了一种可读存储介质，所述可读存储介质上存储程序或指令，所述程序或指令被处理器执行时实现如第一方面所述的方法的步骤，或者实现如第二方面所述的方法的步骤，或者实现如第三方面所述的方法的步骤，或者实现如第四方面所述的方法的步骤。In a seventeenth aspect, a readable storage medium is provided, wherein a program or instruction is stored on the readable storage medium. When the program or instruction is executed by the processor, the steps of the method described in the first aspect are implemented, or the steps of the method described in the second aspect are implemented, or the steps of the method described in the third aspect are implemented, or the steps of the method described in the fourth aspect are implemented.

第十八方面，提供了一种无线通信系统，包括：第一设备及第二设备，所述第一设备可用于执行如第一方面或第三方面所述的方法的步骤，所述第二设备可用于执行如第二方面或第四方面所述的方法的步骤。In the eighteenth aspect, a wireless communication system is provided, including: a first device and a second device, wherein the first device can be used to execute the steps of the method described in the first aspect or the third aspect, and the second device can be used to execute the steps of the method described in the second aspect or the fourth aspect.

第十九方面，提供了一种芯片，所述芯片包括处理器和通信接口，所述通信接口和所述处理器耦合，所述处理器用于运行程序或指令，实现如第一方面所述的方法，或实现如第二方面所述的方法，或实现如第三方面所述的方法，或实现如第四方面所述的方法。In the nineteenth aspect, a chip is provided, comprising a processor and a communication interface, wherein the communication interface is coupled to the processor, and the processor is used to run a program or instructions to implement the method as described in the first aspect, or the method as described in the second aspect, or the method as described in the third aspect, or the method as described in the fourth aspect.

第二十方面，提供了一种计算机程序/程序产品，所述计算机程序/程序产品被存储在存储介质中，所述程序/程序产品被至少一个处理器执行以实现如第一方面至第四方面中的至少之一所述的模型性能监督方法的步骤。In the twentieth aspect, a computer program/program product is provided, wherein the computer program/program product is stored in a storage medium, and the program/program product is executed by at least one processor to implement the steps of the model performance supervision method as described in at least one of the first to fourth aspects.

在本申请第一方面或第二方面的实施例中，可以基于第一AI模型的输出的不确定性或概率分布及第一AI模型的输入特征对应的标签确定第三信息，以及基于第三信息确定第一AI模型的有效性，从而实现第一AI模型的性能监督。具体的，第一AI模型的输出的不确定性或概率分布可以提高第一AI模型的推理结果的鲁棒性，由于第一AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第一AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第一AI模型的性能监督中参考第一AI模型的输出的不确定性或概率分布及对应的标签，能够提升第一AI模型的性能监督的准确性。In the embodiments of the first aspect or the second aspect of the present application, the third information can be determined based on the uncertainty or probability distribution of the output of the first AI model and the label corresponding to the input feature of the first AI model, and the validity of the first AI model can be determined based on the third information, so as to achieve performance supervision of the first AI model. Specifically, the uncertainty or probability distribution of the output of the first AI model can improve the robustness of the reasoning results of the first AI model. Since the reasoning results of the first AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the first AI model (including relevant information on uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the first AI model, referring to the uncertainty or probability distribution of the output of the first AI model and the corresponding label can improve the accuracy of the performance supervision of the first AI model.

在本申请第三方面或第四方面的实施例中，可以基于第二AI模型的输出的不确定性或概率分布确定第二AI模型的有效性，从而实现第二AI模型的性能监督。具体的，第二AI模型的输出的不确定性或概率分布可以提高第二AI模型的推理结果的鲁棒性，由于第二AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第二AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第二AI模型的性能监督中参考第二AI模型的输出的不确定性或概率分布，能够判断第二AI模型的有效性，由于可以不需要外部信息的辅助，因而也更容易实现。In the embodiments of the third aspect or the fourth aspect of the present application, the validity of the second AI model can be determined based on the uncertainty or probability distribution of the output of the second AI model, thereby achieving performance supervision of the second AI model. Specifically, the uncertainty or probability distribution of the output of the second AI model can improve the robustness of the reasoning results of the second AI model. Since the reasoning results of the second AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the second AI model (including relevant information about uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the second AI model, the uncertainty or probability distribution of the output of the second AI model is referred to, and the validity of the second AI model can be judged. Since no external information is required, it is also easier to implement.

BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本申请实施例的技术方案，下面将对本申请实施例描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本申请的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings required for use in the description of the embodiments of the present application will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present application. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying any creative labor.

图1是本申请实施例提供的一种通信系统架构的示意性图。FIG1 is a schematic diagram of a communication system architecture provided in an embodiment of the present application.

图2是本申请提供的一种神经网络的示意性图。FIG2 is a schematic diagram of a neural network provided in the present application.

图3是本申请提供的一种神经元的示意性图。FIG. 3 is a schematic diagram of a neuron provided in the present application.

图4是根据本申请实施例提供的一种模型性能监督方法的示意性流程图。FIG4 is a schematic flowchart of a model performance supervision method provided according to an embodiment of the present application.

图5是根据本申请实施例提供的一种基于软信息提升定位精度的示意性图。FIG5 is a schematic diagram of improving positioning accuracy based on soft information according to an embodiment of the present application.

图6是根据本申请实施例提供的另一种模型性能监督方法的示意性流程图。FIG6 is a schematic flowchart of another model performance supervision method provided according to an embodiment of the present application.

图7是根据本申请实施例提供的另一种模型性能监督方法的示意性流程图。FIG7 is a schematic flowchart of another model performance supervision method provided according to an embodiment of the present application.

图8是根据本申请实施例提供的再一种模型性能监督方法的示意性流程图。FIG8 is a schematic flowchart of another model performance supervision method provided according to an embodiment of the present application.

图9是根据本申请实施例提供的一种模型性能监督装置的示意性框图。FIG9 is a schematic block diagram of a model performance monitoring device provided according to an embodiment of the present application.

图10是根据本申请实施例提供的另一种模型性能监督装置的示意性框图。FIG10 is a schematic block diagram of another model performance monitoring device provided according to an embodiment of the present application.

图11是根据本申请实施例提供的另一种模型性能监督装置的示意性框图。 FIG11 is a schematic block diagram of another model performance monitoring device provided according to an embodiment of the present application.

图12是根据本申请实施例提供的再一种模型性能监督装置的示意性框图。FIG12 is a schematic block diagram of yet another model performance monitoring device provided according to an embodiment of the present application.

图13是根据本申请实施例提供的一种通信设备的示意性框图。FIG13 is a schematic block diagram of a communication device provided according to an embodiment of the present application.

图14是根据本申请实施例提供的一种终端的硬件结构示意图。FIG14 is a schematic diagram of the hardware structure of a terminal provided according to an embodiment of the present application.

图15是根据本申请实施例提供的一种网络侧设备的示意性框图。FIG15 is a schematic block diagram of a network-side device provided according to an embodiment of the present application.

图16是根据本申请实施例提供的另一种网络侧设备的示意性框图。FIG16 is a schematic block diagram of another network-side device provided according to an embodiment of the present application.

DETAILED DESCRIPTION

下面将结合本申请实施例中的附图，对本申请实施例中的技术方案进行清楚描述，显然，所描述的实施例是本申请一部分实施例，而不是全部的实施例。基于本申请中的实施例，本领域普通技术人员所获得的所有其他实施例，都属于本申请保护的范围。The following will be combined with the drawings in the embodiments of the present application to clearly describe the technical solutions in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in the present application, all other embodiments obtained by ordinary technicians in this field belong to the scope of protection of this application.

本申请的术语“第一”、“第二”等是用于区别类似的对象，而不用于描述特定的顺序或先后次序。应该理解这样使用的术语在适当情况下可以互换，以便本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施，且“第一”、“第二”所区别的对象通常为一类，并不限定对象的个数，例如第一对象可以是一个，也可以是多个。此外，本申请中的“或”表示所连接对象的至少其中之一。例如“A或B”涵盖三种方案，即，方案一：包括A且不包括B；方案二：包括B且不包括A；方案三：既包括A又包括B。字符“/”一般表示前后关联对象是一种“或”的关系。The terms "first", "second", etc. of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It should be understood that the terms used in this way are interchangeable where appropriate, so that the embodiments of the present application can be implemented in an order other than those illustrated or described herein, and the objects distinguished by "first" and "second" are generally of one type, and the number of objects is not limited, for example, the first object can be one or more. In addition, "or" in the present application represents at least one of the connected objects. For example, "A or B" covers three schemes, namely, Scheme 1: including A but not including B; Scheme 2: including B but not including A; Scheme 3: including both A and B. The character "/" generally indicates that the objects associated with each other are in an "or" relationship.

本申请的术语“指示”既可以是一个直接的指示(或者说显式的指示)，也可以是一个间接的指示(或者说隐含的指示)。其中，直接的指示可以理解为，发送方在发送的指示中明确告知了接收方具体的信息、需要执行的操作或请求结果等内容；间接的指示可以理解为，接收方根据发送方发送的指示确定对应的信息，或者进行判断并根据判断结果确定需要执行的操作或请求结果等。The term "indication" in this application can be a direct indication (or explicit indication) or an indirect indication (or implicit indication). A direct indication can be understood as the sender explicitly informing the receiver of specific information, operations to be performed, or request results in the sent indication; an indirect indication can be understood as the receiver determining the corresponding information according to the indication sent by the sender, or making a judgment and determining the operation to be performed or the request result according to the judgment result.

值得指出的是，本申请实施例所描述的技术不限于物联网(Ambient Internet of Things，IoT)系统，还可用于其他无线通信系统，诸如长期演进型(Long Term Evolution，LTE)/LTE的演进(LTE-Advanced，LTE-A)系统、码分多址(Code Division Multiple Access，CDMA)、时分多址(Time Division Multiple Access，TDMA)、频分多址(Frequency Division Multiple Access，FDMA)、正交频分多址(Orthogonal Frequency Division Multiple Access，OFDMA)、单载波频分多址(Single-carrier Frequency-Division Multiple Access，SC-FDMA)、无线局域网(Wireless Local Area Networks，WLAN)、无线保真(Wireless Fidelity，WiFi)、蓝牙系统、或其他系统。本申请实施例中的术语“系统”和“网络”常被可互换地使用，所描述的技术既可用于以上提及的系统和无线电技术，也可用于其他系统和无线电技术。以下描述出于示例目的描述了新空口(New Radio，NR)系统，并且在以下大部分描述中使用NR术语，但是这些技术也可应用于NR系统以外的系统，如第6代(6^th Generation，6G)通信系统。It is worth noting that the technology described in the embodiments of the present application is not limited to the Ambient Internet of Things (IoT) system, but can also be used in other wireless communication systems, such as Long Term Evolution (LTE)/LTE-Advanced (LTE-A) system, Code Division Multiple Access (CDMA), Time Division Multiple Access (TDMA), Frequency Division Multiple Access (FDMA), Orthogonal Frequency Division Multiple Access (OFDMA), Single-carrier Frequency-Division Multiple Access (SC-FDMA), Wireless Local Area Networks (WLAN), Wireless Fidelity (WiFi), Bluetooth system, or other systems. The terms "system" and "network" in the embodiments of the present application are often used interchangeably, and the described technology can be used for the above-mentioned systems and radio technologies as well as other systems and radio technologies. The following description describes a New Radio (NR) system for example purposes, and NR terminology is used in most of the following description, but these technologies can also be applied to systems other than NR systems, such as ^6th Generation (6G) communication systems.

图1示出本申请实施例可应用的一种无线通信系统的框图。无线通信系统包括终端11和网络侧设备12，其中，终端11可以直接或通过其他网元与网络侧设备12通信。Fig. 1 shows a block diagram of a wireless communication system applicable to the embodiments of the present application. The wireless communication system includes a terminal 11 and a network side device 12, wherein the terminal 11 can communicate with the network side device 12 directly or through other network elements.

其中，终端11可以是手机、平板电脑(Tablet Personal Computer)、膝上型电脑(Laptop Computer)、笔记本电脑、个人数字助理(Personal Digital Assistant，PDA)、掌上电脑、上网本、超级移动个人计算机(Ultra-mobile Personal Computer，UMPC)、移动上网装置(Mobile Internet Device，MID)、增强现实(Augmented Reality，AR)、虚拟现实(Virtual Reality，VR)设备、机器人、可穿戴式设备(Wearable Device)、飞行器(flight vehicle)、车载设备(Vehicle User Equipment，VUE)、船载设备、行人终端(Pedestrian User Equipment，PUE)、智能家居(具有无线通信功能的家居设备，如冰箱、电视、洗衣机或者家具等)、游戏机、个人计算机(Personal Computer，PC)、柜员机或者自助机等终端侧设备。可穿戴式设备包括：智能手表、智能手环、智能耳机、智能眼镜、智能首饰(智能手镯、智能手链、智能戒指、智能项链、智能脚镯、智能脚链等)、智能腕带、智能服装等。其中，车载设备也可以称为车载终端、车载控制器、车载模块、车载部件、车载芯片或车载单元等。需要说明的是，在本申请实施例并不限定终端11的具体类型。Among them, the terminal 11 can be a mobile phone, a tablet computer (Tablet Personal Computer), a laptop computer (Laptop Computer), a notebook computer, a personal digital assistant (Personal Digital Assistant, PDA), a handheld computer, a netbook, an ultra-mobile personal computer (Ultra-mobile Personal Computer, UMPC), a mobile Internet device (Mobile Internet Device, MID), an augmented reality (Augmented Reality, AR), a virtual reality (Virtual Reality, VR) device, a robot, a wearable device (Wearable Device), an aircraft (flight vehicle), a vehicle-mounted device (Vehicle User Equipment, VUE), a ship-mounted device, a pedestrian terminal (Pedestrian User Equipment, PUE), a smart home (home appliances with wireless communication functions, such as refrigerators, televisions, washing machines or furniture, etc.), a game console, a personal computer (Personal Computer, PC), a teller machine or a self-service machine and other terminal side devices. Wearable devices include: smart watches, smart bracelets, smart headphones, smart glasses, smart jewelry (smart bracelets, smart Bracelets, smart rings, smart necklaces, smart anklets, smart anklets, etc.), smart wristbands, smart clothing, etc. Among them, the vehicle-mounted device can also be called a vehicle-mounted terminal, a vehicle-mounted controller, a vehicle-mounted module, a vehicle-mounted component, a vehicle-mounted chip or a vehicle-mounted unit, etc. It should be noted that the specific type of the terminal 11 is not limited in the embodiment of the present application.

其中，网络侧设备12可以包括接入网设备或核心网设备。The network side device 12 may include an access network device or a core network device.

其中，接入网设备也可以称为无线接入网(Radio Access Network,RAN)设备、无线接入网功能或无线接入网单元。接入网设备可以包括基站、无线局域网(Wireless Local Area Network，WLAN)接入点(Access Point，AS)或无线保真(Wireless Fidelity，WiFi)节点等。其中，基站可被称为节点B(Node B，NB)、演进节点B(Evolved Node B，eNB)、下一代节点B(the next generation Node B，gNB)、新空口节点B(New Radio Node B，NR Node B)、接入点、中继站(Relay Base Station，RBS)、服务基站(Serving Base Station，SBS)、基收发机站(Base Transceiver Station，BTS)、无线电基站、无线电收发机、基本服务集(Basic Service Set，BSS)、扩展服务集(Extended Service Set，ESS)、家用B节点(home Node B，HNB)、家用演进型B节点(home evolved Node B)、发送接收点(Transmission Reception Point，TRP)或所述领域中其他某个合适的术语，只要达到相同的技术效果，所述基站不限于特定技术词汇，需要说明的是，在本申请实施例中仅以NR系统中的基站为例进行介绍，并不限定基站的具体类型。Among them, the access network equipment can also be called radio access network (Radio Access Network, RAN) equipment, radio access network function or radio access network unit. The access network equipment may include base stations, wireless local area network (Wireless Local Area Network, WLAN) access points (Access Point, AS) or wireless fidelity (Wireless Fidelity, WiFi) nodes, etc. Among them, the base station can be called node B (Node B, NB), evolved node B (Evolved Node B, eNB), next generation node B (the next generation Node B, gNB), new radio node B (New Radio Node B, NR Node B), access point, relay station (Relay Base Station, RBS), serving base station (Serving Base Station, SBS), base transceiver station (Base Transceiver Station, BTS), radio base station, radio transceiver, base The base station is not limited to specific technical terms as long as the same technical effect is achieved. It should be noted that in the embodiments of the present application, only the base station in the NR system is taken as an example for introduction, and the specific type of the base station is not limited.

其中，核心网设备可以包含但不限于如下至少一项：核心网节点、核心网功能、移动管理实体(Mobility Management Entity，MME)、接入移动管理功能(Access and Mobility Management Function，AMF)、会话管理功能(Session Management Function，SMF)、用户平面功能(User Plane Function，UPF)、策略控制功能(Policy Control Function，PCF)、策略与计费规则功能单元(Policy and Charging Rules Function，PCRF)、边缘应用服务发现功能(Edge Application Server Discovery Function，EASDF)、统一数据管理(Unified Data Management，UDM)、统一数据仓储(Unified Data Repository，UDR)、归属用户服务器(Home Subscriber Server，HSS)、集中式网络配置(Centralized network configuration，CNC)、网络存储功能(Network Repository Function，NRF)、网络开放功能(Network Exposure Function，NEF)、本地NEF(Local NEF，或L-NEF)、绑定支持功能(Binding Support Function，BSF)、应用功能(Application Function，AF)、定位管理功能(Location Management Function，LMF)等。需要说明的是，在本申请实施例中仅以NR系统中的核心网设备为例进行介绍，并不限定核心网设备的具体类型。Among them, the core network equipment may include but is not limited to at least one of the following: core network nodes, core network functions, mobility management entity (Mobility Management Entity, MME), access mobility management function (Access and Mobility Management Function, AMF), session management function (Session Management Function, SMF), user plane function (User Plane Function, UPF), policy control function (Policy Control Function, PCF), policy and charging rules function unit (Policy and Charging Rules Function, PCRF), edge application service discovery function (Edge Application Server Discovery Function, EASDF), unified data management (U nified Data Management, UDM), Unified Data Repository (UDR), Home Subscriber Server (HSS), Centralized network configuration (CNC), Network Repository Function (NRF), Network Exposure Function (NEF), Local NEF (L-NEF), Binding Support Function (BSF), Application Function (AF), Location Management Function (LMF), etc. It should be noted that, in the embodiments of the present application, only the core network device in the NR system is taken as an example for introduction, and the specific type of the core network device is not limited.

为便于更好的理解本申请实施例，对本申请相关的技术进行说明。In order to facilitate a better understanding of the embodiments of the present application, the technologies related to the present application are explained.

人工智能(AI)目前在各个领域获得了广泛的应用，将人工智能融入无线通信网络，显著提升吞吐量、时延以及用户容量等技术指标是未来的无线通信网络的重要任务。AI模块有多种实现方式，例如神经网络、决策树、支持向量机、贝叶斯分类器等。本申请以神经网络为例进行说明，但是并不限定AI模块的具体类型。Artificial intelligence (AI) has been widely used in various fields. Integrating artificial intelligence into wireless communication networks and significantly improving technical indicators such as throughput, latency, and user capacity are important tasks for future wireless communication networks. There are many ways to implement AI modules, such as neural networks, decision trees, support vector machines, Bayesian classifiers, etc. This application takes neural networks as an example for illustration, but does not limit the specific type of AI modules.

示例性示出的一个神经网络可以如图2所示，其中，神经网络由神经元组成，神经元的可以如图3所示，其中α₁,α₂,…α_K为输入，w为权值(乘性系数)，b为偏置(加性系数)，σ(.)为激活函数。常见的激活函数包括Sigmoid、tanh、整流线性单元(Rectified Linear Unit，ReLU)等等。An exemplary neural network can be shown in FIG2 , where the neural network is composed of neurons, and the neurons can be shown in FIG3 , where α ₁ , α ₂ , … α _K are inputs, w is a weight (multiplicative coefficient), b is a bias (additive coefficient), and σ(.) is an activation function. Common activation functions include Sigmoid, tanh, Rectified Linear Unit (ReLU), and the like.

神经网络的参数通过梯度优化算法进行优化。梯度优化算法是一类最小化或者最大化目标函数(有时候也叫损失函数)的算法，而目标函数往往是模型参数和数据的数学组合。例如给定数据X和其对应的标签Y，我们构建一个神经网络模型f(.)，有了模型后，根据输入x就可以得到预测输出f(x)，并且可以计算出预测值和真实值之间的差距(f(x)-Y)，这个就是损失函数。模型训练的目的是找到合适的w,b使上述的损失函数的值达到最小，损失值越小，则说明构建的神经网络模型越接近于真实情况。The parameters of the neural network are optimized using a gradient optimization algorithm. A gradient optimization algorithm is a type of algorithm that minimizes or maximizes an objective function (sometimes called a loss function), and the objective function is often a mathematical combination of model parameters and data. For example, given data X and its corresponding label Y, we build a neural network model f(.). With the model, we can get the predicted output f(x) based on the input x, and we can calculate the difference between the predicted value and the true value (f(x)-Y), which is the loss function. The purpose of model training is to find the appropriate w, b to minimize the value of the above loss function. The smaller the loss value, the closer the constructed neural network model is to the actual situation.

示例性的，在神经网络模型的训练过程中，常见的优化算法，基本都是基于误差(error)反向传播(Back Propagation，BP)算法。BP算法的基本思想是，学习过程由信号的正向传播与误差的反向传播两个过程组成。正向传播时，输入样本从输入层传入，经各隐层逐层处理后，传向输出层。若输出层的实际输出与期望的输出不符，则转入误差的反向传播阶段。误差反传是将输出误差以某种形式通过隐层向输入层逐层反传，并将误差分摊给各层的所有单元，从而获得各层单元的误差信号，此误差信号即作为修正各单元权值的依据。这种信号正向传播与误差反向传播的各层权值调整过程，是周而复始地进行的。权值不断调整的过程，也就是网络的学习训练过程。此过程一直进行到网络输出的误差减少到可接受的程度，或进行到预先设定的学习次数为止。For example, in the training process of neural network models, common optimization algorithms are basically based on the error back propagation (BP) algorithm. The basic idea of BP algorithm is that the learning process is a forward propagation of the signal. The error back propagation process consists of two processes: forward propagation and error back propagation. During forward propagation, the input sample is transmitted from the input layer, processed by each hidden layer layer by layer, and then transmitted to the output layer. If the actual output of the output layer does not match the expected output, the error back propagation stage is entered. Error back propagation is to propagate the output error layer by layer through the hidden layer in some form, and distribute the error to all units in each layer, so as to obtain the error signal of each layer unit. This error signal is used as the basis for correcting the weights of each unit. This process of adjusting the weights of each layer of the signal forward propagation and error back propagation is repeated. The process of continuous adjustment of weights is also the learning and training process of the network. This process continues until the error of the network output is reduced to an acceptable level, or until the pre-set number of learning times is reached.

常见的优化算法有梯度下降(Gradient Descent)、随机梯度下降(Stochastic Gradient Descent，SGD)、mini-batch gradient descent(小批量梯度下降)、动量法(Momentum)、Nesterov(具体为带动量的随机梯度下降)、自适应梯度下降(ADAptive GRADient descent，Adagrad)、Adadelta、均方根误差降速(root mean square prop，RMSprop)、自适应动量估计(Adaptive Moment Estimation，Adam)等。Common optimization algorithms include Gradient Descent, Stochastic Gradient Descent (SGD), mini-batch gradient descent, Momentum, Nesterov (specifically stochastic gradient descent with momentum), Adaptive Gradient Descent (Adagrad), Adadelta, root mean square prop (RMSprop), Adaptive Momentum Estimation (Adam), etc.

这些优化算法在误差反向传播时，可以根据损失函数得到的误差/损失，对神经元求导数/偏导，加上学习速率、之前的梯度/导数/偏导等影响，得到梯度，将梯度传给上一层。When these optimization algorithms are backpropagating errors, they can calculate the derivative/partial derivative of neurons based on the error/loss obtained from the loss function, add the influence of the learning rate, the previous gradient/derivative/partial derivative, etc., get the gradient, and pass the gradient to the previous layer.

为便于理解本申请实施例的技术方案，以下通过具体实施例详述本申请的技术方案。以上相关技术作为可选方案与本申请实施例的技术方案可以进行任意结合，其均属于本申请实施例的保护范围。本申请实施例包括以下内容中的至少部分内容。To facilitate understanding of the technical solutions of the embodiments of the present application, the technical solutions of the present application are described in detail below through specific embodiments. The above related technologies can be combined arbitrarily with the technical solutions of the embodiments of the present application as optional solutions, and they all belong to the protection scope of the embodiments of the present application. The embodiments of the present application include at least part of the following contents.

图4是根据本申请实施例的模型性能监督方法200的示意性流程图，如图4所示，该模型性能监督方法200可以包括如下内容中的至少部分内容：FIG4 is a schematic flow chart of a model performance supervision method 200 according to an embodiment of the present application. As shown in FIG4 , the model performance supervision method 200 may include at least part of the following contents:

S210，第一设备获取第一信息，其中，该第一信息用于表征第一AI模型的输出的不确定性或概率分布，该第一AI模型的输入为第一特征；S210: The first device obtains first information, where the first information is used to characterize uncertainty or probability distribution of an output of a first AI model, and an input of the first AI model is a first feature;

S220，该第一设备根据该第一信息和第二信息确定第三信息，其中，该第二信息为该第一特征对应的标签，该第三信息用于确定该第一AI模型的有效性。S220, the first device determines third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

应理解，图4示出了模型性能监督方法200的步骤或操作，但这些步骤或操作仅是示例，本申请实施例还可以执行其他操作或者图4中的各个操作的变形。It should be understood that FIG4 shows the steps or operations of the model performance supervision method 200, but these steps or operations are merely examples, and the embodiments of the present application may also perform other operations or variations of the operations in FIG4.

在本申请实施例中，可以基于第一AI模型的输出的不确定性或概率分布及第一AI模型的输入特征对应的标签确定第三信息，以及基于第三信息确定第一AI模型的有效性，从而实现第一AI模型的性能监督。具体的，第一AI模型的输出的不确定性或概率分布可以提高第一AI模型的推理结果的鲁棒性，由于第一AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第一AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第一AI模型的性能监督中参考第一AI模型的输出的不确定性或概率分布及对应的标签，能够提升第一AI模型的性能监督的准确性。In an embodiment of the present application, third information can be determined based on the uncertainty or probability distribution of the output of the first AI model and the label corresponding to the input feature of the first AI model, and the validity of the first AI model can be determined based on the third information, thereby achieving performance supervision of the first AI model. Specifically, the uncertainty or probability distribution of the output of the first AI model can improve the robustness of the reasoning results of the first AI model. Since the reasoning results of the first AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the first AI model (including relevant information on uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the first AI model, referring to the uncertainty or probability distribution of the output of the first AI model and the corresponding label can improve the accuracy of the performance supervision of the first AI model.

示例性的，其他种类的信息可以如下：如运动状态信息(如速度、加速度等)，信号质量的度量信息(如参考信号接收功率(Reference Signal Received Power，RSRP)，信号干扰噪声比(Signal to Interference plus Noise Ratio，SINR)，参考信号接收质量(Reference Signal Received Quality，RSRQ)等)。Exemplarily, other types of information may be as follows: motion state information (such as speed, acceleration, etc.), signal quality measurement information (such as reference signal received power (Reference Signal Received Power, RSRP), signal to interference plus noise ratio (Signal to Interference plus Noise Ratio, SINR), reference signal received quality (Reference Signal Received Quality, RSRQ), etc.).

示例性的，基于本申请实施例获得的第一AI模型的有效性的判断结果，也可以与其他的模型监督方法得到的第一AI模型的有效性的判断结果结合，从而得到最终的第一AI模型的有效性的结论。Exemplarily, the judgment result of the effectiveness of the first AI model obtained based on the embodiment of the present application can also be combined with the judgment result of the effectiveness of the first AI model obtained by other model supervision methods to obtain the final conclusion on the effectiveness of the first AI model.

需要说明的是，在本申请实施例中，第一AI模型的输出的不确定性越高，第一AI模型的输出的准确性越低。It should be noted that in the embodiment of the present application, the higher the uncertainty of the output of the first AI model, the lower the accuracy of the output of the first AI model.

本申请实施例所述的标签是通过某种方式获取，与目标任务关联，这个标签可以通过测量或其他先验信息得到；比如基于AI模型的定位，AI模型的输入是信道状态信息，AI模型输出是位置的不确定性或概率分布，那么这个标签是在信道状态信息所对应的位置信息，位置信息可以通过GPS及其他定位方法得到，或者已知位置的定位参考单元得到。The tag described in the embodiment of the present application is obtained in some way and is associated with the target task. This tag can be obtained through measurement or other prior information; for example, based on the positioning of the AI model, the input of the AI model is the channel state information, and the output of the AI model is the uncertainty or probability distribution of the position. Then this tag is the position information corresponding to the channel state information. The position information can be obtained by GPS and other positioning methods, or by a positioning reference unit with a known position. arrive.

在本申请实施例中，第一信息也可以称之为软信息(如概率分布、置信度、置信区间等)，其中，软信息给出了可能结果的概率分布或置信度。可选地，第一AI模型可以是软信息AI模型，也可以不是软信息AI模型。其中，软信息AI模型是指输出为软信息(如概率分布、置信度、置信区间等)的一类AI模型，既包括经典的概率模型、也包括基于神经网络的AI模型；软信息AI模型度量了不同预测结果的可能性，并给出每个可能结果的概率分布或置信度。In an embodiment of the present application, the first information may also be referred to as soft information (such as probability distribution, confidence level, confidence interval, etc.), wherein the soft information gives the probability distribution or confidence level of the possible results. Optionally, the first AI model may be a soft information AI model or may not be a soft information AI model. Among them, the soft information AI model refers to a type of AI model that outputs soft information (such as probability distribution, confidence level, confidence interval, etc.), including both classic probability models and AI models based on neural networks; the soft information AI model measures the possibility of different prediction results and gives the probability distribution or confidence level of each possible result.

需要说明的是，相比于AI模型推理得到硬信息(hard value)(如视距到达时间(Time of Arrival，TOA)、参考信号时差(Reference Signal Time Difference，RSTD)、到达角(Angle of Arrival，AoA)、离开角(Angle of Departure，AoD)、参考信号接收功率(Reference Signal Received Power，RSRP)、视距(Line Of Sight，LOS)指示、非视距(Non Line Of Sight，NLOS)指示等)，AI模型推理得到软信息能够显著提升推理精度和鲁棒性。具体的，软信息能够更好地描述世界的不确定性，能够提升模型推理的鲁棒性，用于一些对推理可靠性要求比较的业务也能够提供较好的安全性。It should be noted that compared with the hard information (hard value) obtained by AI model reasoning (such as Time of Arrival (TOA), Reference Signal Time Difference (RSTD), Angle of Arrival (AoA), Angle of Departure (AoD), Reference Signal Received Power (RSRP), Line of Sight (LOS) indication, Non Line of Sight (NLOS) indication, etc.), the soft information obtained by AI model reasoning can significantly improve the reasoning accuracy and robustness. Specifically, soft information can better describe the uncertainty of the world, improve the robustness of model reasoning, and provide better security for some businesses that require relatively high reasoning reliability.

在一些实施例中，第一特征的维度为D1维，对应的，第一信息为第二特征的D2组软信息，其中，D2组软信息描述的是第一AI模型输出的范围，如只有一组软信息(即D2＝1)，那这组软信息描述的是整个第一AI模型输出的不确定性；如有至少两组软信息(即D2≥2)，那每组软信息描述的是第一AI模型输出的部分特征的不确定性。In some embodiments, the dimension of the first feature is D1 dimension, and correspondingly, the first information is D2 group of soft information of the second feature, wherein the D2 group of soft information describes the range of the output of the first AI model. If there is only one group of soft information (ie, D2=1), then this group of soft information describes the uncertainty of the output of the entire first AI model; if there are at least two groups of soft information (ie, D2≥2), then each group of soft information describes the uncertainty of some features of the output of the first AI model.

例如，对于预测任务：每一个预测时刻对应一组软信息，或者每一个预测时刻的每一个特征对应一组软信息；对于其他任务，每一个特征对应一组软信息，或所有特征对应一组软信息，比如第一AI模型输出是2维，那么对应2组软信息。For example, for prediction tasks: each prediction moment corresponds to a set of soft information, or each feature at each prediction moment corresponds to a set of soft information; for other tasks, each feature corresponds to a set of soft information, or all features correspond to a set of soft information. For example, if the output of the first AI model is 2-dimensional, then it corresponds to 2 sets of soft information.

例如，对于定位任务，第一AI模型输出是2维的水平位置坐标，2维的水平位置坐标对应一组软信息，或，2维的水平位置坐标中的每一维的水平位置坐标对应一组软信息。For example, for a positioning task, the first AI model output is a 2-dimensional horizontal position coordinate, the 2-dimensional horizontal position coordinate corresponds to a set of soft information, or each dimension of the 2-dimensional horizontal position coordinate corresponds to a set of soft information.

在一些实施例中，本申请中所述的AI模型也可称为AI单元、AI模型/AI单元、机器学习(machine learning，ML)模型、ML单元、AI结构、AI功能、AI特性、神经网络、神经网络函数、神经网络功能等，或者，本申请中所述的AI模型也可以是指能够实现与AI相关的特定的算法、公式、处理流程、能力等的处理单元，或者，本申请中所述的AI模型可以是针对特定数据集的处理方法、算法、功能、模块或单元，或者，本申请中所述的AI模型可以是运行在图形处理单元(graphics processing unit，GPU)、神经网络处理单元(neural processing unit，NPU)、张量处理单元(tensor processing unit，TPU)、专用集成电路(application specific integrated circuit，ASIC)等AI/ML相关硬件上的处理方法、算法、功能、模块或单元，本申请对此不做具体限定。可选地，所述特定数据集包括AI模型的输入或输出。In some embodiments, the AI model described in this application may also be referred to as an AI unit, an AI model/AI unit, a machine learning (ML) model, an ML unit, an AI structure, an AI function, an AI feature, a neural network, a neural network function, a neural network function, etc., or the AI model described in this application may also refer to a processing unit capable of implementing specific algorithms, formulas, processing procedures, capabilities, etc. related to AI, or the AI model described in this application may be a processing method, algorithm, function, module or unit for a specific data set, or the AI model described in this application may be a processing method, algorithm, function, module or unit running on AI/ML related hardware such as a graphics processing unit (GPU), a neural processing unit (NPU), a tensor processing unit (TPU), an application specific integrated circuit (ASIC), etc., and this application does not specifically limit this. Optionally, the specific data set includes the input or output of the AI model.

在一些实施例中，本申请中所述的AI模型的标识，可以是AI单元标识、AI结构标识、AI算法标识，或者，本申请中所述的AI模型关联的特定数据集的标识，或者，本申请中所述的AI模型相关的特定场景、环境、信道特征、设备的标识，或者，本申请中所述的AI模型相关的功能、特性、能力或模块的标识，本申请对此不做具体限定。In some embodiments, the identifier of the AI model described in this application may be an AI unit identifier, an AI structure identifier, an AI algorithm identifier, or an identifier of a specific data set associated with the AI model described in this application, or an identifier of a specific scenario, environment, channel feature, or device related to the AI model described in this application, or an identifier of a function, feature, capability, or module related to the AI model described in this application. This application does not make any specific limitations on this.

需要说明的是，AI模型的泛化能力有限，基于一个场景的数据训练的模型，应用到另一个场景可能会失效；甚至基于同一个场景训练的数据的模型，随着时间的推移，该AI模型也会失效；所述失效是指AI模型的推理精度降低无法达到目标要求，因此，需要监督AI模型的性能。It should be noted that the generalization ability of AI models is limited. A model trained based on data from one scenario may fail when applied to another scenario. Even a model trained based on data from the same scenario will fail over time. Failure refers to the reduction in reasoning accuracy of the AI model to the point where it cannot meet the target requirements. Therefore, the performance of the AI model needs to be supervised.

在一些实施例中，第一AI模型可以为处于激活态的模型，或，第一AI模型可以为处于非激活态的模型。具体的，在第一AI模型为处于非激活态的模型的情况下，在确定第一AI模型的有效性之后，进行AI模型选取时可以从至少两个AI模型中优先选取有效的AI模型，从而有益于AI模型的选取。 In some embodiments, the first AI model may be an activated model, or the first AI model may be an inactivated model. Specifically, when the first AI model is an inactivated model, after determining the validity of the first AI model, when performing AI model selection, an effective AI model may be preferentially selected from at least two AI models, thereby facilitating the selection of an AI model.

在一些实施例中，该第一信息包括但不限于以下至少之一：第二特征的概率密度分布的参数，第二特征的置信区间，第二特征的值，第二特征的值及其概率，第二特征的值及其置信度。本申请实施例明确了第一信息所包含的内容，有益于实现AI模型的性能监督。In some embodiments, the first information includes but is not limited to at least one of the following: a parameter of the probability density distribution of the second feature, a confidence interval of the second feature, a value of the second feature, a value of the second feature and its probability, and a value of the second feature and its confidence. The embodiment of the present application clarifies the content of the first information, which is beneficial to realizing the performance supervision of the AI model.

示例性的，该第一信息所包含的信息的数量可以是一个或至少两个，也即，该第一信息可以包括但不限于以下至少之一：一个或至少两个第二特征的概率密度分布的参数，一个或至少两个第二特征的置信区间，一个或至少两个第二特征的值，一个或至少两个第二特征的值及其概率，一个或至少两个第二特征的值及其置信度。Exemplarily, the amount of information contained in the first information may be one or at least two, that is, the first information may include but is not limited to at least one of the following: parameters of the probability density distribution of one or at least two second features, confidence intervals of one or at least two second features, values of one or at least two second features, values of one or at least two second features and their probabilities, and values of one or at least two second features and their confidence levels.

在一些实施例中，该概率密度分布的参数包括但不限于以下至少之一：概率密度分布的均值或期望，概率密度分布的方差或标准差，概率密度分布的类型指示。In some embodiments, the parameters of the probability density distribution include, but are not limited to, at least one of the following: a mean or expectation of the probability density distribution, a variance or standard deviation of the probability density distribution, and an indication of the type of the probability density distribution.

例如，概率密度分布的类型可以包括但不限于以下至少之一：高斯分布，泊松分布。For example, the type of probability density distribution may include but is not limited to at least one of the following: Gaussian distribution, Poisson distribution.

示例性的，对于高斯分布，置信区间、置信度与均值和标准差存在转换关系，如典型的均值为μ，标准差为σ的高斯分布。For example, for a Gaussian distribution, there is a conversion relationship between the confidence interval, the confidence level, the mean, and the standard deviation, such as a typical Gaussian distribution with a mean of μ and a standard deviation of σ.

例如，90％置信区间为：[μ-1.645σ，μ+1.645σ]，含义是预测目标值有90％的概率在[μ-1.645σ，μ+1.645σ]区间之内。For example, the 90% confidence interval is: [μ-1.645σ, μ+1.645σ], which means that there is a 90% probability that the predicted target value is within the interval [μ-1.645σ, μ+1.645σ].

又例如，95％置信区间为：[μ-1.96σ，μ+1.96σ]，含义是预测目标值有95％的概率在[μ-1.96σ，μ+1.96σ]区间之内。For example, the 95% confidence interval is: [μ-1.96σ, μ+1.96σ], which means that there is a 95% probability that the predicted target value is within the interval [μ-1.96σ, μ+1.96σ].

示例性的，置信区间、置信度与均值和标准差可以存在转换关系，其中，均值为μ，标准差为σ，可以通过概率密度分布函数或概率密度分布的类型得到系数z，置信度为p％的置信区间可以如下：
[μ-z_p％σ，μ+z_p％σ]。Exemplarily, there may be a conversion relationship between the confidence interval, the confidence level, the mean, and the standard deviation, where the mean is μ, the standard deviation is σ, the coefficient z may be obtained by the probability density distribution function or the type of probability density distribution, and the confidence interval with a confidence level of p% may be as follows:
[μ-z _p% σ, μ+z _p% σ].

在一些实施例中，该第一信息为该第一AI模型的输出信息，或者，该第一信息为基于该第一AI模型的输出信息确定的信息。In some embodiments, the first information is output information of the first AI model, or the first information is information determined based on the output information of the first AI model.

示例性的，第一AI模型的输出信息为如下至少之一：第二特征的概率密度分布的参数，第二特征的置信区间，第二特征的值，第二特征的值及其概率，第二特征的值及其置信度。即该第一信息为该第一AI模型的输出信息。Exemplarily, the output information of the first AI model is at least one of the following: a parameter of the probability density distribution of the second feature, a confidence interval of the second feature, a value of the second feature, a value of the second feature and its probability, and a value of the second feature and its confidence. That is, the first information is the output information of the first AI model.

示例性的，第一AI模型的输出信息为第二特征的值，可以基于第二特征的值确定第一信息。可选地，基于第二特征的值及其他信息(如运动状态信息(速度、加速度等)，信号质量的度量信息(如RSRP、SINR、RSRQ等))确定第一信息。例如，信号质量越差，则基于第二特征的值及其他信息得到的第二特征的不确定性越高；又例如，运动速度越快，则基于第二特征的值及其他信息得到的第二特征的不确定性越高。Exemplarily, the output information of the first AI model is the value of the second feature, and the first information can be determined based on the value of the second feature. Optionally, the first information is determined based on the value of the second feature and other information (such as motion state information (speed, acceleration, etc.), signal quality measurement information (such as RSRP, SINR, RSRQ, etc.)). For example, the worse the signal quality, the higher the uncertainty of the second feature obtained based on the value of the second feature and other information; for another example, the faster the movement speed, the higher the uncertainty of the second feature obtained based on the value of the second feature and other information.

例如，第一AI模型的输出信息为第二特征的值，可以基于第二特征的值确定如下至少之一：第二特征的概率密度分布的参数，第二特征的置信区间，第二特征的值的概率，第二特征的值的置信度。即该第一信息为基于该第一AI模型的输出信息确定的信息。For example, if the output information of the first AI model is the value of the second feature, at least one of the following can be determined based on the value of the second feature: a parameter of the probability density distribution of the second feature, a confidence interval of the second feature, a probability of the value of the second feature, and a confidence level of the value of the second feature. That is, the first information is information determined based on the output information of the first AI model.

在一些实施例中，该第一AI模型可以用于实现如下功能中的一项：定位，波束管理，信道状态信息(Channel State Information，CSI)预测，移动性管理，CSI压缩。当然，该第一AI模型也可以用于实现其他功能，本申请对此并不限定。In some embodiments, the first AI model can be used to implement one of the following functions: positioning, beam management, channel state information (CSI) prediction, mobility management, and CSI compression. Of course, the first AI model can also be used to implement other functions, which is not limited in this application.

具体的，本实施例所述的模型性能监督方法200可以用于实现至少两种不同的功能，也即，本实施例所述的模型性能监督方法200可以适用于公共的基于标签的AI模型监督框架，从而可以避免针对不同的功能，分别设计性能监督方案。Specifically, the model performance supervision method 200 described in this embodiment can be used to implement at least two different functions, that is, the model performance supervision method 200 described in this embodiment can be applicable to a public label-based AI model supervision framework, thereby avoiding the need to design performance supervision solutions for different functions.

在一些实施例中，在第一AI模型用于实现定位功能的情况下，第一特征可以包括但不限于以下至少之一：时域信道脉冲响应，RSRP(如层1RSRP或层3RSRP)，频域信道脉冲响应，接收信号的时域波形。可选地，时域信道脉冲响应包括以下至少之一：时间信息，功率信息，相位信息。可选地，频域信道脉冲响应包括以下至少之一：频率信息 (子载波序号及间隔)，功率信息，相位信息。In some embodiments, when the first AI model is used to implement the positioning function, the first feature may include but is not limited to at least one of the following: time domain channel impulse response, RSRP (such as layer 1 RSRP or layer 3 RSRP), frequency domain channel impulse response, time domain waveform of the received signal. Optionally, the time domain channel impulse response includes at least one of the following: time information, power information, phase information. Optionally, the frequency domain channel impulse response includes at least one of the following: frequency information (subcarrier number and interval), power information, phase information.

在一些实施例中，在第一AI模型用于实现定位功能的情况下，第二特征可以包括但不限于以下至少之一：视距TOA、RSTD、AoA、AoD、RSRP、LOS指示、NLOS指示。In some embodiments, when the first AI model is used to implement the positioning function, the second feature may include but is not limited to at least one of the following: line-of-sight TOA, RSTD, AoA, AoD, RSRP, LOS indication, NLOS indication.

具体例如，在第一AI模型用于实现定位功能的情况下，第一AI模型的输入(即第一特征)为时域信道脉冲响应，第一AI模型的输出为中间特征量(即第二特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，中间特征量包括如下至少之一：视距TOA、RSTD、AoA、AoD、RSRP、LOS指示、NLOS指示等，基于中间特征量的软信息可以进一步确定位置坐标；第一AI模型的输出也可以为位置坐标的软信息。For example, when the first AI model is used to realize the positioning function, the input of the first AI model (i.e., the first feature) is the time domain channel impulse response, and the output of the first AI model is the soft information of the intermediate feature quantity (i.e., the second feature) (such as parameters of the probability density distribution, confidence interval, confidence level, etc.), and the intermediate feature quantity includes at least one of the following: line of sight TOA, RSTD, AoA, AoD, RSRP, LOS indication, NLOS indication, etc. The position coordinates can be further determined based on the soft information of the intermediate feature quantity; the output of the first AI model may also be the soft information of the position coordinates.

在一些实施例中，在第一AI模型用于实现波束管理功能的情况下，第一特征可以包括历史T1个时刻的波束信息，如序号、角度、L1-RSRP等，第二特征包括未来T2个时刻的波束信息。In some embodiments, when the first AI model is used to implement the beam management function, the first feature may include beam information at T1 historical moments, such as sequence number, angle, L1-RSRP, etc., and the second feature includes beam information at T2 future moments.

具体例如，在第一AI模型用于实现波束管理功能的情况下，第一AI模型的输入(即第一特征)为历史T1个时刻的波束信息，如序号、角度、L1-RSRP等，第一AI模型的输出为未来T2个时刻的波束信息(即第二特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，如每一个时刻的垂直方向的波束和水平方向波束分别对应一组软信息，如L1-RSRP的置信区间、概率密度分布等，如每一个时刻的波束信息对应一组软信息。For example, when the first AI model is used to implement the beam management function, the input of the first AI model (i.e., the first feature) is the beam information at T1 historical moments, such as serial number, angle, L1-RSRP, etc., and the output of the first AI model is the soft information (such as parameters of probability density distribution, confidence interval, confidence level, etc.) of the beam information (i.e., the second feature) at T2 future moments. For example, the vertical beam and the horizontal beam at each moment correspond to a set of soft information, such as the confidence interval and probability density distribution of L1-RSRP, etc., and the beam information at each moment corresponds to a set of soft information.

在一些实施例中，在第一AI模型用于实现CSI预测功能的情况下，第一特征可以包括历史T1个时刻的CSI，第二特征可以包括未来T2个时刻的CSI。In some embodiments, when the first AI model is used to implement the CSI prediction function, the first feature may include the CSI at T1 historical moments, and the second feature may include the CSI at T2 future moments.

具体例如，在第一AI模型用于实现CSI预测功能的情况下，第一AI模型的输入(即第一特征)为历史T1个时刻的CSI，第一AI模型的输出为未来T2个时刻的CSI(即第二特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，如每一个时刻的CSI对应一组软信息，如每一个时刻的CSI的每一个维度对应一组软信息，或每一个时刻的CSI对应一组软信息。For example, when the first AI model is used to implement the CSI prediction function, the input of the first AI model (i.e., the first feature) is the CSI at T1 historical moments, and the output of the first AI model is the soft information (such as parameters of the probability density distribution, confidence interval, confidence level, etc.) of the CSI at T2 moments in the future (i.e., the second feature), such as the CSI at each moment corresponds to a set of soft information, such as each dimension of the CSI at each moment corresponds to a set of soft information, or the CSI at each moment corresponds to a set of soft information.

在一些实施例中，在第一AI模型用于实现移动性管理功能的情况下，第一特征可以包括历史T1个时刻的层1RSRP(L1-RSRP)或层3RSRP(L3-RSRP)，第二特征可以包括未来T2个时刻的RSRP或未来T2个时刻的是否发生小区切换的决策。需要说明的是，L3-RSRP为L1-RSRP经过滤波得到的。In some embodiments, when the first AI model is used to implement the mobility management function, the first feature may include the layer 1 RSRP (L1-RSRP) or layer 3 RSRP (L3-RSRP) at the historical T1 moment, and the second feature may include the RSRP at the future T2 moments or the decision of whether a cell switching occurs at the future T2 moments. It should be noted that L3-RSRP is obtained by filtering L1-RSRP.

具体例如，在第一AI模型用于实现移动性管理功能的情况下，第一AI模型的输入(即第一特征)为历史T1个时刻的L1-RSRP或L3-RSRP，第一AI模型的输出为未来T2个时刻的RSRP(即第二特征)的软信息(如概率密度分布的参数、置信区间、置信度等)；或第一AI模型的输出为未来T2个时刻的是否发生小区切换的决策(即第二特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，如1表示切换，0表示不切换，软信息为0～1之间的值。For example, when the first AI model is used to implement the mobility management function, the input of the first AI model (i.e., the first feature) is the L1-RSRP or L3-RSRP at T1 historical moments, and the output of the first AI model is the soft information (such as parameters of probability density distribution, confidence interval, confidence level, etc.) of the RSRP (i.e., the second feature) at T2 moments in the future; or the output of the first AI model is the soft information (such as parameters of probability density distribution, confidence interval, confidence level, etc.) of the decision on whether a cell switching will occur (i.e., the second feature) at T2 moments in the future, such as 1 for switching and 0 for no switching, and the soft information is a value between 0 and 1.

在一些实施例中，在第一AI模型用于实现CSI压缩功能的情况下，第一特征可以包括未压缩的CSI，第二特征可以包括压缩后的CSI。In some embodiments, when the first AI model is used to implement the CSI compression function, the first feature may include uncompressed CSI, and the second feature may include compressed CSI.

具体例如，在第一AI模型用于实现CSI压缩功能的情况下，第一AI模型的输入(即第一特征)为未压缩的CSI，第一AI模型的输出为压缩后的CSI(即第二特征)的软信息(如概率密度分布的参数、置信区间、置信度等)。For example, when the first AI model is used to implement the CSI compression function, the input of the first AI model (ie, the first feature) is the uncompressed CSI, and the output of the first AI model is the soft information of the compressed CSI (ie, the second feature) (such as parameters of the probability density distribution, confidence interval, confidence level, etc.).

在一些实施例中，该第一特征对应的标签(即第二信息)可以是真实的，也可以是测量或估计得到的，本申请实施例对此并不限定。In some embodiments, the label corresponding to the first feature (ie, the second information) may be real, or may be measured or estimated, and this is not limited in the embodiments of the present application.

需要说明的是，第一特征对应的标签的类型与第一AI模型的输出的类型一致。例如，均为位置信息、均为TOA、均为CSI、均为RSRP等；这取决于具体的任务类型。It should be noted that the type of label corresponding to the first feature is consistent with the type of output of the first AI model. For example, both are location information, both are TOA, both are CSI, both are RSRP, etc.; this depends on the specific task type.

在一些实施例中，该第一特征对应的标签(即第二信息)可以是第一设备测量或估计或存储的，或，该第一特征对应的标签(即第二信息)可以是第一设备从第二设备或其他设备处获取的。 In some embodiments, the label corresponding to the first feature (ie, the second information) may be measured, estimated, or stored by the first device, or the label corresponding to the first feature (ie, the second information) may be obtained by the first device from the second device or other devices.

在一些实施例中，在上述S210中，第一设备获取第一信息，包括以下其中一项：In some embodiments, in the above S210, the first device obtains the first information, including one of the following:

第一设备从第二设备接收第一信息；The first device receives first information from the second device;

第一设备通过第一AI模型的输出信息，获取第一信息；The first device obtains first information through output information of the first AI model;

第一设备从第二设备接收第一AI模型的输出信息，根据第一AI模型的输出信息，获得第一信息。The first device receives output information of the first AI model from the second device, and obtains first information according to the output information of the first AI model.

在一些实施例中，在第一设备通过第一AI模型的输出信息，获取第一信息的情况下，第一AI模型可以部署在第一设备侧。In some embodiments, when the first device obtains the first information through output information of the first AI model, the first AI model can be deployed on the first device side.

在一些实施例中，在第一设备从第二设备接收第一信息的情况下，第一AI模型可以部署在第二设备侧。例如，第二设备通过第一AI模型的输出信息得到第一信息，以及第二设备向第一设备发送第一信息。In some embodiments, when the first device receives the first information from the second device, the first AI model can be deployed on the second device side. For example, the second device obtains the first information through the output information of the first AI model, and the second device sends the first information to the first device.

在一些实施例中，在第一设备从第二设备接收第一AI模型的输出信息，根据第一AI模型的输出信息，获得第一信息的情况下，第一AI模型可以部署在第二设备侧。In some embodiments, when the first device receives output information of the first AI model from the second device and obtains the first information based on the output information of the first AI model, the first AI model can be deployed on the second device side.

在一些实施例中，第一设备根据第三信息确定第一AI模型的有效性。具体的，在第一设备根据第一信息和第二信息确定第三信息之后，第一设备根据第三信息确定第一AI模型的有效性。可选地，第一设备向第二设备发送第一指示信息，其中，第一指示信息用于指示第一AI模型的有效性。例如，第一指示信息占用1比特；其中，取值0表示第一AI模型有效，取值1表示第一AI模型无效；或，取值1表示第一AI模型有效，取值0表示第一AI模型无效。可选地，该第一指示信息包括该第一AI模型的标识或该第一AI模型所关联的功能的标识。In some embodiments, the first device determines the validity of the first AI model based on the third information. Specifically, after the first device determines the third information based on the first information and the second information, the first device determines the validity of the first AI model based on the third information. Optionally, the first device sends first indication information to the second device, wherein the first indication information is used to indicate the validity of the first AI model. For example, the first indication information occupies 1 bit; wherein a value of 0 indicates that the first AI model is valid, and a value of 1 indicates that the first AI model is invalid; or, a value of 1 indicates that the first AI model is valid, and a value of 0 indicates that the first AI model is invalid. Optionally, the first indication information includes an identifier of the first AI model or an identifier of a function associated with the first AI model.

在一些实施例中，第一设备向第二设备发送第三信息。进一步地，第二设备可以根据第三信息确定第一AI模型的有效性。可选地，第一设备从第二设备接收第二指示信息，其中，第二指示信息用于指示第一AI模型的有效性。例如，第二指示信息占用1比特；其中，取值0表示第一AI模型有效，取值1表示第一AI模型无效；或，取值1表示第一AI模型有效，取值0表示第一AI模型无效。可选地，该第二指示信息包括该第一AI模型的标识或该第一AI模型所关联的功能的标识。In some embodiments, the first device sends third information to the second device. Further, the second device may determine the validity of the first AI model based on the third information. Optionally, the first device receives second indication information from the second device, wherein the second indication information is used to indicate the validity of the first AI model. For example, the second indication information occupies 1 bit; wherein a value of 0 indicates that the first AI model is valid, and a value of 1 indicates that the first AI model is invalid; or, a value of 1 indicates that the first AI model is valid, and a value of 0 indicates that the first AI model is invalid. Optionally, the second indication information includes an identifier of the first AI model or an identifier of a function associated with the first AI model.

示例性的，在第一AI模型部署在第一设备侧的情况下，第一设备通过第一AI模型的输出信息获取第一信息，接着，第一设备从本地或其他设备(如第二设备或除第一设备和第二设备之外的设备)获取第二信息，以及第一设备根据第一信息和第二信息确定第三信息，然后，第一设备根据第三信息确定第一AI模型的有效性，最后，第一设备向第二设备发送第一指示信息，其中，第一指示信息用于指示第一AI模型的有效性。Exemplarily, when the first AI model is deployed on the first device side, the first device obtains first information through the output information of the first AI model, then the first device obtains second information from a local or other device (such as a second device or a device other than the first device and the second device), and the first device determines third information based on the first information and the second information, and then, the first device determines the validity of the first AI model based on the third information, and finally, the first device sends first indication information to the second device, wherein the first indication information is used to indicate the validity of the first AI model.

示例性的，在第一AI模型部署在第一设备侧的情况下，第一设备通过第一AI模型的输出信息获取第一信息，接着，第一设备从本地或其他设备(如第二设备或除第一设备和第二设备之外的设备)获取第二信息，以及第一设备根据第一信息和第二信息确定第三信息，然后，第一设备向第二设备发送第三信息，之后，第二设备根据第三信息确定第一AI模型的有效性，最后，第二设备向第一设备发送第二指示信息，其中，第二指示信息用于指示第一AI模型的有效性。Exemplarily, when the first AI model is deployed on the first device side, the first device obtains the first information through the output information of the first AI model, then the first device obtains the second information from the local or other device (such as the second device or a device other than the first device and the second device), and the first device determines the third information based on the first information and the second information, and then the first device sends the third information to the second device, after which the second device determines the validity of the first AI model based on the third information, and finally, the second device sends the second indication information to the first device, wherein the second indication information is used to indicate the validity of the first AI model.

示例性的，在第一AI模型部署在第二设备侧的情况下，第二设备通过第一AI模型的输出信息得到第一信息，以及第二设备向第一设备发送第一信息，接着，第一设备从本地或其他设备(如第二设备或除第一设备和第二设备之外的设备)获取第二信息，以及第一设备根据第一信息和第二信息确定第三信息，然后，第一设备根据第三信息确定第一AI模型的有效性，最后，第一设备向第二设备发送第一指示信息，其中，第一指示信息用于指示第一AI模型的有效性。Exemplarily, when the first AI model is deployed on the second device side, the second device obtains the first information through the output information of the first AI model, and the second device sends the first information to the first device. Then, the first device obtains the second information from the local or other devices (such as the second device or a device other than the first device and the second device), and the first device determines the third information based on the first information and the second information. Then, the first device determines the validity of the first AI model based on the third information. Finally, the first device sends the first indication information to the second device, wherein the first indication information is used to indicate the validity of the first AI model.

示例性的，在第一AI模型部署在第二设备侧的情况下，第二设备通过第一AI模型的输出信息得到第一信息，以及第二设备向第一设备发送第一信息，接着，第一设备从本地或其他设备(如第二设备或除第一设备和第二设备之外的设备)获取第二信息，以及第一设备根据第一信息和第二信息确定第三信息，然后，第一设备向第二设备发送第三信息，之后，第二设备根据第三信息确定第一AI模型的有效性，最后，第二设备向第一设备发送第二指示信息，其中，第二指示信息用于指示第一AI模型的有效性。Exemplarily, in the case where the first AI model is deployed on the second device side, the second device obtains the first information through the output information of the first AI model, and the second device sends the first information to the first device, then the first device obtains the second information from the local or other device (such as the second device or a device other than the first device and the second device), and the first device determines the third information based on the first information and the second information, and then the first device sends the third information to the second device. Three information, then, the second device determines the validity of the first AI model according to the third information, and finally, the second device sends second indication information to the first device, wherein the second indication information is used to indicate the validity of the first AI model.

示例性的，在第一AI模型部署在第二设备侧的情况下，第一设备从第二设备接收第一AI模型的输出信息，以及第一设备根据第一AI模型的输出信息，获得第一信息，接着，第一设备从本地或其他设备(如第二设备或除第一设备和第二设备之外的设备)获取第二信息，以及第一设备根据第一信息和第二信息确定第三信息，然后，第一设备根据第三信息确定第一AI模型的有效性，最后，第一设备向第二设备发送第一指示信息，其中，第一指示信息用于指示第一AI模型的有效性。Exemplarily, when the first AI model is deployed on the second device side, the first device receives the output information of the first AI model from the second device, and the first device obtains the first information based on the output information of the first AI model. Then, the first device obtains the second information from the local or other device (such as the second device or a device other than the first device and the second device), and the first device determines the third information based on the first information and the second information. Then, the first device determines the validity of the first AI model based on the third information. Finally, the first device sends the first indication information to the second device, wherein the first indication information is used to indicate the validity of the first AI model.

示例性的，在第一AI模型部署在第二设备侧的情况下，第一设备从第二设备接收第一AI模型的输出信息，以及第一设备根据第一AI模型的输出信息，获得第一信息，接着，第一设备从本地或其他设备(如第二设备或除第一设备和第二设备之外的设备)获取第二信息，以及第一设备根据第一信息和第二信息确定第三信息，然后，第一设备向第二设备发送第三信息，之后，第二设备根据第三信息确定第一AI模型的有效性，最后，第二设备向第一设备发送第二指示信息，其中，第二指示信息用于指示第一AI模型的有效性。Exemplarily, when the first AI model is deployed on the second device side, the first device receives the output information of the first AI model from the second device, and the first device obtains the first information based on the output information of the first AI model. Then, the first device obtains the second information from the local or other device (such as the second device or a device other than the first device and the second device), and the first device determines the third information based on the first information and the second information. Then, the first device sends the third information to the second device, and then, the second device determines the validity of the first AI model based on the third information. Finally, the second device sends the second indication information to the first device, wherein the second indication information is used to indicate the validity of the first AI model.

在一些实施例中，该第一设备可以是终端、网络侧设备或第三方服务器，其中，该网络侧设备包括接入网设备或核心网设备。In some embodiments, the first device may be a terminal, a network side device, or a third-party server, wherein the network side device includes an access network device or a core network device.

在一些实施例中，该第二设备可以是终端、网络侧设备或第三方服务器，其中，该网络侧设备包括接入网设备或核心网设备。In some embodiments, the second device may be a terminal, a network side device, or a third-party server, wherein the network side device includes an access network device or a core network device.

在一些实施例中，在第一信息包括第二特征的概率密度分布的参数的情况下，第三信息用于表征第一特征的N个样本的第一概率的均值，或，第三信息用于表征第一特征的N个样本的第一概率的对数的均值；其中，该第一概率为该N个样本中每一个样本对应的标签在该第二特征的概率密度分布下的概率，N为正整数。可选地，N个样本也可以替换为第一AI模型的N次推理过程。In some embodiments, when the first information includes parameters of the probability density distribution of the second feature, the third information is used to characterize the mean of the first probability of N samples of the first feature, or the third information is used to characterize the mean of the logarithm of the first probability of N samples of the first feature; wherein the first probability is the probability of the label corresponding to each sample in the N samples under the probability density distribution of the second feature, and N is a positive integer. Optionally, the N samples can also be replaced by N inference processes of the first AI model.

示例性的，在第一信息包括第二特征的概率密度分布的参数的情况下，第三信息可以为对数似然值。Exemplarily, when the first information includes parameters of the probability density distribution of the second feature, the third information may be a log-likelihood value.

在一些实施例中，在第三信息用于表征第一特征的N个样本的第一概率的均值的情况下，该第三信息基于以下公式1确定：
In some embodiments, when the third information is used to characterize the mean of the first probabilities of N samples of the first feature, the third information is determined based on the following formula 1:

其中，L₁表示该第三信息，i表示该N个样本中的第i个样本，表示该第i个样本对应的第二特征的概率密度分布的标准差或方差，表示该第i个样本对应的第二特征的概率密度分布的均值(如统计平均值)，y_i表示该第i个样本对应的标签，g(·)表示该第一AI模型的输出所服从的概率密度分布函数。Wherein, _L1 represents the third information, i represents the i-th sample among the N samples, Represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, represents the mean (such as the statistical mean) of the probability density distribution of the second feature corresponding to the i-th sample, _yi represents the label corresponding to the i-th sample, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

在一些实施例中，在第三信息用于表征第一特征的N个样本的第一概率的对数的均值的情况下，该第三信息基于以下公式2确定：
In some embodiments, when the third information is used to characterize the mean of the logarithm of the first probability of N samples of the first feature, the third information is determined based on the following formula 2:

其中，L₂表示该第三信息，i表示该N个样本中的第i个样本，表示该第i个样本对应的第二特征的概率密度分布的均值(如统计平均值)，表示该第i个样本对应的第二特征的概率密度分布的标准差或方差，y_i表示该第i个样本对应的标签，s为正整数，g(·)表示该第一AI模型的输出所服从的概率密度分布函数。Wherein, L ₂ represents the third information, i represents the i-th sample among the N samples, represents the mean (such as the statistical mean) of the probability density distribution of the second feature corresponding to the i-th sample, represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, _yi represents the label corresponding to the i-th sample, s is a positive integer, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

可选地，s由协议约定，或，s由第一设备确定，或，s由第二设备配置或指示。 Optionally, s is agreed upon by a protocol, or s is determined by the first device, or s is configured or indicated by the second device.

在一些实施例中，在上述公式1或公式2中，g(·)与概率密度分布的类型相关，如可以基于概率密度分布的类型确定g(·)。In some embodiments, in the above formula 1 or formula 2, g(·) is related to the type of probability density distribution, for example, g(·) can be determined based on the type of probability density distribution.

例如，以概率密度分布的类型为高斯分布为例，上述公式1可以变型为如下公式3。
For example, taking the Gaussian distribution as the type of probability density distribution, the above formula 1 can be transformed into the following formula 3.

例如，以概率密度分布的类型为高斯分布为例，上述公式2可以变型为如下公式4。
For example, taking the Gaussian distribution as the type of probability density distribution, the above formula 2 can be transformed into the following formula 4.

在一些实施例中，在第一信息包括第二特征的概率密度分布的参数的情况下，该第三信息用于确定该第一AI模型的有效性，包括：In some embodiments, when the first information includes parameters of the probability density distribution of the second feature, the third information is used to determine the validity of the first AI model, including:

在该第三信息大于或等于第一阈值的情况下，该第一AI模型有效；或，When the third information is greater than or equal to the first threshold, the first AI model is valid; or,

在该第三信息小于第一阈值的情况下，该第一AI模型失效。When the third information is less than the first threshold, the first AI model fails.

可选地，第一阈值由协议约定，或，第一阈值由第一设备确定，或，第一阈值由第二设备配置或指示。Optionally, the first threshold is agreed upon by a protocol, or the first threshold is determined by the first device, or the first threshold is configured or indicated by the second device.

在一些实施例中，在第一信息包括第二特征的置信区间的情况下，第三信息用于表征第一特征的N个样本中，对应的标签位于第二特征的置信区间内的样本数与总样本数N的比值，其中，N为正整数。可选地，N个样本也可以替换为第一AI模型的N次推理过程。In some embodiments, when the first information includes the confidence interval of the second feature, the third information is used to characterize the ratio of the number of samples whose corresponding labels are within the confidence interval of the second feature among the N samples of the first feature to the total number of samples N, where N is a positive integer. Optionally, the N samples can also be replaced by N inference processes of the first AI model.

在一些实施例中，在第一信息包括第二特征的置信区间的情况下，该第三信息基于以下公式5确定：
In some embodiments, when the first information includes a confidence interval of the second feature, the third information is determined based on the following formula 5:

其中，L₃表示该第三信息，i表示该N个样本中的第i个样本，x_i表示该第i个样本对应的输入信息，y_i表示该第i个样本对应的标签，当y_i位于该第二特征的置信区间f(x_i)之内时，否则 Wherein, L ₃ represents the third information, i represents the i-th sample in the N samples, x _i represents the input information corresponding to the i-th sample, y _i represents the label corresponding to the i-th sample, when y _i is within the confidence interval f(x _i ) of the second feature, otherwise

在一些实施例中，在第一信息包括第二特征的置信区间的情况下，该第三信息用于确定该第一AI模型的有效性，包括：In some embodiments, when the first information includes a confidence interval of the second feature, the third information is used to determine the validity of the first AI model, including:

在该第三信息大于或等于第二阈值的情况下，该第一AI模型有效；或，When the third information is greater than or equal to the second threshold, the first AI model is valid; or,

在该第三信息小于第二阈值的情况下，该第一AI模型失效。When the third information is less than the second threshold, the first AI model fails.

可选地，第二阈值由协议约定，或，第二阈值由第一设备确定，或，第二阈值由第二设备配置或指示。Optionally, the second threshold is agreed upon by a protocol, or the second threshold is determined by the first device, or the second threshold is configured or indicated by the second device.

在一些实施例中，在第一信息包括第二特征的值及其概率或置信度的情况下，第三信息用于表征第一特征的N个样本对应的标签与第二特征的值之间的加权距离的均值，其中，加权权重基于第二特征的概率或置信度确定，N为正整数。可选地，N个样本也可以替换为第一AI模型的N次推理过程。In some embodiments, when the first information includes the value of the second feature and its probability or confidence, the third information is used to characterize the average of the weighted distances between the labels corresponding to the N samples of the first feature and the value of the second feature, where the weighted weight is determined based on the probability or confidence of the second feature, and N is a positive integer. Optionally, the N samples can also be replaced by N inference processes of the first AI model.

在一些实施例中，在第一信息包括第二特征的值及其概率或置信度的情况下，该第三信息基于以下公式6确定：
In some embodiments, when the first information includes the value of the second feature and its probability or confidence, the third information is determined based on the following formula 6:

其中，L₄表示该第三信息，i表示该N个样本中的第i个样本，表示该第i个样本对应的第二特征的概率密度分布的均值，y_i表示该第i个样本对应的标签。 Wherein, L ₄ represents the third information, i represents the i-th sample among the N samples, represents the mean of the probability density distribution of the second feature corresponding to the i-th sample, and _yi represents the label corresponding to the i-th sample.

示例性的，在上述公式6中，Σ^-1的对角线元素为概率或置信度的P次幂，其中，P大于或等于0，且P为整数。Exemplarily, in the above formula 6, the diagonal elements of Σ ^-1 are P powers of probability or confidence, where P is greater than or equal to 0 and P is an integer.

在一些实施例中，在第一信息包括第二特征的值及其概率或置信度的情况下，该第三信息用于确定该第一AI模型的有效性，包括：In some embodiments, when the first information includes the value of the second feature and its probability or confidence, the third information is used to determine the validity of the first AI model, including:

在该第三信息小于或等于第三阈值的情况下，该第一AI模型有效；或，When the third information is less than or equal to the third threshold, the first AI model is valid; or,

在该第三信息大于第三阈值的情况下，该第一AI模型失效。When the third information is greater than a third threshold, the first AI model fails.

可选地，第三阈值由协议约定，或，第三阈值由第一设备确定，或，第三阈值由第二设备配置或指示。Optionally, the third threshold is agreed upon by a protocol, or the third threshold is determined by the first device, or the third threshold is configured or indicated by the second device.

需要说明的是，在上述公式中，y_i、可以为一个具体的数值，也可以为一个向量或矩阵，本申请实施例对此并不限定。It should be noted that in the above formula, _yi , It can be a specific numerical value, or a vector or a matrix, which is not limited in the embodiments of the present application.

在一些实施例中，该N个样本为M个时间单元内获取的，其中，M为正整数。In some embodiments, the N samples are acquired within M time units, where M is a positive integer.

可选地，该时间单元可以包括如下至少之一：正交频分复用(Orthogonal frequency-division multiplexing，OFDM)符号、时隙、子帧、帧、微秒、毫秒、秒、分钟、小时、天、星期、月。Optionally, the time unit may include at least one of: orthogonal frequency-division multiplexing (OFDM) symbol, time slot, subframe, frame, microsecond, millisecond, second, minute, hour, day, week, month.

可选地，M由第二设备配置或指示，或，M由协议约定。Optionally, M is configured or indicated by the second device, or M is agreed upon by a protocol.

在一些实施例中，该N个样本所关联的场景标识或数据集标识相同。In some embodiments, the scene identifiers or data set identifiers associated with the N samples are the same.

在一些实施例中，第一设备可以从第二设备获取如下参数中的至少之一：In some embodiments, the first device may obtain at least one of the following parameters from the second device:

用于AI模型性能监督的样本数量N；The number of samples N used for AI model performance supervision;

用于AI模型性能监督的最小样本数量N；The minimum number of samples N used for AI model performance supervision;

获取N个样本的时间范围T；Get the time range T of N samples;

概率密度分布的类型及参数；如高斯混合模型，还需要指出所含的高斯分布的个数；The type and parameters of the probability density distribution; for example, in the case of a Gaussian mixture model, the number of Gaussian distributions involved needs to be indicated;

参数P；Parameter P;

参数M。Parameter M.

在一些实施例中，第一设备向第二设备上报如下参数中的至少之一：In some embodiments, the first device reports at least one of the following parameters to the second device:

获取N个样本的时间范围T；Get the time range T of N samples;

参数P；Parameter P;

参数M。Parameter M.

在一些实施例中，该第三信息可以为正向激励信息或负向激励信息；例如，在负向激励信息大于某一阈值或正向激励信息小于或等于某一阈值，认为第一AI模型失效。In some embodiments, the third information may be positive incentive information or negative incentive information; for example, when the negative incentive information is greater than a certain threshold or the positive incentive information is less than or equal to a certain threshold, the first AI model is considered to be invalid.

示例性的，正向激励信息可以包括但不限于如下至少之一：Exemplarily, the positive incentive information may include but is not limited to at least one of the following:

在第一特征的N个样本中，或的样本数量或比例；Among the N samples of the first feature, or the sample size or proportion;

在第一特征的N个样本中，标签在置信区间之内的样本数量或比例；The number or proportion of samples whose labels are within the confidence interval among the N samples of the first feature;

在第一特征的N个样本中，标签和第二特征的值的加权距离小于或等于t₃的样本数量或比例。The number or proportion of samples for which the weighted distance between the label and the value of the second feature is less than or equal to t ₃ among the N samples of the first feature.

示例性的，负向激励信息可以包括但不限于如下至少之一：Exemplarily, the negative incentive information may include but is not limited to at least one of the following:

在第一特征的N个样本中，标签在置信区间之外的样本数量或比例；The number or proportion of samples whose labels are outside the confidence interval among the N samples of the first feature;

在第一特征的N个样本中，标签和第二特征的值的加权距离大于t₃的样本数量或比例。Among the N samples of the first feature, the number or proportion of samples whose weighted distance between the label and the value of the second feature is greater than t ₃ .

具体的，t₁可以为上述第一阈值，t₂可以为上述第二阈值，t₃可以为上述第三阈值。Specifically, _t1 may be the first threshold, _t2 may be the second threshold, and _t3 may be the third threshold.

因此，在本申请实施例中，可以基于第一AI模型的输出的不确定性或概率分布(即第一信息)及第一AI模型的输入特征对应的标签(即第二信息)确定第三信息，以及基于第三信息确定第一AI模型的有效性，从而实现第一AI模型的性能监督。具体的，第一AI模型的输出的不确定性或概率分布可以提高第一AI模型的推理结果的鲁棒性，由于第一AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第一AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第一AI模型的性能监督中参考第一AI模型的输出的不确定性或概率分布及对应的标签，能够提升第一AI模型的性能监督的准确性。Therefore, in the embodiment of the present application, the third information can be determined based on the uncertainty or probability distribution of the output of the first AI model (i.e., the first information) and the label corresponding to the input feature of the first AI model (i.e., the second information), and the The validity of the first AI model is determined based on the third information, thereby achieving performance supervision of the first AI model. Specifically, the uncertainty or probability distribution of the output of the first AI model can improve the robustness of the reasoning results of the first AI model. Since the reasoning results of the first AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the first AI model (including relevant information on uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the first AI model, referring to the uncertainty or probability distribution of the output of the first AI model and the corresponding labels can improve the accuracy of the performance supervision of the first AI model.

以下通过一个具体的示例描述基于软信息的AI模型定位的方案，软信息可以提高AI模型的推理结果的鲁棒性，因为推理结果涵盖了多个可能的结果及其概率分布。有利于对AI模型的推理结果的进一步处理和利用，如软信息与其他种类的信息(如其他用于实现定位功能的AI模型输出的信息)结合得到更准确的目标结果，用于一些对推理可靠性要求比较的业务也能够提供较好的安全性。The following describes a solution for AI model positioning based on soft information through a specific example. Soft information can improve the robustness of the reasoning results of the AI model because the reasoning results cover multiple possible results and their probability distribution. It is conducive to further processing and utilization of the reasoning results of the AI model, such as combining soft information with other types of information (such as information output by other AI models used to implement positioning functions) to obtain more accurate target results, and it can also provide better security for some businesses that require relatively high reasoning reliability.

示例1Example 1

第i个AI模型的输入为第i个发送接收点(Transmission Reception Point，TRP)的时域信道脉冲响应(channel impulse response，CIR)，包括多径的时间、功率和相位信息；The input of the ith AI model is the time-domain channel impulse response (CIR) of the ith transmission reception point (TRP), including the time, power, and phase information of the multipath.

第i个AI模型的输出为第i个TRP与终端之间的视距TOA的均值μ_i和标准差σ_i；The output of the ith AI model is the mean μ _i and standard deviation σ _i of the line-of-sight TOA between the ith TRP and the terminal;

第i个TRP与终端之间的视距TOA估计x_i建模为高斯分布：
The line-of-sight TOA estimate _xi between the ith TRP and the terminal is modeled as a Gaussian distribution:

对于N个TRP，其似然函数建模为：For N TRPs, the likelihood function is modeled as:

其中，x＝[x₁,...,x_N]^T； Where x = [x ₁ , ..., x _N ] ^T ;

可以将最大似然估计问题转化成加权最小二乘问题：
The maximum likelihood estimation problem can be transformed into a weighted least squares problem:

其中，μ＝[μ₁,...,μ_N]^T，Σ是个N*N的矩阵，其第i个对角线元素为它的目标是找一个位置由这个位置结合N个TRP的坐标得到的N个TOA估计与AI模型估计的N个TOA均值μ的加权距离最小。Where μ = [μ ₁ , ..., μ _N ] ^T , Σ is an N*N matrix, and its i-th diagonal element is Its goal is to find a location The N TOA estimates obtained by combining this position with the coordinates of the N TRPs are The weighted distance to the N TOA mean μ estimated by the AI model is the smallest.

由于和之间的关系是非线性的，因此可以通过线性逼近、贪婪算法求解。下面以粒子群优化算法为例，给出的具体的实现方案和仿真结果，如图5所示。because and The relationship between is nonlinear, so it can be solved by linear approximation and greedy algorithm. Taking the particle swarm optimization algorithm as an example, the specific implementation scheme and simulation results are given as shown in Figure 5.

其中，不同的优化算法、TRP数量，及相应的定位精度可以如下表1所示。Among them, different optimization algorithms, TRP numbers, and corresponding positioning accuracy can be shown in Table 1 below.

表1
Table 1

此外，该框架可以支持多种类型软信息的混合定位，上例中只给出了N个TRP的TOA的软信息μ,σ，另外还可以包括角度的软信息等。其似然函数可以写作如下：
In addition, the framework can support mixed positioning of multiple types of soft information. In the above example, only the soft information μ, σ of TOA of N TRPs is given. In addition, soft information of angles can also be included. Its likelihood function can be written as follows:

x,α,z是指不同类型的信息，比如分别为TOA，AOD和AOA。x, α, and z refer to different types of information, such as TOA, AOD, and AOA, respectively.

其次，不同类型信息的TRP数量也可以不同，如x的TRP数量为N₁，α的TRP数量为N₂，z的TRP数量为N₃，其似然函数可以写作如下：
Secondly, the number of TRPs for different types of information can also be different. For example, the number of TRPs for x is N ₁ , the number of TRPs for α is N ₂ , and the number of TRPs for z is N ₃ . Their likelihood functions can be written as follows:

上文结合图4至图5，详细描述了本申请的第一设备侧实施例，下文结合图6，详细描述本申请的第二设备侧实施例，应理解，第二设备侧实施例与第一设备侧实施例相互对应，类似的描述可以参照第一设备侧实施例。The above, in combination with Figures 4 to 5, describes in detail the first device side embodiment of the present application. The following, in combination with Figure 6, describes in detail the second device side embodiment of the present application. It should be understood that the second device side embodiment corresponds to the first device side embodiment, and similar descriptions can refer to the first device side embodiment.

图6是根据本申请实施例的模型性能监督方法300的示意性流程图，如图6所示，该模型性能监督方法300可以包括如下内容中的至少部分内容：FIG6 is a schematic flow chart of a model performance supervision method 300 according to an embodiment of the present application. As shown in FIG6 , the model performance supervision method 300 may include at least part of the following contents:

S310，第二设备从第一设备接收第三信息，其中，该第三信息基于第一信息和第二信息确定，该第一信息用于表征第一AI模型的输出的不确定性或概率分布，该第一AI模型的输入为第一特征，该第二信息为该第一特征对应的标签；S310, the second device receives third information from the first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first AI model, the input of the first AI model is the first feature, and the second information is a label corresponding to the first feature;

S320，该第二设备根据该第三信息确定该第一AI模型的有效性。S320: The second device determines the validity of the first AI model based on the third information.

应理解，图6示出了模型性能监督方法300的步骤或操作，但这些步骤或操作仅是示例，本申请实施例还可以执行其他操作或者图6中的各个操作的变形。It should be understood that FIG6 shows the steps or operations of the model performance supervision method 300, but these steps or operations are merely examples, and the embodiments of the present application may also perform other operations or variations of the operations in FIG6.

在一些实施例中，该第一信息包括以下至少之一：In some embodiments, the first information includes at least one of the following:

第二特征的概率密度分布的参数，第二特征的置信区间，第二特征的值，第二特征的值及其概率，第二特征的值及其置信度。Parameters of the probability density distribution of the second feature, a confidence interval of the second feature, a value of the second feature, a value of the second feature and its probability, and a value of the second feature and its confidence level.

在一些实施例中，该概率密度分布的参数包括以下至少之一：概率密度分布的均值或期望，概率密度分布的方差或标准差，概率密度分布的类型指示。In some embodiments, the parameters of the probability density distribution include at least one of the following: a mean or expectation of the probability density distribution, a variance or standard deviation of the probability density distribution, and an indication of the type of the probability density distribution.

在一些实施例中，在该第一信息包括该第二特征的概率密度分布的参数的情况下，该第三信息用于表征该第一特征的N个样本的第一概率的均值，或，该第三信息用于表征该第一特征的N个样本的第一概率的对数的均值；In some embodiments, when the first information includes a parameter of a probability density distribution of the second feature, the third information is used to characterize a mean of the first probabilities of N samples of the first feature, or the third information is used to characterize a mean of the logarithms of the first probabilities of N samples of the first feature;

其中，该第一概率为该N个样本中每一个样本对应的标签在该第二特征的概率密度分布下的概率，N为正整数。The first probability is the probability of the label corresponding to each of the N samples under the probability density distribution of the second feature, and N is a positive integer.

在一些实施例中，在该第三信息用于表征该第一特征的N个样本的第一概率的均值的情况下，该第三信息基于以下公式确定：
In some embodiments, when the third information is used to characterize the mean of the first probabilities of N samples of the first feature, the third information is determined based on the following formula:

其中，L₁表示该第三信息，i表示该N个样本中的第i个样本，表示该第i个样本对应的第二特征的概率密度分布的标准差或方差，表示该第i个样本对应的第二特征的概率密度分布的均值，y_i表示该第i个样本对应的标签，g(·)表示该第一AI模型的输出所服从的概率密度分布函数。Wherein, _L1 represents the third information, i represents the i-th sample among the N samples, Represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, represents the mean of the probability density distribution of the second feature corresponding to the i-th sample, _yi represents the label corresponding to the i-th sample, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

在一些实施例中，在该第三信息用于表征该第一特征的N个样本的第一概率的对数的均值的情况下，该第三信息基于以下公式确定：
In some embodiments, when the third information is used to characterize the mean of the logarithm of the first probability of N samples of the first feature, the third information is determined based on the following formula:

其中，L₂表示该第三信息，i表示该N个样本中的第i个样本，表示该第i个样本对应的第二特征的概率密度分布的均值，表示该第i个样本对应的第二特征的概率密度分布的标准差或方差，y_i表示该第i个样本对应的标签，s为正整数，g(·)表示该第一AI模型的输出所服从的概率密度分布函数。Wherein, L ₂ represents the third information, i represents the i-th sample among the N samples, Indicates that the i-th sample The corresponding mean of the probability density distribution of the second feature, represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, _yi represents the label corresponding to the i-th sample, s is a positive integer, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

在一些实施例中，在该第一信息包括该第二特征的概率密度分布的参数的情况下，上述S320具体可以包括：In some embodiments, when the first information includes parameters of the probability density distribution of the second feature, the above S320 may specifically include:

在第三信息大于或等于第一阈值的情况下，该第二设备确定该第一AI模型有效；或，When the third information is greater than or equal to the first threshold, the second device determines that the first AI model is valid; or,

在第三信息小于第一阈值的情况下，该第二设备确定该第一AI模型失效。When the third information is less than the first threshold, the second device determines that the first AI model is invalid.

在一些实施例中，在该第一信息包括该第二特征的置信区间的情况下，该第三信息用于表征该第一特征的N个样本中，对应的标签位于该第二特征的置信区间内的样本数与总样本数N的比值，其中，N为正整数。In some embodiments, when the first information includes the confidence interval of the second feature, the third information is used to characterize the ratio of the number of samples whose corresponding labels are within the confidence interval of the second feature among the N samples of the first feature to the total number of samples N, where N is a positive integer.

在一些实施例中，在该第一信息包括该第二特征的置信区间的情况下，该第三信息基于以下公式确定：
In some embodiments, when the first information includes a confidence interval of the second feature, the third information is determined based on the following formula:

在一些实施例中，在该第一信息包括该第二特征的置信区间的情况下，上述S320具体可以包括：In some embodiments, when the first information includes the confidence interval of the second feature, the above S320 may specifically include:

在第三信息大于或等于第二阈值的情况下，该第二设备确定该第一AI模型有效；或，When the third information is greater than or equal to the second threshold, the second device determines that the first AI model is valid; or,

在第三信息小于第二阈值的情况下，该第二设备确定该第一AI模型失效。When the third information is less than the second threshold, the second device determines that the first AI model is invalid.

在一些实施例中，在该第一信息包括该第二特征的值及其概率或置信度的情况下，该第三信息用于表征该第一特征的N个样本对应的标签与该第二特征的值之间的加权距离的均值，其中，加权权重基于该第二特征的概率或置信度确定，N为正整数。In some embodiments, when the first information includes the value of the second feature and its probability or confidence, the third information is used to characterize the mean of the weighted distances between the labels corresponding to N samples of the first feature and the value of the second feature, wherein the weighted weight is determined based on the probability or confidence of the second feature, and N is a positive integer.

在一些实施例中，在该第一信息包括该第二特征的值及其概率或置信度的情况下，该第三信息基于以下公式确定：
In some embodiments, when the first information includes the value of the second feature and its probability or confidence, the third information is determined based on the following formula:

其中，L₄表示该第三信息，i表示该N个样本中的第i个样本，表示该第i个样本对应的第二特征的概率密度分布的均值，y_i表示该第i个样本对应的标签。Wherein, L ₄ represents the third information, i represents the i-th sample among the N samples, represents the mean of the probability density distribution of the second feature corresponding to the i-th sample, and _yi represents the label corresponding to the i-th sample.

在一些实施例中，在该第一信息包括该第二特征的值及其概率或置信度的情况下，上述S320具体可以包括：In some embodiments, when the first information includes the value of the second feature and its probability or confidence, the above S320 may specifically include:

在第三信息小于或等于第三阈值的情况下，该第二设备确定该第一AI模型有效；或，When the third information is less than or equal to the third threshold, the second device determines that the first AI model is valid; or,

在第三信息大于第三阈值的情况下，该第二设备确定该第一AI模型失效。When the third information is greater than the third threshold, the second device determines that the first AI model is invalid.

在一些实施例中，在该第二设备从该第一设备接收该第三信息之前，该第二设备向该第一设备发送该第一信息。In some embodiments, before the second device receives the third information from the first device, the second device sends the first information to the first device.

在一些实施例中，该第二设备向该第一设备发送第二指示信息，其中，该第二指示信息用于指示该第一AI模型的有效性。可选地，该第二指示信息包括该第一AI模型的标识或该第一AI模型所关联的功能的标识。In some embodiments, the second device sends second indication information to the first device, wherein the second indication information is used to indicate the validity of the first AI model. Optionally, the second indication information includes an identifier of the first AI model or an identifier of a function associated with the first AI model.

因此，在本申请实施例中，可以基于第一AI模型的输出的不确定性或概率分布(即第一信息)及第一AI模型的输入特征对应的标签(即第二信息)确定第三信息，以及基于第三信息确定第一AI模型的有效性，从而实现第一AI模型的性能监督。具体的，第一AI模型的输出的不确定性或概率分布可以提高第一AI模型的推理结果的鲁棒性，由于第一AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第一AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第一AI模型的性能监督中参考第一AI模型的输出的不确定性或概率分布及对应的标签，能够提升第一AI模型的性能监督的准确性。Therefore, in the embodiment of the present application, the third information can be determined based on the uncertainty or probability distribution of the output of the first AI model (i.e., the first information) and the label corresponding to the input feature of the first AI model (i.e., the second information), and the validity of the first AI model can be determined based on the third information, thereby achieving performance supervision of the first AI model. Specifically, the uncertainty or probability distribution of the output of the first AI model can improve the robustness of the reasoning result of the first AI model, by The reasoning results of the first AI model cover multiple possible results and their probability distribution, which is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the first AI model (including relevant information on uncertainty or probability distribution) are combined with other types of information to obtain more accurate target results. In the performance supervision of the first AI model, referring to the uncertainty or probability distribution of the output of the first AI model and the corresponding labels can improve the accuracy of the performance supervision of the first AI model.

图7是根据本申请实施例的模型性能监督方法400的示意性流程图，如图7所示，该模型性能监督方法400可以包括如下内容中的至少部分内容：FIG. 7 is a schematic flow chart of a model performance supervision method 400 according to an embodiment of the present application. As shown in FIG. 7 , the model performance supervision method 400 may include at least part of the following contents:

S410，第一设备获取第四信息，其中，该第四信息用于表征第二AI模型的输出的不确定性或概率分布，该第二AI模型的输入为第三特征；S410: The first device obtains fourth information, where the fourth information is used to characterize uncertainty or probability distribution of an output of a second AI model, and an input of the second AI model is a third feature;

S420，该第一设备根据该第四信息确定该第二AI模型的有效性；或，该第一设备向第二设备发送该第四信息。S420, the first device determines the validity of the second AI model according to the fourth information; or, the first device sends the fourth information to the second device.

应理解，图7示出了模型性能监督方法400的步骤或操作，但这些步骤或操作仅是示例，本申请实施例还可以执行其他操作或者图7中的各个操作的变形。It should be understood that FIG7 shows the steps or operations of the model performance supervision method 400, but these steps or operations are merely examples, and the embodiments of the present application may also perform other operations or variations of the operations in FIG7.

在本申请实施例中，可以基于第二AI模型的输出的不确定性或概率分布确定第二AI模型的有效性，从而实现第二AI模型的性能监督。具体的，第二AI模型的输出的不确定性或概率分布可以提高第二AI模型的推理结果的鲁棒性，由于第二AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第二AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第二AI模型的性能监督中参考第二AI模型的输出的不确定性或概率分布，能够判断第二AI模型的有效性，由于可以不需要外部信息的辅助，因而也更容易实现。In an embodiment of the present application, the validity of the second AI model can be determined based on the uncertainty or probability distribution of the output of the second AI model, thereby achieving performance supervision of the second AI model. Specifically, the uncertainty or probability distribution of the output of the second AI model can improve the robustness of the reasoning results of the second AI model. Since the reasoning results of the second AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the second AI model (including relevant information about uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the second AI model, the uncertainty or probability distribution of the output of the second AI model is referred to, and the validity of the second AI model can be judged. Since no external information is required, it is also easier to implement.

示例性的，基于本申请实施例获得的第二AI模型的有效性的判断结果，也可以与其他的模型监督方法得到的第二AI模型的有效性的判断结果结合，从而得到最终的第二AI模型的有效性的结论。Exemplarily, the judgment result of the effectiveness of the second AI model obtained based on the embodiment of the present application can also be combined with the judgment result of the effectiveness of the second AI model obtained by other model supervision methods to obtain the final conclusion on the effectiveness of the second AI model.

需要说明的是，在本申请实施例中，第二AI模型的输出的不确定性越高，第二AI模型的输出的准确性越低。It should be noted that in the embodiment of the present application, the higher the uncertainty of the output of the second AI model, the lower the accuracy of the output of the second AI model.

示例性的，本申请实施例所述的第二AI模型与上述第一AI模型可以相同，也可以不同，本申请对此并不限定。Illustratively, the second AI model described in the embodiment of the present application may be the same as or different from the first AI model described above, and the present application is not limited to this.

在本申请实施例中，第四信息也可以称之为软信息(如概率分布、置信度、置信区间等)，其中，软信息给出了可能结果的概率分布或置信度。可选地，第二AI模型可以是软信息AI模型，也可以不是软信息AI模型。其中，软信息AI模型是指输出为软信息(如概率分布、置信度、置信区间等)的一类AI模型，既包括经典的概率模型、也包括基于神经网络的AI模型；软信息AI模型度量了不同预测结果的可能性，并给出每个可能结果的可能性、概率分布或置信度。In an embodiment of the present application, the fourth information may also be referred to as soft information (such as probability distribution, confidence level, confidence interval, etc.), wherein the soft information gives the probability distribution or confidence level of the possible results. Optionally, the second AI model may be a soft information AI model or may not be a soft information AI model. Among them, the soft information AI model refers to a type of AI model that outputs soft information (such as probability distribution, confidence level, confidence interval, etc.), including both classic probability models and AI models based on neural networks; the soft information AI model measures the possibility of different prediction results and gives the possibility, probability distribution or confidence level of each possible result.

需要说明的是，相比于AI模型推理得到硬信息(hard value)(如视距TOA、RSTD、AoA、AoD、RSRP、LOS指示、NLOS指示等)，AI模型推理得到软信息能够显著提升推理精度和鲁棒性。具体的，软信息能够更好地描述世界的不确定性，能够提升模型推理的鲁棒性，用于一些对推理可靠性要求比较的业务也能够提供较好的安全性。It should be noted that compared with the hard information (hard value) (such as TOA, RSTD, AoA, AoD, RSRP, LOS indication, NLOS indication, etc.) obtained by AI model reasoning, the soft information obtained by AI model reasoning can significantly improve the reasoning accuracy and robustness. Specifically, soft information can better describe the uncertainty of the world, improve the robustness of model reasoning, and provide better security for some businesses that require relatively high reasoning reliability.

在一些实施例中，第三特征的维度为D3维，对应的，第四信息为第四特征的D4组软信息，其中，D4组软信息描述的是第二AI模型输出的范围，如只有一组软信息(即D4＝1)，那这组软信息描述的是整个第二AI模型输出的不确定性；如有至少两组软信息(即D4≥2)，那每组软信息描述的是第二AI模型输出的部分特征的不确定性。In some embodiments, the dimension of the third feature is D3, and correspondingly, the fourth information is D4 group of soft information of the fourth feature, wherein D4 group of soft information describes the range of the output of the second AI model. If there is only one group of soft information (ie, D4=1), then this group of soft information describes the uncertainty of the output of the entire second AI model; if there are at least two groups of soft information (ie, D4≥2), then each group of soft information describes the uncertainty of some features of the output of the second AI model.

例如，对于预测任务：每一个预测时刻对应一组软信息，或者每一个预测时刻的每一个特征对应一组软信息；对于其他任务，每一个特征对应一组软信息，或所有特征对应一组软信息。For example, for prediction tasks: each prediction moment corresponds to a set of soft information, or each feature at each prediction moment corresponds to a set of soft information; for other tasks, each feature corresponds to a set of soft information, or all features correspond to a set of soft information.

例如，每一个特征对应一组软信息，第二AI模型的输出是2维，对应2组软信息。 For example, each feature corresponds to a set of soft information, and the output of the second AI model is 2-dimensional, corresponding to 2 sets of soft information.

例如，对于定位任务，第二AI模型输出是2维的水平位置坐标，2维的水平位置坐标对应一组软信息，或，2维的水平位置坐标中的每一维的水平位置坐标对应一组软信息。For example, for the positioning task, the second AI model output is a 2-dimensional horizontal position coordinate, the 2-dimensional horizontal position coordinate corresponds to a set of soft information, or each dimension of the 2-dimensional horizontal position coordinate corresponds to a set of soft information.

在一些实施例中，本申请中所述的AI模型也可称为AI单元、AI模型/AI单元、ML模型、ML单元、AI结构、AI功能、AI特性、神经网络、神经网络函数、神经网络功能等，或者，本申请中所述的AI模型也可以是指能够实现与AI相关的特定的算法、公式、处理流程、能力等的处理单元，或者，本申请中所述的AI模型可以是针对特定数据集的处理方法、算法、功能、模块或单元，或者，本申请中所述的AI模型可以是运行在GPU、NPU、TPU、ASIC等AI/ML相关硬件上的处理方法、算法、功能、模块或单元，本申请对此不做具体限定。可选地，所述特定数据集包括AI模型的输入或输出。In some embodiments, the AI model described in this application may also be referred to as an AI unit, an AI model/AI unit, an ML model, an ML unit, an AI structure, an AI function, an AI feature, a neural network, a neural network function, a neural network function, etc., or the AI model described in this application may also refer to a processing unit capable of implementing specific algorithms, formulas, processing procedures, capabilities, etc. related to AI, or the AI model described in this application may be a processing method, algorithm, function, module or unit for a specific data set, or the AI model described in this application may be a processing method, algorithm, function, module or unit running on AI/ML related hardware such as a GPU, NPU, TPU, ASIC, etc., which is not specifically limited in this application. Optionally, the specific data set includes the input or output of the AI model.

在一些实施例中，第二AI模型可以为处于激活态的模型，或，第二AI模型可以为处于非激活态的模型。具体的，在第二AI模型为处于非激活态的模型的情况下，在确定第二AI模型的有效性之后，进行AI模型选取时可以从至少两个AI模型中优先选取有效的AI模型，从而有益于AI模型的选取。In some embodiments, the second AI model may be an activated model, or the second AI model may be an inactivated model. Specifically, when the second AI model is an inactivated model, after determining the validity of the second AI model, when performing AI model selection, a valid AI model may be preferentially selected from at least two AI models, thereby facilitating the selection of an AI model.

在一些实施例中，该第四信息包括但不限于以下至少之一：第四特征的概率密度分布的参数，第四特征的置信区间，第四特征的值，第四特征的值及其概率，第四特征的值及其置信度。本申请实施例明确了第四信息所包含的内容，有益于实现AI模型的性能监督。In some embodiments, the fourth information includes but is not limited to at least one of the following: a parameter of the probability density distribution of the fourth feature, a confidence interval of the fourth feature, a value of the fourth feature, a value of the fourth feature and its probability, and a value of the fourth feature and its confidence. The embodiment of the present application clarifies the content of the fourth information, which is beneficial to realizing the performance supervision of the AI model.

示例性的，该第四信息所包含的信息的数量可以是一个或至少两个，也即，该第四信息可以包括但不限于以下至少之一：一个或至少两个第四特征的概率密度分布的参数，一个或至少两个第四特征的置信区间，一个或至少两个第四特征的值，一个或至少两个第四特征的值及其概率，一个或至少两个第四特征的值及其置信度。Exemplarily, the amount of information contained in the fourth information may be one or at least two, that is, the fourth information may include but is not limited to at least one of the following: parameters of the probability density distribution of one or at least two fourth features, confidence intervals of one or at least two fourth features, values of one or at least two fourth features, values of one or at least two fourth features and their probabilities, and values of one or at least two fourth features and their confidence levels.

在一些实施例中，该第四信息为该第二AI模型的输出信息，或者，该第四信息为基于该第二AI模型的输出信息确定的信息。In some embodiments, the fourth information is output information of the second AI model, or the fourth information is based on Information determined by the output information of the second AI model.

示例性的，第二AI模型的输出信息为如下至少之一：第四特征的概率密度分布的参数，第四特征的置信区间，第四特征的值，第四特征的值及其概率，第四特征的值及其置信度。即第四信息为第二AI模型的输出信息。Exemplarily, the output information of the second AI model is at least one of the following: a parameter of the probability density distribution of the fourth feature, a confidence interval of the fourth feature, a value of the fourth feature, a value of the fourth feature and its probability, and a value of the fourth feature and its confidence. That is, the fourth information is the output information of the second AI model.

示例性的，第二AI模型的输出信息为第四特征的值，可以基于第四特征的值确定第四信息，如基于第四特征的值及其他信息(如运动状态信息(速度、加速度等)，信号质量的度量信息(如RSRP、SINR、RSRQ等))确定第四信息。例如，信号质量越差，则基于第四特征的值及其他信息得到的第四特征的不确定性越高；又例如，运动速度越快，则基于第四特征的值及其他信息得到的第四特征的不确定性越高。Exemplarily, the output information of the second AI model is the value of the fourth feature, and the fourth information can be determined based on the value of the fourth feature, such as based on the value of the fourth feature and other information (such as motion state information (speed, acceleration, etc.), signal quality measurement information (such as RSRP, SINR, RSRQ, etc.)). For example, the worse the signal quality, the higher the uncertainty of the fourth feature obtained based on the value of the fourth feature and other information; for another example, the faster the movement speed, the higher the uncertainty of the fourth feature obtained based on the value of the fourth feature and other information.

例如，第二AI模型的输出信息为第四特征的值，可以基于第四特征的值确定如下至少之一：第四特征的概率密度分布的参数，第四特征的置信区间，第四特征的值的概率，第四特征的值的置信度。即第四信息为基于第二AI模型的输出信息确定的信息。For example, if the output information of the second AI model is the value of the fourth feature, at least one of the following can be determined based on the value of the fourth feature: a parameter of the probability density distribution of the fourth feature, a confidence interval of the fourth feature, a probability of the value of the fourth feature, and a confidence level of the value of the fourth feature. That is, the fourth information is information determined based on the output information of the second AI model.

在一些实施例中，该第一AI模型可以用于实现如下功能中的一项：定位，波束管理，CSI预测，移动性管理，CSI压缩。当然，该第一AI模型也可以用于实现其他功能，本申请对此并不限定。In some embodiments, the first AI model can be used to implement one of the following functions: positioning, beam management, CSI prediction, mobility management, CSI compression. Of course, the first AI model can also be used to implement other functions, which is not limited in this application.

具体的，本实施例所述的模型性能监督方法400可以用于实现至少两种不同的功能，也即，本实施例所述的模型性能监督方法400可以适用于公共的基于Ground truth label的AI模型监督框架，从而可以避免针对不同的功能，分别设计性能监督方案。Specifically, the model performance supervision method 400 described in this embodiment can be used to realize at least two different functions. That is, the model performance supervision method 400 described in this embodiment can be applicable to a public AI model supervision framework based on Ground truth label, thereby avoiding the need to design performance supervision solutions for different functions.

在一些实施例中，在第二AI模型用于实现定位功能的情况下，第三特征可以包括但不限于以下至少之一：时域信道脉冲响应，RSRP(如层1RSRP或层3RSRP)，频域信道脉冲响应，接收信号的时域波形。可选地，时域信道脉冲响应包括以下至少之一：时间信息，功率信息，相位信息。可选地，频域信道脉冲响应包括以下至少之一：频率信息(子载波序号及间隔)，功率信息，相位信息。In some embodiments, when the second AI model is used to implement the positioning function, the third feature may include but is not limited to at least one of the following: time domain channel impulse response, RSRP (such as layer 1 RSRP or layer 3 RSRP), frequency domain channel impulse response, time domain waveform of the received signal. Optionally, the time domain channel impulse response includes at least one of the following: time information, power information, phase information. Optionally, the frequency domain channel impulse response includes at least one of the following: frequency information (subcarrier sequence number and interval), power information, phase information.

在一些实施例中，在第二AI模型用于实现定位功能的情况下，第四特征可以包括但不限于以下至少之一：视距TOA、RSTD、AoA、AoD、RSRP、LOS指示、NLOS指示。In some embodiments, when the second AI model is used to implement the positioning function, the fourth feature may include but is not limited to at least one of the following: line-of-sight TOA, RSTD, AoA, AoD, RSRP, LOS indication, NLOS indication.

具体例如，在第二AI模型用于实现定位功能的情况下，第二AI模型的输入(即第三特征)为时域信道脉冲响应，第二AI模型的输出为中间特征量(即第四特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，中间特征量包括如下至少之一：视距TOA、RSTD、AoA、AoD、RSRP、LOS指示、NLOS指示等，基于中间特征量的软信息可以进一步确定位置坐标；第二AI模型的输出也可以为位置坐标的软信息。For example, when the second AI model is used to realize the positioning function, the input of the second AI model (i.e., the third feature) is the time domain channel impulse response, and the output of the second AI model is the soft information of the intermediate feature quantity (i.e., the fourth feature) (such as parameters of probability density distribution, confidence interval, confidence level, etc.), and the intermediate feature quantity includes at least one of the following: line of sight TOA, RSTD, AoA, AoD, RSRP, LOS indication, NLOS indication, etc. The position coordinates can be further determined based on the soft information of the intermediate feature quantity; the output of the second AI model may also be the soft information of the position coordinates.

在一些实施例中，在第二AI模型用于实现波束管理功能的情况下，第三特征可以包括历史T1个时刻的波束信息，如序号、角度、L1-RSRP等，第四特征包括未来T2个时刻的波束信息。In some embodiments, when the second AI model is used to implement the beam management function, the third feature may include beam information at T1 historical moments, such as sequence number, angle, L1-RSRP, etc., and the fourth feature includes beam information at T2 future moments.

具体例如，在第二AI模型用于实现波束管理功能的情况下，第二AI模型的输入(即第三特征)为历史T1个时刻的波束信息，如序号、角度、L1-RSRP等，第二AI模型的输出为未来T2个时刻的波束信息(即第四特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，如每一个时刻的垂直方向的波束和水平方向波束分别对应一组软信息，如L1-RSRP的置信区间、概率密度分布等，如每一个时刻的波束信息对应一组软信息。For example, when the second AI model is used to implement the beam management function, the input of the second AI model (i.e., the third feature) is the beam information at T1 historical moments, such as serial number, angle, L1-RSRP, etc., and the output of the second AI model is the soft information (such as parameters of probability density distribution, confidence interval, confidence level, etc.) of the beam information (i.e., the fourth feature) at T2 future moments. For example, the vertical beam and the horizontal beam at each moment correspond to a set of soft information, such as the confidence interval and probability density distribution of L1-RSRP, etc.; the beam information at each moment corresponds to a set of soft information.

在一些实施例中，在第二AI模型用于实现CSI预测功能的情况下，第三特征可以包括历史T1个时刻的CSI，第四特征可以包括未来T2个时刻的CSI。In some embodiments, when the second AI model is used to implement the CSI prediction function, the third feature may include the CSI at T1 historical moments, and the fourth feature may include the CSI at T2 future moments.

具体例如，在第二AI模型用于实现CSI预测功能的情况下，第二AI模型的输入(即第三特征)为历史T1个时刻的CSI，第二AI模型的输出为未来T2个时刻的CSI(即第四特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，如每一个时刻的CSI对应一组软信息，如每一个时刻的CSI的每一个维度对应一组软信息，或每一个时刻的CSI对应一组软信息。 For example, when the second AI model is used to implement the CSI prediction function, the input of the second AI model (i.e., the third feature) is the CSI at T1 historical moments, and the output of the second AI model is the soft information (such as parameters of the probability density distribution, confidence interval, confidence level, etc.) of the CSI at T2 moments in the future (i.e., the fourth feature), such as the CSI at each moment corresponds to a set of soft information, such as each dimension of the CSI at each moment corresponds to a set of soft information, or the CSI at each moment corresponds to a set of soft information.

在一些实施例中，在第二AI模型用于实现移动性管理功能的情况下，第三特征可以包括历史T1个时刻的层1RSRP(L1-RSRP)或层3RSRP(L3-RSRP)，第四特征可以包括未来T2个时刻的RSRP或未来T2个时刻的是否发生小区切换的决策。需要说明的是，L3-RSRP为L1-RSRP经过滤波得到的。In some embodiments, when the second AI model is used to implement the mobility management function, the third feature may include the layer 1 RSRP (L1-RSRP) or layer 3 RSRP (L3-RSRP) at the historical T1 moment, and the fourth feature may include the RSRP at the future T2 moments or the decision of whether a cell switching occurs at the future T2 moments. It should be noted that L3-RSRP is obtained by filtering L1-RSRP.

具体例如，在第二AI模型用于实现移动性管理功能的情况下，第二AI模型的输入(即第三特征)为历史T1个时刻的L1-RSRP或L3-RSRP，第二AI模型的输出为未来T2个时刻的RSRP(即第四特征)的软信息(如概率密度分布的参数、置信区间、置信度等)；或第二AI模型的输出为未来T2个时刻的是否发生小区切换的决策(即第四特征)的软信息(如概率密度分布的参数、置信区间、置信度等)，如1表示切换，0表示不切换，软信息为0～1之间的值。For example, when the second AI model is used to implement the mobility management function, the input of the second AI model (i.e., the third feature) is the L1-RSRP or L3-RSRP at T1 historical moments, and the output of the second AI model is the soft information of the RSRP (i.e., the fourth feature) at T2 moments in the future (such as parameters of the probability density distribution, confidence interval, confidence level, etc.); or the output of the second AI model is the soft information of the decision on whether a cell switching will occur (i.e., the fourth feature) at T2 moments in the future (such as parameters of the probability density distribution, confidence interval, confidence level, etc.), such as 1 for switching and 0 for no switching, and the soft information is a value between 0 and 1.

在一些实施例中，在第二AI模型用于实现CSI压缩功能的情况下，第三特征可以包括未压缩的CSI，第四特征可以包括压缩后的CSI。In some embodiments, when the second AI model is used to implement the CSI compression function, the third feature may include uncompressed CSI, and the fourth feature may include compressed CSI.

具体例如，在第二AI模型用于实现CSI压缩功能的情况下，第二AI模型的输入(即第三特征)为未压缩的CSI，第二AI模型的输出为压缩后的CSI(即第四特征)的软信息(如概率密度分布的参数、置信区间、置信度等)。For example, when the second AI model is used to implement the CSI compression function, the input of the second AI model (ie, the third feature) is the uncompressed CSI, and the output of the second AI model is the soft information of the compressed CSI (ie, the fourth feature) (such as parameters of the probability density distribution, confidence interval, confidence level, etc.).

在一些实施例中，在上述S410中，第一设备获取第四信息，包括以下其中一项：In some embodiments, in the above S410, the first device obtains the fourth information, including one of the following:

该第一设备从第二设备获取该第四信息；The first device obtains the fourth information from the second device;

该第一设备通过该第二AI模型的输出信息，获取该第四信息；The first device obtains the fourth information through the output information of the second AI model;

该第一设备从该第二设备接收该第二AI模型的输出信息，根据该第二AI模型的输出信息，获得该第四信息。The first device receives the output information of the second AI model from the second device, and obtains the fourth information according to the output information of the second AI model.

在一些实施例中，在第一设备通过第二AI模型的输出信息，获取第四信息的情况下，第二AI模型可以部署在第一设备侧。In some embodiments, when the first device obtains the fourth information through the output information of the second AI model, the second AI model can be deployed on the first device side.

在一些实施例中，在第一设备从其他设备获取第四信息的情况下，第二AI模型可以部署在其他设备侧。其中，该其他设备可以是第二设备，也可以是除第一设备和第二设备之外的设备。In some embodiments, when the first device obtains the fourth information from another device, the second AI model can be deployed on the other device side. The other device can be the second device or a device other than the first device and the second device.

在一些实施例中，在第一设备从第二设备接收第二AI模型的输出信息，根据第二AI模型的输出信息，获得第四信息的情况下，第二AI模型可以部署在第二设备侧。In some embodiments, when the first device receives output information of the second AI model from the second device and obtains fourth information based on the output information of the second AI model, the second AI model can be deployed on the second device side.

在一些实施例中，在第一设备根据第四信息确定第二AI模型的有效性的情况下，第一设备向第二设备发送第三指示信息，其中，第三指示信息用于指示第二AI模型的有效性。例如，第三指示信息占用1比特；其中，取值0表示第二AI模型有效，取值1表示第二AI模型无效；或，取值1表示第二AI模型有效，取值0表示第二AI模型无效。In some embodiments, when the first device determines the validity of the second AI model according to the fourth information, the first device sends third indication information to the second device, wherein the third indication information is used to indicate the validity of the second AI model. For example, the third indication information occupies 1 bit; wherein a value of 0 indicates that the second AI model is valid, and a value of 1 indicates that the second AI model is invalid; or, a value of 1 indicates that the second AI model is valid, and a value of 0 indicates that the second AI model is invalid.

可选地，该第三指示信息包括该第二AI模型的标识或该第二AI模型所关联的功能的标识。Optionally, the third indication information includes an identifier of the second AI model or an identifier of a function associated with the second AI model.

在一些实施例中，在第一设备向第二设备发送第四信息的情况下，第一设备从第二设备接收第四指示信息，其中，第四指示信息用于指示第二AI模型的有效性。具体的，第二设备在接收到第四信息之后，可以根据第四信息确定第二AI模型的有效性。例如，第四指示信息占用1比特；其中，取值0表示第二AI模型有效，取值1表示第二AI模型无效；或，取值1表示第二AI模型有效，取值0表示第二AI模型无效。In some embodiments, when the first device sends the fourth information to the second device, the first device receives the fourth indication information from the second device, wherein the fourth indication information is used to indicate the validity of the second AI model. Specifically, after receiving the fourth information, the second device can determine the validity of the second AI model according to the fourth information. For example, the fourth indication information occupies 1 bit; wherein a value of 0 indicates that the second AI model is valid, and a value of 1 indicates that the second AI model is invalid; or, a value of 1 indicates that the second AI model is valid, and a value of 0 indicates that the second AI model is invalid.

可选地，该第四指示信息包括该第二AI模型的标识或该第二AI模型所关联的功能的标识。Optionally, the fourth indication information includes an identifier of the second AI model or an identifier of a function associated with the second AI model.

示例性的，在第二AI模型部署在第一设备侧的情况下，第一设备通过第二AI模型的输出信息获取第四信息，接着，第一设备根据第四信息确定第二AI模型的有效性，最后，第一设备向第二设备发送第三指示信息，其中，第三指示信息用于指示第二AI模型的有效性。Exemplarily, when the second AI model is deployed on the first device side, the first device obtains the fourth information through the output information of the second AI model. Then, the first device determines the validity of the second AI model based on the fourth information. Finally, the first device sends third indication information to the second device, wherein the third indication information is used to indicate the validity of the second AI model.

示例性的，在第二AI模型部署在第一设备侧的情况下，第一设备通过第二AI模型的输出信息获取第四信息，接着，第一设备向第二设备发送第四信息，之后，第二设备根据第四信息确定第二AI模型的有效性，最后，第二设备向第一设备发送第四指示信息，其中，第四指示信息用于指示第二AI模型的有效性。Exemplarily, when the second AI model is deployed on the first device side, the first device obtains the fourth information through the output information of the second AI model, then the first device sends the fourth information to the second device, and then the second device receives the fourth information based on the output information of the second AI model. The validity of the second AI model is determined according to the fourth information. Finally, the second device sends fourth indication information to the first device, wherein the fourth indication information is used to indicate the validity of the second AI model.

示例性的，在第二AI模型部署在第二设备侧的情况下，第二设备通过第二AI模型的输出信息得到第四信息，以及第二设备向第一设备发送第四信息，接着，第一设备根据第四信息确定第二AI模型的有效性，最后，第一设备向第二设备发送第三指示信息，其中，第三指示信息用于指示第二AI模型的有效性。Exemplarily, when the second AI model is deployed on the second device side, the second device obtains the fourth information through the output information of the second AI model, and the second device sends the fourth information to the first device. Then, the first device determines the validity of the second AI model based on the fourth information. Finally, the first device sends third indication information to the second device, wherein the third indication information is used to indicate the validity of the second AI model.

示例性的，在第二AI模型部署在第三设备侧的情况下，第三设备通过第二AI模型的输出信息得到第四信息，以及第三设备向第一设备发送第四信息，接着，第一设备向第二设备发送第四信息，之后，第二设备根据第四信息确定第二AI模型的有效性，最后，第二设备向第一设备发送第四指示信息，其中，第四指示信息用于指示第二AI模型的有效性。Exemplarily, when the second AI model is deployed on the third device side, the third device obtains the fourth information through the output information of the second AI model, and the third device sends the fourth information to the first device, then the first device sends the fourth information to the second device, and then the second device determines the validity of the second AI model based on the fourth information, and finally, the second device sends fourth indication information to the first device, wherein the fourth indication information is used to indicate the validity of the second AI model.

示例性的，在第二AI模型部署在第二设备侧的情况下，第一设备从第二设备接收第二AI模型的输出信息，以及第一设备根据第二AI模型的输出信息，获得第四信息，接着，第一设备根据第四信息确定第二AI模型的有效性，最后，第一设备向第二设备发送第三指示信息，其中，第三指示信息用于指示第二AI模型的有效性。Exemplarily, when the second AI model is deployed on the second device side, the first device receives the output information of the second AI model from the second device, and the first device obtains fourth information based on the output information of the second AI model. Then, the first device determines the validity of the second AI model based on the fourth information. Finally, the first device sends third indication information to the second device, wherein the third indication information is used to indicate the validity of the second AI model.

示例性的，在第二AI模型部署在第二设备侧的情况下，第一设备从第二设备接收第二AI模型的输出信息，以及第一设备根据第二AI模型的输出信息，获得第四信息，接着，第一设备向第二设备发送第四信息，之后，第二设备根据第四信息确定第二AI模型的有效性，最后，第二设备向第一设备发送第四指示信息，其中，第四指示信息用于指示第二AI模型的有效性。Exemplarily, when the second AI model is deployed on the second device side, the first device receives the output information of the second AI model from the second device, and the first device obtains fourth information based on the output information of the second AI model. Then, the first device sends the fourth information to the second device. Thereafter, the second device determines the validity of the second AI model based on the fourth information. Finally, the second device sends fourth indication information to the first device, wherein the fourth indication information is used to indicate the validity of the second AI model.

在一些实施例中，该第一设备可以是终端、网络侧设备或第三方服务器。In some embodiments, the first device may be a terminal, a network-side device, or a third-party server.

在一些实施例中，该第二设备可以是终端、网络侧设备或第三方服务器。In some embodiments, the second device may be a terminal, a network-side device, or a third-party server.

在一些实施例中，在该第四信息包括该第四特征的概率密度分布的参数的情况下，该第一设备根据该第四信息确定该第二AI模型的有效性，包括：In some embodiments, when the fourth information includes a parameter of a probability density distribution of the fourth feature, the first device determines the validity of the second AI model according to the fourth information, including:

在该第四特征的概率密度分布的方差或标准差大于或等于第一阈值的情况下，该第一设备确定该第二AI模型失效；或，When the variance or standard deviation of the probability density distribution of the fourth feature is greater than or equal to the first threshold, the first device determines that the second AI model fails; or,

在该第三特征的N个样本对应的该第四特征的概率密度分布的方差或标准差的均值大于或等于第二阈值的情况下，该第一设备确定该第二AI模型失效；或，When the mean of the variance or standard deviation of the probability density distribution of the fourth feature corresponding to the N samples of the third feature is greater than or equal to the second threshold, the first device determines that the second AI model is invalid; or,

在该第三特征的N个样本中，对应的该第四特征的概率密度分布的方差或标准差大于或等于第一阈值的样本数占总样本数N的比例大于或等于第三阈值的情况下，该第一设备确定该第二AI模型失效；When, among the N samples of the third feature, the proportion of the number of samples whose variance or standard deviation of the probability density distribution of the corresponding fourth feature is greater than or equal to the first threshold to the total number of samples N is greater than or equal to the third threshold, the first device determines that the second AI model has failed;

其中，N为正整数。Wherein, N is a positive integer.

可选地，N个样本也可以替换为第二AI模型的N次推理过程。Optionally, the N samples may also be replaced by N reasoning processes of the second AI model.

在一些实施例中，第一阈值由协议约定，或，第一阈值由第一设备确定，或，第一阈值由第二设备配置或指示。In some embodiments, the first threshold is agreed upon by a protocol, or the first threshold is determined by the first device, or the first threshold is configured or indicated by the second device.

在一些实施例中，第二阈值由协议约定，或，第二阈值由第一设备确定，或，第二阈值由第二设备配置或指示。In some embodiments, the second threshold is agreed upon by a protocol, or the second threshold is determined by the first device, or the second threshold is configured or indicated by the second device.

在一些实施例中，第三阈值由协议约定，或，第三阈值由第一设备确定，或，第三阈值由第二设备配置或指示。In some embodiments, the third threshold is agreed upon by a protocol, or the third threshold is determined by the first device, or the third threshold is configured or indicated by the second device.

在一些实施例中，在该第四信息包括该第四特征的置信区间的情况下，该第一设备根据该第四信息确定该第二AI模型的有效性，包括：In some embodiments, when the fourth information includes a confidence interval of the fourth feature, the first device determines the validity of the second AI model according to the fourth information, including:

在该第四特征的置信区间的宽度大于或等于第四阈值的情况下，该第一设备确定该第二AI模型失效；或，When the width of the confidence interval of the fourth feature is greater than or equal to a fourth threshold, the first device determines that the second AI model is invalid; or,

在该第三特征的N个样本对应的该第四特征的置信区间的平均宽度大于或等于第五阈值的情况下，该第一设备确定该第二AI模型失效；或，When the average width of the confidence interval of the fourth feature corresponding to the N samples of the third feature is greater than or equal to the fifth threshold, the first device determines that the second AI model is invalid; or,

在该第三特征的N个样本中，对应的该第四特征的置信区间的宽度大于或等于第四阈值的样本数占总样本数N的比例大于或等于第六阈值的情况下，该第一设备确定该第二AI模型失效；Among the N samples of the third feature, the width of the corresponding confidence interval of the fourth feature is greater than or equal to the fourth When the ratio of the number of samples of the threshold to the total number of samples N is greater than or equal to a sixth threshold, the first device determines that the second AI model is invalid;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，第四阈值由协议约定，或，第四阈值由第一设备确定，或，第四阈值由第二设备配置或指示。In some embodiments, the fourth threshold is agreed upon by a protocol, or the fourth threshold is determined by the first device, or the fourth threshold is configured or indicated by the second device.

在一些实施例中，第五阈值由协议约定，或，第五阈值由第一设备确定，或，第五阈值由第二设备配置或指示。In some embodiments, the fifth threshold is agreed upon by a protocol, or the fifth threshold is determined by the first device, or the fifth threshold is configured or indicated by the second device.

在一些实施例中，第六阈值由协议约定，或，第六阈值由第一设备确定，或，第六阈值由第二设备配置或指示。In some embodiments, the sixth threshold is agreed upon by a protocol, or the sixth threshold is determined by the first device, or the sixth threshold is configured or indicated by the second device.

在一些实施例中，在该第四信息包括该第四特征的值及其概率或置信度的情况下，该第一设备根据该第四信息确定该第二AI模型的有效性，包括：In some embodiments, when the fourth information includes the value of the fourth feature and its probability or confidence, the first device determines the validity of the second AI model according to the fourth information, including:

在该第四特征的概率或置信度小于或等于第七阈值的情况下，该第一设备确定该第二AI模型失效；或，When the probability or confidence of the fourth feature is less than or equal to the seventh threshold, the first device determines that the second AI model fails; or,

在该第三特征的N个样本对应的该第四特征的概率或置信度的均值小于或等于第八阈值的情况下，该第一设备确定该第二AI模型失效；或，When the average of the probability or confidence of the fourth feature corresponding to the N samples of the third feature is less than or equal to the eighth threshold, the first device determines that the second AI model fails; or,

在该第三特征的N个样本中，对应的该第四特征的概率或置信度小于或等于第七阈值的样本数占总样本数N的比例大于或等于第九阈值的情况下，该第一设备确定该第二AI模型失效；When, among the N samples of the third feature, the number of samples whose probability or confidence of the corresponding fourth feature is less than or equal to the seventh threshold accounts for a proportion of the total number of samples N that is greater than or equal to the ninth threshold, the first device determines that the second AI model has failed;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，第七阈值由协议约定，或，第七阈值由第一设备确定，或，第七阈值由第二设备配置或指示。In some embodiments, the seventh threshold is agreed upon by a protocol, or the seventh threshold is determined by the first device, or the seventh threshold is configured or indicated by the second device.

在一些实施例中，第八阈值由协议约定，或，第八阈值由第一设备确定，或，第八阈值由第二设备配置或指示。In some embodiments, the eighth threshold is agreed upon by a protocol, or the eighth threshold is determined by the first device, or the eighth threshold is configured or indicated by the second device.

在一些实施例中，第九阈值由协议约定，或，第九阈值由第一设备确定，或，第九阈值由第二设备配置或指示。In some embodiments, the ninth threshold is agreed upon by a protocol, or the ninth threshold is determined by the first device, or the ninth threshold is configured or indicated by the second device.

因此，在本申请实施例中，可以基于第二AI模型的输出的不确定性或概率分布确定第二AI模型的有效性，从而实现第二AI模型的性能监督。具体的，第二AI模型的输出的不确定性或概率分布可以提高第二AI模型的推理结果的鲁棒性，由于第二AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第二AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第二AI模型的性能监督中参考第二AI模型的输出的不确定性或概率分布，能够判断第二AI模型的有效性，由于可以不需要外部信息的辅助，因而也更容易实现。Therefore, in an embodiment of the present application, the validity of the second AI model can be determined based on the uncertainty or probability distribution of the output of the second AI model, thereby achieving performance supervision of the second AI model. Specifically, the uncertainty or probability distribution of the output of the second AI model can improve the robustness of the reasoning results of the second AI model. Since the reasoning results of the second AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the second AI model (including relevant information about uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the second AI model, the uncertainty or probability distribution of the output of the second AI model is referred to, and the validity of the second AI model can be judged. Since no external information is required, it is also easier to implement.

上文结合图7，详细描述了本申请的第一设备侧实施例，下文结合图8，详细描述本申请的第二设备侧实施例，应理解，第二设备侧实施例与第一设备侧实施例相互对应，类似的描述可以参照第一设备侧实施例。The above, in conjunction with Figure 7, describes in detail the first device side embodiment of the present application. The following, in conjunction with Figure 8, describes in detail the second device side embodiment of the present application. It should be understood that the second device side embodiment corresponds to the first device side embodiment, and similar descriptions can refer to the first device side embodiment.

图8是根据本申请实施例的模型性能监督方法500的示意性流程图，如图8所示，该模型性能监督方法500可以包括如下内容中的至少部分内容：FIG8 is a schematic flow chart of a model performance supervision method 500 according to an embodiment of the present application. As shown in FIG8 , the model performance supervision method 500 may include at least part of the following contents:

S510，第二设备从第一设备接收第四信息，其中，该第四信息用于表征第二AI模型的输出的不确定性或概率分布，该第二AI模型的输入为第三特征；S510: The second device receives fourth information from the first device, wherein the fourth information is used to characterize uncertainty or probability distribution of an output of a second AI model, and an input of the second AI model is a third feature;

S520，该第二设备根据该第四信息确定该第二AI模型的有效性。S520: The second device determines the validity of the second AI model according to the fourth information.

应理解，图8示出了模型性能监督方法500的步骤或操作，但这些步骤或操作仅是示例，本申请实施例还可以执行其他操作或者图8中的各个操作的变形。It should be understood that FIG8 shows the steps or operations of the model performance supervision method 500, but these steps or operations are merely examples, and the embodiments of the present application may also perform other operations or variations of the operations in FIG8.

在一些实施例中，该第四信息包括以下至少之一：第四特征的概率密度分布的参数，第四特征的置信区间，第四特征的值，第四特征的值及其概率，第四特征的值及其置信度。 In some embodiments, the fourth information includes at least one of the following: a parameter of a probability density distribution of the fourth feature, a confidence interval of the fourth feature, a value of the fourth feature, a value of the fourth feature and its probability, and a value of the fourth feature and its confidence.

在一些实施例中，在该第四信息包括该第四特征的概率密度分布的参数的情况下，该第二设备根据该第四信息确定该第二AI模型的有效性，包括：In some embodiments, when the fourth information includes a parameter of a probability density distribution of the fourth feature, the second device determines the validity of the second AI model according to the fourth information, including:

在该第四特征的概率密度分布的方差或标准差大于或等于第一阈值的情况下，该第二设备确定该第二AI模型失效；或，When the variance or standard deviation of the probability density distribution of the fourth feature is greater than or equal to the first threshold, the second device determines that the second AI model fails; or,

在该第三特征的N个样本对应的该第四特征的概率密度分布的方差或标准差的均值大于或等于第二阈值的情况下，该第二设备确定该第二AI模型失效；或，When the mean of the variance or standard deviation of the probability density distribution of the fourth feature corresponding to the N samples of the third feature is greater than or equal to the second threshold, the second device determines that the second AI model is invalid; or,

在该第三特征的N个样本中，对应的该第四特征的概率密度分布的方差或标准差大于或等于第一阈值的样本数占总样本数N的比例大于或等于第三阈值的情况下，该第二设备确定该第二AI模型失效；When, among the N samples of the third feature, the proportion of the number of samples whose variance or standard deviation of the probability density distribution of the corresponding fourth feature is greater than or equal to the first threshold to the total number of samples N is greater than or equal to the third threshold, the second device determines that the second AI model has failed;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，在该第四信息包括该第四特征的置信区间的情况下，该第二设备根据该第四信息确定该第二AI模型的有效性，包括：In some embodiments, when the fourth information includes a confidence interval of the fourth feature, the second device determines the validity of the second AI model according to the fourth information, including:

在该第四特征的置信区间的宽度大于或等于第四阈值的情况下，该第二设备确定该第二AI模型失效；或，When the width of the confidence interval of the fourth feature is greater than or equal to the fourth threshold, the second device determines that the second AI model is invalid; or,

在该第三特征的N个样本对应的该第四特征的置信区间的平均宽度大于或等于第五阈值的情况下，该第二设备确定该第二AI模型失效；或，When the average width of the confidence interval of the fourth feature corresponding to the N samples of the third feature is greater than or equal to the fifth threshold, the second device determines that the second AI model is invalid; or,

在该第三特征的N个样本中，对应的该第四特征的置信区间的宽度大于或等于第四阈值的样本数占总样本数N的比例大于或等于第六阈值的情况下，该第二设备确定该第二AI模型失效；When, among the N samples of the third feature, the number of samples whose width of the confidence interval of the corresponding fourth feature is greater than or equal to the fourth threshold accounts for a proportion of the total number of samples N that is greater than or equal to a sixth threshold, the second device determines that the second AI model is invalid;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，在该第四信息包括该第四特征的值及其概率或置信度的情况下，该第二设备根据该第四信息确定该第二AI模型的有效性，包括：In some embodiments, when the fourth information includes the value of the fourth feature and its probability or confidence, the second device determines the validity of the second AI model according to the fourth information, including:

在该第四特征的概率或置信度小于或等于第七阈值的情况下，该第二设备确定该第二AI模型失效；或，When the probability or confidence of the fourth feature is less than or equal to the seventh threshold, the second device determines that the second AI model fails; or,

在该第三特征的N个样本对应的该第四特征的概率或置信度的均值小于或等于第八阈值的情况下，该第二设备确定该第二AI模型失效；或，When the average of the probability or confidence of the fourth feature corresponding to the N samples of the third feature is less than or equal to the eighth threshold, the second device determines that the second AI model fails; or,

在该第三特征的N个样本中，对应的该第四特征的概率或置信度小于或等于第七阈值的样本数占总样本数N的比例大于或等于第九阈值的情况下，该第二设备确定该第二AI模型失效；When, among the N samples of the third feature, the proportion of the number of samples whose probability or confidence of the corresponding fourth feature is less than or equal to the seventh threshold to the total number of samples N is greater than or equal to the ninth threshold, the second device determines that the second AI model is invalid;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，该第二设备向该第一设备发送第四指示信息，其中，该第四指示信息用于指示该第二AI模型的有效性。In some embodiments, the second device sends fourth indication information to the first device, wherein the fourth indication information is used to indicate the validity of the second AI model.

本申请实施例提供的模型性能监督方法，执行主体可以为模型性能监督装置，或模型性能监督装置中用于执行模型性能监督方法的处理单元。本申请实施例中以模型性能监督装置执行模型性能监督方法为例，说明本申请实施例提供的模型性能监督装置。The model performance supervision method provided in the embodiment of the present application can be executed by a model performance supervision device, or a processing unit in the model performance supervision device for executing the model performance supervision method. Taking the execution of a model performance supervision method by a supervision device as an example, the model performance supervision device provided in an embodiment of the present application is explained.

图9示出了根据本申请实施例的模型性能监督装置600的示意性框图。如图9所示，所述模型性能监督装置600包括：Fig. 9 shows a schematic block diagram of a model performance monitoring device 600 according to an embodiment of the present application. As shown in Fig. 9, the model performance monitoring device 600 includes:

获取单元610，用于获取第一信息，其中，所述第一信息用于表征第一人工智能AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征；An acquisition unit 610 is used to acquire first information, wherein the first information is used to characterize the uncertainty or probability distribution of the output of a first artificial intelligence AI model, and the input of the first AI model is a first feature;

处理单元620，用于根据所述第一信息和第二信息确定第三信息，其中，所述第二信息为所述第一特征对应的标签，所述第三信息用于确定所述第一AI模型的有效性。Processing unit 620 is used to determine third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

在一些实施例中，所述获取单元610获取第一信息，包括以下其中一项：In some embodiments, the acquiring unit 610 acquires the first information, including one of the following:

从第二设备接收所述第一信息；receiving the first information from a second device;

通过所述第一AI模型的输出信息，获取所述第一信息；Obtaining the first information through output information of the first AI model;

从第二设备接收所述第一AI模型的输出信息，根据所述第一AI模型的输出信息，获得所述第一信息。Receive output information of the first AI model from a second device, and obtain the first information according to the output information of the first AI model.

在一些实施例中，所述模型性能监督装置600还包括：收发单元630；In some embodiments, the model performance monitoring device 600 further includes: a transceiver unit 630;

所述处理单元620还用于根据所述第三信息确定所述第一AI模型的有效性；或，The processing unit 620 is further configured to determine the validity of the first AI model according to the third information; or,

所述收发单元630用于向第二设备发送所述第三信息。The transceiver unit 630 is used to send the third information to the second device.

在一些实施例中，在所述模型性能监督装置600根据所述第三信息确定所述第一AI模型的有效性的情况下，所述收发单元630还用于向所述第二设备发送第一指示信息，其中，所述第一指示信息用于指示所述第一AI模型的有效性；或，In some embodiments, when the model performance monitoring device 600 determines the validity of the first AI model according to the third information, the transceiver unit 630 is further used to send first indication information to the second device, wherein the first indication information is used to indicate the validity of the first AI model; or,

在所述模型性能监督装置600向所述第二设备发送所述第三信息的情况下，所述收发单元630还用于从所述第二设备接收第二指示信息，其中，所述第二指示信息用于指示所述第一AI模型的有效性。When the model performance monitoring device 600 sends the third information to the second device, the transceiver unit 630 is also used to receive second indication information from the second device, wherein the second indication information is used to indicate the validity of the first AI model.

在一些实施例中，所述第一信息包括以下至少之一：In some embodiments, the first information includes at least one of the following:

在一些实施例中，所述概率密度分布的参数包括以下至少之一：概率密度分布的均值或期望，概率密度分布的方差或标准差，概率密度分布的类型指示。In some embodiments, the parameters of the probability density distribution include at least one of the following: a mean or expectation of the probability density distribution, a variance or standard deviation of the probability density distribution, and an indication of the type of the probability density distribution.

在一些实施例中，在所述第一信息包括所述第二特征的概率密度分布的参数的情况下，所述第三信息用于表征所述第一特征的N个样本的第一概率的均值，或，所述第三信息用于表征所述第一特征的N个样本的第一概率的对数的均值；In some embodiments, when the first information includes parameters of the probability density distribution of the second feature, the third information is used to characterize the mean of the first probabilities of N samples of the first feature, or the third information is used to characterize the mean of the logarithms of the first probabilities of N samples of the first feature;

其中，所述第一概率为所述N个样本中每一个样本对应的标签在所述第二特征的概率密度分布下的概率，N为正整数。The first probability is the probability of the label corresponding to each sample in the N samples under the probability density distribution of the second feature, and N is a positive integer.

在一些实施例中，在所述第三信息用于表征所述第一特征的N个样本的第一概率的均值的情况下，所述第三信息基于以下公式确定：
In some embodiments, when the third information is used to characterize the mean of the first probabilities of N samples of the first feature, the third information is determined based on the following formula:

其中，L₁表示所述第三信息，i表示所述N个样本中的第i个样本，表示所述第i个样本对应的第二特征的概率密度分布的标准差或方差，表示所述第i个样本对应的第二特征的概率密度分布的均值，y_i表示所述第i个样本对应的标签，g(·)表示所述第一AI模型的输出所服从的概率密度分布函数。Wherein, _L1 represents the third information, i represents the i-th sample among the N samples, represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, represents the mean of the probability density distribution of the second feature corresponding to the ith sample, _yi represents the label corresponding to the ith sample, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

在一些实施例中，在所述第三信息用于表征所述第一特征的N个样本的第一概率的对数的均值的情况下，所述第三信息基于以下公式确定：
In some embodiments, when the third information is used to characterize the mean of the logarithm of the first probability of N samples of the first feature, the third information is determined based on the following formula:

其中，L₂表示所述第三信息，i表示所述N个样本中的第i个样本，表示所述第i 个样本对应的第二特征的概率密度分布的均值，表示所述第i个样本对应的第二特征的概率密度分布的标准差或方差，y_i表示所述第i个样本对应的标签，s为正整数，g(·)表示所述第一AI模型的输出所服从的概率密度分布函数。Wherein, L ₂ represents the third information, i represents the i-th sample among the N samples, Indicates the i-th The mean of the probability density distribution of the second feature corresponding to samples, represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, _yi represents the label corresponding to the i-th sample, s is a positive integer, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

在一些实施例中，所述第三信息用于确定所述第一AI模型的有效性，包括：In some embodiments, the third information is used to determine the validity of the first AI model, including:

在所述第三信息大于或等于第一阈值的情况下，所述第一AI模型有效；或，When the third information is greater than or equal to the first threshold, the first AI model is valid; or,

在所述第三信息小于第一阈值的情况下，所述第一AI模型失效。When the third information is less than the first threshold, the first AI model fails.

在一些实施例中，在所述第一信息包括所述第二特征的置信区间的情况下，所述第三信息用于表征所述第一特征的N个样本中，对应的标签位于所述第二特征的置信区间内的样本数与总样本数N的比值，其中，N为正整数。In some embodiments, when the first information includes the confidence interval of the second feature, the third information is used to characterize the ratio of the number of samples whose corresponding labels are within the confidence interval of the second feature among the N samples of the first feature to the total number of samples N, where N is a positive integer.

在一些实施例中，所述第三信息基于以下公式确定：
In some embodiments, the third information is determined based on the following formula:

其中，L₃表示所述第三信息，i表示所述N个样本中的第i个样本，x_i表示所述第i个样本对应的输入信息，y_i表示所述第i个样本对应的标签，当y_i位于所述第二特征的置信区间f(x_i)之内时，否则 Wherein, L ₃ represents the third information, i represents the i-th sample among the N samples, x _i represents the input information corresponding to the i-th sample, y _i represents the label corresponding to the i-th sample, and when y _i is within the confidence interval f(x _i ) of the second feature, otherwise

在所述第三信息大于或等于第二阈值的情况下，所述第一AI模型有效；或，When the third information is greater than or equal to the second threshold, the first AI model is valid; or,

在所述第三信息小于第二阈值的情况下，所述第一AI模型失效。When the third information is less than the second threshold, the first AI model fails.

在一些实施例中，在所述第一信息包括所述第二特征的值及其概率或置信度的情况下，所述第三信息用于表征所述第一特征的N个样本对应的标签与所述第二特征的值之间的加权距离的均值，其中，加权权重基于所述第二特征的概率或置信度确定，N为正整数。In some embodiments, when the first information includes the value of the second feature and its probability or confidence, the third information is used to characterize the mean of the weighted distances between the labels corresponding to N samples of the first feature and the value of the second feature, wherein the weighted weight is determined based on the probability or confidence of the second feature, and N is a positive integer.

其中，L₄表示所述第三信息，i表示所述N个样本中的第i个样本，表示所述第i个样本对应的第二特征的概率密度分布的均值，y_i表示所述第i个样本对应的标签。Wherein, L ₄ represents the third information, i represents the i-th sample among the N samples, represents the mean of the probability density distribution of the second feature corresponding to the i-th sample, and _yi represents the label corresponding to the i-th sample.

在所述第三信息小于或等于第三阈值的情况下，所述第一AI模型有效；或，When the third information is less than or equal to a third threshold, the first AI model is valid; or,

在所述第三信息大于第三阈值的情况下，所述第一AI模型失效。When the third information is greater than a third threshold, the first AI model fails.

在一些实施例中，上述收发单元可以是通信接口或收发器，或者是通信芯片或者片上系统的输入输出接口。In some embodiments, the transceiver unit may be a communication interface or a transceiver, or an input/output interface of a communication chip or a system on chip.

应理解，根据本申请实施例的模型性能监督装置600可对应于本申请方法实施例中的第一设备，并且模型性能监督装置600中的各个单元的上述和其它操作或功能分别为了实现图4所示的方法200中第一设备的相应流程，为了简洁，在此不再赘述。It should be understood that the model performance monitoring device 600 according to the embodiment of the present application may correspond to the first device in the method embodiment of the present application, and the above-mentioned and other operations or functions of each unit in the model performance monitoring device 600 are respectively for realizing the corresponding process of the first device in the method 200 shown in Figure 4. For the sake of brevity, they will not be repeated here.

因此，在本申请实施例中，可以基于第一AI模型的输出的不确定性或概率分布及第一AI模型的输入特征对应的标签确定第三信息，以及基于第三信息确定第一AI模型的有效性，从而实现第一AI模型的性能监督。具体的，第一AI模型的输出的不确定性或概率分布可以提高第一AI模型的推理结果的鲁棒性，由于第一AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第一AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第一AI模型的性能监督中参考第一AI模型的输出的不确定性或概率分布及对应的标签，能够提升第一AI模型的性能监督的准确性。 Therefore, in an embodiment of the present application, the third information can be determined based on the uncertainty or probability distribution of the output of the first AI model and the label corresponding to the input feature of the first AI model, and the validity of the first AI model can be determined based on the third information, so as to achieve performance supervision of the first AI model. Specifically, the uncertainty or probability distribution of the output of the first AI model can improve the robustness of the reasoning results of the first AI model. Since the reasoning results of the first AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the first AI model (including relevant information on uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the first AI model, referring to the uncertainty or probability distribution of the output of the first AI model and the corresponding label can improve the accuracy of the performance supervision of the first AI model.

图10示出了根据本申请实施例的模型性能监督装置700的示意性框图。如图10所示，所述模型性能监督装置700包括：Fig. 10 shows a schematic block diagram of a model performance monitoring device 700 according to an embodiment of the present application. As shown in Fig. 10, the model performance monitoring device 700 includes:

收发单元710，用于从第一设备接收第三信息，其中，所述第三信息基于第一信息和第二信息确定，所述第一信息用于表征第一人工智能AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征，所述第二信息为所述第一特征对应的标签；The transceiver unit 710 is configured to receive third information from the first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first artificial intelligence AI model, the input of the first AI model is the first feature, and the second information is the label corresponding to the first feature;

处理单元720，用于根据所述第三信息确定所述第一AI模型的有效性。The processing unit 720 is used to determine the validity of the first AI model according to the third information.

在一些实施例中，所述处理单元720具体用于：In some embodiments, the processing unit 720 is specifically configured to:

在所述第三信息小于或等于第三阈值的情况下，确定所述第一AI模型有效；或，When the third information is less than or equal to a third threshold, determining that the first AI model is valid; or,

在所述第三信息大于第三阈值的情况下，确定所述第一AI模型失效。When the third information is greater than a third threshold, it is determined that the first AI model is invalid.

在一些实施例中，在所述模型性能监督装置700从所述第一设备接收所述第三信息之前，所述收发单元710还用于向所述第一设备发送所述第一信息。In some embodiments, before the model performance monitoring apparatus 700 receives the third information from the first device, the transceiver unit 710 is further configured to send the first information to the first device.

在一些实施例中，所述收发单元710还用于向所述第一设备发送第二指示信息，其中，所述第二指示信息用于指示所述第一AI模型的有效性。In some embodiments, the transceiver unit 710 is further used to send second indication information to the first device, wherein the second indication information is used to indicate the validity of the first AI model.

其中，L₂表示所述第三信息，i表示所述N个样本中的第i个样本，表示所述第i个样本对应的第二特征的概率密度分布的均值，表示所述第i个样本对应的第二特征的概率密度分布的标准差或方差，y_i表示所述第i个样本对应的标签，s为正整数，g(·)表示所述第一AI模型的输出所服从的概率密度分布函数。Wherein, L ₂ represents the third information, i represents the i-th sample among the N samples, represents the mean of the probability density distribution of the second feature corresponding to the i-th sample, represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, _yi represents the label corresponding to the i-th sample, s is a positive integer, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

在所述第三信息大于或等于第一阈值的情况下，确定所述第一AI模型有效；或，When the third information is greater than or equal to the first threshold, determining that the first AI model is valid; or,

在所述第三信息小于第一阈值的情况下，确定所述第一AI模型失效。When the third information is less than the first threshold, it is determined that the first AI model is invalid.

在一些实施例中，在所述第一信息包括所述第二特征的置信区间的情况下，所述第三信息用于表征所述第一特征的N个样本中，对应的标签位于所述第二特征的置信区间内的样本数与总样本数N的比值，其中，N为正整数。In some embodiments, when the first information includes a confidence interval of the second feature, the first The third information is used to characterize the ratio of the number of samples whose corresponding labels are within the confidence interval of the second feature among the N samples of the first feature to the total number of samples N, where N is a positive integer.

在所述第三信息大于或等于第二阈值的情况下，确定所述第一AI模型有效；或，When the third information is greater than or equal to the second threshold, determining that the first AI model is valid; or,

在所述第三信息小于第二阈值的情况下，确定所述第一AI模型失效。When the third information is less than a second threshold, it is determined that the first AI model is invalid.

应理解，根据本申请实施例的模型性能监督装置700可对应于本申请方法实施例中的第二设备，并且模型性能监督装置700中的各个单元的上述和其它操作或功能分别为了实现图6所示的方法300中第二设备的相应流程，为了简洁，在此不再赘述。It should be understood that the model performance monitoring device 700 according to the embodiment of the present application may correspond to the second device in the method embodiment of the present application, and the above-mentioned and other operations or functions of each unit in the model performance monitoring device 700 are respectively for implementing the corresponding processes of the second device in the method 300 shown in Figure 6. For the sake of brevity, they will not be repeated here.

因此，在本申请实施例中，可以基于第一AI模型的输出的不确定性或概率分布及第一AI模型的输入特征对应的标签确定第三信息，以及基于第三信息确定第一AI模型的有效性，从而实现第一AI模型的性能监督。具体的，第一AI模型的输出的不确定性或概率分布可以提高第一AI模型的推理结果的鲁棒性，由于第一AI模型的推理结果涵盖了多个可能的结果及其概率分布，有利于对推理结果的进一步处理和利用，如第一AI模型的推理结果(包含不确定性或概率分布的相关信息)与其他种类的信息结合得到更准确的目标结果，在第一AI模型的性能监督中参考第一AI模型的输出的不确定性或概率分布及对应的标签，能够提升第一AI模型的性能监督的准确性。Therefore, in an embodiment of the present application, the third information can be determined based on the uncertainty or probability distribution of the output of the first AI model and the label corresponding to the input feature of the first AI model, and the validity of the first AI model can be determined based on the third information, so as to achieve performance supervision of the first AI model. Specifically, the uncertainty or probability distribution of the output of the first AI model can improve the robustness of the reasoning results of the first AI model. Since the reasoning results of the first AI model cover multiple possible results and their probability distributions, it is conducive to further processing and utilization of the reasoning results. For example, the reasoning results of the first AI model (including relevant information on uncertainty or probability distribution) are combined with other types of information to obtain a more accurate target result. In the performance supervision of the first AI model, referring to the uncertainty or probability distribution of the output of the first AI model and the corresponding label can improve the accuracy of the performance supervision of the first AI model.

图11示出了根据本申请实施例的模型性能监督装置800的示意性框图。如图11所示，所述模型性能监督装置800包括：Fig. 11 shows a schematic block diagram of a model performance monitoring device 800 according to an embodiment of the present application. As shown in Fig. 11, the model performance monitoring device 800 includes:

获取单元810，用于获取第四信息，其中，所述第四信息用于表征第二人工智能AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；An acquisition unit 810 is used to acquire fourth information, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence AI model, and the input of the second AI model is the third feature;

处理单元820，用于根据所述第四信息确定所述第二AI模型的有效性；或，收发单元830，用于向第二设备发送所述第四信息。The processing unit 820 is used to determine the validity of the second AI model according to the fourth information; or the transceiver unit 830 is used to send the fourth information to the second device.

在一些实施例中，所述获取单元810获取第四信息，包括以下其中一项：In some embodiments, the acquiring unit 810 acquires the fourth information, including one of the following:

从其他设备获取所述第四信息；Acquire the fourth information from other devices;

通过所述第二AI模型的输出信息，获取所述第四信息；Obtaining the fourth information through the output information of the second AI model;

从所述第二设备接收所述第二AI模型的输出信息，根据所述第二AI模型的输出信息，获得所述第四信息。receiving output information of the second AI model from the second device, and information, and obtain the fourth information.

在一些实施例中，在所述模型性能监督装置800根据所述第四信息确定所述第二AI模型的有效性的情况下，所述收发单元830还用于向所述第二设备发送第三指示信息，其中，所述第三指示信息用于指示所述第二AI模型的有效性；或，In some embodiments, when the model performance monitoring device 800 determines the validity of the second AI model according to the fourth information, the transceiver unit 830 is further used to send third indication information to the second device, wherein the third indication information is used to indicate the validity of the second AI model; or,

在所述模型性能监督装置800向所述第二设备发送所述第四信息的情况下，所述收发单元830还用于从所述第二设备接收第四指示信息，其中，所述第四指示信息用于指示所述第二AI模型的有效性。When the model performance monitoring device 800 sends the fourth information to the second device, the transceiver unit 830 is also used to receive fourth indication information from the second device, wherein the fourth indication information is used to indicate the validity of the second AI model.

在一些实施例中，所述第四信息包括以下至少之一：第四特征的概率密度分布的参数，第四特征的置信区间，第四特征的值，第四特征的值及其概率，第四特征的值及其置信度。In some embodiments, the fourth information includes at least one of the following: a parameter of a probability density distribution of the fourth feature, a confidence interval of the fourth feature, a value of the fourth feature, a value of the fourth feature and its probability, and a value of the fourth feature and its confidence.

在一些实施例中，在所述第四信息包括所述第四特征的概率密度分布的参数的情况下，所述处理单元820具体用于：In some embodiments, when the fourth information includes parameters of the probability density distribution of the fourth feature, the processing unit 820 is specifically configured to:

在所述第四特征的概率密度分布的方差或标准差大于或等于第一阈值的情况下，确定所述第二AI模型失效；或，When the variance or standard deviation of the probability density distribution of the fourth feature is greater than or equal to the first threshold, determining that the second AI model is invalid; or,

在所述第三特征的N个样本对应的所述第四特征的概率密度分布的方差或标准差的均值大于或等于第二阈值的情况下，确定所述第二AI模型失效；或，When the mean of the variance or standard deviation of the probability density distribution of the fourth feature corresponding to the N samples of the third feature is greater than or equal to the second threshold, determining that the second AI model is invalid; or,

在所述第三特征的N个样本中，对应的所述第四特征的概率密度分布的方差或标准差大于或等于第一阈值的样本数占总样本数N的比例大于或等于第三阈值的情况下，确定所述第二AI模型失效；When, among the N samples of the third feature, the proportion of the number of samples whose variance or standard deviation of the probability density distribution of the corresponding fourth feature is greater than or equal to the first threshold to the total number of samples N is greater than or equal to the third threshold, it is determined that the second AI model is invalid;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，在所述第四信息包括所述第四特征的置信区间的情况下，所述处理单元820具体用于：In some embodiments, when the fourth information includes a confidence interval of the fourth feature, the processing unit 820 is specifically configured to:

在所述第四特征的置信区间的宽度大于或等于第四阈值的情况下，确定所述第二AI模型失效；或，When the width of the confidence interval of the fourth feature is greater than or equal to a fourth threshold, determining that the second AI model is invalid; or,

在所述第三特征的N个样本对应的所述第四特征的置信区间的平均宽度大于或等于第五阈值的情况下，确定所述第二AI模型失效；或，When the average width of the confidence interval of the fourth feature corresponding to the N samples of the third feature is greater than or equal to a fifth threshold, determining that the second AI model is invalid; or,

在所述第三特征的N个样本中，对应的所述第四特征的置信区间的宽度大于或等于第四阈值的样本数占总样本数N的比例大于或等于第六阈值的情况下，确定所述第二AI模型失效；When, among the N samples of the third feature, the number of samples whose width of the confidence interval of the corresponding fourth feature is greater than or equal to the fourth threshold accounts for a proportion of the total number of samples N that is greater than or equal to the sixth threshold, it is determined that the second AI model is invalid;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，在所述第四信息包括所述第四特征的值及其概率或置信度的情况下，所述处理单元820具体用于：In some embodiments, when the fourth information includes the value of the fourth feature and its probability or confidence, the processing unit 820 is specifically configured to:

在所述第四特征的概率或置信度小于或等于第七阈值的情况下，确定所述第二AI模型失效；或，When the probability or confidence of the fourth feature is less than or equal to a seventh threshold, determining that the second AI model fails; or,

在所述第三特征的N个样本对应的所述第四特征的概率或置信度的均值小于或等于第八阈值的情况下，确定所述第二AI模型失效；或，When the average of the probability or confidence of the fourth feature corresponding to the N samples of the third feature is less than or equal to an eighth threshold, determining that the second AI model is invalid; or,

在所述第三特征的N个样本中，对应的所述第四特征的概率或置信度小于或等于第七阈值的样本数占总样本数N的比例大于或等于第九阈值的情况下，确定所述第二AI模型失效；When, among the N samples of the third feature, the number of samples whose probability or confidence of the corresponding fourth feature is less than or equal to the seventh threshold accounts for a proportion of the total number of samples N that is greater than or equal to the ninth threshold, it is determined that the second AI model has failed;

其中，N为正整数。Wherein, N is a positive integer.

应理解，根据本申请实施例的模型性能监督装置800可对应于本申请方法实施例中的第一设备，并且模型性能监督装置800中的各个单元的上述和其它操作或功能分别为了实现图7所示的方法400中第一设备的相应流程，为了简洁，在此不再赘述。It should be understood that the model performance monitoring device 800 according to the embodiment of the present application may correspond to the first device in the method embodiment of the present application, and the above and other operations or functions of each unit in the model performance monitoring device 800 are respectively In order to implement the corresponding process of the first device in the method 400 shown in FIG. 7 , it will not be repeated here for the sake of brevity.

图12示出了根据本申请实施例的模型性能监督装置900的示意性框图。如图12所示，所述模型性能监督装置900包括：Fig. 12 shows a schematic block diagram of a model performance monitoring device 900 according to an embodiment of the present application. As shown in Fig. 12, the model performance monitoring device 900 includes:

收发单元910，用于从第一设备接收第四信息，其中，所述第四信息用于表征第二人工智能AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；The transceiver unit 910 is configured to receive fourth information from the first device, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence AI model, and the input of the second AI model is the third feature;

处理单元920，用于根据所述第四信息确定所述第二AI模型的有效性。The processing unit 920 is used to determine the validity of the second AI model according to the fourth information.

在一些实施例中，所述收发单元910还用于向所述第一设备发送第四指示信息，其中，所述第四指示信息用于指示所述第二AI模型的有效性。In some embodiments, the transceiver unit 910 is further used to send fourth indication information to the first device, wherein the fourth indication information is used to indicate the validity of the second AI model.

在一些实施例中，在所述第四信息包括所述第四特征的概率密度分布的参数的情况下，所述处理单元920具体用于：In some embodiments, when the fourth information includes parameters of the probability density distribution of the fourth feature, the processing unit 920 is specifically configured to:

在所述第三特征的N个样本中，对应的所述第四特征的概率密度分布的方差或标准差大于或等于第一阈值的样本数占总样本数N的比例大于或等于第三阈值的情况下，确定所述第二AI模型失效；When, among the N samples of the third feature, the number of samples whose variance or standard deviation of the probability density distribution of the corresponding fourth feature is greater than or equal to the first threshold accounts for a proportion of the total number of samples N that is greater than or equal to the third threshold, it is determined that the second AI model is invalid;

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，在所述第四信息包括所述第四特征的置信区间的情况下，所述处理单元920具体用于：In some embodiments, when the fourth information includes a confidence interval of the fourth feature, the processing unit 920 is specifically configured to:

其中，N为正整数。Wherein, N is a positive integer.

在一些实施例中，在所述第四信息包括所述第四特征的值及其概率或置信度的情况下，所述处理单元920具体用于：In some embodiments, when the fourth information includes the value of the fourth feature and its probability or confidence, the processing unit 920 is specifically configured to:

在所述第三特征的N个样本对应的所述第四特征的概率或置信度的均值小于或等于第八阈值的情况下，确定所述第二AI模型失效；或，The mean of the probability or confidence of the fourth feature corresponding to the N samples of the third feature is less than or equal to In the case of the eighth threshold, it is determined that the second AI model fails; or,

其中，N为正整数。Wherein, N is a positive integer.

应理解，根据本申请实施例的模型性能监督装置900可对应于本申请方法实施例中的第二设备，并且模型性能监督装置900中的各个单元的上述和其它操作或功能分别为了实现图8所示的方法500中第二设备的相应流程，为了简洁，在此不再赘述。It should be understood that the model performance monitoring device 900 according to the embodiment of the present application may correspond to the second device in the method embodiment of the present application, and the above-mentioned and other operations or functions of each unit in the model performance monitoring device 900 are respectively for implementing the corresponding processes of the second device in the method 500 shown in Figure 8. For the sake of brevity, they will not be repeated here.

本申请实施例中的模型性能监督装置可以是电子设备，例如具有操作系统的电子设备，也可以是电子设备中的部件，例如集成电路或芯片。该电子设备可以是终端或网络侧设备，也可以为除终端或网络侧设备之外的其他设备。示例性的，终端可以包括但不限于上述所列举的终端11的类型，网络侧设备可以包括但不限于上述所列举的网络侧设备12的类型，其他设备可以为服务器、网络附属存储器(Network Attached Storage，NAS)等，本申请实施例不作具体限定。The model performance monitoring device in the embodiment of the present application can be an electronic device, such as an electronic device with an operating system, or a component in an electronic device, such as an integrated circuit or a chip. The electronic device can be a terminal or a network-side device, or can be a device other than a terminal or a network-side device. Exemplarily, the terminal can include but is not limited to the types of terminals 11 listed above, the network-side device can include but is not limited to the types of network-side devices 12 listed above, and other devices can be servers, network attached storage (NAS), etc., which are not specifically limited in the embodiment of the present application.

本申请实施例提供的模型性能监督装置能够实现图4至图8的方法实施例实现的各个过程，并达到相同的技术效果，为避免重复，这里不再赘述。The model performance monitoring device provided in the embodiment of the present application can implement the various processes implemented by the method embodiments of Figures 4 to 8 and achieve the same technical effect. To avoid repetition, it will not be repeated here.

如图13所示，本申请实施例还提供一种通信设备1000，包括处理器1001和存储器1002，存储器1002上存储有可在所述处理器1001上运行的程序或指令，例如，该通信设备1000为第一设备时，该程序或指令被处理器1001执行时实现上述模型性能监督方法200或模型性能监督方法400实施例的各个步骤，且能达到相同的技术效果。该通信设备1000为第二设备时，该程序或指令被处理器1001执行时实现上述模型性能监督方法300或模型性能监督方法500实施例的各个步骤，且能达到相同的技术效果，为避免重复，这里不再赘述。As shown in FIG13, the embodiment of the present application further provides a communication device 1000, including a processor 1001 and a memory 1002, and the memory 1002 stores a program or instruction that can be run on the processor 1001. For example, when the communication device 1000 is a first device, the program or instruction is executed by the processor 1001 to implement the various steps of the above-mentioned model performance supervision method 200 or model performance supervision method 400 embodiment, and can achieve the same technical effect. When the communication device 1000 is a second device, the program or instruction is executed by the processor 1001 to implement the various steps of the above-mentioned model performance supervision method 300 or model performance supervision method 500 embodiment, and can achieve the same technical effect. To avoid repetition, it will not be repeated here.

本申请实施例还提供一种终端，包括处理器和通信接口，所述通信接口和所述处理器耦合，所述处理器用于运行程序或指令，实现如图4至图8所示方法实施例中第一设备或第二设备执行的步骤。该终端实施例与上述第一设备或第二设备侧方法实施例对应，上述方法实施例的各个实施过程和实现方式均可适用于该终端实施例中，且能达到相同的技术效果。具体地，图14为实现本申请实施例的一种终端的硬件结构示意图。The embodiment of the present application also provides a terminal, including a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used to run a program or instruction to implement the steps performed by the first device or the second device in the method embodiment shown in Figures 4 to 8. This terminal embodiment corresponds to the above-mentioned first device or second device side method embodiment, and each implementation process and implementation method of the above-mentioned method embodiment can be applied to the terminal embodiment and can achieve the same technical effect. Specifically, Figure 14 is a schematic diagram of the hardware structure of a terminal implementing an embodiment of the present application.

该终端1100包括但不限于：射频单元1101、网络模块1102、音频输出单元1103、输入单元1104、传感器1105、显示单元1106、用户输入单元1107、接口单元1108、存储器1109以及处理器1110等中的至少部分部件。The terminal 1100 includes but is not limited to: a radio frequency unit 1101, a network module 1102, an audio output unit 1103, an input unit 1104, a sensor 1105, a display unit 1106, a user input unit 1107, an interface unit 1108, a memory 1109 and at least some of the components of a processor 1110.

本领域技术人员可以理解，终端1100还可以包括给各个部件供电的电源(比如电池)，电源可以通过电源管理系统与处理器1110逻辑相连，从而通过电源管理系统实现管理充电、放电以及功耗管理等功能。图14中示出的终端结构并不构成对终端的限定，终端可以包括比图示更多或更少的部件，或者组合某些部件，或者不同的部件布置，在此不再赘述。 Those skilled in the art will appreciate that the terminal 1100 may also include a power source (such as a battery) for supplying power to each component, and the power source may be logically connected to the processor 1110 through a power management system, so as to implement functions such as managing charging, discharging, and power consumption management through the power management system. The terminal structure shown in FIG14 does not constitute a limitation on the terminal, and the terminal may include more or fewer components than shown in the figure, or combine certain components, or arrange components differently, which will not be described in detail here.

应理解的是，本申请实施例中，输入单元1104可以包括图形处理单元(Graphics Processing Unit，GPU)11041和麦克风11042，图形处理器11041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。显示单元1106可包括显示面板11061，可以采用液晶显示器、有机发光二极管等形式来配置显示面板11061。用户输入单元1107包括触控面板11071以及其他输入设备11072中的至少一种。触控面板11071，也称为触摸屏。触控面板11071可包括触摸检测装置和触摸控制器两个部分。其他输入设备11072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆，在此不再赘述。It should be understood that in the embodiment of the present application, the input unit 1104 may include a graphics processing unit (GPU) 11041 and a microphone 11042, and the graphics processor 11041 processes the image data of the static picture or video obtained by the image capture device (such as a camera) in the video capture mode or the image capture mode. The display unit 1106 may include a display panel 11061, and the display panel 11061 may be configured in the form of a liquid crystal display, an organic light emitting diode, etc. The user input unit 1107 includes a touch panel 11071 and at least one of other input devices 11072. The touch panel 11071 is also called a touch screen. The touch panel 11071 may include two parts: a touch detection device and a touch controller. Other input devices 11072 may include, but are not limited to, a physical keyboard, function keys (such as a volume control key, a switch key, etc.), a trackball, a mouse, and a joystick, which will not be repeated here.

本申请实施例中，射频单元1101接收来自网络侧设备的下行数据后，可以传输给处理器1110进行处理；另外，射频单元1101可以向网络侧设备发送上行数据。通常，射频单元1101包括但不限于天线、放大器、收发信机、耦合器、低噪声放大器、双工器等。In the embodiment of the present application, after receiving downlink data from the network side device, the RF unit 1101 can transmit the data to the processor 1110 for processing; in addition, the RF unit 1101 can send uplink data to the network side device. Generally, the RF unit 1101 includes but is not limited to an antenna, an amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, etc.

存储器1109可用于存储软件程序或指令以及各种数据。存储器1109可主要包括存储程序或指令的第一存储区和存储数据的第二存储区，其中，第一存储区可存储操作系统、至少一个功能所需的应用程序或指令(比如声音播放功能、图像播放功能等)等。此外，存储器1109可以包括易失性存储器或非易失性存储器。其中，非易失性存储器可以是只读存储器(Read-Only Memory，ROM)、可编程只读存储器(Programmable ROM，PROM)、可擦除可编程只读存储器(Erasable PROM，EPROM)、电可擦除可编程只读存储器(Electrically EPROM，EEPROM)或闪存。易失性存储器可以是随机存取存储器(Random Access Memory，RAM)，静态随机存取存储器(Static RAM，SRAM)、动态随机存取存储器(Dynamic RAM，DRAM)、同步动态随机存取存储器(Synchronous DRAM，SDRAM)、双倍数据速率同步动态随机存取存储器(Double Data Rate SDRAM，DDRSDRAM)、增强型同步动态随机存取存储器(Enhanced SDRAM，ESDRAM)、同步连接动态随机存取存储器(Synch link DRAM，SLDRAM)和直接内存总线随机存取存储器(Direct Rambus RAM，DRRAM)。本申请实施例中的存储器1109包括但不限于这些和任意其它适合类型的存储器。The memory 1109 can be used to store software programs or instructions and various data. The memory 1109 may mainly include a first storage area for storing programs or instructions and a second storage area for storing data, wherein the first storage area may store an operating system, an application program or instruction required for at least one function (such as a sound playback function, an image playback function, etc.), etc. In addition, the memory 1109 may include a volatile memory or a non-volatile memory. Among them, the non-volatile memory may be a read-only memory (ROM), a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), or a flash memory. The volatile memory may be a random access memory (RAM), a static random access memory (SRAM), a dynamic random access memory (DRAM), a synchronous dynamic random access memory (SDRAM), a double data rate synchronous dynamic random access memory (DDRSDRAM), an enhanced synchronous dynamic random access memory (ESDRAM), a synchronous link dynamic random access memory (SLDRAM) and a direct memory bus random access memory (DRRAM). The memory 1109 in the embodiment of the present application includes but is not limited to these and any other suitable types of memory.

处理器1110可包括至少一个处理单元；可选的，处理器1110集成应用处理器和调制解调处理器，其中，应用处理器主要处理涉及操作系统、用户界面和应用程序等的操作，调制解调处理器主要处理无线通信信号，如基带处理器。可以理解的是，上述调制解调处理器也可以不集成到处理器1110中。The processor 1110 may include at least one processing unit; optionally, the processor 1110 integrates an application processor and a modem processor, wherein the application processor mainly processes operations related to an operating system, a user interface, and application programs, and the modem processor mainly processes wireless communication signals, such as a baseband processor. It is understandable that the modem processor may not be integrated into the processor 1110.

示例性的，射频单元1101，用于获取第一信息，其中，所述第一信息用于表征第一AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征；处理器1110用于根据所述第一信息和第二信息确定第三信息，其中，所述第二信息为所述第一特征对应的标签，所述第三信息用于确定所述第一AI模型的有效性。Exemplarily, the RF unit 1101 is used to obtain first information, wherein the first information is used to characterize the uncertainty or probability distribution of the output of a first AI model, and the input of the first AI model is a first feature; the processor 1110 is used to determine third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

示例性的，射频单元1101，用于从第一设备接收第三信息，其中，所述第三信息基于第一信息和第二信息确定，所述第一信息用于表征第一人工智能AI模型的输出的不确定性或概率分布，所述第一AI模型的输入为第一特征，所述第二信息为所述第一特征对应的标签；处理器1110用于根据所述第三信息确定所述第一AI模型的有效性。Exemplarily, the radio frequency unit 1101 is used to receive third information from a first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first artificial intelligence AI model, the input of the first AI model is a first feature, and the second information is a label corresponding to the first feature; the processor 1110 is used to determine the validity of the first AI model based on the third information.

示例性的，射频单元1101，用于获取第四信息，其中，所述第四信息用于表征第二人工智能AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；处理器1110用于根据所述第四信息确定所述第一AI模型的有效性；或，射频单元1101，用于向第二设备发送所述第四信息。Exemplarily, the RF unit 1101 is used to obtain fourth information, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence AI model, and the input of the second AI model is the third feature; the processor 1110 is used to determine the validity of the first AI model based on the fourth information; or, the RF unit 1101 is used to send the fourth information to the second device.

示例性的，射频单元1101，用于从第一设备接收第四信息，其中，所述第四信息用于表征第二人工智能AI模型的输出的不确定性或概率分布，所述第二AI模型的输入为第三特征；处理器1110用于根据所述第四信息确定所述第一AI模型的有效性。Exemplarily, the radio frequency unit 1101 is used to receive fourth information from the first device, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence AI model, and the input of the second AI model is the third feature; the processor 1110 is used to determine the validity of the first AI model based on the fourth information.

可以理解，本实施例中提及的各实现方式的实现过程可以参照方法实施例的相关描述，并达到相同或相应的技术效果，为避免重复，在此不再赘述。 It can be understood that the implementation process of each implementation method mentioned in this embodiment can refer to the relevant description of the method embodiment and achieve the same or corresponding technical effect. To avoid repetition, it will not be repeated here.

本申请实施例还提供一种网络侧设备，包括处理器和通信接口，所述通信接口和所述处理器耦合，所述处理器用于运行程序或指令，实现如图4至图8所示的方法实施例的步骤。该网络侧设备实施例与上述第一设备或第二设备方法实施例对应，上述方法实施例的各个实施过程和实现方式均可适用于该网络侧设备实施例中，且能达到相同的技术效果。The embodiment of the present application also provides a network side device, including a processor and a communication interface, the communication interface is coupled to the processor, and the processor is used to run a program or instruction to implement the steps of the method embodiment shown in Figures 4 to 8. The network side device embodiment corresponds to the first device or second device method embodiment described above, and each implementation process and implementation method of the method embodiment described above can be applied to the network side device embodiment, and can achieve the same technical effect.

具体地，本申请实施例还提供了一种网络侧设备。如图15所示，该网络侧设备1200包括：天线121、射频装置122、基带装置123、处理器124和存储器125。天线121与射频装置122连接。在上行方向上，射频装置122通过天线121接收信息，将接收的信息发送给基带装置123进行处理。在下行方向上，基带装置123对要发送的信息进行处理，并发送给射频装置122，射频装置122对收到的信息进行处理后经过天线121发送出去。Specifically, the embodiment of the present application also provides a network side device. As shown in Figure 15, the network side device 1200 includes: an antenna 121, a radio frequency device 122, a baseband device 123, a processor 124 and a memory 125. The antenna 121 is connected to the radio frequency device 122. In the uplink direction, the radio frequency device 122 receives information through the antenna 121 and sends the received information to the baseband device 123 for processing. In the downlink direction, the baseband device 123 processes the information to be sent and sends it to the radio frequency device 122. The radio frequency device 122 processes the received information and sends it out through the antenna 121.

以上实施例中网络侧设备执行的方法可以在基带装置123中实现，该基带装置123包括基带处理器。The method executed by the network-side device in the above embodiment may be implemented in the baseband device 123, which includes a baseband processor.

基带装置123例如可以包括至少一个基带板，该基带板上设置有至少两个芯片，如图15所示，其中一个芯片例如为基带处理器，通过总线接口与存储器125连接，以调用存储器125中的程序，执行以上方法实施例中所示的网络设备操作。The baseband device 123 may include, for example, at least one baseband board, on which at least two chips are arranged, as shown in Figure 15, one of the chips is, for example, a baseband processor, which is connected to the memory 125 through a bus interface to call the program in the memory 125 to execute the network device operations shown in the above method embodiment.

该网络侧设备还可以包括网络接口126，该接口例如为通用公共无线接口(Common Public Radio Interface，CPRI)。The network side device may also include a network interface 126, which is, for example, a Common Public Radio Interface (CPRI).

具体地，本申请实施例的网络侧设备1200还包括：存储在存储器125上并可在处理器124上运行的指令或程序，处理器124调用存储器125中的指令或程序执行图9至图12中任一项所示各单元执行的方法，并达到相同的技术效果，为避免重复，故不在此赘述。Specifically, the network side device 1200 of the embodiment of the present application also includes: instructions or programs stored in the memory 125 and executable on the processor 124. The processor 124 calls the instructions or programs in the memory 125 to execute the method executed by each unit shown in any one of Figures 9 to 12, and achieves the same technical effect. To avoid repetition, it will not be repeated here.

具体地，本申请实施例还提供了一种网络侧设备。如图16所示，该网络侧设备1300包括：处理器1301、网络接口1302和存储器1303。其中，网络接口1302例如为通用公共无线接口(common public radio interface，CPRI)。Specifically, the embodiment of the present application further provides a network side device. As shown in FIG16 , the network side device 1300 includes: a processor 1301, a network interface 1302, and a memory 1303. Among them, the network interface 1302 is, for example, a common public radio interface (CPRI).

具体地，本申请实施例的网络侧设备1300还包括：存储在存储器1303上并可在处理器1301上运行的指令或程序，处理器1301调用存储器1303中的指令或程序执行图9至图12中任一项所示各单元执行的方法，并达到相同的技术效果，为避免重复，故不在此赘述。Specifically, the network side device 1300 of the embodiment of the present application also includes: instructions or programs stored in the memory 1303 and executable on the processor 1301. The processor 1301 calls the instructions or programs in the memory 1303 to execute the method executed by each unit shown in any one of Figures 9 to 12, and achieves the same technical effect. To avoid repetition, it will not be repeated here.

本申请实施例还提供一种可读存储介质，所述可读存储介质上存储有程序或指令，该程序或指令被处理器执行时实现上述模型性能监督方法实施例的各个过程，且能达到相同的技术效果，为避免重复，这里不再赘述。An embodiment of the present application also provides a readable storage medium, on which a program or instruction is stored. When the program or instruction is executed by a processor, the various processes of the above-mentioned model performance supervision method embodiment are implemented, and the same technical effect can be achieved. To avoid repetition, it will not be repeated here.

其中，所述处理器为上述实施例中所述的终端中的处理器。所述可读存储介质，包括计算机可读存储介质，如计算机只读存储器ROM、随机存取存储器RAM、磁碟或者光盘等。在一些示例中，可读存储介质可以是非瞬态的可读存储介质。The processor is the processor in the terminal described in the above embodiment. The readable storage medium includes a computer readable storage medium, such as a computer read-only memory ROM, a random access memory RAM, a magnetic disk or an optical disk. In some examples, the readable storage medium may be a non-transient readable storage medium.

本申请实施例另提供了一种芯片，所述芯片包括处理器和通信接口，所述通信接口和所述处理器耦合，所述处理器用于运行程序或指令，实现上述模型性能监督方法实施例的各个过程，且能达到相同的技术效果，为避免重复，这里不再赘述。An embodiment of the present application further provides a chip, which includes a processor and a communication interface, wherein the communication interface is coupled to the processor, and the processor is used to run programs or instructions to implement the various processes of the above-mentioned model performance supervision method embodiment, and can achieve the same technical effect. To avoid repetition, it will not be repeated here.

应理解，本申请实施例提到的芯片还可以称为系统级芯片，系统芯片，芯片系统或片上系统芯片等。It should be understood that the chip mentioned in the embodiments of the present application can also be called a system-level chip, a system chip, a chip system or a system-on-chip chip, etc.

本申请实施例另提供了一种计算机程序/程序产品，所述计算机程序/程序产品被存储在存储介质中，所述计算机程序/程序产品被至少一个处理器执行以实现上述模型性能监督方法实施例的各个过程，且能达到相同的技术效果，为避免重复，这里不再赘述。An embodiment of the present application further provides a computer program/program product, which is stored in a storage medium. The computer program/program product is executed by at least one processor to implement the various processes of the above-mentioned model performance supervision method embodiment and can achieve the same technical effect. To avoid repetition, it will not be repeated here.

本申请实施例还提供了一种通信系统，包括：第一设备及第二设备，其中，所述第一设备可用于执行如上所述的模型性能监督方法中由第一设备执行的步骤，所述第二设备可用于执行如上所述的模型性能监督方法中由第二设备执行的步骤。 An embodiment of the present application also provides a communication system, including: a first device and a second device, wherein the first device can be used to execute the steps performed by the first device in the model performance supervision method as described above, and the second device can be used to execute the steps performed by the second device in the model performance supervision method as described above.

需要说明的是，在本文中，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。此外，需要指出的是，本申请实施方式中的方法和装置的范围不限按示出或讨论的顺序来执行功能，还可包括根据所涉及的功能按基本同时的方式或按相反的顺序来执行功能，例如，可以按不同于所描述的次序来执行所描述的方法，并且还可以添加、省去或组合各种步骤。另外，参照某些示例所描述的特征可在其他示例中被组合。It should be noted that, in this article, the terms "comprise", "include" or any other variant thereof are intended to cover non-exclusive inclusion, so that a process, method, article or device including a series of elements includes not only those elements, but also other elements not explicitly listed, or also includes elements inherent to such process, method, article or device. In the absence of further restrictions, an element defined by the sentence "comprises one..." does not exclude the presence of other identical elements in the process, method, article or device including the element. In addition, it should be pointed out that the scope of the method and device in the embodiment of the present application is not limited to performing functions in the order shown or discussed, and may also include performing functions in a substantially simultaneous manner or in reverse order according to the functions involved, for example, the described method may be performed in an order different from that described, and various steps may also be added, omitted or combined. In addition, the features described with reference to certain examples may be combined in other examples.

通过以上的实施方式的描述，本领域的技术人员可以清楚地了解到上述实施例方法可借助计算机软件产品加必需的通用硬件平台的方式来实现，当然也可以通过硬件。该计算机软件产品存储在存储介质(如ROM、RAM、磁碟、光盘等)中，包括若干指令，用以使得终端或者网络侧设备执行本申请各个实施例所述的方法。Through the description of the above implementation methods, those skilled in the art can clearly understand that the above-mentioned embodiment methods can be implemented by means of a computer software product plus a necessary general hardware platform, and of course, can also be implemented by hardware. The computer software product is stored in a storage medium (such as ROM, RAM, disk, CD, etc.), including several instructions to enable a terminal or a network-side device to execute the methods described in each embodiment of the present application.

上面结合附图对本申请的实施例进行了描述，但是本申请并不局限于上述的具体实施方式，上述的具体实施方式仅仅是示意性的，而不是限制性的，本领域的普通技术人员在本申请的启示下，在不脱离本申请宗旨和权利要求所保护的范围情况下，还可做出很多形式的实施方式，这些实施方式均属于本申请的保护之内。 The embodiments of the present application are described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific implementation methods. The above-mentioned specific implementation methods are merely illustrative and not restrictive. Under the guidance of the present application, ordinary technicians in this field can also make many forms of implementation methods without departing from the purpose of the present application and the scope of protection of the claims, and these implementation methods are all within the protection of the present application.

Claims

A model performance supervision method, comprising:

The first device acquires first information, wherein the first information is used to characterize the uncertainty or probability distribution of the output of a first artificial intelligence (AI) model, and the input of the first AI model is a first feature;

The first device determines third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

The method according to claim 1, wherein

The first device obtains first information, including one of the following:

The first device receives the first information from the second device;

The first device obtains the first information through output information of the first AI model;

The first device receives output information of the first AI model from the second device, and obtains the first information according to the output information of the first AI model.

The method according to claim 1 or 2, further comprising:

The first device determines the validity of the first AI model according to the third information; or,

The first device sends the third information to the second device.

The method according to claim 3, wherein the method further comprises:

When the first device determines the validity of the first AI model according to the third information, the first device sends first indication information to the second device, wherein the first indication information is used to indicate the validity of the first AI model; or

When the first device sends the third information to the second device, the first device receives second indication information from the second device, wherein the second indication information is used to indicate the validity of the first AI model.

The method according to any one of claims 1 to 4, wherein the first information includes at least one of the following:

Parameters of the probability density distribution of the second feature, a confidence interval of the second feature, a value of the second feature, a value of the second feature and its probability, and a value of the second feature and its confidence level.

The method according to claim 5, wherein

The parameters of the probability density distribution include at least one of the following: a mean or expectation of the probability density distribution, a variance or standard deviation of the probability density distribution, and an indication of the type of the probability density distribution.

The method according to claim 5 or 6, wherein:

In the case where the first information includes parameters of the probability density distribution of the second feature, the third information is used to characterize a mean of the first probabilities of N samples of the first feature, or the third information is used to characterize a mean of the logarithms of the first probabilities of the N samples of the first feature;

The first probability is the probability of the label corresponding to each sample in the N samples under the probability density distribution of the second feature, and N is a positive integer.

The method according to claim 7, wherein:

In the case where the third information is used to characterize the mean of the first probabilities of N samples of the first feature, the third information is determined based on the following formula:

Wherein, _L1 represents the third information, i represents the i-th sample among the N samples, represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, represents the mean of the probability density distribution of the second feature corresponding to the ith sample, _yi represents the label corresponding to the ith sample, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

The method according to claim 7, wherein:

In the case where the third information is used to characterize the mean of the logarithm of the first probability of N samples of the first feature, the third information is determined based on the following formula:

Wherein, L ₂ represents the third information, i represents the i-th sample among the N samples, represents the mean of the probability density distribution of the second feature corresponding to the i-th sample, represents the standard deviation or variance of the probability density distribution of the second feature corresponding to the i-th sample, _yi represents the label corresponding to the i-th sample, s is a positive integer, and g(·) represents the probability density distribution function obeyed by the output of the first AI model.

The method according to any one of claims 7 to 9, wherein

The third information is used to determine the validity of the first AI model, including:

When the third information is greater than or equal to the first threshold, the first AI model is valid; or,

When the third information is less than the first threshold, the first AI model fails.

The method according to claim 5 or 6, wherein:

In the case where the first information includes the confidence interval of the second feature, the third information is used to characterize the ratio of the number of samples whose corresponding labels are within the confidence interval of the second feature among the N samples of the first feature to the total number of samples N, where N is a positive integer.

The method according to claim 11, wherein

The third information is determined based on the following formula:

Wherein, L ₃ represents the third information, i represents the i-th sample among the N samples, x _i represents the input information corresponding to the i-th sample, y _i represents the label corresponding to the i-th sample, and when y _i is within the confidence interval f(x _i ) of the second feature, otherwise

The method according to claim 11 or 12, wherein

When the third information is greater than or equal to the second threshold, the first AI model is valid; or,

When the third information is less than the second threshold, the first AI model fails.

The method according to claim 5 or 6, wherein:

When the first information includes the value of the second feature and its probability or confidence, the third information is used to characterize the mean of the weighted distances between the labels corresponding to N samples of the first feature and the value of the second feature, wherein the weighted weight is determined based on the probability or confidence of the second feature, and N is a positive integer.

The method according to claim 14, wherein

The third information is determined based on the following formula:

Wherein, L ₄ represents the third information, i represents the i-th sample among the N samples, represents the mean of the probability density distribution of the second feature corresponding to the i-th sample, and _yi represents the label corresponding to the i-th sample.

The method according to claim 14 or 15, wherein

When the third information is less than or equal to a third threshold, the first AI model is valid; or,

When the third information is greater than a third threshold, the first AI model fails.

A model performance supervision method, comprising:

The second device receives third information from the first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first artificial intelligence AI model, the input of the first AI model is a first feature, and the second information is a label corresponding to the first feature;

The second device determines the validity of the first AI model based on the third information.

The method according to claim 17, wherein before the second device receives the third information from the first device, the method further comprises:

The second device sends the first information to the first device.

The method according to claim 17 or 18, wherein the method further comprises:

The second device sends second indication information to the first device, wherein the second indication information is used to indicate the validity of the first AI model.

The method according to any one of claims 17 to 19, wherein the first information includes at least one of the following:

The method according to claim 20, wherein

The method according to claim 20 or 21, wherein

The method according to claim 22, wherein

The method according to any one of claims 22 to 24, wherein

The second device determines the validity of the first AI model according to the third information, including:

When the third information is greater than or equal to the first threshold, the second device determines that the first AI model is valid; or,

When the third information is less than the first threshold, the second device determines that the first AI model is invalid.

The method according to claim 20 or 21, wherein

The method according to claim 26, wherein

The third information is determined based on the following formula:

The method according to claim 26 or 27, wherein

When the third information is greater than or equal to a second threshold, the second device determines that the first AI model is valid; or,

When the third information is less than a second threshold, the second device determines that the first AI model is invalid.

The method according to claim 20 or 21, wherein

The method according to claim 29, wherein

The third information is determined based on the following formula:

The method according to claim 29 or 30, wherein

When the third information is less than or equal to a third threshold, the second device determines that the first AI model is valid; or,

When the third information is greater than a third threshold, the second device determines that the first AI model is invalid.

A model performance supervision method, comprising:

The first device obtains fourth information, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence AI model, and the input of the second AI model is the third feature;

The first device determines the validity of the second AI model based on the fourth information; or, the first device sends the fourth information to the second device.

The method according to claim 32, wherein

The first device obtains fourth information, including one of the following:

The first device obtains the fourth information from other devices;

The first device obtains the fourth information through the output information of the second AI model;

The first device receives the output information of the second AI model from the second device, and obtains the fourth information according to the output information of the second AI model.

The method according to claim 32 or 33, further comprising:

When the first device determines the validity of the second AI model according to the fourth information, the first device sends third indication information to the second device, wherein the third indication information is used to indicate the validity of the second AI model; or

When the first device sends the fourth information to the second device, the first device receives fourth indication information from the second device, wherein the fourth indication information is used to indicate the validity of the second AI model.

The method according to any one of claims 32 to 34, wherein

The fourth information includes at least one of the following: a parameter of a probability density distribution of the fourth feature, a confidence interval of the fourth feature, a value of the fourth feature, a value of the fourth feature and its probability, and a value of the fourth feature and its confidence.

The method according to claim 35, wherein

The method according to claim 35 or 36, wherein

In a case where the fourth information includes a parameter of a probability density distribution of the fourth feature, the first device determines the validity of the second AI model according to the fourth information, including:

When the variance or standard deviation of the probability density distribution of the fourth feature is greater than or equal to the first threshold, the first device determines that the second AI model fails; or,

When the mean of the variance or standard deviation of the probability density distribution of the fourth feature corresponding to the N samples of the third feature is greater than or equal to the second threshold, the first device determines that the second AI model fails; or,

When, among the N samples of the third feature, the proportion of the number of samples whose variance or standard deviation of the probability density distribution of the corresponding fourth feature is greater than or equal to the first threshold to the total number of samples N is greater than or equal to the third threshold, the first device determines that the second AI model has failed;

Wherein, N is a positive integer.

The method according to claim 35 or 36, wherein

In a case where the fourth information includes a confidence interval of the fourth feature, the first device determines the validity of the second AI model according to the fourth information, including:

When the width of the confidence interval of the fourth feature is greater than or equal to a fourth threshold, the first device determines that the second AI model is invalid; or,

When the average width of the confidence interval of the fourth feature corresponding to the N samples of the third feature is greater than or equal to a fifth threshold, the first device determines that the second AI model is invalid; or,

When, among the N samples of the third feature, the number of samples whose width of the confidence interval of the corresponding fourth feature is greater than or equal to the fourth threshold accounts for a proportion of the total number of samples N that is greater than or equal to a sixth threshold, the first device determines that the second AI model has failed;

Wherein, N is a positive integer.

The method according to claim 35 or 36, wherein

In a case where the fourth information includes a value of the fourth feature and a probability or confidence level thereof, the first device determines the validity of the second AI model according to the fourth information, including:

When the probability or confidence of the fourth feature is less than or equal to a seventh threshold, the first device determines that the second AI model fails; or,

When the average of the probability or confidence of the fourth feature corresponding to the N samples of the third feature is less than or equal to an eighth threshold, the first device determines that the second AI model fails; or,

Among the N samples of the third feature, the probability or confidence of the corresponding fourth feature is less than or equal to When the ratio of the number of samples of the seventh threshold to the total number of samples N is greater than or equal to the ninth threshold, the first device determines that the second AI model is invalid;

Wherein, N is a positive integer.

A model performance supervision method, comprising:

The second device receives fourth information from the first device, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence (AI) model, and the input of the second AI model is the third feature;

The second device determines the validity of the second AI model based on the fourth information.

The method according to claim 40, wherein the method further comprises:

The second device sends fourth indication information to the first device, wherein the fourth indication information is used to indicate the validity of the second AI model.

The method according to claim 40 or 41, wherein

The method according to claim 42, wherein

The method according to claim 42 or 43, wherein

In a case where the fourth information includes a parameter of a probability density distribution of the fourth feature, the second device determining the validity of the second AI model according to the fourth information includes:

When the variance or standard deviation of the probability density distribution of the fourth feature is greater than or equal to the first threshold, the second device determines that the second AI model fails; or,

When the mean of the variance or standard deviation of the probability density distribution of the fourth feature corresponding to the N samples of the third feature is greater than or equal to the second threshold, the second device determines that the second AI model is invalid; or,

When, among the N samples of the third feature, the proportion of the number of samples whose variance or standard deviation of the probability density distribution of the corresponding fourth feature is greater than or equal to the first threshold to the total number of samples N is greater than or equal to the third threshold, the second device determines that the second AI model has failed;

Wherein, N is a positive integer.

The method according to claim 42 or 43, wherein

In a case where the fourth information includes a confidence interval of the fourth feature, the second device determines the validity of the second AI model according to the fourth information, including:

When the width of the confidence interval of the fourth feature is greater than or equal to a fourth threshold, the second device determines that the second AI model is invalid; or,

When the average width of the confidence interval of the fourth feature corresponding to the N samples of the third feature is greater than or equal to a fifth threshold, the second device determines that the second AI model is invalid; or,

When, among the N samples of the third feature, the number of samples whose width of the confidence interval of the corresponding fourth feature is greater than or equal to the fourth threshold accounts for a proportion of the total number of samples N that is greater than or equal to a sixth threshold, the second device determines that the second AI model has failed;

Wherein, N is a positive integer.

The method according to claim 42 or 43, wherein

In a case where the fourth information includes the value of the fourth feature and the probability or confidence thereof, the second device determines the validity of the second AI model according to the fourth information, including:

When the probability or confidence of the fourth feature is less than or equal to a seventh threshold, the second device determines that the second AI model fails; or,

When the average of the probability or confidence of the fourth feature corresponding to the N samples of the third feature is less than or equal to an eighth threshold, the second device determines that the second AI model fails; or,

Among the N samples of the third feature, the probability or confidence of the corresponding fourth feature is less than or equal to When the ratio of the number of samples of the seventh threshold to the total number of samples N is greater than or equal to the ninth threshold, the second device determines that the second AI model is invalid;

Wherein, N is a positive integer.

A model performance monitoring device, comprising:

An acquisition unit, configured to acquire first information, wherein the first information is used to characterize the uncertainty or probability distribution of an output of a first artificial intelligence (AI) model, and an input of the first AI model is a first feature;

A processing unit, used to determine third information based on the first information and the second information, wherein the second information is a label corresponding to the first feature, and the third information is used to determine the validity of the first AI model.

A model performance monitoring device, comprising:

a transceiver unit, configured to receive third information from a first device, wherein the third information is determined based on the first information and the second information, the first information is used to characterize the uncertainty or probability distribution of the output of the first artificial intelligence (AI) model, the input of the first AI model is a first feature, and the second information is a label corresponding to the first feature;

A processing unit is used to determine the validity of the first AI model based on the third information.

A model performance monitoring device, comprising:

an acquisition unit, configured to acquire fourth information, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence (AI) model, the input of the second AI model being the third feature;

A processing unit, used to determine the validity of the second AI model according to the fourth information; or a transceiver unit, used to send the fourth information to the second device.

A model performance monitoring device, comprising:

A transceiver unit, configured to receive fourth information from the first device, wherein the fourth information is used to characterize the uncertainty or probability distribution of the output of the second artificial intelligence (AI) model, and the input of the second AI model is the third feature;

A processing unit, configured to determine the validity of the second AI model based on the fourth information.

A model performance supervision device, the model performance supervision device is a first device, the model performance supervision device includes a processor and a memory, the memory stores a program or instruction that can be run on the processor, the program or instruction when executed by the processor implements the steps of the model performance supervision method as described in any one of claims 1 to 16, or the program or instruction when executed by the processor implements the steps of the model performance supervision method as described in any one of claims 32 to 39.

A model performance supervision device, the model performance supervision device is a second device, the model performance supervision device includes a processor and a memory, the memory stores a program or instruction that can be run on the processor, and the program or instruction when executed by the processor implements the steps of the model performance supervision method as described in any one of claims 17 to 31, or the program or instruction when executed by the processor implements the steps of the model performance supervision method as described in any one of claims 40 to 46.

A readable storage medium storing a program or instruction, wherein the program or instruction, when executed by a processor, implements the method of the model performance supervision method as described in any one of claims 1 to 16, or implements the steps of the model performance supervision method as described in any one of claims 17 to 31, or implements the steps of the model performance supervision method as described in any one of claims 32 to 39, or implements the steps of the model performance supervision method as described in any one of claims 40 to 46.