[go: up one dir, main page]

WO2025189861A1 - Communication method and related apparatus - Google Patents

Communication method and related apparatus

Info

Publication number
WO2025189861A1
WO2025189861A1 PCT/CN2024/138059 CN2024138059W WO2025189861A1 WO 2025189861 A1 WO2025189861 A1 WO 2025189861A1 CN 2024138059 W CN2024138059 W CN 2024138059W WO 2025189861 A1 WO2025189861 A1 WO 2025189861A1
Authority
WO
WIPO (PCT)
Prior art keywords
model
information
parameter
threshold
communication device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/138059
Other languages
French (fr)
Chinese (zh)
Inventor
徐晨
张公正
李榕
王坚
童文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of WO2025189861A1 publication Critical patent/WO2025189861A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/045Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/02Arrangements for optimising operational condition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W8/00Network data management
    • H04W8/22Processing or transfer of terminal data, e.g. status or physical capabilities
    • H04W8/24Transfer of terminal data

Definitions

  • the present application relates to the field of communications, and in particular to a communication method and related devices.
  • Wireless communication can be a transmission communication between two or more communication nodes without propagating through conductors or cables.
  • the communication nodes generally include network devices and terminal devices.
  • communication nodes generally possess both signal transceiver capabilities and computing capabilities.
  • network devices with computing capabilities primarily provide computing power to support signal transceiver capabilities (e.g., processing both sending and receiving signals), enabling communication between the network device and other communication nodes.
  • communication nodes may have excess computing power beyond just supporting the aforementioned communication tasks. Therefore, how to utilize this computing power is a pressing technical issue.
  • the present application provides a communication method and related devices for improving the robustness of continuous learning.
  • the present application provides a communication method, which is applied to a first communication device.
  • the first communication device may be a communication device (such as a terminal device or a network device), or the first communication device may be a partial component in the communication device (such as a processor, a chip or a chip system, etc.), or the first communication device may also be a logic module or software that can realize all or part of the functions of the communication device.
  • the first communication device obtains first information, and the first information is used to indicate performance monitoring information of at least one artificial intelligence (AI) model, and the at least one AI model is associated with a first AI model deployed on the first communication device; the first communication device performs noise perturbation processing on the model parameters of the first AI model based on the first information to obtain the updated model parameters of the first AI model.
  • AI artificial intelligence
  • the first communication device can perform noise perturbation processing on the model parameters of the first AI model to obtain updated model parameters of the first AI model.
  • the computing power of the communication node can be applied to AI model processing, and the AI model can be updated based on the performance monitoring information of the AI model.
  • the process of the first communication device performing the AI model parameter update is triggered based on the performance monitoring information of at least one AI model, and during the AI model parameter update process, the first communication device obtains the updated AI model parameters through noise perturbation processing.
  • the solution can be applied to the scenario of continuous learning, and by performing noise perturbation processing on the AI model parameters, the problem of reduced or even disappearance of the plasticity of the AI model during long-term continuous learning can be slowed down or avoided, which can improve the adaptability of continuous learning and enhance the robustness of continuous learning.
  • AI model can be replaced by other terms, such as neural network, neural network model, AI neural network model, machine learning model, or AI processing model.
  • noise disturbance processing can be replaced by other terms, such as noise addition processing, disturbance processing, noise disturbance processing, noise interference processing, or random disturbance processing.
  • the method further includes: the first communication device sending second information based on the first information, the second information being obtained by performing noise disturbance processing based on the first parameter, and the second information being used to update a second AI model; wherein the second AI model is used to be deployed in the second communication device.
  • the first communication device can also send second information obtained by performing noise perturbation processing based on the first parameter based on the first information, so that the recipient of the second information (such as the second communication device) can obtain the second information obtained by performing noise perturbation processing based on the first parameter, and update the second AI model based on the second information.
  • the problem of reduced or even disappearance of the plasticity of the second AI model during long-term continuous learning can be alleviated or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.
  • the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device.
  • the first information is used to indicate performance monitoring information of at least one AI model, and when the performance monitoring information indicated by the first information satisfies a certain condition (e.g., the first condition and/or the second condition described below), the first information can be used to trigger noise perturbation processing of model parameters of the first AI model and/or the second AI model.
  • a certain condition e.g., the first condition and/or the second condition described below
  • the first information can be used to trigger noise perturbation processing of model parameters of the first AI model and/or the second AI model.
  • the at least one AI model may include the first AI model and/or the second AI model.
  • the first communication device can trigger noise perturbation processing of the AI model based on the performance of the associated AI model.
  • the first AI model is associated with the second AI model, including any of the following:
  • the first AI model is a public model and the second AI model is a personalized model
  • the first AI model is a personalized model and the second AI model is a public model;
  • the input of the first AI model includes the output of the second AI model
  • the input of the second AI model includes the output of the first AI model
  • the first AI model is a teacher model and the second AI model is a student model; or,
  • the first AI model is a student model and the second AI model is a teacher model.
  • the second information is obtained by processing a quantization result of the first parameter, or the second information is obtained by processing a transmission parameter of the first parameter.
  • the second information can be obtained by performing noise disturbance processing in a variety of ways to improve the flexibility of the solution implementation.
  • the first parameter includes one or more of the following:
  • the updated model parameters of the second AI model used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.
  • the first communication device sending the second information based on the first information includes: when a first condition is met, the first communication device sending the second information based on the first information; wherein the first condition includes at least one of the following:
  • the training accuracy of the at least one AI model is lower than a first threshold
  • the test accuracy of the at least one AI model is lower than a second threshold
  • the system performance of the AI model system in which the at least one AI model resides is lower than a third threshold
  • the system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold
  • a change in the neural network input-output distribution of the at least one AI model is greater than a fifth threshold
  • the neural network weight distribution of the at least one AI model is greater than a sixth threshold
  • the change in the neural network weight of the at least one AI model is lower than a seventh threshold.
  • the first communication device when the first condition is met, can determine that the current operation of the second AI model may be abnormal. To this end, the first communication device can send second information, so that the second communication device can obtain the second information obtained by noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.
  • the method further includes: the first communication device receiving indication information indicating a second parameter, where the second parameter is used to process the first parameter to obtain the second information.
  • the first communication device can also determine the second parameter through the received indication information, and process the first parameter based on the second parameter to obtain the second information, that is, the first communication device can determine the second parameter based on the indication of other communication devices (for example, the recipient of the second information), so that the recipient of the second information can obtain the corresponding second information through the specified second parameter.
  • the second parameter can be used to perform noise disturbance processing on the first parameter to obtain the second information. Accordingly, the second parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.
  • the method further includes: the first communication device sending request information for requesting the second parameter.
  • the first communication device can determine the second parameter through an interactive process of the request information and the above indication information.
  • the second parameter is a preconfigured parameter.
  • the method further includes: the first communication device receiving indication information indicating a third parameter, where the third parameter is used to process the model parameters of the first AI model to obtain updated model parameters of the first AI model.
  • the first communication device can also receive the third parameter determined by the received indication information, and perform noise disturbance processing on the model parameters of the first AI model based on the third parameter, that is, the first communication device can determine the third parameter based on the indication of other communication devices (such as the recipient of the second information), so that the first communication device can obtain the noise disturbance processing through the specified third parameter.
  • the third parameter can be used to process the model parameters of the first AI model to obtain the updated model parameters of the first AI model. Accordingly, the third parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.
  • the method further includes: the first communication device sending request information for requesting the third parameter.
  • the first communication device can determine the third parameter through an interactive process of the request information and the above indication information.
  • the third parameter is a preconfigured parameter.
  • the first communications device performs noise perturbation processing on model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model, including: when a second condition is met, the first communications device performs noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model;
  • the second condition includes at least one of the following:
  • the training accuracy of the at least one AI model is lower than an eighth threshold
  • the test accuracy of the at least one AI model is lower than a ninth threshold
  • the system performance of the AI model system in which the at least one AI model resides is lower than a tenth threshold
  • the system performance of the communication system in which the communication device deploys the at least one AI model is located is lower than an eleventh threshold
  • the change in the neural network input and output distribution of the at least one AI model is greater than a twelfth threshold
  • the neural network weight distribution of the at least one AI model is greater than a thirteenth threshold
  • the change in the neural network weight of the at least one AI model is lower than a fourteenth threshold.
  • the first communication device can determine that the current operation of the first AI model may be abnormal. To this end, the first communication device can process the model parameters of the first AI model to obtain updated model parameters of the first AI model to update the first AI model. In this way, the problem of the first AI model's plasticity being reduced or even eliminated during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.
  • the first communication device obtains the first information, including: the first communication device receives the first information; or, the first communication device obtains measurement parameters of the first AI model and/or the second AI model, and determines the first information based on the measurement parameters of the first AI model and/or the second AI model.
  • the first communication device can obtain the first information through the above multiple methods to improve the flexibility of implementing the solution.
  • the second aspect of the present application provides a communication method, which is applied to a first communication device, which may be a communication device (such as a terminal device or a network device), or the first communication device may be a partial component in the communication device (such as a processor, a chip or a chip system, etc.), or the first communication device may also be a logic module or software that can realize all or part of the functions of the communication device.
  • a communication device such as a terminal device or a network device
  • the first communication device may be a partial component in the communication device (such as a processor, a chip or a chip system, etc.)
  • the first communication device may also be a logic module or software that can realize all or part of the functions of the communication device.
  • the first communication device obtains first information, which is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model for deployment in the second communication device; the first communication device sends second information based on the first information, which is obtained by performing noise perturbation processing based on the first parameter, and the second information is used to update the second AI model.
  • the first communication device can send the second information obtained by adding noise and perturbation based on the first parameter, so that the recipient of the second information (such as the second communication device) can obtain the second information obtained by adding noise and perturbation based on the first parameter, and update the second AI model based on the second information.
  • the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be slowed down or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.
  • the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device.
  • the first information is used to indicate performance monitoring information of at least one AI model, and when the performance monitoring information indicated by the first information is used to trigger a certain condition (such as the first condition and/or the second condition described below), the first information can be used to perform noise perturbation processing on the model parameters of the first AI model and/or the second AI model.
  • the at least one AI model may include the first AI model and/or the second AI model. In this way, the first communication device can trigger noise perturbation processing of the AI model based on the performance of the associated AI model.
  • the first AI model is associated with the second AI model, including any of the following:
  • the first AI model is a public model and the second AI model is a personalized model
  • the first AI model is a personalized model and the second AI model is a public model;
  • the input of the first AI model includes the output of the second AI model
  • the input of the second AI model includes the output of the first AI model
  • the first AI model is a teacher model and the second AI model is a student model; or,
  • the first AI model is a student model and the second AI model is a teacher model.
  • the second information is obtained by processing a quantization result of the first parameter, or the second information is obtained by processing a transmission parameter of the first parameter.
  • the second information can be obtained by performing noise disturbance processing in a variety of ways to improve the flexibility of the solution implementation.
  • the first parameter includes one or more of the following:
  • the updated model parameters of the second AI model used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.
  • the first communication device sending the second information based on the first information includes: when a first condition is met, the first communication device sending the second information based on the first information;
  • the first condition includes at least one of the following:
  • the training accuracy of the at least one AI model is lower than a first threshold
  • the test accuracy of the at least one AI model is lower than a second threshold
  • the system performance of the AI model system in which the at least one AI model resides is lower than a third threshold
  • the system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold
  • a change in the neural network input-output distribution of the at least one AI model is greater than a fifth threshold
  • the neural network weight distribution of the at least one AI model is greater than a sixth threshold
  • the change in the neural network weight of the at least one AI model is lower than a seventh threshold.
  • the first communication device when the first condition is met, can determine that the current operation of the second AI model may be abnormal. To this end, the first communication device can send second information, so that the second communication device can obtain the second information obtained by noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.
  • the method further includes: the first communication device receiving indication information indicating a second parameter, where the second parameter is used to process the first parameter to obtain the second information.
  • the first communication device can also determine the second parameter through the received indication information, and process the first parameter based on the second parameter to obtain the second information, that is, the first communication device can determine the second parameter based on the indication of other communication devices (for example, the recipient of the second information), so that the recipient of the second information can obtain the corresponding second information through the specified second parameter.
  • the second parameter can be used to perform noise disturbance processing on the first parameter to obtain the second information. Accordingly, the second parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.
  • the method further includes: the first communication device sending request information for requesting the second parameter.
  • the first communication device can determine the second parameter through an interactive process of the request information and the above indication information.
  • the second parameter is a preconfigured parameter.
  • the first communication device obtains the first information, including: the first communication device receives the first information; or, the first communication device obtains measurement parameters of the first AI model and/or the second AI model, and determines the first information based on the measurement parameters of the first AI model and/or the second AI model.
  • the first communication device can obtain the first information through the above multiple methods to improve the flexibility of implementing the solution.
  • the third aspect of the present application provides a communication method, which is applied to a second communication device, which can be a communication device (such as a terminal device or a network device), or the second communication device can be a partial component in the communication device (such as a processor, a chip or a chip system, etc.), or the second communication device can also be a logic module or software that can implement all or part of the functions of the communication device.
  • the second communication device receives second information, which is obtained by performing noise perturbation processing based on the first parameter; wherein the second information is used to update a second AI model deployed in the second communication device; the second communication device updates the second AI model based on the second information.
  • the second information received by the second communication device is obtained by performing noise perturbation processing based on the first parameter, and the second communication device can update the second AI model based on the second information.
  • the second information used to update the second AI model is obtained by performing noise perturbation processing based on the first parameter.
  • the second information is obtained by processing a quantization result of the first parameter, or the second information is obtained by processing a transmission parameter of the first parameter.
  • the second information can be obtained by performing noise disturbance processing in a variety of ways to improve the flexibility of the solution implementation.
  • the first parameter includes one or more of the following:
  • the updated model parameters of the second AI model used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.
  • the present application provides a communication device, which is a first communication device and includes a transceiver unit; the processing unit is used to obtain first information, where the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a first AI model deployed in the first communication device; the processing unit is also used to perform noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.
  • the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the first aspect and achieve corresponding technical effects.
  • the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the first aspect and achieve corresponding technical effects.
  • the present application provides a communication device, which is a first communication device and includes a transceiver unit and a processing unit; the processing unit is used to obtain first information, and the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model for deployment in a second communication device; the transceiver unit is used to send second information based on the first information, and the second information is obtained by noise disturbance processing based on the first parameter, and the second information is used to update the second AI model.
  • the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the second aspect and achieve corresponding technical effects.
  • the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the second aspect and achieve corresponding technical effects.
  • the present application provides a communication device, which is a second communication device and includes a transceiver unit and a processing unit.
  • the transceiver unit is used to receive second information, and the second information is obtained by performing noise disturbance processing based on the first parameter; wherein the second information is used to update a second AI model deployed in the second communication device; and the processing unit is used to update the second AI model based on the second information.
  • the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the third aspect and achieve corresponding technical effects.
  • the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the third aspect and achieve corresponding technical effects.
  • the present application provides a communication device, comprising at least one processor coupled to a memory; the memory is configured to store programs or instructions; the at least one processor is configured to execute the programs or instructions, so that the communication device implements the method described in any possible implementation of any one of the first to third aspects.
  • the communication device may include the memory.
  • the present application provides a communication device comprising at least one logic circuit and an input/output interface; the logic circuit is used to execute the method described in any possible implementation of any one of the first to third aspects.
  • the present application provides a communication system, which includes the above-mentioned first communication device and second communication device.
  • the present application provides a computer-readable storage medium for storing one or more computer-executable instructions.
  • the processor executes the method described in any possible implementation of any one of the first to third aspects above.
  • a computer program product (or computer program) is provided.
  • the processor executes the method described in any possible implementation of any one of the first to third aspects above.
  • a twelfth aspect of the present application provides a chip system, which includes at least one processor for supporting a communication device to implement the method described in any possible implementation of any one of the first to third aspects above.
  • the chip system may further include a memory for storing program instructions and data necessary for the communication device.
  • the chip system may be composed of a chip or may include a chip and other discrete components.
  • the chip system may further include an interface circuit for providing program instructions and/or data to the at least one processor.
  • the technical effects brought about by any design method in the fourth to twelfth aspects can refer to the technical effects brought about by the different design methods in the above-mentioned first to third aspects, and will not be repeated here.
  • FIGS. 1a to 1c are schematic diagrams of a communication system provided by this application.
  • FIGS. 2a to 2g are schematic diagrams of the AI processing process involved in this application.
  • FIG3 is an interactive schematic diagram of the communication method provided by this application.
  • FIGS. 4a to 4c are schematic diagrams of the model processing provided by this application.
  • FIG5 is an interactive diagram of the communication method provided by this application.
  • 6 to 10 are schematic diagrams of the communication device provided in this application.
  • Terminal device It can be a wireless terminal device that can receive network device scheduling and instruction information.
  • the wireless terminal device can be a device that provides voice and/or data connectivity to the user, or a handheld device with wireless connection function, or other processing device connected to a wireless modem.
  • Terminal devices can communicate with one or more core networks or the Internet via a radio access network (RAN).
  • Terminal devices can be mobile terminal devices, such as mobile phones (also known as "cellular" phones, mobile phones), computers, and data cards.
  • mobile phones also known as "cellular" phones, mobile phones
  • computers and data cards.
  • they can be portable, pocket-sized, handheld, computer-built-in, or vehicle-mounted mobile devices that exchange voice and/or data with the radio access network.
  • PCS personal communication service
  • SIP session initiation protocol
  • WLL wireless local loop
  • PDAs personal digital assistants
  • tablets tablets or pads
  • computers with wireless transceiver capabilities, and other devices.
  • Wireless terminal equipment can also be called system, subscriber unit, subscriber station, mobile station/mobile station (MS), remote station, access point (AP), remote terminal (REMOTE terminal), access terminal (ACCESS terminal), user terminal (USER terminal), user agent (USER agent), subscriber station (SS), customer premises equipment (CPE), terminal, user equipment (UE), mobile terminal (MT), etc.
  • the terminal device may also be a wearable device.
  • Wearable devices may also be referred to as wearable smart devices or smart wearable devices, etc., which are a general term for wearable devices that are intelligently designed and developed using wearable technology for daily wear, such as glasses, gloves, watches, clothing, and shoes.
  • a wearable device is a portable device that is worn directly on the body or integrated into the user's clothes or accessories. Wearable devices are not only hardware devices, but also achieve powerful functions through software support, data interaction, and cloud interaction.
  • wearable smart devices include those that are fully functional, large in size, and can achieve complete or partial functions without relying on smartphones, such as smart watches or smart glasses, etc., as well as those that only focus on a certain type of application function and need to be used in conjunction with other devices such as smartphones, such as various smart bracelets, smart helmets, and smart jewelry for vital sign monitoring.
  • the terminal may also be a drone, a robot, a terminal in device-to-device (D2D) communication, a terminal in vehicle to everything (V2X), a virtual reality (VR) terminal device, an augmented reality (AR) terminal device, a wireless terminal in industrial control, a wireless terminal in self-driving, a wireless terminal in telemedicine (telehealth services), a wireless terminal in a smart grid, a wireless terminal in transportation safety, a wireless terminal in a smart city, a wireless terminal in a smart home, etc.
  • D2D device-to-device
  • V2X vehicle to everything
  • VR virtual reality
  • AR augmented reality
  • a wireless terminal in industrial control a wireless terminal in self-driving
  • a wireless terminal in telemedicine telehealth services
  • a wireless terminal in a smart grid a wireless terminal in transportation safety
  • a wireless terminal in a smart city a wireless terminal in a smart home, etc.
  • the terminal device may also be a terminal device in a communication system evolved after the fifth-generation (5G) communication system (e.g., 5G Advanced or sixth-generation (6G) communication system), or a terminal device in a future-evolved public land mobile network (PLMN).
  • 5G Advanced or 6G networks may further expand the form and functionality of 5G communication terminals.
  • 6G terminals include, but are not limited to, vehicles, cellular network terminals (with integrated satellite terminal functionality), drones, and Internet of Things (IoT) devices.
  • the terminal device may also obtain artificial intelligence (AI) services provided by the network device.
  • AI artificial intelligence
  • the terminal device may also have AI processing capabilities.
  • a network device can be a RAN node (or device) that connects a terminal device to a wireless network, which can also be called a base station.
  • RAN equipment are: base station, evolved NodeB (eNodeB), gNB (gNodeB) in a 5G communication system, transmission reception point (TRP), evolved NodeB (eNB), radio network controller (RNC), NodeB (NB), home base station (e.g., home evolved NodeB, or home NodeB, HNB), baseband unit (BBU), or wireless fidelity (Wi-Fi) access point (AP).
  • a network device can include a central unit (CU) node, a distributed unit (DU) node, or a RAN device including a CU node and a DU node.
  • CU central unit
  • DU distributed unit
  • RAN device including a CU node and a DU node.
  • a RAN node can be a macro base station, micro base station, indoor base station, relay node, donor node, or wireless controller in a cloud radio access network (CRAN) scenario.
  • a RAN node can also be a server, wearable device, vehicle, or onboard device.
  • the access network device in vehicle-to-everything (V2X) technology can be a roadside unit (RSU).
  • a RAN node can be a CU, DU, CU-control plane (CP), CU-user plane (UP), or radio unit (RU).
  • the CU and DU can be separate or included in the same network element, such as a baseband unit (BBU).
  • BBU baseband unit
  • the RU can be included in a radio frequency device or radio unit, such as a remote radio unit (RRU), an active antenna unit (AAU), a radio head (RH), or a remote radio head (RRH).
  • RRU remote radio unit
  • AAU active antenna unit
  • RH radio head
  • RRH remote radio head
  • CU or CU-CP and CU-UP
  • DU or RU may have different names, but those skilled in the art can understand their meanings.
  • O-CU open CU
  • DU may also be called O-DU
  • CU-CP may also be called O-CU-CP
  • CU-UP may also be called O-CU-UP
  • RU may also be called O-RU.
  • this application uses CU, CU-CP, CU-UP, DU and RU as examples for description.
  • Any unit among the CU (or CU-CP, CU-UP), DU and RU in this application can be implemented by a software module, a hardware module, or a combination of a software module and a hardware module.
  • This protocol layer may include a control plane protocol layer and a user plane protocol layer.
  • the control plane protocol layer may include at least one of the following: radio resource control (RRC) layer, packet data convergence protocol (PDCP) layer, radio link control (RLC) layer, media access control (MAC) layer, or physical (PHY) layer.
  • the user plane protocol layer may include at least one of the following: service data adaptation protocol (SDAP) layer, PDCP layer, RLC layer, MAC layer, or physical layer.
  • SDAP service data adaptation protocol
  • the network device may be any other device that provides wireless communication functionality to the terminal device.
  • the embodiments of this application do not limit the specific technology and device form used by the network device. For ease of description, the embodiments of this application do not limit this.
  • the network equipment may also include core network equipment, such as the mobility management entity (MME), home subscriber server (HSS), serving gateway (S-GW), policy and charging rules function (PCRF), and public data network gateway (PDN gateway or P-GW) in the fourth generation (4G) network; and the access and mobility management function (AMF), user plane function (UPF), or session management function (SMF) in the 5G network.
  • MME mobility management entity
  • HSS home subscriber server
  • S-GW serving gateway
  • PDN gateway or P-GW public data network gateway
  • the core network equipment may also include other core network equipment in the 5G network and the next generation network of the 5G network.
  • the above-mentioned network device may also have a network node with AI capabilities, which can provide AI services for terminals or other network devices.
  • a network node with AI capabilities can be an AI node on the network side (access network or core network), a computing power node, a RAN node with AI capabilities, a core network element with AI capabilities, etc.
  • the apparatus for implementing the function of the network device may be a network device, or may be a device capable of supporting the network device in implementing the function, such as a chip system, which may be provided in the network device.
  • the technical solutions provided in the embodiments of the present application are described by taking the network device as an example of the apparatus for implementing the function of the network device.
  • Configuration and pre-configuration are used simultaneously.
  • Configuration refers to the network device/server sending some parameter configuration information or parameter values to the terminal through messages or signaling, so that the terminal can determine the communication parameters or resources during transmission based on these values or information.
  • Pre-configuration is similar to configuration, and can be parameter information or parameter values pre-negotiated between the network device/server and the terminal device, or parameter information or parameter values used by the base station/network device or terminal device as specified in the standard protocol, or parameter information or parameter values pre-stored in the base station/server or terminal device. This application does not limit this.
  • “Sending” and “receiving” in the embodiments of the present application indicate the direction of signal transmission.
  • sending information to XX can be understood as the destination of the information being XX, which can include direct sending through the air interface, as well as indirect sending through the air interface by other units or modules.
  • Receiviving information from YY can be understood as the source of the information being YY, which can include direct receiving from YY through the air interface, as well as indirect receiving from YY through the air interface from other units or modules.
  • “Sending” can also be understood as the “output” of the chip interface, and “receiving” can also be understood as the “input” of the chip interface.
  • sending and receiving can be performed between devices, for example, between a network device and a terminal device, or can be performed within a device, for example, sending or receiving between components, modules, chips, software modules or hardware modules within the device through a bus, wiring or interface.
  • information may be processed between the source and destination of information transmission, such as coding, modulation, etc., but the destination can understand the valid information from the source. Similar expressions in this application can be understood similarly and will not be repeated.
  • indication may include direct indication and indirect indication, and may also include explicit indication and implicit indication.
  • the information indicated by a certain information is called information to be indicated.
  • information to be indicated In the specific implementation process, there are many ways to indicate the information to be indicated, such as but not limited to, directly indicating the information to be indicated, such as the information to be indicated itself or the index of the information to be indicated.
  • the information to be indicated may also be indirectly indicated by indicating other information, wherein the other information is associated with the information to be indicated; or only a part of the information to be indicated may be indicated, while the other part of the information to be indicated is known or agreed in advance.
  • the indication of specific information may be achieved by means of the arrangement order of each information agreed in advance (such as predefined by the protocol), thereby reducing the indication overhead to a certain extent.
  • the present application does not limit the specific method of indication. It is understandable that for the sender of the indication information, the indication information can be used to indicate the information to be indicated, and for the receiver of the indication information, the indication information can be used to determine the information to be indicated.
  • the communication system includes at least one network device and/or at least one terminal device.
  • Figure 1a is a schematic diagram of a communication system in this application.
  • Figure 1a exemplarily illustrates a network device and six terminal devices, namely terminal device 1, terminal device 2, terminal device 3, terminal device 4, terminal device 5, and terminal device 6.
  • terminal device 1 is a smart teacup
  • terminal device 2 is a smart air conditioner
  • terminal device 3 is a smart gas pump
  • terminal device 4 is a vehicle
  • terminal device 5 is a mobile phone
  • terminal device 6 is a printer.
  • the sending entity of AI configuration information can be a network device.
  • the receiving entity of AI configuration information can be terminal devices 1-6.
  • the network device and terminal devices 1-6 form a communication system.
  • terminal devices 1-6 can send data to the network device, and the network device needs to receive data sent by terminal devices 1-6.
  • the network device can send configuration information to terminal devices 1-6.
  • terminal devices 4 and 6 can also form a communication system.
  • Terminal device 5 acts as a network device, i.e., the sending entity of AI configuration information
  • terminal devices 4 and 6 act as terminal devices, i.e., the receiving entities of AI configuration information.
  • terminal device 5 sends AI configuration information to terminal devices 4 and 6, respectively, and receives data from terminal devices 4 and 6.
  • terminal devices 4 and 6 receive AI configuration information from terminal device 5 and send data to terminal device 5.
  • different devices may also execute AI-related services.
  • the base station can perform communication-related services and AI-related services with one or more terminal devices, and different terminal devices can also perform communication-related services and AI-related services.
  • an AI network element can be introduced into the communication system provided in this application to implement some or all AI-related operations.
  • the AI network element can also be referred to as an AI node, AI device, AI entity, AI module, AI model, or AI unit, etc.
  • the AI network element can be a network element built into the communication system.
  • the AI network element can be an AI module built into: an access network device, a core network device, a cloud server, or an operation, administration, and maintenance (OAM) system to implement AI-related functions.
  • OAM operation, administration, and maintenance
  • the OAM can serve as a network manager for a core network device and/or as a network manager for an access network device.
  • the AI network element can also be an independently set network element in the communication system.
  • the terminal or the chip built into the terminal can also include an AI entity to implement AI-related functions.
  • AI can imbue machines with human intelligence. For example, it can use computer hardware and software to simulate certain intelligent human behaviors.
  • Machine learning methods can be used to achieve artificial intelligence.
  • a machine uses training data to learn (or train) a model. This model represents the mapping from input to output.
  • the learned model can be used for inference (or prediction), meaning that the model can be used to predict the output corresponding to a given input. This output can also be called an inference result (or prediction result).
  • Machine learning can include supervised learning, unsupervised learning, and reinforcement learning. Among them, unsupervised learning can also be called unsupervised learning.
  • Supervised learning uses machine learning algorithms to learn the mapping relationship between sample values and sample labels based on collected sample values and sample labels, and then expresses this learned mapping relationship using an AI model.
  • the process of training a machine learning model is the process of learning this mapping relationship.
  • sample values are input into the model to obtain the model's predicted values.
  • the model parameters are optimized by calculating the error between the model's predicted values and the sample labels (ideal values).
  • the learned mapping can be used to predict new sample labels.
  • the mapping relationship learned by supervised learning can include linear mappings or nonlinear mappings. Based on the type of label, the learning task can be divided into classification tasks and regression tasks.
  • Unsupervised learning uses algorithms to discover inherent patterns in collected sample values.
  • One type of unsupervised learning algorithm uses the samples themselves as supervisory signals, meaning the model learns the mapping from one sample to another. This is called self-supervised learning.
  • the model parameters are optimized by calculating the error between the model's predictions and the samples themselves.
  • Self-supervised learning can be used in signal compression and decompression recovery applications. Common algorithms include autoencoders and generative adversarial networks.
  • Reinforcement learning unlike supervised learning, is a type of algorithm that learns problem-solving strategies through interaction with the environment. Unlike supervised and unsupervised learning, reinforcement learning problems lack explicit label data for "correct" actions. Instead, the algorithm must interact with the environment to obtain reward signals from the environment, and then adjust its decision-making actions to maximize the reward signal value. For example, in downlink power control, the reinforcement learning model adjusts the downlink transmit power of each user based on the overall system throughput fed back by the wireless network, hoping to achieve higher system throughput. The goal of reinforcement learning is also to learn the mapping between environmental states and optimal (e.g., optimal) decision-making actions. However, because the labels for "correct actions" cannot be obtained in advance, network optimization cannot be achieved by calculating the error between actions and "correct actions.” Reinforcement learning training is achieved through iterative interaction with the environment.
  • NN neural network
  • Traditional communication systems require extensive expert knowledge to design communication modules.
  • deep learning communication systems based on neural networks can automatically discover implicit patterns in massive data sets and establish mapping relationships between data, achieving performance superior to traditional modeling methods.
  • each neuron performs a weighted sum operation on its input values and outputs the result through an activation function.
  • FIG. 2a it is a schematic diagram of a neuron structure.
  • w i is used as the weight of xi to weight xi .
  • the bias for weighted summation of input values according to the weights is, for example, b. There can be many forms of activation functions.
  • the output of the neuron is:
  • b can be a decimal, an integer (eg, 0, a positive integer, or a negative integer), or a complex number, etc.
  • the activation functions of different neurons in a neural network can be the same or different.
  • neural networks generally include multiple layers, each of which may include one or more neurons. Increasing the depth and/or width of a neural network can improve its expressive power, providing more powerful information extraction and abstract modeling capabilities for complex systems.
  • the depth of a neural network can refer to the number of layers it comprises, and the number of neurons in each layer can be referred to as the width of that layer.
  • a neural network includes an input layer and an output layer. The input layer processes the input information received by the neural network through neurons, passing the processing results to the output layer, which then obtains the output of the neural network.
  • a neural network includes an input layer, a hidden layer, and an output layer. The input layer processes the input information received by the neural network through neurons, passing the processing results to an intermediate hidden layer. The hidden layer performs calculations on the received processing results to obtain a calculation result, which is then passed to the output layer or the next adjacent hidden layer, which ultimately obtains the output of the neural network.
  • a neural network can include one hidden layer or multiple hidden layers connected in sequence, without limitation.
  • DNN deep neural network
  • FNNs feedforward neural networks
  • CNNs convolutional neural networks
  • RNNs recurrent neural networks
  • Figure 2b is a schematic diagram of an FNN network.
  • a characteristic of FNN networks is that neurons in adjacent layers are fully connected. This characteristic typically requires a large amount of storage space and results in high computational complexity.
  • CNN is a type of neural network specifically designed to process data with a grid-like structure.
  • time series data e.g., discrete sampling on the time axis
  • image data e.g., discrete sampling on the two-dimensional axis
  • CNNs do not utilize all input information at once for computation. Instead, they use a fixed-size window to intercept a portion of the information for convolution operations, significantly reducing the computational complexity of model parameters.
  • each window can use a different convolution kernel, enabling CNNs to better extract features from the input data.
  • RNNs are a type of DNN that utilizes feedback time series information.
  • RNN inputs include the current input value and its own output value at the previous moment.
  • RNNs are suitable for capturing temporally correlated sequence features and are particularly well-suited for applications such as speech recognition and channel coding.
  • a loss function can be defined. This function describes the gap or discrepancy between the model's output and the desired target value. Loss functions can be expressed in a variety of forms, and their specific form is not restricted. The model training process can be viewed as adjusting some or all of the model's parameters to keep the loss function below a threshold or meet the target.
  • a model may also be referred to as an AI model, rule, or other name.
  • An AI model can be considered a specific method for implementing an AI function.
  • An AI model represents a mapping relationship or function between the input and output of a model.
  • AI functions may include one or more of the following: data collection, model training (or model learning), model information release, model inference (or model reasoning, inference, or prediction, etc.), model monitoring or model verification, or inference result release, etc.
  • AI functions may also be referred to as AI (related) operations, or AI-related functions.
  • Fully connected neural network also called multilayer perceptron (MLP).
  • an MLP consists of an input layer (left), an output layer (right), and multiple hidden layers (center).
  • Each layer of the MLP contains several nodes, called neurons. Neurons in adjacent layers are connected to each other.
  • w is the weight matrix
  • b is the bias vector
  • f is the activation function
  • m is the index of the neural network layer
  • m is greater than or equal to 1
  • m is less than or equal to M, where M is the total number of neural network layers.
  • a neural network can be understood as a mapping from an input data set to an output data set.
  • Neural networks are typically initialized randomly, and the process of obtaining this mapping from random w and b using existing data is called neural network training.
  • a specific method of training is to use a loss function to evaluate the output results of the neural network.
  • the error can be backpropagated, and the neural network parameters (including w and b) can be iteratively optimized using gradient descent until the loss function reaches a minimum, which is the "better point (e.g., optimal point)" in Figure 2d. It is understood that the neural network parameters corresponding to the "better point (e.g., optimal point)" in Figure 2d can be used as the neural network parameters in the trained AI model information.
  • the gradient descent process can be expressed as:
  • is the parameter to be optimized (including w and b)
  • L is the loss function
  • is the learning rate, which controls the step size of gradient descent.
  • the backpropagation process utilizes the chain rule for partial derivatives.
  • the gradient of the previous layer parameters can be recursively calculated from the gradient of the next layer parameters, which can be expressed as:
  • wij is the weight of node j connecting to node i
  • si is the weighted sum of the inputs on node i.
  • the FL architecture is the most widely used training architecture in the current FL field.
  • the FedAvg algorithm is the basic algorithm of FL. Its algorithm flow is roughly as follows:
  • the center initializes the model to be trained And broadcast it to all client devices.
  • the central node aggregates and collects the local training results from all (or some) clients. Assume that the client set that uploads the local model in round t is The center will use the number of samples of the corresponding client as the weight to perform weighted averaging to obtain a new global model. The specific update rule is: The center then sends the latest version of the global model Broadcast to all client devices for a new round of training.
  • the central node In addition to reporting local models You can also use the local gradient of training After reporting, the central node averages the local gradients and updates the global model according to the direction of the average gradient.
  • Distributed nodes collect local datasets, perform local training, and report the local training results (models or gradients) to the central node.
  • the central node itself does not have a dataset; it is only responsible for fusing the training results of distributed nodes to obtain a global model and send it to the distributed nodes.
  • decentralized learning Different from federated learning, decentralized learning is another distributed learning architecture.
  • the design goal f(x) of a decentralized learning system is generally the mean of the goals fi (x) of each node, that is, Where p is the number of distributed nodes, x is the parameter to be optimized. In machine learning, x is the parameter of the machine learning (such as neural network) model.
  • Each node uses local data and local target fi (x) to calculate the local gradient Then it is sent to the neighboring nodes that can be communicated with. After any node receives the gradient information sent by its neighbor, it can update the parameter x of the local model according to the following formula:
  • wireless communication systems e.g., the systems shown in Figures 1a, 1b, or 1c.
  • communication nodes generally have both signal transceiver capabilities and computing capabilities.
  • network devices with computing capabilities primarily provide computing power to support signal transceiver capabilities (e.g., performing signal transmission and reception processing) to enable communication between the network device and other communication nodes.
  • communication nodes may have excess computing power beyond just supporting the aforementioned communication tasks. Therefore, how to utilize this computing power is a pressing technical issue.
  • a communication node can serve as a participating node in an AI learning system, and the computing power of the communication node can be applied to a certain link of the AI learning system (e.g., the AI learning system described in FIG2f or FIG2g).
  • an AI node can establish a data set according to a task, complete the training of the model offline, and perform reasoning after online deployment.
  • the AI model may have the problem of disappearing plasticity, that is, the AI model will not be able to learn the knowledge of new training samples, resulting in the performance of the AI model becoming worse and worse, resulting in a decrease in the robustness of continuous learning.
  • a method of selectively reinitializing some parameters during training can be used to ensure that the neural network can maintain the ability to learn new samples.
  • this optimization method it is necessary to design complex functions and count the life cycles of various parameters to select the reset parameter set, which is highly complex; at the same time, reinitializing the parameters will greatly affect the instantaneous performance of the neural network, resulting in the above-mentioned optimization method not being able to effectively solve the above-mentioned problem.
  • FIG3 is a schematic diagram of an implementation of the communication method provided in this application.
  • the method includes the following steps.
  • the communication device can be a communication device (e.g., a terminal device or a network device), or a chip, a baseband chip, a modem chip, a system-on-chip (SoC) chip including a modem core, a system-in-package (SIP) chip, a communication module, a chip system, a processor, a logic module, or software in the communication device.
  • a communication device e.g., a terminal device or a network device
  • SoC system-on-chip
  • SIP system-in-package
  • a second communication device sends first information, and a first communication device receives the first information accordingly.
  • the first information is used to indicate performance monitoring information of at least one AI model associated with a first AI model deployed on the first communication device.
  • the first communication device performs noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.
  • AI model can be replaced by other terms, such as neural network, neural network model, AI neural network model, machine learning model, or AI processing model.
  • noise disturbance processing can be replaced by other terms, such as noise addition processing, disturbance processing, noise disturbance processing, noise interference processing, or random disturbance processing.
  • the first information acquired by the first communication device is used to indicate performance monitoring information of at least one AI model, wherein the at least one AI model includes the first AI model and/or the second AI model, the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device.
  • the first information can be used to indicate the performance monitoring information of the at least one AI model, and in S302, the first communication device can perform noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.
  • the first information when the performance monitoring information indicated by the first information meets a certain condition (e.g., the first condition and/or the second condition described below), the first information can be used to trigger noise perturbation processing of the model parameters of the first AI model and/or the second AI model.
  • the at least one AI model can include the first AI model and/or the second AI model. In this way, the first communication device can trigger noise perturbation processing of the AI model based on the performance of the associated AI model.
  • the first communication device in addition to obtaining the first information through interaction with the second communication device, the first communication device can also obtain the first information through other means.
  • the first communication device can obtain the first information locally.
  • the first communication device can obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model and/or the second AI model.
  • the measured parameters of an AI model may include the output of the AI model.
  • the first communications device may determine performance monitoring information of the AI model based on the output of the AI model.
  • the performance monitoring information may be determined by a mathematical relationship (e.g., at least one of a difference, variance, standard deviation, etc.) between the output of the AI model and a preconfigured tag.
  • the measurement parameter of the AI model may include communication performance information for communication based on the communication parameter (e.g., one or more of the bit error rate, the block error rate, the reference signal received power (RSRP), and the signal to interference plus noise ratio (SINR)).
  • the communication parameter e.g., one or more of the bit error rate, the block error rate, the reference signal received power (RSRP), and the signal to interference plus noise ratio (SINR)
  • the first communication device may determine the performance monitoring information of the AI model based on the communication performance information and the preconfigured performance information.
  • the performance monitoring information may be determined by a mathematical relationship (e.g., at least one of the difference, variance, standard deviation, etc.) between the communication performance information and the preconfigured performance information.
  • the first communication device can obtain the measurement parameters of the AI model in a variety of ways.
  • the first communication device can obtain the measurement parameters of the first AI model based on the output of the local first AI model.
  • the first communication device can obtain the measurement parameters of the first AI model based on the communication process with other communication devices.
  • the first communication device can obtain the measurement parameters of the second AI model based on the communication process with a second communication device.
  • the first AI model is associated with the second AI model, including any one of the following methods A to F:
  • Method A The first AI model is a public model and the second AI model is a personalized model.
  • Method B The first AI model is a personalized model and the second AI model is a public model.
  • Method C The input of the first AI model includes the output of the second AI model.
  • Mode D The input of the second AI model includes the output of the first AI model.
  • Method E The first AI model is a teacher model and the second AI model is a student model.
  • Method F The first AI model is a student model and the second AI model is a teacher model.
  • Methods A and B can be understood as the federated learning scenario shown in Figure 2f.
  • federated learning there are central nodes and distributed nodes.
  • the central node stores the public model and periodically aggregates the personalized models uploaded by the distributed nodes.
  • the distributed nodes periodically download the public model and fine-tune it using local data to obtain personalized models.
  • the first communication device can be the central node and the second communication device can be a distributed node.
  • the second communication device can be the central node and the first communication device can be a distributed node.
  • FIG. 4a it is a schematic diagram of an implementation of federated learning, taking the network device as the central node and the terminal device as the distributed node as an example.
  • the network device can save the public model
  • the terminal device 1 to the terminal device 3 can save their own personalized models.
  • the two can interact through the model download process and the model upload process.
  • some or all AI models in the terminal device can participate in federated learning.
  • the AI models of terminal device 1 and terminal device 2 can all participate in federated learning
  • the AI model of terminal device 3 can include a part filled with a black pattern that participates in federated learning, and a part filled with a blank that does not participate in federated learning.
  • Each device shown in Figure 4a can serve as a first communication device to perform noise perturbation on the model parameters of a locally deployed AI model.
  • noise perturbations to the model parameters of a public or personalized model, the robustness of continuous federated learning to distributional changes is increased.
  • the parameters for the noise perturbation process may be part or all of the model parameters of the AI model, for example, the part or all of the model parameters are determined by the above
  • e i represents the i-th parameter of the model.
  • a and b represent the lower and upper bounds.
  • Methods C and D can be understood as split learning scenarios.
  • the AI model can be split into two or more parts.
  • Figure 4b shows a schematic diagram of split learning.
  • the AI model is split into two parts, which can be an encoder and a decoder.
  • the encoder and decoder communicate intermediate features and intermediate gradients.
  • data enters the encoding neural network, obtains intermediate features, and transmits them to the decoding neural network.
  • the decoding neural network calculates the loss based on the label values and updates the decoding neural network in reverse order. Simultaneously, the decoding neural network transmits the intermediate gradients to the encoding neural network, which then updates the encoding neural network based on the received intermediate gradients.
  • the encoder or decoder shown in Figure 4b can be used as the first AI model in the first communication device to perform noise perturbation processing.
  • noise can be added to some or all of the model parameters of the encoder or decoder, thereby increasing the robustness of continuous split learning to distribution changes.
  • the noise perturbation method for some or all of the model parameters of the encoder or decoder please refer to the description of the previous example.
  • Methods E and F can be understood as knowledge distillation scenarios.
  • Knowledge distillation can enhance the loss of smaller or simpler models (such as student models or knowledge learning models) through the output of larger or more complex models (such as teacher models or knowledge transfer models), thereby achieving better convergence performance or faster convergence speed.
  • This can be considered a model compression technology.
  • Figure 4c is a schematic diagram of knowledge distillation.
  • This example takes the models involved in the knowledge distillation process as an example, including a teacher model and a student model.
  • the teacher model does not participate in training, but this is not limited here, that is, there is no distinction between specific teacher models and student models.
  • the teacher model can also be guided by the student model to update, or the two models can be updated at the same time, or there can be multiple teacher models or multiple student models. The following only takes the most common single teacher guiding single student model learning as an example.
  • the teacher model and the student model respectively infer and obtain inference results.
  • the knowledge distillation loss KD loss
  • the device deployed with the student model can obtain the task loss (task Loss) according to the label, and then the device can update the model parameters of the student model according to the distillation loss and task loss.
  • the teacher model or student model shown in FIG4c can be used as the first AI model in the first communication device to perform noise perturbation processing.
  • noise can be added to some or all model parameters of the teacher model or student model, thereby increasing the robustness of the continuous knowledge distillation process to distribution changes.
  • the noise perturbation method for some or all model parameters of the teacher model or student model can refer to the description of the previous example.
  • the first communication device when the second condition is met, performs noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model; wherein the second condition includes at least one of the following:
  • the training accuracy of the at least one AI model is lower than a first threshold
  • the test accuracy of the at least one AI model is lower than a second threshold
  • the system performance of the AI model system in which the at least one AI model resides is lower than a third threshold
  • the system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold
  • a change in the neural network input-output distribution of the at least one AI model is greater than a fifth threshold
  • the neural network weight distribution of the at least one AI model is greater than a sixth threshold
  • the change in the neural network weight of the at least one AI model is lower than a seventh threshold.
  • the first communication device may determine the values of the first to seventh thresholds in a variety of ways.
  • one or more of the first to seventh thresholds may be preconfigured.
  • one or more of the first to seventh thresholds may be configured for the first communication device by another device (e.g., a second communication device, a network device, or a server).
  • the first communication device may determine that the current operation of the first AI model may be abnormal. To this end, the first communication device may process the model parameters of the first AI model to obtain updated model parameters of the first AI model to update the first AI model. In this way, the problem of the first AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.
  • the first communication device can perform noise perturbation processing on the model parameters of the first AI model to obtain updated model parameters of the first AI model.
  • the computing power of the communication node can be applied to AI model processing, and the AI model can be updated based on the performance monitoring information of the AI model.
  • the process of the first communication device performing the AI model parameter update is triggered based on the performance monitoring information of at least one AI model, and during the AI model parameter update process, the first communication device obtains the updated AI model parameters through noise perturbation processing.
  • the solution can be applied to the scenario of continuous learning, and by performing noise perturbation processing on the AI model parameters, the problem of reduced or even disappearance of the plasticity of the AI model during long-term continuous learning can be slowed down or avoided, which can improve the adaptability of continuous learning and enhance the robustness of continuous learning.
  • the method further includes: the first communication device receives indication information indicating a third parameter, and the third parameter is used to process the model parameters of the first AI model to obtain the updated model parameters of the first AI model.
  • the first communication device can also receive the third parameter determined by the received indication information, and perform noise perturbation processing on the model parameters of the first AI model based on the third parameter, that is, the first communication device can determine the third parameter based on the indication of other communication devices (such as the recipient of the second information), so that the first communication device can obtain the noise perturbation processing through the specified third parameter.
  • the second communication device may send indication information indicating the third parameter to the first communication device based on the first information, that is, the second communication device may send indication information indicating the third parameter under the condition that it is determined that the above-mentioned second condition is met, so that the first communication device obtains the third parameter and executes S302.
  • the indication information indicating the third parameter may include at least one of the third parameter and an index of the third parameter (the index may be implemented in a manner shown in Table 2 or Table 3 below).
  • the third parameter can be used to process the model parameters of the first AI model to obtain the updated model parameters of the first AI model.
  • the third parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter.
  • the third parameter can be the set described above. ⁇ , One or more of .
  • the method further includes: the first communication device sending request information for requesting the third parameter.
  • the first communication device can determine the third parameter through an interaction between the request information and the indication information.
  • the first communication device can send the request information for requesting the third parameter based on the first information. That is, the first communication device can send the request information for requesting the third parameter upon determining that the second condition is met, thereby obtaining the third parameter and executing S302.
  • the third parameter is a preconfigured parameter.
  • the third parameter includes the set
  • the set This can be determined from Table 2 below.
  • can be determined by the following Table 3.
  • the first communication device can perform noise perturbation processing on the model parameters of the first AI model deployed on the first communication device based on the performance monitoring information of at least one AI model to improve the robustness of continuous learning.
  • the at least one AI model can be associated with the first AI model or associated with the second AI model deployed on the second communication device.
  • the first communication device can also perform noise perturbation processing on the model parameters of the second AI model deployed on the second communication device based on the performance monitoring information of at least one AI model to improve the robustness of continuous learning.
  • the process shown in Figure 5 will be described below.
  • FIG5 is another schematic diagram of the communication method provided in this application.
  • a second communication device sends first information, and a first communication device receives the first information accordingly.
  • the first information is used to indicate performance monitoring information of at least one AI model associated with a first AI model deployed on the first communication device.
  • S501 is optional.
  • the first communication device in addition to obtaining the first information through interaction with the second communication device, the first communication device can also obtain the first information through other means.
  • the specific implementation process please refer to the above S301 and related descriptions.
  • the first communication device sends second information based on the first information, and the second communication device receives the second information accordingly.
  • the second information is obtained by performing noise perturbation processing based on the first parameter, and the second information is used to update a second AI model; the second AI model is deployed on the second communication device.
  • the first parameter includes one or more of the following: model parameters of the updated second AI model, gradient information used to update the second AI model, intermediate feature information used to update the second AI model, intermediate gradient information used to update the second AI model, inference results of the first AI model, distillation loss information corresponding to the second AI model, and task loss information of the second AI model.
  • the second information sent by the first communication device in S502 may be implemented in a variety of ways, which will be described in detail below.
  • the second information is obtained by processing a quantized result of the first parameter.
  • the second communication device when the second information is obtained by processing based on the quantization result of the first parameter, after the second communication device receives the second information in S502, the second communication device can perform analysis processing on the received second information (for example, the analysis processing may include one or more of channel equalization, demodulation processing, decoding processing, digital domain processing, etc.), and update the second AI model based on the result of the analysis processing.
  • the analysis processing may include one or more of channel equalization, demodulation processing, decoding processing, digital domain processing, etc.
  • the second information is obtained by processing the transmission parameters of the first parameter.
  • the implementation process can be understood as an over-the-air calculation process, that is, after the second communication device receives the second information in S502, the second communication device may not perform digital domain processing on the received second information, but directly update the second AI model based on the original information of the received second information.
  • the first communication device converts one or more parameter values contained in the first parameter into one or more first symbols; after performing noise perturbation processing on the one or more first symbols to obtain one or more second symbols, the one or more second symbols are mapped to air interface resources (e.g., at least one of time domain resources, frequency domain resources, and air domain resources) to generate the second information.
  • air interface resources e.g., at least one of time domain resources, frequency domain resources, and air domain resources
  • the first communication device does not perform source encoding on the first parameter at the application layer and then send it back to the physical layer as a bit stream, which is then channel-coded and symbol-modulated by the physical layer and then generated and sent as a signal, but converts the first parameter into one or more symbols before transmitting. Therefore, the method of transmitting the first parameter on the wireless air interface can increase the transmission speed of the first parameter, thereby improving communication efficiency, and increasing the processing speed of the second communication device for model update based on the second information.
  • the transmission parameters of the first parameter may include one or more of a transmit power parameter, a precoding parameter, and an indication of whether to use method 2 for processing (the indication information can be used to control whether one or more communication devices use method 2 to transmit information, so as to control the signal-to-noise ratio of the signal after superposition in the air by controlling the number of accessed communication devices).
  • the first parameter may include the model parameter of the updated second AI model, which is used to update one or more items of the gradient information of the second AI model.
  • the first communication device may be a network device.
  • the first communication device may, during the process of issuing the public model (i.e., the first parameters include the model parameters or update gradients of the public model), perform noise perturbation processing on the model parameters or update gradients of the public model to obtain second information, and transmit the second information in S502.
  • the first communication device can be any terminal device.
  • the first communication device can perform noise perturbation processing on the model parameters or update gradients of the personalized model to obtain second information, and transmit the second information in S502.
  • lossless transmission can be changed to lossy transmission, that is, the model parameters or update gradients will be superimposed with transmission noise during uploading and downloading to achieve the effect of noise disturbance processing.
  • the first parameter may include intermediate feature information for updating the second AI model, and one or more items of intermediate gradient information for updating the second AI model.
  • the first communication device can add noise disturbance when transmitting the intermediate features and intermediate gradients, so that the transmission mode of the intermediate features and intermediate gradients of the encoder or decoder is changed from lossless transmission to lossy transmission, that is, the intermediate features and intermediate gradients will be superimposed with transmission noise during uploading/downloading to achieve the effect of noise disturbance processing.
  • the first parameter may include one or more of the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.
  • These inputs can be intermediate features of the AI model or the output of the AI model.
  • the distillation loss can be calculated on the same node as the teacher model, on the same node as the student model, or separately on a different node. Therefore, for knowledge distillation, lossy transmission can be used to transmit ft and/or fs to achieve the effect of noise perturbation.
  • the first communication device when a first condition is met, sends the second information based on the first information; wherein the first condition includes at least one of the following:
  • the training accuracy of the at least one AI model is lower than an eighth threshold
  • the test accuracy of the at least one AI model is lower than a ninth threshold
  • the system performance of the AI model system in which the at least one AI model resides is lower than a tenth threshold
  • the system performance of the communication system in which the communication device deploys the at least one AI model is located is lower than an eleventh threshold
  • the change in the neural network input and output distribution of the at least one AI model is greater than a twelfth threshold
  • the neural network weight distribution of the at least one AI model is greater than a thirteenth threshold
  • the change in the neural network weight of the at least one AI model is lower than a fourteenth threshold.
  • the first condition and the second condition may be the same, for example, the i-th (i is 1 to 7) threshold value from the first to the seventh threshold value is the same as the i-th threshold value from the eighth to the fourteenth threshold value.
  • the first condition and the second condition may be different, for example, the i-th (i is 1 to 7) threshold value from the first to seventh threshold values is partially different or completely different from the i-th threshold value from the eighth to fourteenth threshold values.
  • the first communication device may determine the values of the eighth to fourteenth thresholds in a variety of ways.
  • one or more of the eighth to fourteenth thresholds may be preconfigured.
  • one or more of the eighth to fourteenth thresholds may be configured for the first communication device by another device (e.g., a second communication device, a network device, or a server).
  • the first communication device when the first condition is met, can determine that the current operation of the second AI model may be abnormal. To this end, the first communication device can send second information, so that the second communication device can obtain second information obtained by noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.
  • the first communication device can send the second information obtained by performing noise perturbation processing based on the first parameter based on the first information, so that the recipient of the second information (such as the second communication device) can obtain the second information obtained by performing noise perturbation processing based on the first parameter, and update the second AI model based on the second information.
  • the problem of reduced or even disappearance of the plasticity of the second AI model during long-term continuous learning can be slowed down or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.
  • S302 shown in FIG5 can be combined with the method shown in FIG3 above.
  • S502 shown in FIG5 can also be executed.
  • the execution order of S302 and S502 is not limited. That is, S302 can be executed first and then S502, or S502 can be executed first and then S302.
  • the method further includes: the first communication device receiving indication information indicating a second parameter, the second parameter being used to process the first parameter to obtain the second information.
  • the first communication device may also determine the second parameter based on the received indication information, and process the first parameter based on the second parameter to obtain the second information. That is, the first communication device may determine the second parameter based on an indication from another communication device (e.g., a recipient of the second information), so that the recipient of the second information can obtain the corresponding second information using the specified second parameter.
  • the second communication device may send indication information indicating the second parameter to the first communication device based on the first information, that is, the second communication device may send indication information indicating the second parameter under the condition that it is determined that the above-mentioned first condition is met, so that the first communication device obtains the second parameter and executes S502.
  • the indication information indicating the second parameter may include at least one of the second parameter and an index of the second parameter (the index may be implemented in a manner as shown in Table 2 or Table 3 above).
  • the second parameter can be used to perform noise disturbance processing on the first parameter to obtain the second information. Accordingly, the second parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.
  • the method further includes: the first communication device sending request information for requesting the second parameter.
  • the first communication device can determine the second parameter through an interaction between the request information and the indication information.
  • the first communication device can send the request information for requesting the second parameter based on the first information. That is, the first communication device can send the request information for requesting the second parameter upon determining that the first condition is met, thereby obtaining the second parameter and executing S502.
  • the second parameter is a preconfigured parameter.
  • an embodiment of the present application provides a communication device 600.
  • This communication device 600 can implement the functions of the first communication device (or second communication device) in the above-described method embodiment, thereby also achieving the beneficial effects of the above-described method embodiment.
  • the communication device 600 can be the first communication device (or second communication device), or it can be an integrated circuit or component within the first communication device (or second communication device), such as a chip, a baseband chip, a modem chip, a SoC chip including a modem core, a system-in-package (SIP) chip, a communication module, a chip system, a processor, etc.
  • SIP system-in-package
  • the transceiver unit 602 may include a sending unit and a receiving unit, which are respectively used to perform sending and receiving.
  • the communication device 600 when the communication device 600 is used to execute the method executed by the first communication device in Figure 3 and related embodiments, the communication device 600 includes a processing unit 601; the processing unit 601 is used to obtain first information, which is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with the first AI model deployed in the first communication device; the processing unit 601 is also used to perform noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.
  • first information which is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with the first AI model deployed in the first communication device
  • the processing unit 601 is also used to perform noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.
  • the communication device 600 when the communication device 600 is used to execute the method executed by the first communication device in Figure 5 and related embodiments, the communication device 600 includes a processing unit 601; the processing unit 601 is used to obtain first information, and the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model for deployment in a second communication device; the transceiver unit 602 is used to send second information based on the first information, and the second information is obtained by noise disturbance processing based on the first parameter, and the second information is used to update the second AI model.
  • the communication device 600 when the communication device 600 is used to execute the method executed by the second communication device in Figure 5 and related embodiments, the communication device 600 includes a processing unit 601; the transceiver unit 602 is used to receive second information, which is obtained by noise disturbance processing based on the first parameter; wherein the second information is used to update the second AI model deployed in the second communication device; the processing unit 601 is used to update the second AI model based on the second information.
  • the functions of the processing unit 601 can be implemented by one or more processors.
  • the processor can include a modem chip, or a SoC chip or SIP chip containing a modem core.
  • the functions of the transceiver unit 602 can be implemented by a transceiver circuit.
  • the communication device 600 when the communication device 600 is a circuit or chip responsible for communication functions in a terminal, such as a modem chip or a SoC chip or SIP chip containing a modem core, the functions of the processing unit 601 can be implemented by a circuit system including one or more processors or processor cores in the above chip.
  • the functions of the transceiver unit 602 can be implemented by an interface circuit or data transceiver circuit on the above chip.
  • Fig. 7 is another schematic structural diagram of a communication device 700 provided in this application.
  • the communication device 700 includes a logic circuit 701 and an input/output interface 702.
  • the communication device 700 may be a chip or an integrated circuit.
  • the transceiver unit 602 shown in Figure 6 may include a communication interface, which may include the input/output interface 702 in Figure 7 , which may include an input interface and an output interface.
  • the communication interface may be a transceiver circuit, which may include an input interface circuit and an output interface circuit.
  • the logic circuit 701 when the communication device 700 is used to execute the method executed by the first communication device in Figure 3 and related embodiments, the logic circuit 701 is used to obtain first information, where the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with the first AI model deployed in the first communication device; the logic circuit 701 is also used to perform noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.
  • the logic circuit 701 is used to obtain first information, which is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model deployed in a second communication device; the input and output interface 702 is used to send second information based on the first information, which is obtained by noise disturbance processing based on the first parameter, and the second information is used to update the second AI model.
  • the input-output interface 702 is used to receive second information, which is obtained by noise disturbance processing based on the first parameter; wherein the second information is used to update the second AI model deployed in the second communication device; and the logic circuit 701 is used to update the second AI model based on the second information.
  • the logic circuit 701 and the input/output interface 702 may also execute other steps executed by the first communication device or the second communication device in any embodiment and achieve corresponding beneficial effects, which will not be described in detail here.
  • the processing unit 601 shown in FIG. 6 may be the logic circuit 701 in FIG. 7 .
  • the logic circuit 701 may be a processing device, and the functions of the processing device may be partially or entirely implemented by software.
  • the functions of the processing device may be partially or entirely implemented by software.
  • the processing device may include a memory and a processor, wherein the memory is used to store a computer program, and the processor reads and executes the computer program stored in the memory to perform corresponding processing and/or steps in any one of the method embodiments.
  • the processing device may include only a processor.
  • a memory for storing the computer program is located outside the processing device, and the processor is connected to the memory via circuits/wires to read and execute the computer program stored in the memory.
  • the memory and processor may be integrated or physically separate.
  • the processing device may be one or more chips, or one or more integrated circuits.
  • the processing device may be one or more field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), system-on-chips (SoCs), central processing units (CPUs), network processors (NPs), digital signal processing circuits (DSPs), microcontrollers (MCUs), programmable logic devices (PLDs), or other integrated chips, or any combination of the above chips or processors.
  • FPGAs field-programmable gate arrays
  • ASICs application-specific integrated circuits
  • SoCs system-on-chips
  • CPUs central processing units
  • NPs network processors
  • DSPs digital signal processing circuits
  • MCUs microcontrollers
  • PLDs programmable logic devices
  • FIG 8 shows the communication device 800 involved in the above-mentioned embodiments provided in an embodiment of the present application.
  • the communication device 800 can specifically be a communication device serving as a terminal device in the above-mentioned embodiments.
  • the example shown in Figure 8 is that the terminal device is implemented through the terminal device (or a component in the terminal device).
  • the communication device 800 may include but is not limited to at least one processor 801 and a communication port 802 .
  • the transceiver unit 602 shown in Figure 6 may include a communication interface, which may include the communication port 802 shown in Figure 8 , which may include an input interface and an output interface.
  • the communication port 802 may also be a transceiver circuit, which may include an input interface circuit and an output interface circuit.
  • the communication device 800 may also include at least one of a memory 803 and a bus 804.
  • the at least one processor 801 is used to control and process the actions of the communication device 800.
  • the processor 801 may be a central processing unit (CPU), a general-purpose processor (GPPC), a digital signal processor (DSP), an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or other programmable logic device (PLD), a transistor logic device (TLD), a hardware component, or any combination thereof. It may implement or execute the various exemplary logic blocks, modules, and circuits described in conjunction with the disclosure of this application.
  • the processor may also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a DSP and a microprocessor, and the like.
  • the communication device 800 shown in Figure 8 can be specifically used to implement the steps implemented by the terminal device in the aforementioned method embodiment and achieve the corresponding technical effects of the terminal device.
  • the specific implementation methods of the communication device shown in Figure 8 can refer to the description in the aforementioned method embodiment and will not be repeated here.
  • FIG. 9 is a structural diagram of the communication device 900 involved in the above-mentioned embodiments provided in an embodiment of the present application.
  • the communication device 900 can specifically be a communication device as a network device in the above-mentioned embodiments.
  • the example shown in Figure 9 is that the network device is implemented through the network device (or a component in the network device), wherein the structure of the communication device can refer to the structure shown in Figure 9.
  • the communication device 900 includes at least one processor 911 and at least one network interface 914. Further optionally, the communication device also includes at least one memory 912, at least one transceiver 913 and one or more antennas 915.
  • the processor 911, the memory 912, the transceiver 913 and the network interface 914 are connected, for example, via a bus. In an embodiment of the present application, the connection may include various interfaces, transmission lines or buses, etc., which are not limited in this embodiment.
  • the antenna 915 is connected to the transceiver 913.
  • the network interface 914 is used to enable the communication device to communicate with other communication devices through a communication link.
  • the network interface 914 may include a network interface between the communication device and the core network device, such as an S1 interface, and the network interface may include a network interface between the communication device and other communication devices (such as other network devices or core network devices), such as an X2 or Xn interface.
  • the transceiver unit 602 shown in Figure 6 may include a communication interface, which may include the network interface 914 shown in Figure 9.
  • the network interface 914 may include an input interface and an output interface.
  • the network interface 914 may be a transceiver circuit, which may include an input interface circuit and an output interface circuit.
  • Processor 911 is primarily used to process communication protocols and communication data, control the entire communication device, execute software programs, and process software program data, for example, to support the communication device in performing the actions described in the embodiments.
  • the communication device may include a baseband processor and a central processing unit.
  • the baseband processor is primarily used to process communication protocols and communication data, while the central processing unit is primarily used to control the entire terminal device, execute software programs, and process software program data.
  • Processor 911 in Figure 9 may integrate the functions of both a baseband processor and a central processing unit. Those skilled in the art will appreciate that the baseband processor and the central processing unit may also be independent processors interconnected via a bus or other technology.
  • a terminal device may include multiple baseband processors to accommodate different network standards, multiple central processing units to enhance its processing capabilities, and various components of the terminal device may be connected via various buses.
  • the baseband processor may also be referred to as a baseband processing circuit or a baseband processing chip.
  • the central processing unit may also be referred to as a central processing circuit or a central processing chip.
  • the functionality for processing communication protocols and communication data may be built into the processor or stored in memory as a software program, which is executed by the processor to implement the baseband processing functionality.
  • the memory is primarily used to store software programs and data.
  • Memory 912 can exist independently and be connected to processor 911. Alternatively, memory 912 and processor 911 can be integrated together, for example, within a single chip.
  • Memory 912 can store program code for executing the technical solutions of the embodiments of the present application, and execution is controlled by processor 911. The various computer program codes executed can also be considered drivers for processor 911.
  • Figure 9 shows only one memory and one processor. In an actual terminal device, there may be multiple processors and multiple memories.
  • the memory may also be referred to as a storage medium or a storage device.
  • the memory may be a storage element on the same chip as the processor, i.e., an on-chip storage element, or an independent storage element, which is not limited in the present embodiment.
  • the transceiver 913 can be used to support the reception or transmission of radio frequency signals between the communication device and the terminal, and the transceiver 913 can be connected to the antenna 915.
  • the transceiver 913 includes a transmitter Tx and a receiver Rx. Specifically, one or more antennas 915 can receive radio frequency signals.
  • the receiver Rx of the transceiver 913 is used to receive the radio frequency signal from the antenna, convert the radio frequency signal into a digital baseband signal or a digital intermediate frequency signal, and provide the digital baseband signal or digital intermediate frequency signal to the processor 911 so that the processor 911 can further process the digital baseband signal or digital intermediate frequency signal, such as demodulation and decoding.
  • the transmitter Tx in the transceiver 913 is also used to receive a modulated digital baseband signal or digital intermediate frequency signal from the processor 911, convert the modulated digital baseband signal or digital intermediate frequency signal into a radio frequency signal, and send the radio frequency signal through one or more antennas 915.
  • the receiver Rx can selectively perform one or more stages of down-mixing and analog-to-digital conversion on the RF signal to obtain a digital baseband signal or a digital intermediate frequency signal.
  • the order of the down-mixing and analog-to-digital conversion processes is adjustable.
  • the transmitter Tx can selectively perform one or more stages of up-mixing and digital-to-analog conversion on the modulated digital baseband signal or digital intermediate frequency signal to obtain a RF signal.
  • the order of the up-mixing and digital-to-analog conversion processes is adjustable.
  • the digital baseband signal and the digital intermediate frequency signal may be collectively referred to as digital signals.
  • the transceiver 913 may also be referred to as a transceiver unit, a transceiver, a transceiver device, etc.
  • a device in the transceiver unit that implements a receiving function may be referred to as a receiving unit
  • a device in the transceiver unit that implements a transmitting function may be referred to as a transmitting unit. That is, the transceiver unit includes a receiving unit and a transmitting unit.
  • the receiving unit may also be referred to as a receiver, an input port, a receiving circuit, etc.
  • the transmitting unit may be referred to as a transmitter, a transmitter, or a transmitting circuit, etc.
  • the communication device 900 shown in Figure 9 can be specifically used to implement the steps implemented by the network device in the aforementioned method embodiment and achieve the corresponding technical effects of the network device.
  • the specific implementation methods of the communication device 900 shown in Figure 9 can refer to the description in the aforementioned method embodiment and will not be repeated here.
  • FIG10 is a schematic structural diagram of the communication device involved in the above-mentioned embodiment provided in an embodiment of the present application.
  • the communication device 100 includes, for example, modules, units, elements, circuits, or interfaces, which are appropriately configured together to implement the technical solutions provided in this application.
  • the communication device 100 can be the terminal device or network device described above, or a component (such as a chip) in these devices, used to implement the method described in the following method embodiment.
  • the communication device 100 includes one or more processors 101.
  • the processor 101 can be a general-purpose processor or a dedicated processor.
  • it can be a baseband processor or a central processing unit.
  • the baseband processor can be used to process communication protocols and communication data
  • the central processing unit can be used to control the communication device (such as a RAN node, terminal, or chip, etc.), execute software programs, and process data of software programs.
  • the processor 101 may include a program 103 (sometimes also referred to as code or instructions), which may be executed on the processor 101 to cause the communication device 100 to perform the methods described in the following embodiments.
  • the communication device 100 includes circuitry (not shown in FIG10 ).
  • the communication device 100 may include one or more memories 102 on which a program 104 (sometimes also referred to as code or instructions) is stored.
  • the program 104 can be run on the processor 101, so that the communication device 100 executes the method described in the above method embodiment.
  • the processor 101 and/or the memory 102 may include AI modules 107 and 108, which are used to implement AI-related functions.
  • the AI module may be implemented through software, hardware, or a combination of software and hardware.
  • the AI module may include a wireless intelligent control (RIC) module.
  • the AI module may be a near-real-time RIC or a non-real-time RIC.
  • data may be stored in the processor 101 and/or the memory 102.
  • the processor and the memory may be provided separately or integrated together.
  • the communication device 100 may further include a transceiver 105 and/or an antenna 106.
  • the processor 101 may also be sometimes referred to as a processing unit, and controls the communication device (e.g., a RAN node or terminal).
  • the transceiver 105 may also be sometimes referred to as a transceiver unit, a transceiver, a transceiver circuit, or a transceiver, and is configured to implement the transceiver functions of the communication device through the antenna 106.
  • the processing unit 601 shown in FIG6 may be the processor 101.
  • the transceiver unit 602 shown in FIG6 may be a communication interface, which may be the transceiver 105 shown in FIG10 .
  • the transceiver 105 may include an input interface and an output interface.
  • the transceiver 105 may be a transceiver circuit, which may include an input interface circuit and an output interface circuit.
  • An embodiment of the present application further provides a computer-readable storage medium, which is used to store one or more computer-executable instructions.
  • the processor executes the method described in the possible implementation methods of the first communication device or the second communication device in the aforementioned embodiment.
  • An embodiment of the present application also provides a computer program product (or computer program).
  • the processor executes the method that may be implemented by the above-mentioned first communication device or second communication device.
  • An embodiment of the present application also provides a chip system, which includes at least one processor for supporting a communication device to implement the functions involved in the possible implementation methods of the above-mentioned communication device.
  • the chip system also includes an interface circuit, which provides program instructions and/or data to the at least one processor.
  • the chip system may also include a memory, which is used to store the necessary program instructions and data for the communication device.
  • the chip system can be composed of chips, or it can include chips and other discrete devices, wherein the communication device can specifically be the first communication device or the second communication device in the aforementioned method embodiment.
  • An embodiment of the present application further provides a communication system, wherein the network system architecture includes the first communication device and/or the second communication device in any of the above embodiments.
  • the disclosed systems, devices and methods can be implemented in other ways.
  • the device embodiments described above are merely schematic.
  • the division of the units is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed.
  • Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of devices or units, which can be electrical, mechanical or other forms. Whether a function is performed in hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of this application.
  • the units described as separate components may or may not be physically separate, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed across multiple network units. Some or all of these units may be selected to achieve the purpose of this embodiment according to actual needs.
  • the functional units in the various embodiments of the present application can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or in the form of a software functional unit. If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or the contributing part or all or part of the technical solution can be embodied in the form of a software product.
  • the computer software product is stored in a storage medium and includes several instructions for enabling a computer device (which can be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the method described in the various embodiments of the present application.
  • the aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), disk or optical disk, and other media that can store program code.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Medical Informatics (AREA)
  • Monitoring And Testing Of Transmission In General (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

A communication method and a related apparatus. In the method, after a first communication apparatus acquires performance monitoring information of at least one AI model by means of first information, the first communication apparatus may perform noise perturbation processing on a model parameter of a first AI model, so as to obtain an updated model parameter of the first AI model. By using the method, the computing power of a communication node can be applied to the processing of an AI model, and the AI model can be updated by means of performance monitoring information of the AI model. In addition, by means of the process of performing noise perturbation processing on a parameter of the AI model, the problem of the plasticity of the AI model being reduced or even disappeared during a long-time continual learning process can be mitigated or avoided, the adaptability of continual learning can be improved, and the robustness of continual learning can also be improved.

Description

一种通信方法及相关装置A communication method and related device

本申请要求于2024年03月15日提交国家知识产权局、申请号为202410303028.6、申请名称为“一种通信方法及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application filed with the State Intellectual Property Office on March 15, 2024, with application number 202410303028.6 and application name “A communication method and related device”, the entire contents of which are incorporated by reference into this application.

技术领域Technical Field

本申请涉及通信领域,尤其涉及一种通信方法及相关装置。The present application relates to the field of communications, and in particular to a communication method and related devices.

背景技术Background Art

无线通信可以是两个或两个以上的通信节点间不经由导体或缆线传播而进行的传输通讯,该通信节点一般包括网络设备和终端设备。Wireless communication can be a transmission communication between two or more communication nodes without propagating through conductors or cables. The communication nodes generally include network devices and terminal devices.

目前,在无线通信系统中,通信节点一般具备信号收发能力和计算能力。以具备计算能力的网络设备为例,网络设备的计算能力主要是为信号收发能力提供算力支持(例如:对信号进行发送处理和接收处理),以实现网络设备与其它通信节点的通信。Currently, in wireless communication systems, communication nodes generally possess both signal transceiver capabilities and computing capabilities. For example, network devices with computing capabilities primarily provide computing power to support signal transceiver capabilities (e.g., processing both sending and receiving signals), enabling communication between the network device and other communication nodes.

然而,在通信网络中,通信节点的计算能力除了为上述通信任务提供算力支持之外,还可能具备富余的计算能力。为此,如何利用这些计算能力,是一个亟待解决的技术问题。However, in communication networks, communication nodes may have excess computing power beyond just supporting the aforementioned communication tasks. Therefore, how to utilize this computing power is a pressing technical issue.

发明内容Summary of the Invention

本申请提供了一种通信方法及相关装置,用于提升持续学习的鲁棒性。The present application provides a communication method and related devices for improving the robustness of continuous learning.

本申请第一方面提供了一种通信方法,该方法应用于第一通信装置,该第一通信装置可以是通信设备(如终端设备或网络设备),或者,该第一通信装置可以是通信设备中的部分组件(例如处理器、芯片或芯片系统等),或者该第一通信装置还可以是能实现全部或部分通信设备功能的逻辑模块或软件。在该方法中,第一通信装置获取第一信息,该第一信息用于指示至少一个人工智能(artificial intelligence,AI)模型的性能监测信息,该至少一个AI模型关联于部署在该第一通信装置的第一AI模型;该第一通信装置基于该第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。In a first aspect, the present application provides a communication method, which is applied to a first communication device. The first communication device may be a communication device (such as a terminal device or a network device), or the first communication device may be a partial component in the communication device (such as a processor, a chip or a chip system, etc.), or the first communication device may also be a logic module or software that can realize all or part of the functions of the communication device. In this method, the first communication device obtains first information, and the first information is used to indicate performance monitoring information of at least one artificial intelligence (AI) model, and the at least one AI model is associated with a first AI model deployed on the first communication device; the first communication device performs noise perturbation processing on the model parameters of the first AI model based on the first information to obtain the updated model parameters of the first AI model.

基于上述方案,第一通信装置通过第一信息获取至少一个AI模型的性能监测信息之后,该第一通信装置可以对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。通过这种方式,能够使得通信节点的算力能够应用于AI模型的处理,并且,通过AI模型的性能检测信息能够实现对AI模型的更新。Based on the above solution, after the first communication device obtains performance monitoring information of at least one AI model through the first information, the first communication device can perform noise perturbation processing on the model parameters of the first AI model to obtain updated model parameters of the first AI model. In this way, the computing power of the communication node can be applied to AI model processing, and the AI model can be updated based on the performance monitoring information of the AI model.

此外,第一通信装置执行AI模型参数更新的过程是基于至少一个AI模型的性能监测信息触发的,并且,在该AI模型参数更新的过程中,第一通信装置是通过加噪扰动处理得到更新后的AI模型参数的。通过这种方式,使得方案能够应用于持续学习的场景,并且,通过对AI模型参数进行加噪扰动处理的过程,可以减缓或避免AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。In addition, the process of the first communication device performing the AI model parameter update is triggered based on the performance monitoring information of at least one AI model, and during the AI model parameter update process, the first communication device obtains the updated AI model parameters through noise perturbation processing. In this way, the solution can be applied to the scenario of continuous learning, and by performing noise perturbation processing on the AI model parameters, the problem of reduced or even disappearance of the plasticity of the AI model during long-term continuous learning can be slowed down or avoided, which can improve the adaptability of continuous learning and enhance the robustness of continuous learning.

本申请中,AI模型,可以替换为其它术语,例如神经网络、神经网络模型、AI神经网络模型、机器学习模型、或AI处理模型等。In this application, AI model can be replaced by other terms, such as neural network, neural network model, AI neural network model, machine learning model, or AI processing model.

本申请中,加噪扰动处理,可以替换为其它术语,例如加噪处理、扰动处理、噪声扰动处理、噪声干扰处理、或随机扰动处理等。In this application, noise disturbance processing can be replaced by other terms, such as noise addition processing, disturbance processing, noise disturbance processing, noise interference processing, or random disturbance processing.

在第一方面的一种可能的实现方式中,该方法还包括:该第一通信装置基于该第一信息发送第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的,该第二信息用于更新第二AI模型;其中,该第二AI模型用于部署在第二通信装置。In a possible implementation of the first aspect, the method further includes: the first communication device sending second information based on the first information, the second information being obtained by performing noise disturbance processing based on the first parameter, and the second information being used to update a second AI model; wherein the second AI model is used to be deployed in the second communication device.

基于上述方案,第一通信装置还可以基于第一信息发送基于第一参数进行加噪扰动处理得到的第二信息,使得该第二信息的接收方(例如第二通信装置)能够获得基于第一参数进行加噪扰动处理得到的第二信息,并基于该第二信息更新第二AI模型。通过这种方式,可以减缓或避免第二AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Based on the above solution, the first communication device can also send second information obtained by performing noise perturbation processing based on the first parameter based on the first information, so that the recipient of the second information (such as the second communication device) can obtain the second information obtained by performing noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, the problem of reduced or even disappearance of the plasticity of the second AI model during long-term continuous learning can be alleviated or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.

在第一方面的一种可能的实现方式中,该至少一个AI模型包括该第一AI模型和/或第二AI模型;其中,该第一AI模型关联于第二AI模型,该第二AI模型用于部署在第二通信装置。In a possible implementation manner of the first aspect, the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device.

基于上述方案,第一信息用于指示至少一个AI模型的性能监测信息,并且,该第一信息指示的性能监测信息满足某个条件(例如后文描述的第一条件和/或第二条件)的情况下,该第一信息可以用于触发第一AI模型和/或第二AI模型的模型参数的加噪扰动处理。相应的,该至少一个AI模型可以包括该第一AI模型和/或第二AI模型,通过这种方式,第一通信装置能够基于存在关联关系的AI模型的性能触发AI模型的加噪扰动处理。Based on the above solution, the first information is used to indicate performance monitoring information of at least one AI model, and when the performance monitoring information indicated by the first information satisfies a certain condition (e.g., the first condition and/or the second condition described below), the first information can be used to trigger noise perturbation processing of model parameters of the first AI model and/or the second AI model. Accordingly, the at least one AI model may include the first AI model and/or the second AI model. In this way, the first communication device can trigger noise perturbation processing of the AI model based on the performance of the associated AI model.

可选地,该第一AI模型关联于该第二AI模型,包括下述任一项:Optionally, the first AI model is associated with the second AI model, including any of the following:

该第一AI模型为公共模型且该第二AI模型为个性化模型;The first AI model is a public model and the second AI model is a personalized model;

该第一AI模型为个性化模型且该第二AI模型为公共模型;The first AI model is a personalized model and the second AI model is a public model;

该第一AI模型的输入包括该第二AI模型的输出;The input of the first AI model includes the output of the second AI model;

该第二AI模型的输入包括该第一AI模型的输出;The input of the second AI model includes the output of the first AI model;

该第一AI模型为教师模型且该第二AI模型为学生模型;或,The first AI model is a teacher model and the second AI model is a student model; or,

该第一AI模型为学生型且该第二AI模型为教师模型。The first AI model is a student model and the second AI model is a teacher model.

在第一方面的一种可能的实现方式中,该第二信息是基于该第一参数的量化结果进行处理得到的,或,该第二信息是对该第一参数的传输参数进行处理得到的。In a possible implementation manner of the first aspect, the second information is obtained by processing a quantization result of the first parameter, or the second information is obtained by processing a transmission parameter of the first parameter.

基于上述方案,第二信息可以通过多种方式进行加噪扰动处理得到的,以提升方案实现的灵活性。Based on the above solution, the second information can be obtained by performing noise disturbance processing in a variety of ways to improve the flexibility of the solution implementation.

可选地,该第一参数包括以下一项或多项:Optionally, the first parameter includes one or more of the following:

更新后的第二AI模型的模型参数,用于更新该第二AI模型的梯度信息,用于更新该第二AI模型的中间特征信息,用于更新该第二AI模型的中间梯度信息,该第一AI模型的推理结果,该第二AI模型对应的蒸馏损失信息,该第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.

在第一方面的一种可能的实现方式中,第一通信装置基于该第一信息发送第二信息,包括:满足第一条件时,该第一通信装置基于该第一信息发送第二信息;其中,该第一条件包括以下至少一项:In a possible implementation of the first aspect, the first communication device sending the second information based on the first information includes: when a first condition is met, the first communication device sending the second information based on the first information; wherein the first condition includes at least one of the following:

该至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold;

该至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold;

该至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model resides is lower than a third threshold;

部署该至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold;

该至少一个AI模型的神经网络输入输出分布变化大于第五阈值;A change in the neural network input-output distribution of the at least one AI model is greater than a fifth threshold;

该至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold;

该至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold.

基于上述方案,在满足第一条件时,第一通信装置可以确定第二AI模型当前的运行可能存在异常,为此,该第一通信装置可以发送第二信息,使得第二通信装置能够获得基于第一参数进行加噪扰动处理得到的第二信息,并基于该第二信息更新第二AI模型。通过这种方式,可以减缓或避免第二AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Based on the above solution, when the first condition is met, the first communication device can determine that the current operation of the second AI model may be abnormal. To this end, the first communication device can send second information, so that the second communication device can obtain the second information obtained by noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.

在第一方面的一种可能的实现方式中,该方法还包括:该第一通信装置接收指示第二参数的指示信息,该第二参数用于对该第一参数进行处理得到该第二信息。In a possible implementation manner of the first aspect, the method further includes: the first communication device receiving indication information indicating a second parameter, where the second parameter is used to process the first parameter to obtain the second information.

基于上述方案,第一通信装置还可以通过接收的指示信息确定该第二参数,并基于该第二参数对第一参数进行处理得到第二信息,即该第一通信装置可以基于其它通信装置(例如第二信息的接收方)的指示以实现第二参数的确定,以便于该第二信息的接收方能够通过指定的第二参数获得相应的第二信息。Based on the above scheme, the first communication device can also determine the second parameter through the received indication information, and process the first parameter based on the second parameter to obtain the second information, that is, the first communication device can determine the second parameter based on the indication of other communication devices (for example, the recipient of the second information), so that the recipient of the second information can obtain the corresponding second information through the specified second parameter.

应理解,第二参数可以用于对第一参数进行加噪扰动处理得到第二信息,相应的,该第二参数可以理解为加噪扰动参数,加噪参数,或扰动参数等。It should be understood that the second parameter can be used to perform noise disturbance processing on the first parameter to obtain the second information. Accordingly, the second parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.

在第一方面的一种可能的实现方式中,该方法还包括:该第一通信装置发送用于请求该第二参数的请求信息。In a possible implementation manner of the first aspect, the method further includes: the first communication device sending request information for requesting the second parameter.

基于上述方案,第一通信装置可以通过请求信息和上述指示信息的交互过程,以实现第二参数的确定。Based on the above solution, the first communication device can determine the second parameter through an interactive process of the request information and the above indication information.

可选地,该第二参数为预配置的参数。Optionally, the second parameter is a preconfigured parameter.

在第一方面的一种可能的实现方式中,该方法还包括:该第一通信装置接收指示第三参数的指示信息,该第三参数用于对该第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数。In a possible implementation of the first aspect, the method further includes: the first communication device receiving indication information indicating a third parameter, where the third parameter is used to process the model parameters of the first AI model to obtain updated model parameters of the first AI model.

基于上述方案,第一通信装置还可以接收通过接收的指示信息确定该第三参数,并基于该第三参数对第一AI模型的模型参数进行加噪扰动处理,即该第一通信装置可以基于其它通信装置(例如第二信息的接收方)的指示以实现第三参数的确定,以便于该第一通信装置能够通过指定的第三参数获得实现加噪扰动处理。Based on the above scheme, the first communication device can also receive the third parameter determined by the received indication information, and perform noise disturbance processing on the model parameters of the first AI model based on the third parameter, that is, the first communication device can determine the third parameter based on the indication of other communication devices (such as the recipient of the second information), so that the first communication device can obtain the noise disturbance processing through the specified third parameter.

应理解,第三参数可以用于对该第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数,相应的,该第三参数可以理解为加噪扰动参数,加噪参数,或扰动参数等。It should be understood that the third parameter can be used to process the model parameters of the first AI model to obtain the updated model parameters of the first AI model. Accordingly, the third parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.

在第一方面的一种可能的实现方式中,该方法还包括:该第一通信装置发送用于请求该第三参数的请求信息。In a possible implementation manner of the first aspect, the method further includes: the first communication device sending request information for requesting the third parameter.

基于上述方案,第一通信装置可以通过请求信息和上述指示信息的交互过程,以实现第三参数的确定。Based on the above solution, the first communication device can determine the third parameter through an interactive process of the request information and the above indication information.

可选地,该第三参数为预配置的参数。Optionally, the third parameter is a preconfigured parameter.

在第一方面的一种可能的实现方式中,第一通信装置基于该第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数,包括:满足第二条件时,该第一通信装置基于该第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数;In a possible implementation of the first aspect, the first communications device performs noise perturbation processing on model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model, including: when a second condition is met, the first communications device performs noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model;

该第二条件包括以下至少一项:The second condition includes at least one of the following:

该至少一个AI模型的训练精度低于第八阈值;The training accuracy of the at least one AI model is lower than an eighth threshold;

该至少一个AI模型的测试精度低于第九阈值;The test accuracy of the at least one AI model is lower than a ninth threshold;

该至少一个AI模型所在的AI模型系统的系统性能低于第十阈值;The system performance of the AI model system in which the at least one AI model resides is lower than a tenth threshold;

部署该至少一个AI模型的通信装置所在的通信系统的系统性能低于第十一阈值;The system performance of the communication system in which the communication device deploys the at least one AI model is located is lower than an eleventh threshold;

该至少一个AI模型的神经网络输入输出分布变化大于第十二阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a twelfth threshold;

该至少一个AI模型的神经网络权重分布大于第十三阈值;The neural network weight distribution of the at least one AI model is greater than a thirteenth threshold;

该至少一个AI模型的神经网络权重变化量低于第十四阈值。The change in the neural network weight of the at least one AI model is lower than a fourteenth threshold.

基于上述方案,在满足第二条件时,第一通信装置可以确定第一AI模型当前的运行可能存在异常,为此,该第一通信装置可以于对该第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数,以实现对第一AI模型的更新。通过这种方式,可以减缓或避免第一AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Based on the above solution, when the second condition is met, the first communication device can determine that the current operation of the first AI model may be abnormal. To this end, the first communication device can process the model parameters of the first AI model to obtain updated model parameters of the first AI model to update the first AI model. In this way, the problem of the first AI model's plasticity being reduced or even eliminated during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.

在第一方面的一种可能的实现方式中,该第一通信装置获取第一信息,包括:该第一通信装置接收该第一信息;或,该第一通信装置获取该第一AI模型和/或该第二AI模型的测量参数,并基于该第一AI模型和/或该第二AI模型的测量参数确定该第一信息。In a possible implementation of the first aspect, the first communication device obtains the first information, including: the first communication device receives the first information; or, the first communication device obtains measurement parameters of the first AI model and/or the second AI model, and determines the first information based on the measurement parameters of the first AI model and/or the second AI model.

基于上述方案,第一通信装置可以通过上述多种方式获得第一信息,以提升方案实现的灵活性。Based on the above solution, the first communication device can obtain the first information through the above multiple methods to improve the flexibility of implementing the solution.

本申请第二方面提供了一种通信方法,该方法应用于第一通信装置,该第一通信装置可以是通信设备(如终端设备或网络设备),或者,该第一通信装置可以是通信设备中的部分组件(例如处理器、芯片或芯片系统等),或者该第一通信装置还可以是能实现全部或部分通信设备功能的逻辑模块或软件。在该方法中,第一通信装置获取第一信息,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于用于部署在第二通信装置的第二AI模型;该第一通信装置基于该第一信息发送第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的,该第二信息用于更新该第二AI模型。The second aspect of the present application provides a communication method, which is applied to a first communication device, which may be a communication device (such as a terminal device or a network device), or the first communication device may be a partial component in the communication device (such as a processor, a chip or a chip system, etc.), or the first communication device may also be a logic module or software that can realize all or part of the functions of the communication device. In this method, the first communication device obtains first information, which is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model for deployment in the second communication device; the first communication device sends second information based on the first information, which is obtained by performing noise perturbation processing based on the first parameter, and the second information is used to update the second AI model.

基于上述方案,第一通信装置通过第一信息获取至少一个AI模型的性能监测信息之后,该第一通信装置可以发送基于第一参数进行加噪扰动处理得到的第二信息,使得该第二信息的接收方(例如第二通信装置)能够获得基于第一参数进行加噪扰动处理得到的第二信息,并基于该第二信息更新第二AI模型。通过这种方式,可以减缓或避免第二AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Based on the above solution, after the first communication device obtains the performance monitoring information of at least one AI model through the first information, the first communication device can send the second information obtained by adding noise and perturbation based on the first parameter, so that the recipient of the second information (such as the second communication device) can obtain the second information obtained by adding noise and perturbation based on the first parameter, and update the second AI model based on the second information. In this way, the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be slowed down or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.

在第二方面的一种可能的实现方式中,该至少一个AI模型包括该第一AI模型和/或第二AI模型;其中,该第一AI模型关联于第二AI模型,该第二AI模型用于部署在第二通信装置。In a possible implementation of the second aspect, the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device.

基于上述方案,第一信息用于指示至少一个AI模型的性能监测信息,并且,该第一信息用于触发指示的性能监测信息满足某个条件(例如后文描述的第一条件和/或第二条件)的情况下,该第一信息可以第一AI模型和/或第二AI模型的模型参数的加噪扰动处理。相应的,该至少一个AI模型可以包括该第一AI模型和/或第二AI模型,通过这种方式,第一通信装置能够基于存在关联关系的AI模型的性能触发AI模型的加噪扰动处理。Based on the above solution, the first information is used to indicate performance monitoring information of at least one AI model, and when the performance monitoring information indicated by the first information is used to trigger a certain condition (such as the first condition and/or the second condition described below), the first information can be used to perform noise perturbation processing on the model parameters of the first AI model and/or the second AI model. Accordingly, the at least one AI model may include the first AI model and/or the second AI model. In this way, the first communication device can trigger noise perturbation processing of the AI model based on the performance of the associated AI model.

可选地,该第一AI模型关联于该第二AI模型,包括下述任一项:Optionally, the first AI model is associated with the second AI model, including any of the following:

该第一AI模型为公共模型且该第二AI模型为个性化模型;The first AI model is a public model and the second AI model is a personalized model;

该第一AI模型为个性化模型且该第二AI模型为公共模型;The first AI model is a personalized model and the second AI model is a public model;

该第一AI模型的输入包括该第二AI模型的输出;The input of the first AI model includes the output of the second AI model;

该第二AI模型的输入包括该第一AI模型的输出;The input of the second AI model includes the output of the first AI model;

该第一AI模型为教师模型且该第二AI模型为学生模型;或,The first AI model is a teacher model and the second AI model is a student model; or,

该第一AI模型为学生型且该第二AI模型为教师模型。The first AI model is a student model and the second AI model is a teacher model.

在第二方面的一种可能的实现方式中,该第二信息是基于该第一参数的量化结果进行处理得到的,或,该第二信息是对该第一参数的传输参数进行处理得到的。In a possible implementation manner of the second aspect, the second information is obtained by processing a quantization result of the first parameter, or the second information is obtained by processing a transmission parameter of the first parameter.

基于上述方案,第二信息可以通过多种方式进行加噪扰动处理得到的,以提升方案实现的灵活性。Based on the above solution, the second information can be obtained by performing noise disturbance processing in a variety of ways to improve the flexibility of the solution implementation.

可选地,该第一参数包括以下一项或多项:Optionally, the first parameter includes one or more of the following:

更新后的第二AI模型的模型参数,用于更新该第二AI模型的梯度信息,用于更新该第二AI模型的中间特征信息,用于更新该第二AI模型的中间梯度信息,该第一AI模型的推理结果,该第二AI模型对应的蒸馏损失信息,该第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.

在第二方面的一种可能的实现方式中,第一通信装置基于该第一信息发送第二信息,包括:满足第一条件时,该第一通信装置基于该第一信息发送第二信息;In a possible implementation of the second aspect, the first communication device sending the second information based on the first information includes: when a first condition is met, the first communication device sending the second information based on the first information;

该第一条件包括以下至少一项:The first condition includes at least one of the following:

该至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold;

该至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold;

该至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model resides is lower than a third threshold;

部署该至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold;

该至少一个AI模型的神经网络输入输出分布变化大于第五阈值;A change in the neural network input-output distribution of the at least one AI model is greater than a fifth threshold;

该至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold;

该至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold.

基于上述方案,在满足第一条件时,第一通信装置可以确定第二AI模型当前的运行可能存在异常,为此,该第一通信装置可以发送第二信息,使得第二通信装置能够获得基于第一参数进行加噪扰动处理得到的第二信息,并基于该第二信息更新第二AI模型。通过这种方式,可以减缓或避免第二AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Based on the above solution, when the first condition is met, the first communication device can determine that the current operation of the second AI model may be abnormal. To this end, the first communication device can send second information, so that the second communication device can obtain the second information obtained by noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.

在第二方面的一种可能的实现方式中,该方法还包括:该第一通信装置接收指示第二参数的指示信息,该第二参数用于对该第一参数进行处理得到该第二信息。In a possible implementation manner of the second aspect, the method further includes: the first communication device receiving indication information indicating a second parameter, where the second parameter is used to process the first parameter to obtain the second information.

基于上述方案,第一通信装置还可以通过接收的指示信息确定该第二参数,并基于该第二参数对第一参数进行处理得到第二信息,即该第一通信装置可以基于其它通信装置(例如第二信息的接收方)的指示以实现第二参数的确定,以便于该第二信息的接收方能够通过指定的第二参数获得相应的第二信息。Based on the above scheme, the first communication device can also determine the second parameter through the received indication information, and process the first parameter based on the second parameter to obtain the second information, that is, the first communication device can determine the second parameter based on the indication of other communication devices (for example, the recipient of the second information), so that the recipient of the second information can obtain the corresponding second information through the specified second parameter.

应理解,第二参数可以用于对第一参数进行加噪扰动处理得到第二信息,相应的,该第二参数可以理解为加噪扰动参数,加噪参数,或扰动参数等。It should be understood that the second parameter can be used to perform noise disturbance processing on the first parameter to obtain the second information. Accordingly, the second parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.

在第二方面的一种可能的实现方式中,该方法还包括:该第一通信装置发送用于请求该第二参数的请求信息。In a possible implementation manner of the second aspect, the method further includes: the first communication device sending request information for requesting the second parameter.

基于上述方案,第一通信装置可以通过请求信息和上述指示信息的交互过程,以实现第二参数的确定。Based on the above solution, the first communication device can determine the second parameter through an interactive process of the request information and the above indication information.

可选地,该第二参数为预配置的参数。Optionally, the second parameter is a preconfigured parameter.

在第二方面的一种可能的实现方式中,该第一通信装置获取第一信息,包括:该第一通信装置接收该第一信息;或,该第一通信装置获取该第一AI模型和/或该第二AI模型的测量参数,并基于该第一AI模型和/或该第二AI模型的测量参数确定该第一信息。In a possible implementation of the second aspect, the first communication device obtains the first information, including: the first communication device receives the first information; or, the first communication device obtains measurement parameters of the first AI model and/or the second AI model, and determines the first information based on the measurement parameters of the first AI model and/or the second AI model.

基于上述方案,第一通信装置可以通过上述多种方式获得第一信息,以提升方案实现的灵活性。Based on the above solution, the first communication device can obtain the first information through the above multiple methods to improve the flexibility of implementing the solution.

本申请第三方面提供了一种通信方法,该方法应用于第二通信装置,该第二通信装置可以是通信设备(如,终端设备或网络设备),或者,该第二通信装置可以是通信设备中的部分组件(例如处理器、芯片或芯片系统等),或者该第二通信装置还可以是能实现全部或部分通信设备功能的逻辑模块或软件。在该方法中,第二通信装置接收第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的;其中,所述第二信息用于更新部署在所述第二通信装置的第二AI模型;该第二通信装置基于该第二信息更新该第二AI模型。The third aspect of the present application provides a communication method, which is applied to a second communication device, which can be a communication device (such as a terminal device or a network device), or the second communication device can be a partial component in the communication device (such as a processor, a chip or a chip system, etc.), or the second communication device can also be a logic module or software that can implement all or part of the functions of the communication device. In this method, the second communication device receives second information, which is obtained by performing noise perturbation processing based on the first parameter; wherein the second information is used to update a second AI model deployed in the second communication device; the second communication device updates the second AI model based on the second information.

基于上述方案,第二通信装置接收的第二信息是基于第一参数进行加噪扰动处理得到的,并且,该第二通信装置能够基于该第二信息更新第二AI模型。换言之,第二通信装置在更新第二AI模型的过程中,用于更新第二AI模型的第二信息是基于第一参数进行加噪扰动处理得到的。通过这种方式,可以减缓或避免第二AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Based on the above solution, the second information received by the second communication device is obtained by performing noise perturbation processing based on the first parameter, and the second communication device can update the second AI model based on the second information. In other words, in the process of updating the second AI model by the second communication device, the second information used to update the second AI model is obtained by performing noise perturbation processing based on the first parameter. In this way, the problem of reduced or even disappearance of the plasticity of the second AI model during long-term continuous learning can be alleviated or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.

在第三方面的一种可能的实现方式中,该第二信息是基于该第一参数的量化结果进行处理得到的,或,该第二信息是对该第一参数的传输参数进行处理得到的。In a possible implementation manner of the third aspect, the second information is obtained by processing a quantization result of the first parameter, or the second information is obtained by processing a transmission parameter of the first parameter.

基于上述方案,第二信息可以通过多种方式进行加噪扰动处理得到的,以提升方案实现的灵活性。Based on the above solution, the second information can be obtained by performing noise disturbance processing in a variety of ways to improve the flexibility of the solution implementation.

可选地,该第一参数包括以下一项或多项:Optionally, the first parameter includes one or more of the following:

更新后的第二AI模型的模型参数,用于更新该第二AI模型的梯度信息,用于更新该第二AI模型的中间特征信息,用于更新该第二AI模型的中间梯度信息,该第一AI模型的推理结果,该第二AI模型对应的蒸馏损失信息,该第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.

本申请第四方面提供了一种通信装置,该通信装置为第一通信装置,该通信装置包括收发单元;该处理单元用于获取第一信息,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于部署在该第一通信装置的第一AI模型;该处理单元还用于基于该第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。In a fourth aspect, the present application provides a communication device, which is a first communication device and includes a transceiver unit; the processing unit is used to obtain first information, where the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a first AI model deployed in the first communication device; the processing unit is also used to perform noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.

本申请第四方面中,通信装置的组成模块还可以用于执行第一方面的各个可能实现方式中所执行的步骤,并实现相应的技术效果,具体均可以参阅第一方面,此处不再赘述。In the fourth aspect of the present application, the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the first aspect and achieve corresponding technical effects. For details, please refer to the first aspect and will not be repeated here.

本申请第五方面提供了一种通信装置,该通信装置为第一通信装置,该通信装置包括收发单元和处理单元;该处理单元用于获取第一信息,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于用于部署在第二通信装置的第二AI模型;该收发单元用于基于该第一信息发送第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的,该第二信息用于更新该第二AI模型。In a fifth aspect, the present application provides a communication device, which is a first communication device and includes a transceiver unit and a processing unit; the processing unit is used to obtain first information, and the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model for deployment in a second communication device; the transceiver unit is used to send second information based on the first information, and the second information is obtained by noise disturbance processing based on the first parameter, and the second information is used to update the second AI model.

本申请第五方面中,通信装置的组成模块还可以用于执行第二方面的各个可能实现方式中所执行的步骤,并实现相应的技术效果,具体均可以参阅第二方面,此处不再赘述。In the fifth aspect of this application, the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the second aspect and achieve corresponding technical effects. For details, please refer to the second aspect and will not be repeated here.

本申请第六方面提供了一种通信装置,该通信装置为第二通信装置,该通信装置包括收发单元和处理单元,该收发单元用于接收第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的;其中,该第二信息用于更新部署在所述第二通信装置的第二AI模型;该处理单元用于基于该第二信息更新该第二AI模型。In a sixth aspect, the present application provides a communication device, which is a second communication device and includes a transceiver unit and a processing unit. The transceiver unit is used to receive second information, and the second information is obtained by performing noise disturbance processing based on the first parameter; wherein the second information is used to update a second AI model deployed in the second communication device; and the processing unit is used to update the second AI model based on the second information.

本申请第六方面中,通信装置的组成模块还可以用于执行第三方面的各个可能实现方式中所执行的步骤,并实现相应的技术效果,具体均可以参阅第三方面,此处不再赘述。In the sixth aspect of this application, the constituent modules of the communication device can also be used to execute the steps performed in each possible implementation method of the third aspect and achieve corresponding technical effects. For details, please refer to the third aspect and will not be repeated here.

本申请第七方面提供了一种通信装置,包括至少一个处理器,所述至少一个处理器与存储器耦合;该存储器用于存储程序或指令;该至少一个处理器用于执行该程序或指令,以使该通信装置实现前述第一方面至第三方面任一方面中的任意一种可能的实现方式所述的方法。可选的,所述通信装置可以包括所述存储器。In a seventh aspect, the present application provides a communication device, comprising at least one processor coupled to a memory; the memory is configured to store programs or instructions; the at least one processor is configured to execute the programs or instructions, so that the communication device implements the method described in any possible implementation of any one of the first to third aspects. Optionally, the communication device may include the memory.

本申请第八方面提供了一种通信装置,包括至少一个逻辑电路和输入输出接口;该逻辑电路用于执行如前述第一方面至第三方面任一方面中的任意一种可能的实现方式所述的方法。In an eighth aspect, the present application provides a communication device comprising at least one logic circuit and an input/output interface; the logic circuit is used to execute the method described in any possible implementation of any one of the first to third aspects.

本申请第九方面提供了一种通信系统,该通信系统包括上述第一通信装置以及第二通信装置。In a ninth aspect, the present application provides a communication system, which includes the above-mentioned first communication device and second communication device.

本申请第十方面提供一种计算机可读存储介质,该存储介质用于存储一个或多个计算机执行指令,当计算机执行指令被处理器执行时,该处理器执行如上述第一方面至第三方面中任一方面的任意一种可能的实现方式所述的方法。In a tenth aspect, the present application provides a computer-readable storage medium for storing one or more computer-executable instructions. When the computer-executable instructions are executed by a processor, the processor executes the method described in any possible implementation of any one of the first to third aspects above.

本申请第十一方面提供一种计算机程序产品(或称计算机程序),当计算机程序产品中的计算机程序被该处理器执行时,该处理器执行上述第一方面至第三方面中任一方面的任意一种可能的实现方式所述的方法。In an eleventh aspect of the present application, a computer program product (or computer program) is provided. When the computer program in the computer program product is executed by the processor, the processor executes the method described in any possible implementation of any one of the first to third aspects above.

本申请第十二方面提供了一种芯片系统,该芯片系统包括至少一个处理器,用于支持通信装置实现上述第一方面至第三方面中任一方面的任意一种可能的实现方式所述的方法。A twelfth aspect of the present application provides a chip system, which includes at least one processor for supporting a communication device to implement the method described in any possible implementation of any one of the first to third aspects above.

在一种可能的设计中,该芯片系统还可以包括存储器,存储器,用于保存该通信装置必要的程序指令和数据。该芯片系统,可以由芯片构成,也可以包含芯片和其他分立器件。可选的,所述芯片系统还包括接口电路,所述接口电路为所述至少一个处理器提供程序指令和/或数据。In one possible design, the chip system may further include a memory for storing program instructions and data necessary for the communication device. The chip system may be composed of a chip or may include a chip and other discrete components. Optionally, the chip system may further include an interface circuit for providing program instructions and/or data to the at least one processor.

其中,第四方面至第十二方面中任一种设计方式所带来的技术效果可参见上述第一方面至第三方面中不同设计方式所带来的技术效果,在此不再赘述。Among them, the technical effects brought about by any design method in the fourth to twelfth aspects can refer to the technical effects brought about by the different design methods in the above-mentioned first to third aspects, and will not be repeated here.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1a至图1c为本申请提供的通信系统的示意图;Figures 1a to 1c are schematic diagrams of a communication system provided by this application;

图2a至图2g为本申请涉及的AI处理过程的示意图;Figures 2a to 2g are schematic diagrams of the AI processing process involved in this application;

图3为本申请提供的通信方法的一个交互示意图;FIG3 is an interactive schematic diagram of the communication method provided by this application;

图4a至图4c为本申请提供的模型处理的示意图;Figures 4a to 4c are schematic diagrams of the model processing provided by this application;

图5为本申请提供的通信方法的一个交互示意图;FIG5 is an interactive diagram of the communication method provided by this application;

图6至图10为本申请提供的通信装置的示意图。6 to 10 are schematic diagrams of the communication device provided in this application.

具体实施方式DETAILED DESCRIPTION

首先,对本申请实施例中的部分用语进行解释说明,以便于本领域技术人员理解。First, some of the terms used in the embodiments of the present application are explained to facilitate understanding by those skilled in the art.

(1)终端设备:可以是能够接收网络设备调度和指示信息的无线终端设备,无线终端设备可以是指向用户提供语音和/或数据连通性的设备,或具有无线连接功能的手持式设备,或连接到无线调制解调器的其他处理设备。(1) Terminal device: It can be a wireless terminal device that can receive network device scheduling and instruction information. The wireless terminal device can be a device that provides voice and/or data connectivity to the user, or a handheld device with wireless connection function, or other processing device connected to a wireless modem.

终端设备可以经无线接入网(radio access network,RAN)与一个或多个核心网或者互联网进行通信,终端设备可以是移动终端设备,如移动电话(或称为“蜂窝”电话,手机(mobile phone))、计算机和数据卡,例如,可以是便携式、袖珍式、手持式、计算机内置的或者车载的移动装置,它们与无线接入网交换语音和/或数据。例如,个人通信业务(personal communication service,PCS)电话、无绳电话、会话发起协议(session initiation protocol,SIP)话机、无线本地环路(wireless local loop,WLL)站、个人数字助理(personal digital assistant,PDA)、平板电脑(tablet或pad)、带无线收发功能的电脑等设备。无线终端设备也可以称为系统、订户单元(subscriber unit)、订户站(subscriber station),移动站/移动台(mobile station,MS)、远程站(remote station)、接入点(access point,AP)、远程终端(remote terminal)、接入终端(access terminal)、用户终端(user terminal)、用户代理(user agent)、用户站(subscriber station,SS)、用户端设备(customer premises equipment,CPE)、终端(terminal)、用户设备(user equipment,UE)、移动终端(mobile terminal,MT)等。Terminal devices can communicate with one or more core networks or the Internet via a radio access network (RAN). Terminal devices can be mobile terminal devices, such as mobile phones (also known as "cellular" phones, mobile phones), computers, and data cards. For example, they can be portable, pocket-sized, handheld, computer-built-in, or vehicle-mounted mobile devices that exchange voice and/or data with the radio access network. For example, personal communication service (PCS) phones, cordless phones, session initiation protocol (SIP) phones, wireless local loop (WLL) stations, personal digital assistants (PDAs), tablets (tablets or pads), computers with wireless transceiver capabilities, and other devices. Wireless terminal equipment can also be called system, subscriber unit, subscriber station, mobile station/mobile station (MS), remote station, access point (AP), remote terminal (REMOTE terminal), access terminal (ACCESS terminal), user terminal (USER terminal), user agent (USER agent), subscriber station (SS), customer premises equipment (CPE), terminal, user equipment (UE), mobile terminal (MT), etc.

作为示例而非限定,在本申请实施例中,该终端设备还可以是可穿戴设备。可穿戴设备也可以称为穿戴式智能设备或智能穿戴式设备等,是应用穿戴式技术对日常穿戴进行智能化设计、开发出可以穿戴的设备的总称,如眼镜、手套、手表、服饰及鞋等。可穿戴设备即直接穿在身上,或是整合到用户的衣服或配件的一种便携式设备。可穿戴设备不仅仅是一种硬件设备,更是通过软件支持以及数据交互、云端交互来实现强大的功能。广义穿戴式智能设备包括功能全、尺寸大、可不依赖智能手机实现完整或者部分的功能,例如:智能手表或智能眼镜等,以及只专注于某一类应用功能,需要和其它设备如智能手机配合使用,如各类进行体征监测的智能手环、智能头盔、智能首饰等。As an example and not a limitation, in the embodiments of the present application, the terminal device may also be a wearable device. Wearable devices may also be referred to as wearable smart devices or smart wearable devices, etc., which are a general term for wearable devices that are intelligently designed and developed using wearable technology for daily wear, such as glasses, gloves, watches, clothing, and shoes. A wearable device is a portable device that is worn directly on the body or integrated into the user's clothes or accessories. Wearable devices are not only hardware devices, but also achieve powerful functions through software support, data interaction, and cloud interaction. Broadly speaking, wearable smart devices include those that are fully functional, large in size, and can achieve complete or partial functions without relying on smartphones, such as smart watches or smart glasses, etc., as well as those that only focus on a certain type of application function and need to be used in conjunction with other devices such as smartphones, such as various smart bracelets, smart helmets, and smart jewelry for vital sign monitoring.

终端还可以是无人机、机器人、设备到设备通信(device-to-device,D2D)中的终端、车到一切(vehicle to everything,V2X)中的终端、虚拟现实(virtual reality,VR)终端设备、增强现实(augmented reality,AR)终端设备、工业控制(industrial control)中的无线终端、无人驾驶(self driving)中的无线终端、远程医疗(telemedicine或telehealth services)中的无线终端、智能电网(smart grid)中的无线终端、运输安全(transportation safety)中的无线终端、智慧城市(smart city)中的无线终端、智慧家庭(smart home)中的无线终端等。The terminal may also be a drone, a robot, a terminal in device-to-device (D2D) communication, a terminal in vehicle to everything (V2X), a virtual reality (VR) terminal device, an augmented reality (AR) terminal device, a wireless terminal in industrial control, a wireless terminal in self-driving, a wireless terminal in telemedicine (telehealth services), a wireless terminal in a smart grid, a wireless terminal in transportation safety, a wireless terminal in a smart city, a wireless terminal in a smart home, etc.

此外,终端设备也可以是第五代(5th generation,5G)通信系统之后演进的通信系统(例如5GAdvanced或第六代(6th generation,6G)通信系统等)中的终端设备或者未来演进的公共陆地移动网络(public land mobile network,PLMN)中的终端设备等。示例性的,5G Advanced或6G网络可以进一步扩展5G通信终端的形态和功能,6G终端包括但不限于车、蜂窝网络终端(融合卫星终端功能)、无人机、物联网(internet of things,IoT)设备。Furthermore, the terminal device may also be a terminal device in a communication system evolved after the fifth-generation (5G) communication system (e.g., 5G Advanced or sixth-generation (6G) communication system), or a terminal device in a future-evolved public land mobile network (PLMN). For example, 5G Advanced or 6G networks may further expand the form and functionality of 5G communication terminals. 6G terminals include, but are not limited to, vehicles, cellular network terminals (with integrated satellite terminal functionality), drones, and Internet of Things (IoT) devices.

在本申请实施例中,上述终端设备还可以获得网络设备提供的人工智能(artificial intelligence,AI)服务。可选地,终端设备还可以具有AI处理能力。In an embodiment of the present application, the terminal device may also obtain artificial intelligence (AI) services provided by the network device. Optionally, the terminal device may also have AI processing capabilities.

(2)网络设备:可以是无线网络中的设备,例如网络设备可以为将终端设备接入到无线网络的RAN节点(或设备),又可以称为基站。目前,一些RAN设备的举例为:基站(base station)、演进型基站(evolved NodeB,eNodeB)、5G通信系统中的基站gNB(gNodeB)、传输接收点(transmission reception point,TRP)、演进型节点B(evolved Node B,eNB)、无线网络控制器(radio network controller,RNC)、节点B(Node B,NB)、家庭基站(例如,home evolved Node B,或home Node B,HNB)、基带单元(base band unit,BBU),或无线保真(wireless fidelity,Wi-Fi)接入点(access point,AP)等。另外,在一种网络结构中,网络设备可以包括集中单元(central unit,CU)节点、或分布单元(distributed unit,DU)节点、或包括CU节点和DU节点的RAN设备。(2) Network equipment: It can be a device in a wireless network. For example, a network device can be a RAN node (or device) that connects a terminal device to a wireless network, which can also be called a base station. Currently, some examples of RAN equipment are: base station, evolved NodeB (eNodeB), gNB (gNodeB) in a 5G communication system, transmission reception point (TRP), evolved NodeB (eNB), radio network controller (RNC), NodeB (NB), home base station (e.g., home evolved NodeB, or home NodeB, HNB), baseband unit (BBU), or wireless fidelity (Wi-Fi) access point (AP). In addition, in a network structure, a network device can include a central unit (CU) node, a distributed unit (DU) node, or a RAN device including a CU node and a DU node.

可选的,RAN节点还可以是宏基站、微基站或室内站、中继节点或施主节点、或者是云无线接入网络(cloud radio access network,CRAN)场景下的无线控制器。RAN节点还可以是服务器,可穿戴设备,车辆或车载设备等。例如,车辆外联(vehicle to everything,V2X)技术中的接入网设备可以为路侧单元(road side unit,RSU)。Alternatively, a RAN node can be a macro base station, micro base station, indoor base station, relay node, donor node, or wireless controller in a cloud radio access network (CRAN) scenario. A RAN node can also be a server, wearable device, vehicle, or onboard device. For example, the access network device in vehicle-to-everything (V2X) technology can be a roadside unit (RSU).

在另一种可能的场景中,由多个RAN节点协作协助终端实现无线接入,不同RAN节点分别实现基站的部分功能。例如,RAN节点可以是CU,DU,CU-控制面(control plane,CP),CU-用户面(user plane,UP),或者无线单元(radio unit,RU)等。CU和DU可以是单独设置,或者也可以包括在同一个网元中,例如基带单元(BBU)中。RU可以包括在射频设备或者射频单元中,例如包括在射频拉远单元(remote radio unit,RRU)、有源天线处理单元(active antenna unit,AAU)、射频头(radio head,RH)或远程射频头(remote radio head,RRH)中。In another possible scenario, multiple RAN nodes collaborate to assist terminals in achieving wireless access, with different RAN nodes implementing portions of the base station's functionality. For example, a RAN node can be a CU, DU, CU-control plane (CP), CU-user plane (UP), or radio unit (RU). The CU and DU can be separate or included in the same network element, such as a baseband unit (BBU). The RU can be included in a radio frequency device or radio unit, such as a remote radio unit (RRU), an active antenna unit (AAU), a radio head (RH), or a remote radio head (RRH).

在不同系统中,CU(或CU-CP和CU-UP)、DU或RU也可以有不同的名称,但是本领域的技术人员可以理解其含义。例如,在开放式接入网(open RAN,O-RAN或ORAN)系统中,CU也可以称为O-CU(开放式CU),DU也可以称为O-DU,CU-CP也可以称为O-CU-CP,CU-UP也可以称为O-CU-UP,RU也可以称为O-RU。为描述方便,本申请中以CU,CU-CP,CU-UP、DU和RU为例进行描述。本申请中的CU(或CU-CP、CU-UP)、DU和RU中的任一单元,可以是通过软件模块、硬件模块、或者软件模块与硬件模块结合来实现。In different systems, CU (or CU-CP and CU-UP), DU or RU may have different names, but those skilled in the art can understand their meanings. For example, in an open access network (open RAN, O-RAN or ORAN) system, CU may also be called O-CU (open CU), DU may also be called O-DU, CU-CP may also be called O-CU-CP, CU-UP may also be called O-CU-UP, and RU may also be called O-RU. For the convenience of description, this application uses CU, CU-CP, CU-UP, DU and RU as examples for description. Any unit among the CU (or CU-CP, CU-UP), DU and RU in this application can be implemented by a software module, a hardware module, or a combination of a software module and a hardware module.

接入网设备和终端设备之间的通信遵循一定的协议层结构。该协议层可以包括控制面协议层和用户面协议层。控制面协议层可以包括以下至少一项:无线资源控制(radio resource control,RRC)层、分组数据汇聚层协议(packet data convergence protocol,PDCP)层、无线链路控制(radio link control,RLC)层、媒体接入控制(media access control,MAC)层、或物理(physical,PHY)层等。用户面协议层可以包括以下至少一项:业务数据适配协议(service data adaptation protocol,SDAP)层、PDCP层、RLC层、MAC层、或物理层等。Communication between access network equipment and terminal devices follows a specific protocol layer structure. This protocol layer may include a control plane protocol layer and a user plane protocol layer. The control plane protocol layer may include at least one of the following: radio resource control (RRC) layer, packet data convergence protocol (PDCP) layer, radio link control (RLC) layer, media access control (MAC) layer, or physical (PHY) layer. The user plane protocol layer may include at least one of the following: service data adaptation protocol (SDAP) layer, PDCP layer, RLC layer, MAC layer, or physical layer.

对于ORAN系统中的网元及其可实现的协议层功能对应关系,可参照下表1。For the correspondence between network elements in the ORAN system and their achievable protocol layer functions, please refer to Table 1 below.

表1
Table 1

网络设备可以是其它为终端设备提供无线通信功能的装置。本申请的实施例对网络设备所采用的具体技术和具体设备形态不做限定。为方便描述,本申请实施例并不限定。The network device may be any other device that provides wireless communication functionality to the terminal device. The embodiments of this application do not limit the specific technology and device form used by the network device. For ease of description, the embodiments of this application do not limit this.

网络设备还可以包括核心网设备,核心网设备例如包括第四代(4th generation,4G)网络中的移动性管理实体(mobility management entity,MME),归属用户服务器(home subscriber server,HSS),服务网关(serving gateway,S-GW),策略和计费规则功能(policy and charging rules function,PCRF),公共数据网网关(public data network gateway,PDN gateway或P-GW);5G网络中的访问和移动管理功能(access and mobility management function,AMF)、用户面功能(user plane function,UPF)或会话管理功能(session management function,SMF)等网元。此外,该核心网设备还可以包括5G网络以及5G网络的下一代网络中的其他核心网设备。The network equipment may also include core network equipment, such as the mobility management entity (MME), home subscriber server (HSS), serving gateway (S-GW), policy and charging rules function (PCRF), and public data network gateway (PDN gateway or P-GW) in the fourth generation (4G) network; and the access and mobility management function (AMF), user plane function (UPF), or session management function (SMF) in the 5G network. In addition, the core network equipment may also include other core network equipment in the 5G network and the next generation network of the 5G network.

本申请实施例中,上述网络设备还可以具有AI能力的网络节点,可以为终端或其他网络设备提供AI服务,例如,可以为网络侧(接入网或核心网)的AI节点、算力节点、具有AI能力的RAN节点、具有AI能力的核心网网元等。In an embodiment of the present application, the above-mentioned network device may also have a network node with AI capabilities, which can provide AI services for terminals or other network devices. For example, it can be an AI node on the network side (access network or core network), a computing power node, a RAN node with AI capabilities, a core network element with AI capabilities, etc.

本申请实施例中,用于实现网络设备的功能的装置可以是网络设备,也可以是能够支持网络设备实现该功能的装置,例如芯片系统,该装置可以被设置在网络设备中。在本申请实施例提供的技术方案中,以用于实现网络设备的功能的装置是网络设备为例,描述本申请实施例提供的技术方案。In the embodiments of the present application, the apparatus for implementing the function of the network device may be a network device, or may be a device capable of supporting the network device in implementing the function, such as a chip system, which may be provided in the network device. In the technical solutions provided in the embodiments of the present application, the technical solutions provided in the embodiments of the present application are described by taking the network device as an example of the apparatus for implementing the function of the network device.

(3)配置与预配置:在本申请中,会同时用到配置与预配置。其中,配置是指网络设备/服务器通过消息或信令将一些参数的配置信息或参数的取值发送给终端,以便终端根据这些取值或信息来确定通信的参数或传输时的资源。预配置与配置类似,可以是网络设备/服务器预先与终端设备协商好的参数信息或参数值,也可以是标准协议规定的基站/网络设备或终端设备采用的参数信息或参数值,还可以是预先存储在基站/服务器或终端设备的参数信息或参数值。本申请对此不做限定。(3) Configuration and pre-configuration: In this application, configuration and pre-configuration are used simultaneously. Configuration refers to the network device/server sending some parameter configuration information or parameter values to the terminal through messages or signaling, so that the terminal can determine the communication parameters or resources during transmission based on these values or information. Pre-configuration is similar to configuration, and can be parameter information or parameter values pre-negotiated between the network device/server and the terminal device, or parameter information or parameter values used by the base station/network device or terminal device as specified in the standard protocol, or parameter information or parameter values pre-stored in the base station/server or terminal device. This application does not limit this.

进一步地,这些取值和参数,是可以变化或更新的。Furthermore, these values and parameters can be changed or updated.

(4)本申请实施例中的术语“系统”和“网络”可被互换使用。“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A、同时存在A和B、单独存在B的情况,其中A,B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如“A,B和C中的至少一项”包括A,B,C,AB,AC,BC或ABC。以及,除非有特别说明,本申请实施例提及“第一”、“第二”等序数词是用于对多个对象进行区分,不用于限定多个对象的顺序、时序、优先级或者重要程度。(4) The terms "system" and "network" in the embodiments of the present application can be used interchangeably. "Multiple" refers to two or more. "And/or" describes the association relationship of associated objects, indicating that three relationships can exist. For example, A and/or B can mean: A exists alone, A and B exist at the same time, and B exists alone, where A and B can be singular or plural. The character "/" generally indicates that the previous and next associated objects are in an "or" relationship. "At least one of the following" or similar expressions refers to any combination of these items, including any combination of single or plural items. For example, "at least one of A, B and C" includes A, B, C, AB, AC, BC or ABC. In addition, unless otherwise specified, the ordinal numbers such as "first" and "second" mentioned in the embodiments of the present application are used to distinguish multiple objects, and are not used to limit the order, timing, priority or importance of multiple objects.

(5)本申请实施例中的“发送”和“接收”,表示信号传递的走向。例如,“向XX发送信息”可以理解为该信息的目的端是XX,可以包括通过空口直接发送,也包括其他单元或模块通过空口间接发送。“接收来自YY的信息”可以理解为该信息的源端是YY,可以包括通过空口直接从YY接收,也可以包括通过空口从其他单元或模块间接地从YY接收。“发送”也可以理解为芯片接口的“输出”,“接收”也可以理解为芯片接口的“输入”。(5) “Sending” and “receiving” in the embodiments of the present application indicate the direction of signal transmission. For example, “sending information to XX” can be understood as the destination of the information being XX, which can include direct sending through the air interface, as well as indirect sending through the air interface by other units or modules. “Receiving information from YY” can be understood as the source of the information being YY, which can include direct receiving from YY through the air interface, as well as indirect receiving from YY through the air interface from other units or modules. “Sending” can also be understood as the “output” of the chip interface, and “receiving” can also be understood as the “input” of the chip interface.

换言之,发送和接收可以是在设备之间进行的,例如,网络设备和终端设备之间进行的,也可以是在设备内进行的,例如,通过总线、走线或接口在设备内的部件之间、模组之间、芯片之间、软件模块或者硬件模块之间发送或接收。In other words, sending and receiving can be performed between devices, for example, between a network device and a terminal device, or can be performed within a device, for example, sending or receiving between components, modules, chips, software modules or hardware modules within the device through a bus, wiring or interface.

可以理解的是,信息在信息发送的源端和目的端之间可能会被进行必要的处理,比如编码、调制等,但目的端可以理解来自源端的有效信息。本申请中类似的表述可以做相似的理解,不再赘述。It is understandable that information may be processed between the source and destination of information transmission, such as coding, modulation, etc., but the destination can understand the valid information from the source. Similar expressions in this application can be understood similarly and will not be repeated.

(6)在本申请实施例中,“指示”可以包括直接指示和间接指示,也可以包括显式指示和隐式指示。将某一信息(如下文所述的指示信息)所指示的信息称为待指示信息,则具体实现过程中,对待指示信息进行指示的方式有很多种,例如但不限于,可以直接指示待指示信息,如待指示信息本身或者该待指示信息的索引等。也可以通过指示其他信息来间接指示待指示信息,其中该其他信息与待指示信息之间存在关联关系;还可以仅仅指示待指示信息的一部分,而待指示信息的其他部分则是已知的或者提前约定的,例如可以借助预先约定(例如协议预定义)的各个信息的排列顺序来实现对特定信息的指示,从而在一定程度上降低指示开销。本申请对于指示的具体方式不作限定。可以理解的是,对于该指示信息的发送方来说,该指示信息可用于指示待指示信息,对于指示信息的接收方来说,该指示信息可用于确定待指示信息。(6) In the embodiments of the present application, "indication" may include direct indication and indirect indication, and may also include explicit indication and implicit indication. The information indicated by a certain information (such as the indication information described below) is called information to be indicated. In the specific implementation process, there are many ways to indicate the information to be indicated, such as but not limited to, directly indicating the information to be indicated, such as the information to be indicated itself or the index of the information to be indicated. The information to be indicated may also be indirectly indicated by indicating other information, wherein the other information is associated with the information to be indicated; or only a part of the information to be indicated may be indicated, while the other part of the information to be indicated is known or agreed in advance. For example, the indication of specific information may be achieved by means of the arrangement order of each information agreed in advance (such as predefined by the protocol), thereby reducing the indication overhead to a certain extent. The present application does not limit the specific method of indication. It is understandable that for the sender of the indication information, the indication information can be used to indicate the information to be indicated, and for the receiver of the indication information, the indication information can be used to determine the information to be indicated.

本申请中,除特殊说明外,各个实施例之间相同或相似的部分可以互相参考。在本申请中各个实施例、以及各实施例中的各个方法/设计/实现方式中,如果没有特殊说明以及逻辑冲突,不同的实施例之间、以及各实施例中的各个方法/设计/实现方式之间的术语和/或描述具有一致性、且可以相互引用,不同的实施例、以及各实施例中的各个方法/设计/实现方式中的技术特征根据其内在的逻辑关系可以组合形成新的实施例、方法、或实现方式。以下所述的本申请实施方式并不构成对本申请保护范围的限定。In this application, unless otherwise specified, the same or similar parts between the various embodiments can refer to each other. In the various embodiments of this application, and the various methods/designs/implementations in each embodiment, if there is no special explanation and logical conflict, the terms and/or descriptions between different embodiments and the various methods/designs/implementations in each embodiment are consistent and can be referenced to each other. The technical features in different embodiments and the various methods/designs/implementations in each embodiment can be combined to form new embodiments, methods, or implementations according to their inherent logical relationships. The following description of the implementation methods of this application does not constitute a limitation on the scope of protection of this application.

本申请可以应用于长期演进(long term evolution,LTE)系统、新无线(new radio,NR)系统,或者是5G之后演进的通信系统(例如6G等)。其中,该通信系统中包括至少一个网络设备和/或至少一个终端设备。This application can be applied to a long-term evolution (LTE) system, a new radio (NR) system, or a communication system evolved after 5G (e.g., 6G). The communication system includes at least one network device and/or at least one terminal device.

请参阅图1a,为本申请中通信系统的一种示意图。图1a中,示例性的示出了一个网络设备和6个终端设备,6个终端设备分别为终端设备1、终端设备2、终端设备3、终端设备4、终端设备5以及终端设备6等。在图1a所示的示例中,是以终端设备1为智能茶杯,终端设备2为智能空调,终端设备3为智能加油机,终端设备4为交通工具,终端设备5为手机,终端设备6为打印机进行举例说明的。Please refer to Figure 1a, which is a schematic diagram of a communication system in this application. Figure 1a exemplarily illustrates a network device and six terminal devices, namely terminal device 1, terminal device 2, terminal device 3, terminal device 4, terminal device 5, and terminal device 6. In the example shown in Figure 1a, terminal device 1 is a smart teacup, terminal device 2 is a smart air conditioner, terminal device 3 is a smart gas pump, terminal device 4 is a vehicle, terminal device 5 is a mobile phone, and terminal device 6 is a printer.

如图1a所示,AI配置信息的发送实体可以为网络设备。AI配置信息的接收实体可以为终端设备1-终端设备6,此时,网络设备和终端设备1-终端设备6组成一个通信系统,在该通信系统中,终端设备1-终端设备6可以发送数据给网络设备,网络设备需要接收终端设备1-终端设备6发送的数据。同时,网络设备可以向终端设备1-终端设备6发送配置信息。As shown in Figure 1a, the sending entity of AI configuration information can be a network device. The receiving entity of AI configuration information can be terminal devices 1-6. In this case, the network device and terminal devices 1-6 form a communication system. In this communication system, terminal devices 1-6 can send data to the network device, and the network device needs to receive data sent by terminal devices 1-6. At the same time, the network device can send configuration information to terminal devices 1-6.

示例性的,在图1a中,终端设备4-终端设备6也可以组成一个通信系统。其中,终端设备5作为网络设备,即AI配置信息的发送实体;终端设备4和终端设备6作为终端设备,即AI配置信息的接收实体。例如车联网系统中,终端设备5分别向终端设备4和终端设备6发送AI配置信息,并且接收终端设备4和终端设备6发送的数据;相应的,终端设备4和终端设备6接收终端设备5发送的AI配置信息,并向终端设备5发送数据。For example, in Figure 1a, terminal devices 4 and 6 can also form a communication system. Terminal device 5 acts as a network device, i.e., the sending entity of AI configuration information; terminal devices 4 and 6 act as terminal devices, i.e., the receiving entities of AI configuration information. For example, in a connected vehicle system, terminal device 5 sends AI configuration information to terminal devices 4 and 6, respectively, and receives data from terminal devices 4 and 6. Correspondingly, terminal devices 4 and 6 receive AI configuration information from terminal device 5 and send data to terminal device 5.

以图1a所示通信系统为例,不同的设备之间(包括网络设备与网络设备之间,网络设备与终端设备之间,和/或,终端设备和终端设备之间)除了执行通信相关业务之外,还有可能执行AI相关业务。Taking the communication system shown in Figure 1a as an example, in addition to executing communication-related services, different devices (including between network devices, between network devices and terminal devices, and/or between terminal devices) may also execute AI-related services.

如图1b所示,以网络设备为基站为例,基站可以与一个或多个终端设备之间可以执行通信相关业务和AI相关业务,不同终端设备之间也可以执行通信相关业务和AI相关业务。As shown in Figure 1b, taking the network device as a base station as an example, the base station can perform communication-related services and AI-related services with one or more terminal devices, and different terminal devices can also perform communication-related services and AI-related services.

如图1c所示,以终端设备包括电视和手机为例,电视和手机之间也可以执行通信相关业务和AI相关业务。As shown in Figure 1c, taking the terminal devices including a TV and a mobile phone as an example, communication-related services and AI-related services can also be performed between the TV and the mobile phone.

本申请提供的技术方案可以应用于无线通信系统(例如图1a、图1b或图1c所示系统),例如本申请提供的通信系统中可以引入AI网元来实现部分或全部AI相关的操作。AI网元也可以称为AI节点、AI设备、AI实体、AI模块、AI模型、或AI单元等。所述AI网元可以是内置在通信系统的网元中。例如,AI网元可以是内置在:接入网设备、核心网设备、云服务器、或操作管理维护(operation,administration and maintenance,OAM)中的AI模块,用以实现AI相关的功能。所述OAM可以作为核心网设备的网管和/或作为接入网设备的网管。或者,所述AI网元也可以是通信系统中独立设置的网元。可选的,终端或终端内置的芯片中也可以包括AI实体,用于实现AI相关的功能。The technical solution provided in this application can be applied to wireless communication systems (for example, the systems shown in Figures 1a, 1b, or 1c). For example, an AI network element can be introduced into the communication system provided in this application to implement some or all AI-related operations. The AI network element can also be referred to as an AI node, AI device, AI entity, AI module, AI model, or AI unit, etc. The AI network element can be a network element built into the communication system. For example, the AI network element can be an AI module built into: an access network device, a core network device, a cloud server, or an operation, administration, and maintenance (OAM) system to implement AI-related functions. The OAM can serve as a network manager for a core network device and/or as a network manager for an access network device. Alternatively, the AI network element can also be an independently set network element in the communication system. Optionally, the terminal or the chip built into the terminal can also include an AI entity to implement AI-related functions.

下面将本申请中可能涉及到的AI进行简要介绍。The following is a brief introduction to the AI that may be involved in this application.

AI可以让机器具有人类的智能,例如可以让机器应用计算机的软硬件来模拟人类某些智能行为。为了实现人工智能,可以采用机器学习方法。机器学习方法中,机器利用训练数据学习(或训练)得到模型。该模型表征了从输入到输出之间的映射。学习得到的模型可以用于进行推理(或预测),即可以利用该模型预测出给定输入所对应的输出。其中,该输出还可以称为推理结果(或预测结果)。AI can imbue machines with human intelligence. For example, it can use computer hardware and software to simulate certain intelligent human behaviors. Machine learning methods can be used to achieve artificial intelligence. In machine learning, a machine uses training data to learn (or train) a model. This model represents the mapping from input to output. The learned model can be used for inference (or prediction), meaning that the model can be used to predict the output corresponding to a given input. This output can also be called an inference result (or prediction result).

机器学习可以包括监督学习、无监督学习、和强化学习。其中,无监督学习还可以称为非监督学习。Machine learning can include supervised learning, unsupervised learning, and reinforcement learning. Among them, unsupervised learning can also be called unsupervised learning.

监督学习依据已采集到的样本值和样本标签,利用机器学习算法学习样本值到样本标签的映射关系,并用AI模型来表达学到的映射关系。训练机器学习模型的过程就是学习这种映射关系的过程。在训练过程中,将样本值输入模型得到模型的预测值,通过计算模型的预测值与样本标签(理想值)之间的误差来优化模型参数。映射关系学习完成后,就可以利用学到的映射来预测新的样本标签。监督学习学到的映射关系可以包括线性映射或非线性映射。根据标签的类型可将学习的任务分为分类任务和回归任务。Supervised learning uses machine learning algorithms to learn the mapping relationship between sample values and sample labels based on collected sample values and sample labels, and then expresses this learned mapping relationship using an AI model. The process of training a machine learning model is the process of learning this mapping relationship. During training, sample values are input into the model to obtain the model's predicted values. The model parameters are optimized by calculating the error between the model's predicted values and the sample labels (ideal values). Once the mapping relationship is learned, the learned mapping can be used to predict new sample labels. The mapping relationship learned by supervised learning can include linear mappings or nonlinear mappings. Based on the type of label, the learning task can be divided into classification tasks and regression tasks.

无监督学习依据采集到的样本值,利用算法自行发掘样本的内在模式。无监督学习中有一类算法将样本自身作为监督信号,即模型学习从样本到样本的映射关系,称为自监督学习。训练时,通过计算模型的预测值与样本本身之间的误差来优化模型参数。自监督学习可用于信号压缩及解压恢复的应用,常见的算法包括自编码器和对抗生成型网络等。Unsupervised learning uses algorithms to discover inherent patterns in collected sample values. One type of unsupervised learning algorithm uses the samples themselves as supervisory signals, meaning the model learns the mapping from one sample to another. This is called self-supervised learning. During training, the model parameters are optimized by calculating the error between the model's predictions and the samples themselves. Self-supervised learning can be used in signal compression and decompression recovery applications. Common algorithms include autoencoders and generative adversarial networks.

强化学习不同于监督学习,是一类通过与环境进行交互来学习解决问题的策略的算法。与监督、无监督学习不同,强化学习问题并没有明确的“正确的”动作标签数据,算法需要与环境进行交互,获取环境反馈的奖励信号,进而调整决策动作以获得更大的奖励信号数值。如下行功率控制中,强化学习模型根据无线网络反馈的系统总吞吐率,调整各个用户的下行发送功率,进而期望获得更高的系统吞吐率。强化学习的目标也是学习环境状态与较优(例如最优)决策动作之间的映射关系。但因为无法事先获得“正确动作”的标签,所以不能通过计算动作与“正确动作”之间的误差来优化网络。强化学习的训练是通过与环境的迭代交互而实现的。Reinforcement learning, unlike supervised learning, is a type of algorithm that learns problem-solving strategies through interaction with the environment. Unlike supervised and unsupervised learning, reinforcement learning problems lack explicit label data for "correct" actions. Instead, the algorithm must interact with the environment to obtain reward signals from the environment, and then adjust its decision-making actions to maximize the reward signal value. For example, in downlink power control, the reinforcement learning model adjusts the downlink transmit power of each user based on the overall system throughput fed back by the wireless network, hoping to achieve higher system throughput. The goal of reinforcement learning is also to learn the mapping between environmental states and optimal (e.g., optimal) decision-making actions. However, because the labels for "correct actions" cannot be obtained in advance, network optimization cannot be achieved by calculating the error between actions and "correct actions." Reinforcement learning training is achieved through iterative interaction with the environment.

神经网络(neural network,NN)是机器学习技术中的一种具体的模型。根据通用近似定理,神经网络在理论上可以逼近任意连续函数,从而使得神经网络具备学习任意映射的能力。传统的通信系统需要借助丰富的专家知识来设计通信模块,而基于神经网络的深度学习通信系统可以从大量的数据集中自动发现隐含的模式结构,建立数据之间的映射关系,获得优于传统建模方法的性能。A neural network (NN) is a specific model in machine learning technology. According to the universal approximation theorem, NNs can theoretically approximate any continuous function, enabling them to learn arbitrary mappings. Traditional communication systems require extensive expert knowledge to design communication modules. However, deep learning communication systems based on neural networks can automatically discover implicit patterns in massive data sets and establish mapping relationships between data, achieving performance superior to traditional modeling methods.

神经网络的思想来源于大脑组织的神经元结构。例如,每个神经元都对其输入值进行加权求和运算,通过一个激活函数输出运算结果。The idea of a neural network is derived from the neuronal structure of the brain. For example, each neuron performs a weighted sum operation on its input values and outputs the result through an activation function.

如图2a所示,为神经元结构的一种示意图。假设神经元的输入为x=[x0,x1,…,xn],与各个输入对应的权值分别为w=[w0,w1,…,wn],其中,n为正整数,wi和xi可以是小数、整数(例如0、正整数或负整数等)、或复数等各种可能的类型。wi作为xi的权值,用于对xi进行加权。根据权值对输入值进行加权求和的偏置例如为b。激活函数的形式可以有多种,假设一个神经元的激活函数为:y=f(z)=max(0,z),则该神经元的输出为:再例如,一个神经元的激活函数为:y=f(z)=z,则该神经元的输出为: 其中,b可以是小数、整数(例如0、正整数或负整数)、或复数等各种可能的类型。神经网络中不同神经元的激活函数可以相同或不同。As shown in Figure 2a, it is a schematic diagram of a neuron structure. Assume that the input of the neuron is x = [x 0 , x 1 ,…, x n ], and the weights corresponding to each input are w = [w 0 , w 1 ,…, w n ], where n is a positive integer, and w i and xi can be various possible types such as decimals, integers (such as 0, positive integers or negative integers, etc.), or complex numbers. w i is used as the weight of xi to weight xi . The bias for weighted summation of input values according to the weights is, for example, b. There can be many forms of activation functions. Assuming that the activation function of a neuron is: y = f(z) = max(0,z), the output of the neuron is: For another example, if the activation function of a neuron is: y = f(z) = z, then the output of the neuron is: b can be a decimal, an integer (eg, 0, a positive integer, or a negative integer), or a complex number, etc. The activation functions of different neurons in a neural network can be the same or different.

此外,神经网络一般包括多个层,每层可包括一个或多个神经元。通过增加神经网络的深度和/或宽度,能够提高该神经网络的表达能力,为复杂系统提供更强大的信息提取和抽象建模能力。其中,神经网络的深度可以是指神经网络包括的层数,每层包括的神经元个数可以称为该层的宽度。在一种实现方式中,神经网络包括输入层和输出层。神经网络的输入层将接收到的输入信息经过神经元处理,将处理结果传递给输出层,由输出层得到神经网络的输出结果。在另一种实现方式中,神经网络包括输入层、隐藏层和输出层。神经网络的输入层将接收到的输入信息经过神经元处理,将处理结果传递给中间的隐藏层,隐藏层对接收的处理结果进行计算,得到计算结果,隐藏层将计算结果传递给输出层或者下一个相邻的隐藏层,最终由输出层得到神经网络的输出结果。其中,一个神经网络可以包括一个隐藏层,或者包括多个依次连接的隐藏层,不予限制。Furthermore, neural networks generally include multiple layers, each of which may include one or more neurons. Increasing the depth and/or width of a neural network can improve its expressive power, providing more powerful information extraction and abstract modeling capabilities for complex systems. The depth of a neural network can refer to the number of layers it comprises, and the number of neurons in each layer can be referred to as the width of that layer. In one implementation, a neural network includes an input layer and an output layer. The input layer processes the input information received by the neural network through neurons, passing the processing results to the output layer, which then obtains the output of the neural network. In another implementation, a neural network includes an input layer, a hidden layer, and an output layer. The input layer processes the input information received by the neural network through neurons, passing the processing results to an intermediate hidden layer. The hidden layer performs calculations on the received processing results to obtain a calculation result, which is then passed to the output layer or the next adjacent hidden layer, which ultimately obtains the output of the neural network. A neural network can include one hidden layer or multiple hidden layers connected in sequence, without limitation.

神经网络例如为深度神经网络(deep neural network,DNN)。根据网络的构建方式,DNN可以包括前馈神经网络(feedforward neural network,FNN)、卷积神经网络(convolutional neural networks,CNN)和递归神经网络(recurrent neural network,RNN)。An example of a neural network is a deep neural network (DNN). Depending on how the network is constructed, DNNs can include feedforward neural networks (FNNs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs).

图2b为一种FNN网络示意图。FNN网络的特点为相邻层的神经元之间两两完全相连。该特点使得FNN通常需要大量的存储空间、导致较高的计算复杂度。Figure 2b is a schematic diagram of an FNN network. A characteristic of FNN networks is that neurons in adjacent layers are fully connected. This characteristic typically requires a large amount of storage space and results in high computational complexity.

CNN是一种专门来处理具有类似网格结构的数据的神经网络。例如,时间序列数据(例如时间轴离散采样)和图像数据(例如二维离散采样)都可以认为是类似网格结构的数据。CNN并不一次性利用全部的输入信息做运算,而是采用一个固定大小的窗截取部分信息做卷积运算,这就大大降低了模型参数的计算量。另外根据窗截取的信息类型的不同(如同一副图中的人和物为不同类型信息),每个窗可以采用不同的卷积核运算,这使得CNN能更好的提取输入数据的特征。CNN is a type of neural network specifically designed to process data with a grid-like structure. For example, time series data (e.g., discrete sampling on the time axis) and image data (e.g., discrete sampling on the two-dimensional axis) can both be considered grid-like data. CNNs do not utilize all input information at once for computation. Instead, they use a fixed-size window to intercept a portion of the information for convolution operations, significantly reducing the computational complexity of model parameters. Furthermore, depending on the type of information intercepted by the window (e.g., people and objects in an image represent different types of information), each window can use a different convolution kernel, enabling CNNs to better extract features from the input data.

RNN是一类利用反馈时间序列信息的DNN网络。RNN的输入包括当前时刻的新的输入值和自身在前一时刻的输出值。RNN适合获取在时间上具有相关性的序列特征,特别适用于语音识别、信道编译码等应用。RNNs are a type of DNN that utilizes feedback time series information. RNN inputs include the current input value and its own output value at the previous moment. RNNs are suitable for capturing temporally correlated sequence features and are particularly well-suited for applications such as speech recognition and channel coding.

在上述机器学习的模型训练过程中,可以定义损失函数(loss function)。损失函数描述了模型的输出值和理想目标值之间的差距或差异。损失函数可以通过多种形式体现,对于损失函数的具体形式不予限制。模型训练过程可以看作以下过程:通过调整模型的部分或全部参数,使得损失函数的值小于门限值或者满足目标需求。During the machine learning model training process, a loss function can be defined. This function describes the gap or discrepancy between the model's output and the desired target value. Loss functions can be expressed in a variety of forms, and their specific form is not restricted. The model training process can be viewed as adjusting some or all of the model's parameters to keep the loss function below a threshold or meet the target.

模型还可以被称为AI模型、规则或者其他名称等。AI模型可以认为是实现AI功能的具体方法。AI模型表征了模型的输入和输出之间的映射关系或者函数。AI功能可以包括以下一项或多项:数据收集、模型训练(或模型学习)、模型信息发布、模型推断(或称为模型推理、推理、或预测等)、模型监控或模型校验、或推理结果发布等。AI功能还可以称为AI(相关的)操作、或AI相关的功能。A model may also be referred to as an AI model, rule, or other name. An AI model can be considered a specific method for implementing an AI function. An AI model represents a mapping relationship or function between the input and output of a model. AI functions may include one or more of the following: data collection, model training (or model learning), model information release, model inference (or model reasoning, inference, or prediction, etc.), model monitoring or model verification, or inference result release, etc. AI functions may also be referred to as AI (related) operations, or AI-related functions.

下面将结合附图,对全连接神经网络的实现过程进行示例性描述。The following is an illustrative description of the implementation process of the fully connected neural network with reference to the accompanying drawings.

1.全连接神经网络,又叫多层感知机(multilayer perceptron,MLP)。1. Fully connected neural network, also called multilayer perceptron (MLP).

如图2c所示,一个MLP包含一个输入层(左侧),一个输出层(右侧),及多个隐藏层(中间)。其中,MLP的每层包含若干个节点,称为神经元。其中,相邻两层的神经元间两两相连。As shown in Figure 2c, an MLP consists of an input layer (left), an output layer (right), and multiple hidden layers (center). Each layer of the MLP contains several nodes, called neurons. Neurons in adjacent layers are connected to each other.

可选的,考虑相邻两层的神经元,下一层的神经元的输出h为所有与之相连的上一层神经元x的加权和并经过激活函数,可以表示为:
h=f(wx+b)。
Alternatively, considering neurons in two adjacent layers, the output h of the neurons in the next layer is the weighted sum of all neurons x in the previous layer connected to it and passes through the activation function, which can be expressed as:
h=f(wx+b).

其中,w为权重矩阵,b为偏置向量,f为激活函数。Among them, w is the weight matrix, b is the bias vector, and f is the activation function.

进一步可选的,神经网络的输出可以递归表达为:
y=fm(wmfm-1(…)+bm)。
Alternatively, the output of the neural network can be recursively expressed as:
y=f m (w m f m-1 (…)+b m ).

其中,m是神经网络层的索引,m大于或等于1,且m小于或等于M,其中M为神经网络的总层数。Where m is the index of the neural network layer, m is greater than or equal to 1, and m is less than or equal to M, where M is the total number of neural network layers.

换言之,可以将神经网络理解为一个从输入数据集合到输出数据集合的映射关系。而通常神经网络都是随机初始化的,用已有数据从随机的w和b得到这个映射关系的过程被称为神经网络的训练。In other words, a neural network can be understood as a mapping from an input data set to an output data set. Neural networks are typically initialized randomly, and the process of obtaining this mapping from random w and b using existing data is called neural network training.

可选的,训练的具体方式为采用损失函数对神经网络的输出结果进行评价。Optionally, a specific method of training is to use a loss function to evaluate the output results of the neural network.

如图2d所示,可以将误差反向传播,通过梯度下降的方法即能迭代优化神经网络参数(包括w和b),直到损失函数达到最小值,即图2d中的“较优点(例如最优点)”。可以理解的是,图2d中的“较优点(例如最优点)”对应的神经网络参数可以作为训练好的AI模型信息中的神经网络参数。As shown in Figure 2d, the error can be backpropagated, and the neural network parameters (including w and b) can be iteratively optimized using gradient descent until the loss function reaches a minimum, which is the "better point (e.g., optimal point)" in Figure 2d. It is understood that the neural network parameters corresponding to the "better point (e.g., optimal point)" in Figure 2d can be used as the neural network parameters in the trained AI model information.

进一步可选的,梯度下降的过程可以表示为:
Alternatively, the gradient descent process can be expressed as:

其中,θ为待优化参数(包括w和b),L为损失函数,η为学习率,控制梯度下降的步长,表示求导运算,表示对L求θ的导数。Among them, θ is the parameter to be optimized (including w and b), L is the loss function, and η is the learning rate, which controls the step size of gradient descent. represents the derivative operation, represents the derivative of θ with respect to L.

进一步可选的,反向传播的过程利用到求偏导的链式法则。Optionally, the backpropagation process utilizes the chain rule for partial derivatives.

如图2e所示,前一层参数的梯度可以由后一层参数的梯度递推计算得到,可以表达为:
As shown in Figure 2e, the gradient of the previous layer parameters can be recursively calculated from the gradient of the next layer parameters, which can be expressed as:

其中,wij为节点j连接节点i的权重,si为节点i上的输入加权和。Among them, wij is the weight of node j connecting to node i, and si is the weighted sum of the inputs on node i.

2.联邦学习(federated learning,FL)。2. Federated learning (FL).

联邦学习这一概念的提出有效地解决了当前人工智能发展所面临的困境,其在充分保障用户数据隐私和安全的前提下,通过促使各个边缘设备和中心端服务器协同合作来高效地完成模型的学习任务。The concept of federated learning effectively solves the current difficulties faced by the development of artificial intelligence. On the premise of fully protecting user data privacy and security, it efficiently completes the model learning task by promoting the collaboration between various edge devices and central servers.

如图2f所示,FL架构是当前FL领域最为广泛的训练架构,FedAvg算法是FL的基础算法,其算法流程大致如下:As shown in Figure 2f, the FL architecture is the most widely used training architecture in the current FL field. The FedAvg algorithm is the basic algorithm of FL. Its algorithm flow is roughly as follows:

(1)中心端初始化待训练模型并将其广播发送给所有客户端设备。(1) The center initializes the model to be trained And broadcast it to all client devices.

(2)在第t∈[1,T]轮中,客户端k∈[1,K]基于局部数据集对接收到的全局模型进行E个epoch的训练以得到本地训练结果将其上报给中心节点。(2) In the round t∈[1,T], client k∈[1,K] based on the local dataset For the received global model Perform E epochs of training to obtain local training results Report it to the central node.

(3)中心节点汇总收集来自全部(或部分)客户端的本地训练结果,假设第t轮上传局部模型的客户端集合为中心端将以对应客户端的样本数为权重进行加权求均得到新的全局模型,具体更新法则为其后中心端再将最新版本的全局模型广播发送给所有客户端设备进行新一轮的训练。(3) The central node aggregates and collects the local training results from all (or some) clients. Assume that the client set that uploads the local model in round t is The center will use the number of samples of the corresponding client as the weight to perform weighted averaging to obtain a new global model. The specific update rule is: The center then sends the latest version of the global model Broadcast to all client devices for a new round of training.

(4)重复步骤(2)和(3)直至模型最终收敛或训练轮数达到上限。(4) Repeat steps (2) and (3) until the model finally converges or the number of training rounds reaches the upper limit.

除了上报本地模型还可以将训练的本地梯度进行上报,中心节点将本地梯度求平均,并根据这个平均梯度的方向更新全局模型。In addition to reporting local models You can also use the local gradient of training After reporting, the central node averages the local gradients and updates the global model according to the direction of the average gradient.

可以看到,在FL框架中,数据集存在于分布式节点处,即分布式节点收集本地的数据集,并进行本地训练,将训练得到的本地结果(模型或梯度)上报给中心节点。中心节点本身没有数据集,只负责将分布式节点的训练结果进行融合处理,得到全局模型,并下发给分布式节点。As you can see, in the FL framework, datasets exist on distributed nodes. Distributed nodes collect local datasets, perform local training, and report the local training results (models or gradients) to the central node. The central node itself does not have a dataset; it is only responsible for fusing the training results of distributed nodes to obtain a global model and send it to the distributed nodes.

3.去中心式学习。与联邦学习不同,另一种分布式学习架构——去中心式学习。3. Decentralized learning: Different from federated learning, decentralized learning is another distributed learning architecture.

如图2g所示,考虑没有中心节点的完全分布式系统。去中心式学习系统的设计目标f(x)一般是各节点目标fi(x)的均值,即其中p是分布式节点数量,x是待优化参数,在机器学习中,x就是机器学习(如神经网络)模型的参数。各节点利用本地数据和本地目标fi(x)计算本地梯度然后将其发送给通信可达的邻居节点。任一节点收到其邻点发来的梯度信息后,可以按照下式更新本地模型的参数x:
As shown in Figure 2g, consider a fully distributed system without a central node. The design goal f(x) of a decentralized learning system is generally the mean of the goals fi (x) of each node, that is, Where p is the number of distributed nodes, x is the parameter to be optimized. In machine learning, x is the parameter of the machine learning (such as neural network) model. Each node uses local data and local target fi (x) to calculate the local gradient Then it is sent to the neighboring nodes that can be communicated with. After any node receives the gradient information sent by its neighbor, it can update the parameter x of the local model according to the following formula:

其中,表示第i个节点中第k+1(k为自然数)次更新后的本地模型的参数,表示第i个节点中第k次更新后的本地模型的参数(若k为0,则表示为第i个节点的未参与更新的本地模型的参数),αk表示调优系数,Pi是节点i的邻居节点集合,|Pi|表示节点i的邻居节点集合中的元素数量,即节点i的邻居节点数量。通过节点间的信息交互,去中心式学习系统最终将学到一个统一的模型。in, represents the parameters of the local model after the k+1th (k is a natural number) update in the i-th node, Represents the parameters of the local model after the kth update in the i-th node (if k is 0, it means (where αk is the parameter of the local model of node i that is not updated), αk represents the tuning coefficient, Pi is the set of neighboring nodes of node i, and | Pi | represents the number of elements in the set of neighboring nodes of node i, that is, the number of neighboring nodes of node i. Through information exchange between nodes, the decentralized learning system will eventually learn a unified model.

本申请提供的技术方案可以应用于无线通信系统(例如图1a或图1b或图1c所示系统),在无线通信系统中,通信节点一般具备信号收发能力和计算能力。以具备计算能力的网络设备为例,网络设备的计算能力主要是为信号收发能力提供算力支持(例如:对信号进行发送处理和接收处理),以实现网络设备与其它通信节点的通信任务。The technical solutions provided in this application can be applied to wireless communication systems (e.g., the systems shown in Figures 1a, 1b, or 1c). In wireless communication systems, communication nodes generally have both signal transceiver capabilities and computing capabilities. For example, network devices with computing capabilities primarily provide computing power to support signal transceiver capabilities (e.g., performing signal transmission and reception processing) to enable communication between the network device and other communication nodes.

然而,在通信网络中,通信节点的计算能力除了为上述通信任务提供算力支持之外,还可能具备富余的计算能力。为此,如何利用这些计算能力,是一个亟待解决的技术问题。However, in communication networks, communication nodes may have excess computing power beyond just supporting the aforementioned communication tasks. Therefore, how to utilize this computing power is a pressing technical issue.

在一种可能的实现方式中,通信节点可以作为AI学习系统的参与节点,将该通信节点的算力应用于AI学习系统(例如图2f或图2g所述AI学习系统)的某一个环节。一般来说,在AI学习系统中,AI节点可以根据任务建立数据集,离线完成模型的训练,在线部署后可以进行推理。然而,当进行长时间的持续学习后,AI模型有可能会出现可塑性消失的问题,即AI模型将不能学到新的训练样本的知识,导致该AI模型的性能将越来越差,导致持续学习的鲁棒性下降。作为一种可能的优化方式,可以在训练的同时选择性重新初始化部分参数的方法,从而保证神经网络可以保持学习新样本的能力。然而,在该优化方式中,需要设计复杂的函数及统计各个参数生命周期来选取重置参数集合,复杂度较高;同时,重新初始化参数会较大影响神经网络瞬时的性能,导致该上述优化方式并未能有效解决上述问题。In one possible implementation, a communication node can serve as a participating node in an AI learning system, and the computing power of the communication node can be applied to a certain link of the AI learning system (e.g., the AI learning system described in FIG2f or FIG2g). Generally speaking, in an AI learning system, an AI node can establish a data set according to a task, complete the training of the model offline, and perform reasoning after online deployment. However, after a long period of continuous learning, the AI model may have the problem of disappearing plasticity, that is, the AI model will not be able to learn the knowledge of new training samples, resulting in the performance of the AI model becoming worse and worse, resulting in a decrease in the robustness of continuous learning. As a possible optimization method, a method of selectively reinitializing some parameters during training can be used to ensure that the neural network can maintain the ability to learn new samples. However, in this optimization method, it is necessary to design complex functions and count the life cycles of various parameters to select the reset parameter set, which is highly complex; at the same time, reinitializing the parameters will greatly affect the instantaneous performance of the neural network, resulting in the above-mentioned optimization method not being able to effectively solve the above-mentioned problem.

为了解决上述问题,本申请提供了一种通信方法及相关设备,下面将结合附图进行详细介绍。In order to solve the above problems, the present application provides a communication method and related equipment, which will be described in detail below with reference to the accompanying drawings.

请参阅图3,为本申请提供的通信方法的一个实现示意图,该方法包括如下步骤。Please refer to FIG3 , which is a schematic diagram of an implementation of the communication method provided in this application. The method includes the following steps.

需要说明的是,在下文中,图3和图5中以第一通信装置和其它通信装置(例如第二通信装置)作为该交互示意的执行主体为例来示意该方法,但本申请并不限制该交互示意的执行主体。例如,通信装置可以为通信设备(例如终端设备或网络设备),或者,通信设备中的芯片、基带(baseband)芯片、调制解调(modem)芯片、包含modem核的片上系统(system on chip,SoC)芯片、系统级封装(system in package,SIP)芯片、通信模组、芯片系统、处理器、逻辑模块或软件等。It should be noted that, in the following, FIG3 and FIG5 illustrate the method by taking the first communication device and other communication devices (e.g., the second communication device) as the execution subjects of the interaction diagram as examples, but the present application does not limit the execution subjects of the interaction diagram. For example, the communication device can be a communication device (e.g., a terminal device or a network device), or a chip, a baseband chip, a modem chip, a system-on-chip (SoC) chip including a modem core, a system-in-package (SIP) chip, a communication module, a chip system, a processor, a logic module, or software in the communication device.

S301.第二通信装置发送第一信息,相应的,第一通信装置接收该第一信息。其中,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于部署在该第一通信装置的第一AI模型。S301. A second communication device sends first information, and a first communication device receives the first information accordingly. The first information is used to indicate performance monitoring information of at least one AI model associated with a first AI model deployed on the first communication device.

S302.第一通信装置基于第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。S302. The first communication device performs noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.

本申请中,AI模型,可以替换为其它术语,例如神经网络、神经网络模型、AI神经网络模型、机器学习模型、或AI处理模型等。In this application, AI model can be replaced by other terms, such as neural network, neural network model, AI neural network model, machine learning model, or AI processing model.

本申请中,加噪扰动处理,可以替换为其它术语,例如加噪处理、扰动处理、噪声扰动处理、噪声干扰处理、或随机扰动处理等。In this application, noise disturbance processing can be replaced by other terms, such as noise addition processing, disturbance processing, noise disturbance processing, noise interference processing, or random disturbance processing.

在一种可能的实现方式中,第一通信装置获取的第一信息用于指示至少一个AI模型的性能监测信息,其中,该至少一个AI模型包括该第一AI模型和/或第二AI模型,该第一AI模型关联于第二AI模型,该第二AI模型用于部署在第二通信装置。其中,第一信息可以用于指示至少一个AI模型的性能监测信息,并且,在S302中第一通信装置可以基于第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。换言之,该第一信息指示的性能监测信息满足某个条件(例如后文描述的第一条件和/或第二条件)的情况下,该第一信息可以用于触发第一AI模型和/或第二AI模型的模型参数的加噪扰动处理。相应的,该至少一个AI模型可以包括该第一AI模型和/或第二AI模型,通过这种方式,第一通信装置能够基于存在关联关系的AI模型的性能触发AI模型的加噪扰动处理。In one possible implementation, the first information acquired by the first communication device is used to indicate performance monitoring information of at least one AI model, wherein the at least one AI model includes the first AI model and/or the second AI model, the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device. The first information can be used to indicate the performance monitoring information of the at least one AI model, and in S302, the first communication device can perform noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model. In other words, when the performance monitoring information indicated by the first information meets a certain condition (e.g., the first condition and/or the second condition described below), the first information can be used to trigger noise perturbation processing of the model parameters of the first AI model and/or the second AI model. Accordingly, the at least one AI model can include the first AI model and/or the second AI model. In this way, the first communication device can trigger noise perturbation processing of the AI model based on the performance of the associated AI model.

需要说明的是,S301是可选的,对于第一通信装置而言,该第一通信装置除了通过与第二通信装置之间的交互的方式获取第一信息之外,还可以通过其它方式获取该第一信息。示例性的,该第一通信装置可以在本地获取该第一信息。例如,该第一通信装置可以获取该第一AI模型和/或该第二AI模型的测量参数,并基于该第一AI模型和/或该第二AI模型的测量参数确定该第一信息。It should be noted that S301 is optional. For the first communication device, in addition to obtaining the first information through interaction with the second communication device, the first communication device can also obtain the first information through other means. Exemplarily, the first communication device can obtain the first information locally. For example, the first communication device can obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model and/or the second AI model.

作为一种示例,AI模型(例如第一AI模型和/或第二AI模型)的测量参数可以包括该AI模型的输出。换言之,第一通信装置可以基于AI模型的输出确定该AI模型的性能监测信息。例如,该性能监测信息可以是通过该AI模型的输出与预配置的标签之间的数学关系(例如差值、方差、标准差等中的至少一个)确定的。As an example, the measured parameters of an AI model (e.g., the first AI model and/or the second AI model) may include the output of the AI model. In other words, the first communications device may determine performance monitoring information of the AI model based on the output of the AI model. For example, the performance monitoring information may be determined by a mathematical relationship (e.g., at least one of a difference, variance, standard deviation, etc.) between the output of the AI model and a preconfigured tag.

作为另一种示例,AI模型(例如第一AI模型和/或第二AI模型)的输出为通信参数(例如编码和/或解码的码率、加扰和/或解扰所使用的序列,调制和/或解调阶数、信道参数、通信速率、误码率等中的至少一个)的情况下,该AI模型的测量参数可以包括基于该通信参数进行通信的通信性能信息(例如误码率、误块率、参考信号接收信号质量(reference signal received power,RSRP)、信号与干扰加噪声比(signal to interference plus noise ratio,SINR)中的一项或多项)。换言之,第一通信装置可以基于通信性能信息与预配置的性能信息确定该AI模型的性能监测信息。例如,该性能监测信息可以是通过该通信性能信息与预配置的性能信息之间的数学关系(例如差值、方差、标准差等中的至少一个)确定的。As another example, when the output of an AI model (e.g., the first AI model and/or the second AI model) is a communication parameter (e.g., at least one of the encoding and/or decoding code rate, the sequence used for scrambling and/or descrambling, the modulation and/or demodulation order, the channel parameter, the communication rate, the bit error rate, etc.), the measurement parameter of the AI model may include communication performance information for communication based on the communication parameter (e.g., one or more of the bit error rate, the block error rate, the reference signal received power (RSRP), and the signal to interference plus noise ratio (SINR)). In other words, the first communication device may determine the performance monitoring information of the AI model based on the communication performance information and the preconfigured performance information. For example, the performance monitoring information may be determined by a mathematical relationship (e.g., at least one of the difference, variance, standard deviation, etc.) between the communication performance information and the preconfigured performance information.

相应的,第一通信装置可以通过多种方式获得AI模型的测量参数。例如,第一通信装置基于本地的第一AI模型的输出获得该第一AI模型的测量参数。又如,第一通信装置基于与其他通信装置的通信过程,获得该第一AI模型的测量参数。又如,第一通信装置基于与第二通信装置的通信过程,获得该第二AI模型的测量参数。Accordingly, the first communication device can obtain the measurement parameters of the AI model in a variety of ways. For example, the first communication device can obtain the measurement parameters of the first AI model based on the output of the local first AI model. In another example, the first communication device can obtain the measurement parameters of the first AI model based on the communication process with other communication devices. In another example, the first communication device can obtain the measurement parameters of the second AI model based on the communication process with a second communication device.

在一种可能的实现方式中,该第一AI模型关联于该第二AI模型,包括下述方式A至方式F中的任一项:In a possible implementation, the first AI model is associated with the second AI model, including any one of the following methods A to F:

方式A.第一AI模型为公共模型且该第二AI模型为个性化模型。Method A: The first AI model is a public model and the second AI model is a personalized model.

方式B.第一AI模型为个性化模型且该第二AI模型为公共模型。Method B: The first AI model is a personalized model and the second AI model is a public model.

方式C.第一AI模型的输入包括该第二AI模型的输出。Method C: The input of the first AI model includes the output of the second AI model.

方式D.第二AI模型的输入包括该第一AI模型的输出。Mode D: The input of the second AI model includes the output of the first AI model.

方式E.第一AI模型为教师模型且该第二AI模型为学生模型。Method E: The first AI model is a teacher model and the second AI model is a student model.

方式F.第一AI模型为学生型且该第二AI模型为教师模型。Method F: The first AI model is a student model and the second AI model is a teacher model.

方式A和方式B可以理解为图2f所示的联邦学习场景。在联邦学习中,存在中心节点和分布式节点。中心节点保存公共模型,周期性对分布式节点上传的个性化模型进行聚合。分布式节点周期性下载公共模型,并采用本地数据对公共模型进行微调得到个性化模型。在方式A中,第一通信装置可以为中心节点且第二通信装置可以为分布式节点。在方式B中,第二通信装置可以为中心节点且第一通信装置可以为分布式节点。Methods A and B can be understood as the federated learning scenario shown in Figure 2f. In federated learning, there are central nodes and distributed nodes. The central node stores the public model and periodically aggregates the personalized models uploaded by the distributed nodes. The distributed nodes periodically download the public model and fine-tune it using local data to obtain personalized models. In Method A, the first communication device can be the central node and the second communication device can be a distributed node. In Method B, the second communication device can be the central node and the first communication device can be a distributed node.

作为一种示例,如图4a所示,为联邦学习的一种实现示意图,以网络设备为中心节点且终端设备为分布式节点为例。其中,网络设备可以保存公共模型,终端设备1至终端设备3可以保存各自的个性化模型,两者可以通过模型下载过程和模型上传过程进行交互。其中,终端设备中的部分或全部AI模型可以参与联邦学习。在图4a所示示例中,终端设备1和终端设备2的AI模型可以全部参与联邦学习,终端设备3的AI模型可以包括参与联邦学习的带黑色图案填充的部分,以及,未参与联邦学习的空白填充的部分。As an example, as shown in Figure 4a, it is a schematic diagram of an implementation of federated learning, taking the network device as the central node and the terminal device as the distributed node as an example. Among them, the network device can save the public model, and the terminal device 1 to the terminal device 3 can save their own personalized models. The two can interact through the model download process and the model upload process. Among them, some or all AI models in the terminal device can participate in federated learning. In the example shown in Figure 4a, the AI models of terminal device 1 and terminal device 2 can all participate in federated learning, and the AI model of terminal device 3 can include a part filled with a black pattern that participates in federated learning, and a part filled with a blank that does not participate in federated learning.

图4a所示各个设备均可以作为第一通信装置,对本地部署的AI模型的模型参数进行加噪扰动处理。换言之,通过在公共模型或个性化模型的模型参数上加噪声扰动,从而增加持续联邦学习对分布变化的鲁棒性。Each device shown in Figure 4a can serve as a first communication device to perform noise perturbation on the model parameters of a locally deployed AI model. In other words, by adding noise perturbations to the model parameters of a public or personalized model, the robustness of continuous federated learning to distributional changes is increased.

作为加噪扰动处理的一种示例,对于任一AI模型,可以定义噪声扰动的标准差σi,模型参数集合(表示集合,pi表示集合中的元素),pi=1表示对该参数加噪声扰动(i取值范围为0至N-1,N为参与噪声扰动的神经网络参数的参数量),值为(即噪声ni服从正态分布,均值为0,方差为),pi=0表示无需进行扰动。As an example of noise perturbation processing, for any AI model, the standard deviation of the noise perturbation σ i can be defined, and the model parameter set ( represents a set, pi represents an element in the set), pi = 1 represents adding noise perturbation to the parameter (i ranges from 0 to N-1, N is the number of parameters of the neural network parameters involved in the noise perturbation), and the value is (i.e., the noise n i obeys Normal distribution, mean 0, variance ), p i = 0 means no disturbance is required.

可选地,加噪声扰动处理的参数可以为AI模型的部分或全部模型参数,例如,该部分或全部模型参数由上述的集合确定。如AI模型一共有1000个(即N=1000)参数,集合为{1,0,0,1…},集合中的1000个元素的序列对应这1000个参数,集合中取值为1的pi所在的位置对应的模型参数进行加噪扰动处理。Optionally, the parameters for the noise perturbation process may be part or all of the model parameters of the AI model, for example, the part or all of the model parameters are determined by the above The set is determined. For example, if the AI model has 1000 (i.e. N=1000) parameters, The set is {1,0,0,1…}, The sequence of 1000 elements in the collection corresponds to these 1000 parameters, The model parameters corresponding to the position where the value of pi in the set is 1 are subjected to noise disturbance processing.

作为加噪扰动处理的另一种示例,对于任一模型参数集合为集合的AI模型,加噪扰动处理满足:
ei=ei+ni
As another example of noise perturbation processing, for any model parameter set The AI model of the collection, noise disturbance processing satisfies:
e i =e i +n i ;

其中,ei表示模型的第i个参数,在pi=1时,(即噪声ni服从正态分布,均值为0,方差为);在pi=0时,ni=0。Where, e i represents the i-th parameter of the model. When p i = 1, (i.e., the noise n i obeys Normal distribution, mean 0, variance ); when p i =0, n i =0.

可选地,上述噪声ni服从正态分布之外,还可以有其它的实现方式。例如,ni服从均匀分布a,b表示下界和上界。Alternatively, in addition to the normal distribution, other implementations are possible. For example, a and b represent the lower and upper bounds.

方式C和方式D可以理解为拆分学习场景。在拆分学习中,AI模型可以拆分为两个或两个以上的部分。Methods C and D can be understood as split learning scenarios. In split learning, the AI model can be split into two or more parts.

作为一种示例,图4b为拆分学习的一种示意图。在该示例中,以AI模型拆分成两个部分为例,这两个部分可以为编码器和解码器,编码器和解码器之间通信中间特征以及中间梯度。训练时数据进入编码神经网络,获得中间特征后传输给解码神经网络。解码神经网络根据标签值计算损失,并反向更新解码神经网络,同时解码神经网络将中间梯度传输给编码神经网络,编码神经网络根据接收的中间梯度更新编码神经网络。As an example, Figure 4b shows a schematic diagram of split learning. In this example, the AI model is split into two parts, which can be an encoder and a decoder. The encoder and decoder communicate intermediate features and intermediate gradients. During training, data enters the encoding neural network, obtains intermediate features, and transmits them to the decoding neural network. The decoding neural network calculates the loss based on the label values and updates the decoding neural network in reverse order. Simultaneously, the decoding neural network transmits the intermediate gradients to the encoding neural network, which then updates the encoding neural network based on the received intermediate gradients.

图4b所示编码器或解码器可以作为第一通信装置中的第一AI模型,执行加噪扰动处理。换言之,可以在编码器或解码器的部分或全部模型参数上增加噪声,从而增加持续拆分学习对分布变化的鲁棒性。类似地,对于编码器或解码器的部分或全部模型参数的加噪扰动方式可以参考前文示例的描述。The encoder or decoder shown in Figure 4b can be used as the first AI model in the first communication device to perform noise perturbation processing. In other words, noise can be added to some or all of the model parameters of the encoder or decoder, thereby increasing the robustness of continuous split learning to distribution changes. Similarly, for the noise perturbation method for some or all of the model parameters of the encoder or decoder, please refer to the description of the previous example.

方式E和方式F可以理解为知识蒸馏场景。其中,知识蒸馏可以通过较大或较复杂的模型(例如教师模型或知识传授模型等)的输出对较小或较简洁的模型(例如学生模型或知识学习模型等)的损失进行增强,从而获得更好的收敛性能或者更快的收敛速度,可以被认为是一种模型压缩技术。Methods E and F can be understood as knowledge distillation scenarios. Knowledge distillation can enhance the loss of smaller or simpler models (such as student models or knowledge learning models) through the output of larger or more complex models (such as teacher models or knowledge transfer models), thereby achieving better convergence performance or faster convergence speed. This can be considered a model compression technology.

作为一种示例,图4c为知识蒸馏的一种示意图。该示例以知识蒸馏过程涉及的模型包括教师模型和学生模型为例。一般来说,教师模型不参与训练,但这里不做限定,即不区分具体的教师模型和学生模型,教师模型也可以被学生模型指导更新,或是两个模型同时进行更新,或是存在多个教师模型或多个学生模型。下面仅以最常见的单教师指导单学生模型学习举例。在图4c中,教师模型和学生模型分别推理得到推理结果,根据教师模型和学生模型的推理结果可以获得知识蒸馏损失(knowledge distillation loss,KD loss),部署了学生模型的设备可以根据标签可以获得任务损失(task Loss),然后该设备可以根据蒸馏损失和任务损失更新该学生模型的模型参数。As an example, Figure 4c is a schematic diagram of knowledge distillation. This example takes the models involved in the knowledge distillation process as an example, including a teacher model and a student model. Generally speaking, the teacher model does not participate in training, but this is not limited here, that is, there is no distinction between specific teacher models and student models. The teacher model can also be guided by the student model to update, or the two models can be updated at the same time, or there can be multiple teacher models or multiple student models. The following only takes the most common single teacher guiding single student model learning as an example. In Figure 4c, the teacher model and the student model respectively infer and obtain inference results. According to the inference results of the teacher model and the student model, the knowledge distillation loss (KD loss) can be obtained. The device deployed with the student model can obtain the task loss (task Loss) according to the label, and then the device can update the model parameters of the student model according to the distillation loss and task loss.

图4c所示教师模型或学生模型可以作为第一通信装置中的第一AI模型,执行加噪扰动处理。换言之,可以在教师模型或学生模型的部分或全部模型参数上增加噪声,从而增加持续知识蒸馏过程对分布变化的鲁棒性。类似地,对于教师模型或学生模型的部分或全部模型参数的加噪扰动方式可以参考前文示例的描述。The teacher model or student model shown in FIG4c can be used as the first AI model in the first communication device to perform noise perturbation processing. In other words, noise can be added to some or all model parameters of the teacher model or student model, thereby increasing the robustness of the continuous knowledge distillation process to distribution changes. Similarly, the noise perturbation method for some or all model parameters of the teacher model or student model can refer to the description of the previous example.

在一种可能的实现方式中,在S302中,满足第二条件时,该第一通信装置基于该第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数;其中,该第二条件包括以下至少一项:In one possible implementation, in S302, when the second condition is met, the first communication device performs noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model; wherein the second condition includes at least one of the following:

该至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold;

该至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold;

该至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model resides is lower than a third threshold;

部署该至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold;

该至少一个AI模型的神经网络输入输出分布变化大于第五阈值;A change in the neural network input-output distribution of the at least one AI model is greater than a fifth threshold;

该至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold;

该至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold.

可选地,第一通信装置可以通过多种方式确定第一阈值至第七阈值的取值。例如,第一阈值至第七阈值中的一个或多个可以是预配置的。又如,第一阈值至第七阈值中的一个或多个可以是其它设备(例如第二通信装置、网络设备或服务器等)向第一通信装置配置的。Optionally, the first communication device may determine the values of the first to seventh thresholds in a variety of ways. For example, one or more of the first to seventh thresholds may be preconfigured. In another example, one or more of the first to seventh thresholds may be configured for the first communication device by another device (e.g., a second communication device, a network device, or a server).

其中,在满足第二条件时,第一通信装置可以确定第一AI模型当前的运行可能存在异常,为此,该第一通信装置可以于对该第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数,以实现对第一AI模型的更新。通过这种方式,可以减缓或避免第一AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。When the second condition is met, the first communication device may determine that the current operation of the first AI model may be abnormal. To this end, the first communication device may process the model parameters of the first AI model to obtain updated model parameters of the first AI model to update the first AI model. In this way, the problem of the first AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability and robustness of continuous learning can be improved.

基于图3所示方案,第一通信装置通过第一信息获取至少一个AI模型的性能监测信息之后,在S302中,该第一通信装置可以对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。通过这种方式,能够使得通信节点的算力能够应用于AI模型的处理,并且,通过AI模型的性能检测信息能够实现对AI模型的更新。Based on the solution shown in Figure 3, after the first communication device obtains performance monitoring information of at least one AI model through the first information, in S302, the first communication device can perform noise perturbation processing on the model parameters of the first AI model to obtain updated model parameters of the first AI model. In this way, the computing power of the communication node can be applied to AI model processing, and the AI model can be updated based on the performance monitoring information of the AI model.

此外,在S302中,第一通信装置执行AI模型参数更新的过程是基于至少一个AI模型的性能监测信息触发的,并且,在该AI模型参数更新的过程中,第一通信装置是通过加噪扰动处理得到更新后的AI模型参数的。通过这种方式,使得方案能够应用于持续学习的场景,并且,通过对AI模型参数进行加噪扰动处理的过程,可以减缓或避免AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。In addition, in S302, the process of the first communication device performing the AI model parameter update is triggered based on the performance monitoring information of at least one AI model, and during the AI model parameter update process, the first communication device obtains the updated AI model parameters through noise perturbation processing. In this way, the solution can be applied to the scenario of continuous learning, and by performing noise perturbation processing on the AI model parameters, the problem of reduced or even disappearance of the plasticity of the AI model during long-term continuous learning can be slowed down or avoided, which can improve the adaptability of continuous learning and enhance the robustness of continuous learning.

在图3所示方法的一种可能的实现方式中,在S302之前,该方法还包括:该第一通信装置接收指示第三参数的指示信息,该第三参数用于对该第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数。具体地,第一通信装置还可以接收通过接收的指示信息确定该第三参数,并基于该第三参数对第一AI模型的模型参数进行加噪扰动处理,即该第一通信装置可以基于其它通信装置(例如第二信息的接收方)的指示以实现第三参数的确定,以便于该第一通信装置能够通过指定的第三参数获得实现加噪扰动处理。In a possible implementation of the method shown in FIG3 , before S302, the method further includes: the first communication device receives indication information indicating a third parameter, and the third parameter is used to process the model parameters of the first AI model to obtain the updated model parameters of the first AI model. Specifically, the first communication device can also receive the third parameter determined by the received indication information, and perform noise perturbation processing on the model parameters of the first AI model based on the third parameter, that is, the first communication device can determine the third parameter based on the indication of other communication devices (such as the recipient of the second information), so that the first communication device can obtain the noise perturbation processing through the specified third parameter.

示例性的,第二通信装置可以基于第一信息向第一通信装置发送指示第三参数的指示信息,即该第二通信装置可以在确定满足上述第二条件的条件下,发送指示第三参数的指示信息,以使得第一通信装置获得第三参数并执行S302。Exemplarily, the second communication device may send indication information indicating the third parameter to the first communication device based on the first information, that is, the second communication device may send indication information indicating the third parameter under the condition that it is determined that the above-mentioned second condition is met, so that the first communication device obtains the third parameter and executes S302.

可选地,指示第三参数的指示信息可以包括第三参数,第三参数的索引(该索引可以参考下文表2或表3所示方式实现)中的至少一项。Optionally, the indication information indicating the third parameter may include at least one of the third parameter and an index of the third parameter (the index may be implemented in a manner shown in Table 2 or Table 3 below).

应理解,第三参数可以用于对该第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数,相应的,该第三参数可以理解为加噪扰动参数,加噪参数,或扰动参数等。例如,第三参数可以为上文描述的集合σ,中的一项或多项。It should be understood that the third parameter can be used to process the model parameters of the first AI model to obtain the updated model parameters of the first AI model. Accordingly, the third parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter. For example, the third parameter can be the set described above. σ, One or more of .

可选地,第一通信装置接收指示第三参数的指示信息之前,该方法还包括:该第一通信装置发送用于请求该第三参数的请求信息。从而,第一通信装置可以通过请求信息和上述指示信息的交互过程,以实现第三参数的确定。示例性的,第一通信装置可以基于第一信息发送用于请求该第三参数的请求信息,即该第一通信装置可以在确定满足上述第二条件的条件下,发送用于请求该第三参数的请求信息,以获得第三参数并执行S302。Optionally, before the first communication device receives the indication information indicating the third parameter, the method further includes: the first communication device sending request information for requesting the third parameter. Thus, the first communication device can determine the third parameter through an interaction between the request information and the indication information. Exemplarily, the first communication device can send the request information for requesting the third parameter based on the first information. That is, the first communication device can send the request information for requesting the third parameter upon determining that the second condition is met, thereby obtaining the third parameter and executing S302.

可选地,该第三参数为预配置的参数。Optionally, the third parameter is a preconfigured parameter.

作为一种示例,以第三参数包括集合为例,集合可以通过下述表2确定。As an example, the third parameter includes the set For example, the set This can be determined from Table 2 below.

表2

Table 2

作为一种示例,以第三参数包括σ为例,σ可以通过下述表3确定。As an example, taking the third parameter including σ as an example, σ can be determined by the following Table 3.

表3
Table 3

由上述图3及相关描述可知,第一通信装置可以基于至少一个AI模型的性能监测信息对部署于该第一通信装置的第一AI模型的模型参数进行加噪扰动处理,以提升持续学习的鲁棒性。其中,该至少一个AI模型可以是关联于第一AI模型,也可以是关联于部署于第二通信装置的第二AI模型的。换言之,第一通信装置也可以基于至少一个AI模型的性能监测信息,对部署于该第二通信装置的第二AI模型的模型参数进行加噪扰动处理,以提升持续学习的鲁棒性。下面将通过图5所示过程进行描述。As can be seen from Figure 3 and the related description above, the first communication device can perform noise perturbation processing on the model parameters of the first AI model deployed on the first communication device based on the performance monitoring information of at least one AI model to improve the robustness of continuous learning. The at least one AI model can be associated with the first AI model or associated with the second AI model deployed on the second communication device. In other words, the first communication device can also perform noise perturbation processing on the model parameters of the second AI model deployed on the second communication device based on the performance monitoring information of at least one AI model to improve the robustness of continuous learning. The process shown in Figure 5 will be described below.

请参阅图5,为本申请提供的通信方法的另一个示意图。Please refer to FIG5 , which is another schematic diagram of the communication method provided in this application.

S501.第二通信装置发送第一信息,相应的,第一通信装置接收该第一信息。其中,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于部署在该第一通信装置的第一AI模型。S501. A second communication device sends first information, and a first communication device receives the first information accordingly. The first information is used to indicate performance monitoring information of at least one AI model associated with a first AI model deployed on the first communication device.

需要说明的是,S501为可选,对于第一通信装置而言,该第一通信装置除了通过与第二通信装置之间的交互的方式获取第一信息之外,还可以通过其它方式获取该第一信息。具体实现过程可以参考前文S301及相关描述。It should be noted that S501 is optional. For the first communication device, in addition to obtaining the first information through interaction with the second communication device, the first communication device can also obtain the first information through other means. For the specific implementation process, please refer to the above S301 and related descriptions.

S502.第一通信装置基于第一信息发送第二信息,相应的,第二通信装置接收该第二信息。其中,该第二信息是基于第一参数进行加噪扰动处理得到的,该第二信息用于更新第二AI模型;其中,该第二AI模型用于部署在第二通信装置。S502. The first communication device sends second information based on the first information, and the second communication device receives the second information accordingly. The second information is obtained by performing noise perturbation processing based on the first parameter, and the second information is used to update a second AI model; the second AI model is deployed on the second communication device.

可选地,该第一参数包括以下一项或多项:更新后的第二AI模型的模型参数,用于更新该第二AI模型的梯度信息,用于更新该第二AI模型的中间特征信息,用于更新该第二AI模型的中间梯度信息,该第一AI模型的推理结果,该第二AI模型对应的蒸馏损失信息,该第二AI模型的任务损失信息。Optionally, the first parameter includes one or more of the following: model parameters of the updated second AI model, gradient information used to update the second AI model, intermediate feature information used to update the second AI model, intermediate gradient information used to update the second AI model, inference results of the first AI model, distillation loss information corresponding to the second AI model, and task loss information of the second AI model.

在一种可能的实现方式中,第一通信装置在S502中发送的第二信息可以通过多种方式实现,下面将进行详细介绍。In a possible implementation, the second information sent by the first communication device in S502 may be implemented in a variety of ways, which will be described in detail below.

方式一,第二信息是基于该第一参数的量化结果进行处理得到的。In a first approach, the second information is obtained by processing a quantized result of the first parameter.

可选地,在方式一中,在第二信息是基于该第一参数的量化结果进行处理得到的情况下,第二通信装置在S502中接收第二信息之后,该第二通信装置可以接收的第二信息进行解析处理(例如解析处理可以包括信道均衡、解调处理、解码处理、数字域处理等一项或多项),而基于解析处理的结果对第二AI模型进行更新。Optionally, in method one, when the second information is obtained by processing based on the quantization result of the first parameter, after the second communication device receives the second information in S502, the second communication device can perform analysis processing on the received second information (for example, the analysis processing may include one or more of channel equalization, demodulation processing, decoding processing, digital domain processing, etc.), and update the second AI model based on the result of the analysis processing.

方式二,该第二信息是对该第一参数的传输参数进行处理得到的。In a second approach, the second information is obtained by processing the transmission parameters of the first parameter.

可选地,在方式二中,在第二信息是对该第一参数的传输参数进行处理得到的情况下,该实现过程可以理解为空中计算的处理过程,即第二通信装置在S502中接收第二信息之后,该第二通信装置可以不对接收的第二信息进行数字域处理,而直接基于接收的第二信息的原始信息对第二AI模型进行更新。例如,第一通信装置将第一参数包含的一个或多个参数值转换为一个或多个第一符号;将一个或多个第一符号进行加噪扰动处理得到一个或多个第二符号之后,将该一个或多个第二符号映射到空口资源(例如时域资源、频域资源、空域资源中的至少一个)上,生成第二信息。在该示例中,第一通信装置不是将第一参数在应用层做信源编码,再以比特流发回物理层,由物理层进行信道编码、符号调制,再生成信号发送,而是将第一参数转换为一个或多个符号再进行传输。从而,在无线空口上传输该第一参数的方式,可提高第一参数的传输速度,以提升通信效率,并提升第二通信装置基于该第二信息进行模型更新的处理速度。Optionally, in the second method, when the second information is obtained by processing the transmission parameter of the first parameter, the implementation process can be understood as an over-the-air calculation process, that is, after the second communication device receives the second information in S502, the second communication device may not perform digital domain processing on the received second information, but directly update the second AI model based on the original information of the received second information. For example, the first communication device converts one or more parameter values contained in the first parameter into one or more first symbols; after performing noise perturbation processing on the one or more first symbols to obtain one or more second symbols, the one or more second symbols are mapped to air interface resources (e.g., at least one of time domain resources, frequency domain resources, and air domain resources) to generate the second information. In this example, the first communication device does not perform source encoding on the first parameter at the application layer and then send it back to the physical layer as a bit stream, which is then channel-coded and symbol-modulated by the physical layer and then generated and sent as a signal, but converts the first parameter into one or more symbols before transmitting. Therefore, the method of transmitting the first parameter on the wireless air interface can increase the transmission speed of the first parameter, thereby improving communication efficiency, and increasing the processing speed of the second communication device for model update based on the second information.

可选地,第一参数的传输参数可以包括发送功率参数、预编码参数、是否使用方式二进行处理的指示信息(该指示信息可以用于控制一个或多个通信装置是否使用方式二的方式进行信息传输,以通过控制接入的通信装置的数量来控制空中叠加后的信号的信噪比)中的一项或多项。Optionally, the transmission parameters of the first parameter may include one or more of a transmit power parameter, a precoding parameter, and an indication of whether to use method 2 for processing (the indication information can be used to control whether one or more communication devices use method 2 to transmit information, so as to control the signal-to-noise ratio of the signal after superposition in the air by controlling the number of accessed communication devices).

下面将以方式二为例,对AI模型的不同实现方式下,第二信息的发送过程进行示例性描述。The following will take method 2 as an example to exemplify the process of sending the second information under different implementation methods of the AI model.

作为一种示例,第一AI模型和第二AI模型通过前文方式A或方式B所示方式实现的情况下,第一参数可以包括更新后的第二AI模型的模型参数,用于更新该第二AI模型的梯度信息中的一项或多项。As an example, when the first AI model and the second AI model are implemented by the method shown in method A or method B above, the first parameter may include the model parameter of the updated second AI model, which is used to update one or more items of the gradient information of the second AI model.

例如,以图4a所示过程为例,第一通信装置可以为网络设备。其中,该第一通信装置可以在公共模型(即第一参数包括公共模型的模型参数或更新梯度)下发的过程中,对该公共模型的模型参数或更新梯度进行加噪扰动处理得到第二信息,并在S502中发送该第二信息。For example, taking the process shown in FIG4a as an example, the first communication device may be a network device. The first communication device may, during the process of issuing the public model (i.e., the first parameters include the model parameters or update gradients of the public model), perform noise perturbation processing on the model parameters or update gradients of the public model to obtain second information, and transmit the second information in S502.

又如,以图4a所示过程为例,第一通信装置可以为任一终端设备。其中,该第一通信装置可以在个性化模型(即第一参数包括个性化模型的模型参数或更新梯度)上传的过程中,对该个性化模型的模型参数或更新梯度进行加噪扰动处理得到第二信息,并在S502中发送该第二信息。For another example, using the process shown in FIG4a as an example, the first communication device can be any terminal device. During the process of uploading the personalized model (i.e., the first parameters include model parameters or update gradients of the personalized model), the first communication device can perform noise perturbation processing on the model parameters or update gradients of the personalized model to obtain second information, and transmit the second information in S502.

换言之,在模型下载或模型上传过程中,可以由无损传输变为有损传输,即模型参数或更新梯度在上传下载时会叠加传输噪声,以实现加噪扰动处理的效果。In other words, during the model download or upload process, lossless transmission can be changed to lossy transmission, that is, the model parameters or update gradients will be superimposed with transmission noise during uploading and downloading to achieve the effect of noise disturbance processing.

作为另一种示例,第一AI模型和第二AI模型通过前文方式C或方式D所示方式实现的情况下,第一参数可以包括用于更新该第二AI模型的中间特征信息,用于更新该第二AI模型的中间梯度信息中的一项或多项。As another example, when the first AI model and the second AI model are implemented by the method shown in Method C or Method D above, the first parameter may include intermediate feature information for updating the second AI model, and one or more items of intermediate gradient information for updating the second AI model.

类似地,以图4b所示过程为例,第一通信装置可以在中间特征与中间梯度传输时增加噪声扰动,使得编码器或解码器的中间特征和中间梯度的传输方式从无损传输变为了有损传输,即中间特征和中间梯度在上传/下载时会叠加传输噪声,以实现加噪扰动处理的效果。Similarly, taking the process shown in Figure 4b as an example, the first communication device can add noise disturbance when transmitting the intermediate features and intermediate gradients, so that the transmission mode of the intermediate features and intermediate gradients of the encoder or decoder is changed from lossless transmission to lossy transmission, that is, the intermediate features and intermediate gradients will be superimposed with transmission noise during uploading/downloading to achieve the effect of noise disturbance processing.

作为另一种示例,第一AI模型和第二AI模型通过前文方式E或方式F所示方式实现的情况下,第一参数可以包括第一AI模型的推理结果,该第二AI模型对应的蒸馏损失信息,该第二AI模型的任务损失信息中的一项或多项。As another example, when the first AI model and the second AI model are implemented by the method shown in Method E or Method F above, the first parameter may include one or more of the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model.

类似地,以图4c所示过程为例,假设蒸馏损失的计算公式为lossKD=L(ft,fs),ft与fs分别为计算蒸馏损失时教师模型的输入和学生模型的输入,如AI模型的中间特征,或AI模型的输出的结果。对于知识蒸馏的训练过程中,蒸馏损失的计算可以与教师模型在一个节点,也可以与学生模型在一个节点,也可以单独在一个其他节点。故对于知识蒸馏来说,可以使用有损传输来传输ft和或fs,以实现加噪扰动处理的效果。Similarly, using the process shown in Figure 4c as an example, assume that the formula for calculating the distillation loss is loss KD = L( ft , fs ), where ft and fs are the inputs to the teacher model and student model, respectively, when calculating the distillation loss. These inputs can be intermediate features of the AI model or the output of the AI model. During knowledge distillation training, the distillation loss can be calculated on the same node as the teacher model, on the same node as the student model, or separately on a different node. Therefore, for knowledge distillation, lossy transmission can be used to transmit ft and/or fs to achieve the effect of noise perturbation.

在一种可能的实现方式中,在S502中,满足第一条件时,该第一通信装置基于该第一信息发送第二信息;其中,该第一条件包括以下至少一项:In a possible implementation, in S502, when a first condition is met, the first communication device sends the second information based on the first information; wherein the first condition includes at least one of the following:

该至少一个AI模型的训练精度低于第八阈值;The training accuracy of the at least one AI model is lower than an eighth threshold;

该至少一个AI模型的测试精度低于第九阈值;The test accuracy of the at least one AI model is lower than a ninth threshold;

该至少一个AI模型所在的AI模型系统的系统性能低于第十阈值;The system performance of the AI model system in which the at least one AI model resides is lower than a tenth threshold;

部署该至少一个AI模型的通信装置所在的通信系统的系统性能低于第十一阈值;The system performance of the communication system in which the communication device deploys the at least one AI model is located is lower than an eleventh threshold;

该至少一个AI模型的神经网络输入输出分布变化大于第十二阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a twelfth threshold;

该至少一个AI模型的神经网络权重分布大于第十三阈值;The neural network weight distribution of the at least one AI model is greater than a thirteenth threshold;

该至少一个AI模型的神经网络权重变化量低于第十四阈值。The change in the neural network weight of the at least one AI model is lower than a fourteenth threshold.

可选地,第一条件与第二条件可以是相同的,例如,第一阈值至第七阈值中的第i(i取值为1至7)个,与第八阈值至第十四阈值中的第i个是相同的。Optionally, the first condition and the second condition may be the same, for example, the i-th (i is 1 to 7) threshold value from the first to the seventh threshold value is the same as the i-th threshold value from the eighth to the fourteenth threshold value.

可选地,第一条件与第二条件可以是不同的,例如,第一阈值至第七阈值中的第i(i取值为1至7)个,与第八阈值至第十四阈值中的第i个是部分不同的或全部不同的。Optionally, the first condition and the second condition may be different, for example, the i-th (i is 1 to 7) threshold value from the first to seventh threshold values is partially different or completely different from the i-th threshold value from the eighth to fourteenth threshold values.

可选地,第一通信装置可以通过多种方式确定第八阈值至第十四阈值的取值。例如,第八阈值至第十四阈值中的一个或多个可以是预配置的。又如,第八阈值至第十四阈值中的一个或多个可以是其它设备(例如第二通信装置、网络设备或服务器等)向第一通信装置配置的。Optionally, the first communication device may determine the values of the eighth to fourteenth thresholds in a variety of ways. For example, one or more of the eighth to fourteenth thresholds may be preconfigured. In another example, one or more of the eighth to fourteenth thresholds may be configured for the first communication device by another device (e.g., a second communication device, a network device, or a server).

其中,在满足第一条件时,第一通信装置可以确定第二AI模型当前的运行可能存在异常,为此,该第一通信装置可以发送第二信息,使得第二通信装置能够获得基于第一参数进行加噪扰动处理得到的第二信息,并基于该第二信息更新第二AI模型。通过这种方式,可以减缓或避免第二AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Among them, when the first condition is met, the first communication device can determine that the current operation of the second AI model may be abnormal. To this end, the first communication device can send second information, so that the second communication device can obtain second information obtained by noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, the problem of the second AI model's plasticity being reduced or even disappearing during long-term continuous learning can be alleviated or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.

基于图5所示方案,在S502中,第一通信装置可以基于第一信息发送基于第一参数进行加噪扰动处理得到的第二信息,使得该第二信息的接收方(例如第二通信装置)能够获得基于第一参数进行加噪扰动处理得到的第二信息,并基于该第二信息更新第二AI模型。通过这种方式,相比于不做加噪扰动处理而直接发送第一参数的方式,可以减缓或避免第二AI模型在长时间的持续学习过程中导致可塑性降低甚至消失的问题,能够提高持续学习的适应能力,并提升持续学习的鲁棒性。Based on the scheme shown in Figure 5, in S502, the first communication device can send the second information obtained by performing noise perturbation processing based on the first parameter based on the first information, so that the recipient of the second information (such as the second communication device) can obtain the second information obtained by performing noise perturbation processing based on the first parameter, and update the second AI model based on the second information. In this way, compared with the method of directly sending the first parameter without performing noise perturbation processing, the problem of reduced or even disappearance of the plasticity of the second AI model during long-term continuous learning can be slowed down or avoided, and the adaptability of continuous learning can be improved, and the robustness of continuous learning can be enhanced.

需要说明的是,图5所示方法与前文图3所示方法可以相互结合。例如,在图3所示S301之后,还可以执行图5所示的S502,其中,S302和S502的执行顺序不做限定,即可以先执行S302后执行S502,也可以先执行S502后执行S302。It should be noted that the method shown in FIG5 can be combined with the method shown in FIG3 above. For example, after S301 shown in FIG3, S502 shown in FIG5 can also be executed. The execution order of S302 and S502 is not limited. That is, S302 can be executed first and then S502, or S502 can be executed first and then S302.

在图5所示方法的一种可能的实现方式中,在S502之前,该方法还包括:该第一通信装置接收指示第二参数的指示信息,该第二参数用于对该第一参数进行处理得到该第二信息。具体地,第一通信装置还可以通过接收的指示信息确定该第二参数,并基于该第二参数对第一参数进行处理得到第二信息,即该第一通信装置可以基于其它通信装置(例如第二信息的接收方)的指示以实现第二参数的确定,以便于该第二信息的接收方能够通过指定的第二参数获得相应的第二信息。In one possible implementation of the method shown in FIG5 , before S502, the method further includes: the first communication device receiving indication information indicating a second parameter, the second parameter being used to process the first parameter to obtain the second information. Specifically, the first communication device may also determine the second parameter based on the received indication information, and process the first parameter based on the second parameter to obtain the second information. That is, the first communication device may determine the second parameter based on an indication from another communication device (e.g., a recipient of the second information), so that the recipient of the second information can obtain the corresponding second information using the specified second parameter.

示例性的,第二通信装置可以基于第一信息向第一通信装置发送指示第二参数的指示信息,即该第二通信装置可以在确定满足上述第一条件的条件下,发送指示第二参数的指示信息,以使得第一通信装置获得第二参数并执行S502。Exemplarily, the second communication device may send indication information indicating the second parameter to the first communication device based on the first information, that is, the second communication device may send indication information indicating the second parameter under the condition that it is determined that the above-mentioned first condition is met, so that the first communication device obtains the second parameter and executes S502.

可选地,指示第二参数的指示信息可以包括第二参数,第二参数的索引(该索引可以参考上文表2或表3所示方式实现)中的至少一项。Optionally, the indication information indicating the second parameter may include at least one of the second parameter and an index of the second parameter (the index may be implemented in a manner as shown in Table 2 or Table 3 above).

应理解,第二参数可以用于对第一参数进行加噪扰动处理得到第二信息,相应的,该第二参数可以理解为加噪扰动参数,加噪参数,或扰动参数等。It should be understood that the second parameter can be used to perform noise disturbance processing on the first parameter to obtain the second information. Accordingly, the second parameter can be understood as a noise disturbance parameter, a noise parameter, or a disturbance parameter, etc.

可选地,在第一通信装置接收指示第二参数的指示信息之前,该方法还包括:该第一通信装置发送用于请求该第二参数的请求信息。从而,第一通信装置可以通过请求信息和上述指示信息的交互过程,以实现第二参数的确定。示例性的,第一通信装置可以基于第一信息发送用于请求该第二参数的请求信息,即该第一通信装置可以在确定满足上述第一条件的条件下,发送用于请求该第二参数的请求信息,以获得第二参数并执行S502。Optionally, before the first communication device receives the indication information indicating the second parameter, the method further includes: the first communication device sending request information for requesting the second parameter. Thus, the first communication device can determine the second parameter through an interaction between the request information and the indication information. Exemplarily, the first communication device can send the request information for requesting the second parameter based on the first information. That is, the first communication device can send the request information for requesting the second parameter upon determining that the first condition is met, thereby obtaining the second parameter and executing S502.

可选地,该第二参数为预配置的参数。Optionally, the second parameter is a preconfigured parameter.

请参阅图6,本申请实施例提供了一种通信装置600,该通信装置600可以实现上述方法实施例中第一通信装置(或第二通信装置)的功能,因此也能实现上述方法实施例所具备的有益效果。在本申请实施例中,该通信装置600可以是第一通信装置(或第二通信装置),也可以是第一通信装置(或第二通信装置)内部的集成电路或者元件等,例如芯片、基带芯片、modem芯片、包含modem核的SoC芯片、系统级封装(system in package,SIP)芯片、通信模组、芯片系统、处理器等。Referring to FIG. 6 , an embodiment of the present application provides a communication device 600. This communication device 600 can implement the functions of the first communication device (or second communication device) in the above-described method embodiment, thereby also achieving the beneficial effects of the above-described method embodiment. In this embodiment of the present application, the communication device 600 can be the first communication device (or second communication device), or it can be an integrated circuit or component within the first communication device (or second communication device), such as a chip, a baseband chip, a modem chip, a SoC chip including a modem core, a system-in-package (SIP) chip, a communication module, a chip system, a processor, etc.

需要说明的是,收发单元602可以包括发送单元和接收单元,分别用于执行发送和接收。It should be noted that the transceiver unit 602 may include a sending unit and a receiving unit, which are respectively used to perform sending and receiving.

一种可能的实现方式中,当该通信装置600为用于执行图3及相关实施例中第一通信装置所执行的方法时,该通信装置600包括处理单元601;该处理单元601用于获取第一信息,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于部署在该第一通信装置的第一AI模型;该处理单元601还用于基于该第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。In one possible implementation, when the communication device 600 is used to execute the method executed by the first communication device in Figure 3 and related embodiments, the communication device 600 includes a processing unit 601; the processing unit 601 is used to obtain first information, which is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with the first AI model deployed in the first communication device; the processing unit 601 is also used to perform noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.

一种可能的实现方式中,当该通信装置600为用于执行图5及相关实施例中第一通信装置所执行的方法时,该通信装置600包括处理单元601;该处理单元601用于获取第一信息,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于用于部署在第二通信装置的第二AI模型;该收发单元602用于基于该第一信息发送第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的,该第二信息用于更新该第二AI模型。In one possible implementation, when the communication device 600 is used to execute the method executed by the first communication device in Figure 5 and related embodiments, the communication device 600 includes a processing unit 601; the processing unit 601 is used to obtain first information, and the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model for deployment in a second communication device; the transceiver unit 602 is used to send second information based on the first information, and the second information is obtained by noise disturbance processing based on the first parameter, and the second information is used to update the second AI model.

一种可能的实现方式中,当该通信装置600为用于执行图5及相关实施例中第二通信装置所执行的方法时,该通信装置600包括处理单元601;该收发单元602用于接收第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的;其中,该第二信息用于更新部署在该第二通信装置的第二AI模型;该处理单元601用于基于该第二信息更新该第二AI模型。In one possible implementation, when the communication device 600 is used to execute the method executed by the second communication device in Figure 5 and related embodiments, the communication device 600 includes a processing unit 601; the transceiver unit 602 is used to receive second information, which is obtained by noise disturbance processing based on the first parameter; wherein the second information is used to update the second AI model deployed in the second communication device; the processing unit 601 is used to update the second AI model based on the second information.

在一种可能的设计中,当该通信装置600是终端设备或终端中的通信模组时,该处理单元601的功能可以由一个或多个处理器实现。具体的该处理器可以包括modem芯片,或包含modem核的SoC芯片或SIP芯片。收发单元602的功能可以由收发机电路来实现。In one possible design, when the communication device 600 is a terminal device or a communication module in a terminal, the functions of the processing unit 601 can be implemented by one or more processors. Specifically, the processor can include a modem chip, or a SoC chip or SIP chip containing a modem core. The functions of the transceiver unit 602 can be implemented by a transceiver circuit.

在一种可能的设计中,当该通信装置600是终端中负责通信功能的电路或芯片,如modem芯片或包含modem核的SoC芯片或SIP芯片时,该处理单元601的功能可以由上述芯片中包括一个或多个处理器或处理器核的电路系统来实现。收发单元602功能可以由上述芯片上的接口电路或数据收发电路来实现。In one possible design, when the communication device 600 is a circuit or chip responsible for communication functions in a terminal, such as a modem chip or a SoC chip or SIP chip containing a modem core, the functions of the processing unit 601 can be implemented by a circuit system including one or more processors or processor cores in the above chip. The functions of the transceiver unit 602 can be implemented by an interface circuit or data transceiver circuit on the above chip.

需要说明的是,上述通信装置600的单元的信息执行过程等内容,具体可参见本申请前述所示的方法实施例中的叙述,此处不再赘述。It should be noted that, for details on the information execution process of the units of the above-mentioned communication device 600, please refer to the description in the method embodiment shown above in this application, and no further details will be given here.

请参阅图7,为本申请提供的通信装置700的另一种示意性结构图,通信装置700包括逻辑电路701和输入输出接口702。其中,通信装置700可以为芯片或集成电路。Please refer to Fig. 7, which is another schematic structural diagram of a communication device 700 provided in this application. The communication device 700 includes a logic circuit 701 and an input/output interface 702. The communication device 700 may be a chip or an integrated circuit.

其中,图6所示收发单元602可以包括通信接口,该通信接口可以包括图7中的输入输出接口702,该输入输出接口702可以包括输入接口和输出接口。或者,该通信接口也可以是收发电路,该收发电路可以包括输入接口电路和输出接口电路。The transceiver unit 602 shown in Figure 6 may include a communication interface, which may include the input/output interface 702 in Figure 7 , which may include an input interface and an output interface. Alternatively, the communication interface may be a transceiver circuit, which may include an input interface circuit and an output interface circuit.

一种可能的实现方式中,当该通信装置700为用于执行图3及相关实施例中第一通信装置所执行的方法时,该逻辑电路701用于获取第一信息,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于部署在该第一通信装置的第一AI模型;该逻辑电路701还用于基于该第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。In one possible implementation, when the communication device 700 is used to execute the method executed by the first communication device in Figure 3 and related embodiments, the logic circuit 701 is used to obtain first information, where the first information is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with the first AI model deployed in the first communication device; the logic circuit 701 is also used to perform noise disturbance processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model.

一种可能的实现方式中,当该通信装置700为用于执行图5及相关实施例中第一通信装置所执行的方法时,该逻辑电路701用于获取第一信息,该第一信息用于指示至少一个AI模型的性能监测信息,该至少一个AI模型关联于用于部署在第二通信装置的第二AI模型;该输入输出接口702用于基于该第一信息发送第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的,该第二信息用于更新该第二AI模型。In one possible implementation, when the communication device 700 is used to execute the method executed by the first communication device in Figure 5 and related embodiments, the logic circuit 701 is used to obtain first information, which is used to indicate performance monitoring information of at least one AI model, and the at least one AI model is associated with a second AI model deployed in a second communication device; the input and output interface 702 is used to send second information based on the first information, which is obtained by noise disturbance processing based on the first parameter, and the second information is used to update the second AI model.

一种可能的实现方式中,当该通信装置700为用于执行图5及相关实施例中第二通信装置所执行的方法时,该输入输出接口702用于接收第二信息,该第二信息是基于第一参数进行加噪扰动处理得到的;其中,该第二信息用于更新部署在第二通信装置的第二AI模型;该逻辑电路701用于基于该第二信息更新该第二AI模型。In one possible implementation, when the communication device 700 is used to execute the method executed by the second communication device in Figure 5 and related embodiments, the input-output interface 702 is used to receive second information, which is obtained by noise disturbance processing based on the first parameter; wherein the second information is used to update the second AI model deployed in the second communication device; and the logic circuit 701 is used to update the second AI model based on the second information.

其中,逻辑电路701和输入输出接口702还可以执行任一实施例中第一通信装置或第二通信装置执行的其他步骤并实现对应的有益效果,此处不再赘述。The logic circuit 701 and the input/output interface 702 may also execute other steps executed by the first communication device or the second communication device in any embodiment and achieve corresponding beneficial effects, which will not be described in detail here.

在一种可能的实现方式中,图6所示处理单元601可以为图7中的逻辑电路701。In a possible implementation, the processing unit 601 shown in FIG. 6 may be the logic circuit 701 in FIG. 7 .

可选的,逻辑电路701可以是一个处理装置,处理装置的功能可以部分或全部通过软件实现。其中,处理装置的功能可以部分或全部通过软件实现。Optionally, the logic circuit 701 may be a processing device, and the functions of the processing device may be partially or entirely implemented by software. The functions of the processing device may be partially or entirely implemented by software.

可选的,处理装置可以包括存储器和处理器,其中,存储器用于存储计算机程序,处理器读取并执行存储器中存储的计算机程序,以执行任意一个方法实施例中的相应处理和/或步骤。Optionally, the processing device may include a memory and a processor, wherein the memory is used to store a computer program, and the processor reads and executes the computer program stored in the memory to perform corresponding processing and/or steps in any one of the method embodiments.

可选地,处理装置可以仅包括处理器。用于存储计算机程序的存储器位于处理装置之外,处理器通过电路/电线与存储器连接,以读取并执行存储器中存储的计算机程序。其中,存储器和处理器可以集成在一起,或者也可以是物理上互相独立的。Alternatively, the processing device may include only a processor. A memory for storing the computer program is located outside the processing device, and the processor is connected to the memory via circuits/wires to read and execute the computer program stored in the memory. The memory and processor may be integrated or physically separate.

可选地,该处理装置可以是一个或多个芯片,或一个或多个集成电路。例如,处理装置可以是一个或多个现场可编程门阵列(field-programmable gate array,FPGA)、专用集成芯片(application specific integrated circuit,ASIC)、系统芯片(system on chip,SoC)、中央处理器(central processor unit,CPU)、网络处理器(network processor,NP)、数字信号处理电路(digital signal processor,DSP)、微控制器(micro controller unit,MCU),可编程逻辑控制器(programmable logic device,PLD)或其它集成芯片,或者上述芯片或者处理器的任意组合等。Optionally, the processing device may be one or more chips, or one or more integrated circuits. For example, the processing device may be one or more field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), system-on-chips (SoCs), central processing units (CPUs), network processors (NPs), digital signal processing circuits (DSPs), microcontrollers (MCUs), programmable logic devices (PLDs), or other integrated chips, or any combination of the above chips or processors.

请参阅图8,为本申请的实施例提供的上述实施例中所涉及的通信装置800,该通信装置800具体可以为上述实施例中的作为终端设备的通信装置,图8所示示例为终端设备通过终端设备(或者终端设备中的部件)实现。Please refer to Figure 8, which shows the communication device 800 involved in the above-mentioned embodiments provided in an embodiment of the present application. The communication device 800 can specifically be a communication device serving as a terminal device in the above-mentioned embodiments. The example shown in Figure 8 is that the terminal device is implemented through the terminal device (or a component in the terminal device).

其中,该通信装置800的一种可能的逻辑结构示意图,该通信装置800可以包括但不限于至少一个处理器801以及通信端口802。Herein, a possible logical structure diagram of the communication device 800 is shown. The communication device 800 may include but is not limited to at least one processor 801 and a communication port 802 .

其中,图6所示收发单元602可以包括通信接口,该通信接口可以包括图8中的通信端口802,该通信端口802可以包括输入接口和输出接口。或者,该通信端口802也可以是收发电路,该收发电路可以包括输入接口电路和输出接口电路。The transceiver unit 602 shown in Figure 6 may include a communication interface, which may include the communication port 802 shown in Figure 8 , which may include an input interface and an output interface. Alternatively, the communication port 802 may also be a transceiver circuit, which may include an input interface circuit and an output interface circuit.

进一步可选的,该通信装置800还可以包括存储器803、总线804中的至少一个,在本申请的实施例中,该至少一个处理器801用于对通信装置800的动作进行控制处理。Further optionally, the communication device 800 may also include at least one of a memory 803 and a bus 804. In an embodiment of the present application, the at least one processor 801 is used to control and process the actions of the communication device 800.

此外,处理器801可以是中央处理器单元,通用处理器,数字信号处理器,专用集成电路,现场可编程门阵列或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框,模块和电路。该处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,数字信号处理器和微处理器的组合等等。所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Furthermore, the processor 801 may be a central processing unit (CPU), a general-purpose processor (GPPC), a digital signal processor (DSP), an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or other programmable logic device (PLD), a transistor logic device (TLD), a hardware component, or any combination thereof. It may implement or execute the various exemplary logic blocks, modules, and circuits described in conjunction with the disclosure of this application. The processor may also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a DSP and a microprocessor, and the like. Those skilled in the art will clearly understand that, for ease and brevity of description, the specific operating processes of the systems, devices, and units described above may refer to the corresponding processes in the aforementioned method embodiments and will not be further described herein.

需要说明的是,图8所示通信装置800具体可以用于实现前述方法实施例中终端设备所实现的步骤,并实现终端设备对应的技术效果,图8所示通信装置的具体实现方式,均可以参考前述方法实施例中的叙述,此处不再一一赘述。It should be noted that the communication device 800 shown in Figure 8 can be specifically used to implement the steps implemented by the terminal device in the aforementioned method embodiment and achieve the corresponding technical effects of the terminal device. The specific implementation methods of the communication device shown in Figure 8 can refer to the description in the aforementioned method embodiment and will not be repeated here.

请参阅图9,为本申请的实施例提供的上述实施例中所涉及的通信装置900的结构示意图,该通信装置900具体可以为上述实施例中的作为网络设备的通信装置,图9所示示例为网络设备通过网络设备(或者网络设备中的部件)实现,其中,该通信装置的结构可以参考图9所示的结构。Please refer to Figure 9, which is a structural diagram of the communication device 900 involved in the above-mentioned embodiments provided in an embodiment of the present application. The communication device 900 can specifically be a communication device as a network device in the above-mentioned embodiments. The example shown in Figure 9 is that the network device is implemented through the network device (or a component in the network device), wherein the structure of the communication device can refer to the structure shown in Figure 9.

通信装置900包括至少一个处理器911以及至少一个网络接口914。进一步可选的,该通信装置还包括至少一个存储器912、至少一个收发器913和一个或多个天线915。处理器911、存储器912、收发器913和网络接口914相连,例如通过总线相连,在本申请实施例中,该连接可包括各类接口、传输线或总线等,本实施例对此不做限定。天线915与收发器913相连。网络接口914用于使得通信装置通过通信链路,与其它通信设备通信。例如网络接口914可以包括通信装置与核心网设备之间的网络接口,例如S1接口,网络接口可以包括通信装置和其他通信装置(例如其他网络设备或者核心网设备)之间的网络接口,例如X2或者Xn接口。The communication device 900 includes at least one processor 911 and at least one network interface 914. Further optionally, the communication device also includes at least one memory 912, at least one transceiver 913 and one or more antennas 915. The processor 911, the memory 912, the transceiver 913 and the network interface 914 are connected, for example, via a bus. In an embodiment of the present application, the connection may include various interfaces, transmission lines or buses, etc., which are not limited in this embodiment. The antenna 915 is connected to the transceiver 913. The network interface 914 is used to enable the communication device to communicate with other communication devices through a communication link. For example, the network interface 914 may include a network interface between the communication device and the core network device, such as an S1 interface, and the network interface may include a network interface between the communication device and other communication devices (such as other network devices or core network devices), such as an X2 or Xn interface.

其中,图6所示收发单元602可以包括通信接口,该通信接口可以包括图9中的网络接口914,该网络接口914可以包括输入接口和输出接口。或者,该网络接口914也可以是收发电路,该收发电路可以包括输入接口电路和输出接口电路。The transceiver unit 602 shown in Figure 6 may include a communication interface, which may include the network interface 914 shown in Figure 9. The network interface 914 may include an input interface and an output interface. Alternatively, the network interface 914 may be a transceiver circuit, which may include an input interface circuit and an output interface circuit.

处理器911主要用于对通信协议以及通信数据进行处理,以及对整个通信装置进行控制,执行软件程序,处理软件程序的数据,例如用于支持通信装置执行实施例中所描述的动作。通信装置可以包括基带处理器和中央处理器,基带处理器主要用于对通信协议以及通信数据进行处理,中央处理器主要用于对整个终端设备进行控制,执行软件程序,处理软件程序的数据。图9中的处理器911可以集成基带处理器和中央处理器的功能,本领域技术人员可以理解,基带处理器和中央处理器也可以是各自独立的处理器,通过总线等技术互联。本领域技术人员可以理解,终端设备可以包括多个基带处理器以适应不同的网络制式,终端设备可以包括多个中央处理器以增强其处理能力,终端设备的各个部件可以通过各种总线连接。该基带处理器也可以表述为基带处理电路或者基带处理芯片。该中央处理器也可以表述为中央处理电路或者中央处理芯片。对通信协议以及通信数据进行处理的功能可以内置在处理器中,也可以以软件程序的形式存储在存储器中,由处理器执行软件程序以实现基带处理功能。Processor 911 is primarily used to process communication protocols and communication data, control the entire communication device, execute software programs, and process software program data, for example, to support the communication device in performing the actions described in the embodiments. The communication device may include a baseband processor and a central processing unit. The baseband processor is primarily used to process communication protocols and communication data, while the central processing unit is primarily used to control the entire terminal device, execute software programs, and process software program data. Processor 911 in Figure 9 may integrate the functions of both a baseband processor and a central processing unit. Those skilled in the art will appreciate that the baseband processor and the central processing unit may also be independent processors interconnected via a bus or other technology. Those skilled in the art will appreciate that a terminal device may include multiple baseband processors to accommodate different network standards, multiple central processing units to enhance its processing capabilities, and various components of the terminal device may be connected via various buses. The baseband processor may also be referred to as a baseband processing circuit or a baseband processing chip. The central processing unit may also be referred to as a central processing circuit or a central processing chip. The functionality for processing communication protocols and communication data may be built into the processor or stored in memory as a software program, which is executed by the processor to implement the baseband processing functionality.

存储器主要用于存储软件程序和数据。存储器912可以是独立存在,与处理器911相连。可选的,存储器912可以和处理器911集成在一起,例如集成在一个芯片之内。其中,存储器912能够存储执行本申请实施例的技术方案的程序代码,并由处理器911来控制执行,被执行的各类计算机程序代码也可被视为是处理器911的驱动程序。The memory is primarily used to store software programs and data. Memory 912 can exist independently and be connected to processor 911. Alternatively, memory 912 and processor 911 can be integrated together, for example, within a single chip. Memory 912 can store program code for executing the technical solutions of the embodiments of the present application, and execution is controlled by processor 911. The various computer program codes executed can also be considered drivers for processor 911.

图9仅示出了一个存储器和一个处理器。在实际的终端设备中,可以存在多个处理器和多个存储器。存储器也可以称为存储介质或者存储设备等。存储器可以为与处理器处于同一芯片上的存储元件,即片内存储元件,或者为独立的存储元件,本申请实施例对此不做限定。Figure 9 shows only one memory and one processor. In an actual terminal device, there may be multiple processors and multiple memories. The memory may also be referred to as a storage medium or a storage device. The memory may be a storage element on the same chip as the processor, i.e., an on-chip storage element, or an independent storage element, which is not limited in the present embodiment.

收发器913可以用于支持通信装置与终端之间射频信号的接收或者发送,收发器913可以与天线915相连。收发器913包括发射机Tx和接收机Rx。具体地,一个或多个天线915可以接收射频信号,该收发器913的接收机Rx用于从天线接收该射频信号,并将射频信号转换为数字基带信号或数字中频信号,并将该数字基带信号或数字中频信号提供给该处理器911,以便处理器911对该数字基带信号或数字中频信号做进一步的处理,例如解调处理和译码处理。此外,收发器913中的发射机Tx还用于从处理器911接收经过调制的数字基带信号或数字中频信号,并将该经过调制的数字基带信号或数字中频信号转换为射频信号,并通过一个或多个天线915发送该射频信号。具体地,接收机Rx可以选择性地对射频信号进行一级或多级下混频处理和模数转换处理以得到数字基带信号或数字中频信号,该下混频处理和模数转换处理的先后顺序是可调整的。发射机Tx可以选择性地对经过调制的数字基带信号或数字中频信号时进行一级或多级上混频处理和数模转换处理以得到射频信号,该上混频处理和数模转换处理的先后顺序是可调整的。数字基带信号和数字中频信号可以统称为数字信号。The transceiver 913 can be used to support the reception or transmission of radio frequency signals between the communication device and the terminal, and the transceiver 913 can be connected to the antenna 915. The transceiver 913 includes a transmitter Tx and a receiver Rx. Specifically, one or more antennas 915 can receive radio frequency signals. The receiver Rx of the transceiver 913 is used to receive the radio frequency signal from the antenna, convert the radio frequency signal into a digital baseband signal or a digital intermediate frequency signal, and provide the digital baseband signal or digital intermediate frequency signal to the processor 911 so that the processor 911 can further process the digital baseband signal or digital intermediate frequency signal, such as demodulation and decoding. In addition, the transmitter Tx in the transceiver 913 is also used to receive a modulated digital baseband signal or digital intermediate frequency signal from the processor 911, convert the modulated digital baseband signal or digital intermediate frequency signal into a radio frequency signal, and send the radio frequency signal through one or more antennas 915. Specifically, the receiver Rx can selectively perform one or more stages of down-mixing and analog-to-digital conversion on the RF signal to obtain a digital baseband signal or a digital intermediate frequency signal. The order of the down-mixing and analog-to-digital conversion processes is adjustable. The transmitter Tx can selectively perform one or more stages of up-mixing and digital-to-analog conversion on the modulated digital baseband signal or digital intermediate frequency signal to obtain a RF signal. The order of the up-mixing and digital-to-analog conversion processes is adjustable. The digital baseband signal and the digital intermediate frequency signal may be collectively referred to as digital signals.

收发器913也可以称为收发单元、收发机、收发装置等。可选的,可以将收发单元中用于实现接收功能的器件视为接收单元,将收发单元中用于实现发送功能的器件视为发送单元,即收发单元包括接收单元和发送单元,接收单元也可以称为接收机、输入口、接收电路等,发送单元可以称为发射机、发射器或者发射电路等。The transceiver 913 may also be referred to as a transceiver unit, a transceiver, a transceiver device, etc. Optionally, a device in the transceiver unit that implements a receiving function may be referred to as a receiving unit, and a device in the transceiver unit that implements a transmitting function may be referred to as a transmitting unit. That is, the transceiver unit includes a receiving unit and a transmitting unit. The receiving unit may also be referred to as a receiver, an input port, a receiving circuit, etc., and the transmitting unit may be referred to as a transmitter, a transmitter, or a transmitting circuit, etc.

需要说明的是,图9所示通信装置900具体可以用于实现前述方法实施例中网络设备所实现的步骤,并实现网络设备对应的技术效果,图9所示通信装置900的具体实现方式,均可以参考前述方法实施例中的叙述,此处不再一一赘述。It should be noted that the communication device 900 shown in Figure 9 can be specifically used to implement the steps implemented by the network device in the aforementioned method embodiment and achieve the corresponding technical effects of the network device. The specific implementation methods of the communication device 900 shown in Figure 9 can refer to the description in the aforementioned method embodiment and will not be repeated here.

请参阅图10,为本申请的实施例提供的上述实施例中所涉及的通信装置的结构示意图。Please refer to FIG10 , which is a schematic structural diagram of the communication device involved in the above-mentioned embodiment provided in an embodiment of the present application.

可以理解的是,通信装置100包括例如模块、单元、元件、电路、或接口等,以适当地配置在一起以执行本申请提供的技术方案。所述通信装置100可以是前文描述的终端设备或网络设备,也可以是这些设备中的部件(例如芯片),用以实现下述方法实施例中描述的方法。通信装置100包括一个或多个处理器101。所述处理器101可以是通用处理器或者专用处理器等。例如可以是基带处理器、或中央处理器。基带处理器可以用于对通信协议以及通信数据进行处理,中央处理器可以用于对通信装置(如,RAN节点、终端、或芯片等)进行控制,执行软件程序,处理软件程序的数据。It can be understood that the communication device 100 includes, for example, modules, units, elements, circuits, or interfaces, which are appropriately configured together to implement the technical solutions provided in this application. The communication device 100 can be the terminal device or network device described above, or a component (such as a chip) in these devices, used to implement the method described in the following method embodiment. The communication device 100 includes one or more processors 101. The processor 101 can be a general-purpose processor or a dedicated processor. For example, it can be a baseband processor or a central processing unit. The baseband processor can be used to process communication protocols and communication data, and the central processing unit can be used to control the communication device (such as a RAN node, terminal, or chip, etc.), execute software programs, and process data of software programs.

可选的,在一种设计中,处理器101可以包括程序103(有时也可以称为代码或指令),所述程序103可以在所述处理器101上被运行,使得所述通信装置100执行下述实施例中描述的方法。在又一种可能的设计中,通信装置100包括电路(图10未示出)。Optionally, in one design, the processor 101 may include a program 103 (sometimes also referred to as code or instructions), which may be executed on the processor 101 to cause the communication device 100 to perform the methods described in the following embodiments. In yet another possible design, the communication device 100 includes circuitry (not shown in FIG10 ).

可选的,所述通信装置100中可以包括一个或多个存储器102,其上存有程序104(有时也可以称为代码或指令),所述程序104可在所述处理器101上被运行,使得所述通信装置100执行上述方法实施例中描述的方法。Optionally, the communication device 100 may include one or more memories 102 on which a program 104 (sometimes also referred to as code or instructions) is stored. The program 104 can be run on the processor 101, so that the communication device 100 executes the method described in the above method embodiment.

可选的,所述处理器101和/或存储器102中可以包括AI模块107,108,所述AI模块用于实现AI相关的功能。所述AI模块可以是通过软件,硬件,或软硬结合的方式实现。例如,AI模块可以包括无线智能控制(radio intelligence control,RIC)模块。例如AI模块可以是近实时RIC或者非实时RIC。Optionally, the processor 101 and/or the memory 102 may include AI modules 107 and 108, which are used to implement AI-related functions. The AI module may be implemented through software, hardware, or a combination of software and hardware. For example, the AI module may include a wireless intelligent control (RIC) module. For example, the AI module may be a near-real-time RIC or a non-real-time RIC.

可选的,所述处理器101和/或存储器102中还可以存储有数据。所述处理器和存储器可以单独设置,也可以集成在一起。Optionally, data may be stored in the processor 101 and/or the memory 102. The processor and the memory may be provided separately or integrated together.

可选的,所述通信装置100还可以包括收发器105和/或天线106。所述处理器101有时也可以称为处理单元,对通信装置(例如RAN节点或终端)进行控制。所述收发器105有时也可以称为收发单元、收发机、收发电路、或者收发器等,用于通过天线106实现通信装置的收发功能。Optionally, the communication device 100 may further include a transceiver 105 and/or an antenna 106. The processor 101 may also be sometimes referred to as a processing unit, and controls the communication device (e.g., a RAN node or terminal). The transceiver 105 may also be sometimes referred to as a transceiver unit, a transceiver, a transceiver circuit, or a transceiver, and is configured to implement the transceiver functions of the communication device through the antenna 106.

其中,图6所示处理单元601可以是处理器101。图6所示收发单元602可以为通信接口,该通信接口可以是图10中的收发器105,该收发器105可以包括输入接口和输出接口。或者,该收发器105也可以是收发电路,该收发电路可以包括输入接口电路和输出接口电路。The processing unit 601 shown in FIG6 may be the processor 101. The transceiver unit 602 shown in FIG6 may be a communication interface, which may be the transceiver 105 shown in FIG10 . The transceiver 105 may include an input interface and an output interface. Alternatively, the transceiver 105 may be a transceiver circuit, which may include an input interface circuit and an output interface circuit.

本申请实施例还提供一种计算机可读存储介质,该存储介质用于存储一个或多个计算机执行指令,当计算机执行指令被处理器执行时,该处理器执行如前述实施例中第一通信装置或第二通信装置可能的实现方式所述的方法。An embodiment of the present application further provides a computer-readable storage medium, which is used to store one or more computer-executable instructions. When the computer-executable instructions are executed by a processor, the processor executes the method described in the possible implementation methods of the first communication device or the second communication device in the aforementioned embodiment.

本申请实施例还提供一种计算机程序产品(或称计算机程序),当计算机程序产品被该处理器执行时,该处理器执行上述第一通信装置或第二通信装置可能实现方式的方法。An embodiment of the present application also provides a computer program product (or computer program). When the computer program product is executed by the processor, the processor executes the method that may be implemented by the above-mentioned first communication device or second communication device.

本申请实施例还提供了一种芯片系统,该芯片系统包括至少一个处理器,用于支持通信装置实现上述通信装置可能的实现方式中所涉及的功能。可选的,所述芯片系统还包括接口电路,所述接口电路为所述至少一个处理器提供程序指令和/或数据。在一种可能的设计中,该芯片系统还可以包括存储器,存储器,用于保存该通信装置必要的程序指令和数据。该芯片系统,可以由芯片构成,也可以包含芯片和其他分立器件,其中,该通信装置具体可以为前述方法实施例中第一通信装置或第二通信装置。An embodiment of the present application also provides a chip system, which includes at least one processor for supporting a communication device to implement the functions involved in the possible implementation methods of the above-mentioned communication device. Optionally, the chip system also includes an interface circuit, which provides program instructions and/or data to the at least one processor. In one possible design, the chip system may also include a memory, which is used to store the necessary program instructions and data for the communication device. The chip system can be composed of chips, or it can include chips and other discrete devices, wherein the communication device can specifically be the first communication device or the second communication device in the aforementioned method embodiment.

本申请实施例还提供了一种通信系统,该网络系统架构包括上述任一实施例中的第一通信装置和/或第二通信装置。An embodiment of the present application further provides a communication system, wherein the network system architecture includes the first communication device and/or the second communication device in any of the above embodiments.

在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。某个功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。本领域技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。In the several embodiments provided in this application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the device embodiments described above are merely schematic. For example, the division of the units is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of devices or units, which can be electrical, mechanical or other forms. Whether a function is performed in hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of this application.

所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separate, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed across multiple network units. Some or all of these units may be selected to achieve the purpose of this embodiment according to actual needs.

另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。In addition, the functional units in the various embodiments of the present application can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or in the form of a software functional unit. If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or the contributing part or all or part of the technical solution can be embodied in the form of a software product. The computer software product is stored in a storage medium and includes several instructions for enabling a computer device (which can be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the method described in the various embodiments of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), disk or optical disk, and other media that can store program code.

Claims (72)

一种通信方法,其特征在于,所述方法应用第一通信装置,所述方法包括:A communication method, characterized in that the method uses a first communication device, and the method includes: 获取第一信息,所述第一信息用于指示至少一个人工智能AI模型的性能监测信息,所述至少一个AI模型关联于部署在所述第一通信装置的第一AI模型;Obtaining first information, where the first information is used to indicate performance monitoring information of at least one artificial intelligence (AI) model, where the at least one AI model is associated with a first AI model deployed on the first communication device; 基于所述第一信息对所述第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。Noise disturbance processing is performed on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model. 根据权利要求1所述的方法,其特征在于,还包括:The method according to claim 1, further comprising: 基于所述第一信息发送第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的,所述第二信息用于更新第二AI模型;其中,所述第二AI模型用于部署在第二通信装置。Second information is sent based on the first information, where the second information is obtained by performing noise disturbance processing based on the first parameter, and the second information is used to update a second AI model; wherein the second AI model is used to be deployed in a second communication device. 根据权利要求2所述的方法,其特征在于,所述第二信息是基于所述第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The method according to claim 2 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求2或3所述的方法,其特征在于,所述第一参数包括以下一项或多项:The method according to claim 2 or 3, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 根据权利要求2至4任一项所述的方法,其特征在于,所述基于所述第一信息发送第二信息,包括:The method according to any one of claims 2 to 4, wherein sending the second information based on the first information comprises: 满足第一条件时,基于所述第一信息发送第二信息;When a first condition is met, sending second information based on the first information; 所述第一条件包括以下至少一项:The first condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold; 所述至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a third threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第五阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a fifth threshold; 所述至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold; 所述至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold. 根据权利要求2至5任一项所述的方法,其特征在于,还包括:The method according to any one of claims 2 to 5, further comprising: 接收指示所述第二参数的指示信息,所述第二参数用于对所述第一参数进行处理得到所述第二信息。Indication information indicating the second parameter is received, where the second parameter is used to process the first parameter to obtain the second information. 根据权利要求6所述的方法,其特征在于,还包括:The method according to claim 6, further comprising: 发送用于请求所述第二参数的请求信息。Sending a request message for requesting the second parameter. 根据权利要求1至7任一项所述的方法,其特征在于,还包括:The method according to any one of claims 1 to 7, further comprising: 接收指示第三参数的指示信息,所述第三参数用于对所述第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数。Indication information indicating a third parameter is received, where the third parameter is used to process the model parameters of the first AI model to obtain updated model parameters of the first AI model. 根据权利要求8所述的方法,其特征在于,还包括:The method according to claim 8, further comprising: 发送用于请求所述第三参数的请求信息。Sending request information for requesting the third parameter. 根据权利要求1至9任一项所述的方法,其特征在于,所述基于所述第一信息对所述第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数,包括:The method according to any one of claims 1 to 9, wherein the performing noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model comprises: 满足第二条件时,基于所述第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数;When the second condition is met, performing noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model; 所述第二条件包括以下至少一项:The second condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第八阈值;The training accuracy of the at least one AI model is lower than an eighth threshold; 所述至少一个AI模型的测试精度低于第九阈值;The test accuracy of the at least one AI model is lower than a ninth threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第十阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a tenth threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第十一阈值;The system performance of the communication system in which the communication device deploying the at least one AI model is located is lower than an eleventh threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第十二阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a twelfth threshold; 所述至少一个AI模型的神经网络权重分布大于第十三阈值;The neural network weight distribution of the at least one AI model is greater than a thirteenth threshold; 所述至少一个AI模型的神经网络权重变化量低于第十四阈值。The change in the neural network weight of the at least one AI model is lower than the fourteenth threshold. 根据权利要求1至10任一项所述的方法,其特征在于,所述获取第一信息,包括:The method according to any one of claims 1 to 10, wherein obtaining the first information comprises: 接收所述第一信息;或,receiving the first information; or, 获取所述第一AI模型和/或所述第二AI模型的测量参数,并基于所述第一AI模型的测量参数确定所述第一信息。Obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model. 根据权利要求1至11任一项所述的方法,其特征在于,所述至少一个AI模型包括所述第一AI模型和/或第二AI模型;其中,所述第一AI模型关联于第二AI模型,所述第二AI模型用于部署在第二通信装置。The method according to any one of claims 1 to 11, characterized in that the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device. 一种通信方法,其特征在于,应用于第一通信装置,所述方法包括:A communication method, characterized in that it is applied to a first communication device, the method comprising: 获取第一信息,所述第一信息用于指示至少一个人工智能AI模型的性能监测信息,所述至少一个AI模型关联于用于部署在第二通信装置的第二AI模型;Obtaining first information, the first information being used to indicate performance monitoring information of at least one artificial intelligence (AI) model, the at least one AI model being associated with a second AI model deployed on a second communication device; 基于所述第一信息发送第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的,所述第二信息用于更新所述第二AI模型。Second information is sent based on the first information, where the second information is obtained by performing noise disturbance processing based on the first parameter, and the second information is used to update the second AI model. 根据权利要求13所述的方法,其特征在于,所述第二信息是基于所述第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The method according to claim 13 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求13或14所述的方法,其特征在于,所述第一参数包括以下一项或多项:The method according to claim 13 or 14, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 根据权利要求13至15任一项所述的方法,其特征在于,所述基于所述第一信息发送第二信息包括:满足第一条件时,基于所述第一信息发送第二信息;The method according to any one of claims 13 to 15, wherein sending the second information based on the first information comprises: sending the second information based on the first information when a first condition is met; 所述第一条件包括以下至少一项:The first condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold; 所述至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a third threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第五阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a fifth threshold; 所述至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold; 所述至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold. 根据权利要求13至16任一项所述的方法,其特征在于,还包括:The method according to any one of claims 13 to 16, further comprising: 接收指示所述第二参数的指示信息,所述第二参数用于对所述第一参数进行处理得到所述第二信息。Indication information indicating the second parameter is received, where the second parameter is used to process the first parameter to obtain the second information. 根据权利要求17所述的方法,其特征在于,还包括:The method according to claim 17, further comprising: 发送用于请求所述第二参数的请求信息。Sending a request message for requesting the second parameter. 根据权利要求13至18任一项所述的方法,其特征在于,所述获取第一信息,包括:The method according to any one of claims 13 to 18, wherein obtaining the first information comprises: 接收所述第一信息;或,receiving the first information; or, 获取所述第一AI模型和/或所述第二AI模型的测量参数,并基于所述第一AI模型的测量参数确定所述第一信息。Obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model. 根据权利要求13至19任一项所述的方法,其特征在于,所述至少一个AI模型包括所述第一AI模型和/或第二AI模型;其中,所述第一AI模型关联于第二AI模型,所述第二AI模型用于部署在第二通信装置。The method according to any one of claims 13 to 19, characterized in that the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device. 一种通信方法,其特征在于,应用于第二通信装置,所述方法包括:A communication method, characterized in that it is applied to a second communication device, the method comprising: 接收第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的;其中,所述第二信息用于更新部署在所述第二通信装置的第二人工智能AI模型;receiving second information, where the second information is obtained by performing noise perturbation processing based on the first parameter; wherein the second information is used to update a second artificial intelligence (AI) model deployed in the second communication device; 基于所述第二信息更新所述第二AI模型。The second AI model is updated based on the second information. 根据权利要求21所述的方法,其特征在于,所述第二信息是基于第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The method according to claim 21 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求21或22所述的方法,其特征在于,所述第一参数包括以下一项或多项:The method according to claim 21 or 22, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 一种第一通信装置,其特征在于,包括处理单元;A first communication device, characterized by comprising a processing unit; 所述处理单元,用于获取第一信息,所述第一信息用于指示至少一个人工智能AI模型的性能监测信息,所述至少一个AI模型关联于部署在所述第一通信装置的第一AI模型;The processing unit is configured to obtain first information, where the first information is used to indicate performance monitoring information of at least one artificial intelligence (AI) model, where the at least one AI model is associated with a first AI model deployed on the first communication device; 基于所述第一信息对所述第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。Noise disturbance processing is performed on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model. 根据权利要求24所述的装置,其特征在于,所述装置还包括收发单元;The device according to claim 24, further comprising a transceiver unit; 所述收发单元,用于基于所述第一信息发送第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的,所述第二信息用于更新第二AI模型;其中,所述第二AI模型用于部署在第二通信装置。The transceiver unit is used to send second information based on the first information, where the second information is obtained by performing noise disturbance processing based on the first parameter, and the second information is used to update a second AI model; wherein the second AI model is used to be deployed in a second communication device. 根据权利要求25所述的装置,其特征在于,所述第二信息是基于所述第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The device according to claim 25 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求25或26所述的装置,其特征在于,所述第一参数包括以下一项或多项:The device according to claim 25 or 26, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 根据权利要求25至27任一项所述的装置,其特征在于,The device according to any one of claims 25 to 27, characterized in that 所述收发单元,还用于:The transceiver unit is further configured to: 满足第一条件时,基于所述第一信息发送第二信息;When a first condition is met, sending second information based on the first information; 所述第一条件包括以下至少一项:The first condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold; 所述至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a third threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第五阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a fifth threshold; 所述至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold; 所述至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold. 根据权利要求25至28任一项所述的装置,其特征在于,The device according to any one of claims 25 to 28, characterized in that 所述收发单元,还用于接收指示所述第二参数的指示信息,所述第二参数用于对所述第一参数进行处理得到所述第二信息。The transceiver unit is further configured to receive indication information indicating the second parameter, where the second parameter is used to process the first parameter to obtain the second information. 根据权利要求29所述的装置,其特征在于,The device according to claim 29, characterized in that 所述收发单元,还用于发送用于请求所述第二参数的请求信息。The transceiver unit is further configured to send request information for requesting the second parameter. 根据权利要求24至30任一项所述的装置,其特征在于,The device according to any one of claims 24 to 30, characterized in that 所述收发单元,还用于接收指示第三参数的指示信息,所述第三参数用于对所述第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数。The transceiver unit is further configured to receive indication information indicating a third parameter, where the third parameter is used to process the model parameters of the first AI model to obtain updated model parameters of the first AI model. 根据权利要求31所述的装置,其特征在于,The device according to claim 31, characterized in that 所述收发单元,还用于发送用于请求所述第三参数的请求信息。The transceiver unit is further configured to send request information for requesting the third parameter. 根据权利要求24至32任一项所述的装置,其特征在于,The device according to any one of claims 24 to 32, characterized in that 所述处理单元,还用于:The processing unit is further configured to: 满足第二条件时,基于所述第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数;When the second condition is met, performing noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model; 所述第二条件包括以下至少一项:The second condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第八阈值;The training accuracy of the at least one AI model is lower than an eighth threshold; 所述至少一个AI模型的测试精度低于第九阈值;The test accuracy of the at least one AI model is lower than a ninth threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第十阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a tenth threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第十一阈值;The system performance of the communication system in which the communication device deploying the at least one AI model is located is lower than an eleventh threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第十二阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a twelfth threshold; 所述至少一个AI模型的神经网络权重分布大于第十三阈值;The neural network weight distribution of the at least one AI model is greater than a thirteenth threshold; 所述至少一个AI模型的神经网络权重变化量低于第十四阈值。The change in the neural network weight of the at least one AI model is lower than the fourteenth threshold. 根据权利要求24至33任一项所述的装置,其特征在于,The device according to any one of claims 24 to 33, characterized in that 所述处理单元,还用于接收所述第一信息;或,The processing unit is further configured to receive the first information; or 获取所述第一AI模型和/或所述第二AI模型的测量参数,并基于所述第一AI模型的测量参数确定所述第一信息。Obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model. 根据权利要求24至34任一项所述的装置,其特征在于,所述至少一个AI模型包括所述第一AI模型和/或第二AI模型;其中,所述第一AI模型关联于第二AI模型,所述第二AI模型用于部署在第二通信装置。The device according to any one of claims 24 to 34, characterized in that the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device. 一种第一通信装置,其特征在于,包括处理单元;A first communication device, characterized by comprising a processing unit; 所述处理单元,用于获取第一信息,所述第一信息用于指示至少一个人工智能AI模型的性能监测信息,所述至少一个AI模型关联于用于部署在第二通信装置的第二AI模型;The processing unit is configured to obtain first information indicating performance monitoring information of at least one artificial intelligence (AI) model, where the at least one AI model is associated with a second AI model deployed on a second communication device; 基于所述第一信息发送第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的,所述第二信息用于更新所述第二AI模型。Second information is sent based on the first information, where the second information is obtained by performing noise disturbance processing based on the first parameter, and the second information is used to update the second AI model. 根据权利要求36所述的装置,其特征在于,所述第二信息是基于所述第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The device according to claim 36 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求36或37所述的装置,其特征在于,所述第一参数包括以下一项或多项:The device according to claim 36 or 37, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 根据权利要求36至38任一项所述的装置,其特征在于,所述收发单元,还用于:The device according to any one of claims 36 to 38, wherein the transceiver unit is further configured to: 满足第一条件时,基于所述第一信息发送第二信息;When a first condition is met, sending second information based on the first information; 所述第一条件包括以下至少一项:The first condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold; 所述至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a third threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第五阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a fifth threshold; 所述至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold; 所述至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold. 根据权利要求36至39任一项所述的装置,其特征在于,The device according to any one of claims 36 to 39, characterized in that 所述收发单元,还用于接收指示所述第二参数的指示信息,所述第二参数用于对所述第一参数进行处理得到所述第二信息。The transceiver unit is further configured to receive indication information indicating the second parameter, where the second parameter is used to process the first parameter to obtain the second information. 根据权利要求40所述的装置,其特征在于,The device according to claim 40, characterized in that 所述收发单元,还用于发送用于请求所述第二参数的请求信息。The transceiver unit is further configured to send request information for requesting the second parameter. 根据权利要求36至41任一项所述的装置,其特征在于,The device according to any one of claims 36 to 41, characterized in that 所述处理单元,还用于接收所述第一信息;或,The processing unit is further configured to receive the first information; or 获取所述第一AI模型和/或所述第二AI模型的测量参数,并基于所述第一AI模型的测量参数确定所述第一信息。Obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model. 根据权利要求36至42任一项所述的装置,其特征在于,所述至少一个AI模型包括所述第一AI模型和/或第二AI模型;其中,所述第一AI模型关联于第二AI模型,所述第二AI模型用于部署在第二通信装置。The device according to any one of claims 36 to 42, characterized in that the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device. 一种第二通信装置,其特征在于,包括处理单元和收发单元;A second communication device, characterized by comprising a processing unit and a transceiver unit; 所述收发单元,用于接收第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的;其中,所述第二信息用于更新部署在所述第二通信装置的第二人工智能AI模型;The transceiver unit is configured to receive second information, where the second information is obtained by performing noise perturbation processing based on the first parameter; wherein the second information is used to update a second artificial intelligence (AI) model deployed in the second communication device; 所述处理单元,用于基于所述第二信息更新所述第二AI模型。The processing unit is configured to update the second AI model based on the second information. 根据权利要求44所述的装置,其特征在于,所述第二信息是基于第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The device according to claim 44 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求44或45所述的装置,其特征在于,所述第一参数包括以下一项或多项:The device according to claim 44 or 45, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 一种第一通信装置,其特征在于,包括处理器;A first communication device, comprising a processor; 所述处理器,用于获取第一信息,所述第一信息用于指示至少一个人工智能AI模型的性能监测信息,所述至少一个AI模型关联于部署在所述第一通信装置的第一AI模型;The processor is configured to obtain first information, where the first information is used to indicate performance monitoring information of at least one artificial intelligence (AI) model, where the at least one AI model is associated with a first AI model deployed on the first communication device; 基于所述第一信息对所述第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数。Noise disturbance processing is performed on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model. 根据权利要求47所述的装置,其特征在于,所述装置还包括通信端口;The device according to claim 47, characterized in that the device further comprises a communication port; 所述通信端口,还用于基于所述第一信息发送第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的,所述第二信息用于更新第二AI模型;其中,所述第二AI模型用于部署在第二通信装置。The communication port is further used to send second information based on the first information, where the second information is obtained by performing noise disturbance processing based on the first parameter, and the second information is used to update a second AI model; wherein the second AI model is used to be deployed in a second communication device. 根据权利要求48所述的装置,其特征在于,所述第二信息是基于所述第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The device according to claim 48 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求48或49所述的装置,其特征在于,所述第一参数包括以下一项或多项:The device according to claim 48 or 49, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 根据权利要求48至50任一项所述的装置,其特征在于,所述通信端口,还用于:The device according to any one of claims 48 to 50, wherein the communication port is further used to: 满足第一条件时,基于所述第一信息发送第二信息;When a first condition is met, sending second information based on the first information; 所述第一条件包括以下至少一项:The first condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold; 所述至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a third threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第五阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a fifth threshold; 所述至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold; 所述至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold. 根据权利要求48至51任一项所述的装置,其特征在于,The device according to any one of claims 48 to 51, characterized in that 所述通信端口,还用于接收指示所述第二参数的指示信息,所述第二参数用于对所述第一参数进行处理得到所述第二信息。The communication port is further used to receive indication information indicating the second parameter, and the second parameter is used to process the first parameter to obtain the second information. 根据权利要求52所述的装置,其特征在于,The device according to claim 52, characterized in that 所述通信端口,还用于发送用于请求所述第二参数的请求信息。The communication port is further used to send request information for requesting the second parameter. 根据权利要求47至53任一项所述的装置,其特征在于,The device according to any one of claims 47 to 53, characterized in that 所述通信端口,还用于接收指示第三参数的指示信息,所述第三参数用于对所述第一AI模型的模型参数进行处理得到更新后的第一AI模型的模型参数。The communication port is further used to receive indication information indicating a third parameter, where the third parameter is used to process the model parameters of the first AI model to obtain updated model parameters of the first AI model. 根据权利要求54所述的装置,其特征在于,The device according to claim 54, characterized in that 所述通信端口,还用于发送用于请求所述第三参数的请求信息。The communication port is further used to send request information for requesting the third parameter. 根据权利要求47至55任一项所述的装置,其特征在于,The device according to any one of claims 47 to 55, characterized in that 所述处理器,还用于:The processor is further configured to: 满足第二条件时,基于所述第一信息对第一AI模型的模型参数进行加噪扰动处理,得到更新后的第一AI模型的模型参数;When the second condition is met, performing noise perturbation processing on the model parameters of the first AI model based on the first information to obtain updated model parameters of the first AI model; 所述第二条件包括以下至少一项:The second condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第八阈值;The training accuracy of the at least one AI model is lower than an eighth threshold; 所述至少一个AI模型的测试精度低于第九阈值;The test accuracy of the at least one AI model is lower than a ninth threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第十阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a tenth threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第十一阈值;The system performance of the communication system in which the communication device deploying the at least one AI model is located is lower than an eleventh threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第十二阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a twelfth threshold; 所述至少一个AI模型的神经网络权重分布大于第十三阈值;The neural network weight distribution of the at least one AI model is greater than a thirteenth threshold; 所述至少一个AI模型的神经网络权重变化量低于第十四阈值。The change in the neural network weight of the at least one AI model is lower than the fourteenth threshold. 根据权利要求47至56任一项所述的装置,其特征在于,The device according to any one of claims 47 to 56, characterized in that 所述处理器,还用于接收所述第一信息;或,The processor is further configured to receive the first information; or 获取所述第一AI模型和/或所述第二AI模型的测量参数,并基于所述第一AI模型的测量参数确定所述第一信息。Obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model. 根据权利要求47至57任一项所述的装置,其特征在于,所述至少一个AI模型包括所述第一AI模型和/或第二AI模型;其中,所述第一AI模型关联于第二AI模型,所述第二AI模型用于部署在第二通信装置。The device according to any one of claims 47 to 57, characterized in that the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device. 一种第一通信装置,其特征在于,包括处理器和通信端口;A first communication device, characterized by comprising a processor and a communication port; 所述处理器,用于获取第一信息,所述第一信息用于指示至少一个人工智能AI模型的性能监测信息,所述至少一个AI模型关联于用于部署在第二通信装置的第二AI模型;The processor is configured to obtain first information indicating performance monitoring information of at least one artificial intelligence (AI) model, where the at least one AI model is associated with a second AI model deployed on a second communication device; 所述通信端口,用于基于所述第一信息发送第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的,所述第二信息用于更新所述第二AI模型。The communication port is used to send second information based on the first information, where the second information is obtained by performing noise disturbance processing based on the first parameter, and the second information is used to update the second AI model. 根据权利要求59所述的装置,其特征在于,所述第二信息是基于所述第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The device according to claim 59 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求59或60所述的装置,其特征在于,所述第一参数包括以下一项或多项:The device according to claim 59 or 60, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 根据权利要求59至61任一项所述的装置,其特征在于,The device according to any one of claims 59 to 61, characterized in that 所述通信端口,还用于:The communication port is also used for: 满足第一条件时,基于所述第一信息发送第二信息;When a first condition is met, sending second information based on the first information; 所述第一条件包括以下至少一项:The first condition includes at least one of the following: 所述至少一个AI模型的训练精度低于第一阈值;The training accuracy of the at least one AI model is lower than a first threshold; 所述至少一个AI模型的测试精度低于第二阈值;The test accuracy of the at least one AI model is lower than a second threshold; 所述至少一个AI模型所在的AI模型系统的系统性能低于第三阈值;The system performance of the AI model system in which the at least one AI model is located is lower than a third threshold; 部署所述至少一个AI模型的通信装置所在的通信系统的系统性能低于第四阈值;The system performance of the communication system where the communication device deploying the at least one AI model is located is lower than a fourth threshold; 所述至少一个AI模型的神经网络输入输出分布变化大于第五阈值;The change in the neural network input and output distribution of the at least one AI model is greater than a fifth threshold; 所述至少一个AI模型的神经网络权重分布大于第六阈值;The neural network weight distribution of the at least one AI model is greater than a sixth threshold; 所述至少一个AI模型的神经网络权重变化量低于第七阈值。The change in the neural network weight of the at least one AI model is lower than a seventh threshold. 根据权利要求59至62任一项所述的装置,其特征在于,The device according to any one of claims 59 to 62, characterized in that 所述通信端口,还用于接收指示所述第二参数的指示信息,所述第二参数用于对所述第一参数进行处理得到所述第二信息。The communication port is further used to receive indication information indicating the second parameter, and the second parameter is used to process the first parameter to obtain the second information. 根据权利要求63所述的装置,其特征在于,The device according to claim 63, characterized in that 所述通信端口,还用于发送用于请求所述第二参数的请求信息。The communication port is further used to send request information for requesting the second parameter. 根据权利要求59至64任一项所述的装置,其特征在于,The device according to any one of claims 59 to 64, characterized in that 所述处理器,还用于接收所述第一信息;或,The processor is further configured to receive the first information; or 获取所述第一AI模型和/或所述第二AI模型的测量参数,并基于所述第一AI模型的测量参数确定所述第一信息。Obtain measurement parameters of the first AI model and/or the second AI model, and determine the first information based on the measurement parameters of the first AI model. 根据权利要求59至65任一项所述的装置,其特征在于,所述至少一个AI模型包括所述第一AI模型和/或第二AI模型;其中,所述第一AI模型关联于第二AI模型,所述第二AI模型用于部署在第二通信装置。The device according to any one of claims 59 to 65 is characterized in that the at least one AI model includes the first AI model and/or the second AI model; wherein the first AI model is associated with the second AI model, and the second AI model is used to be deployed on the second communication device. 一种第二通信装置,其特征在于,包括处理器和通信端口;A second communication device, characterized by comprising a processor and a communication port; 所述通信端口,用于接收第二信息,所述第二信息是基于第一参数进行加噪扰动处理得到的;其中,所述第二信息用于更新部署在所述第二通信装置的第二人工智能AI模型;The communication port is configured to receive second information, the second information being obtained by performing noise perturbation processing based on the first parameter; wherein the second information is used to update a second artificial intelligence (AI) model deployed in the second communication device; 所述处理器,用于基于所述第二信息更新所述第二AI模型。The processor is configured to update the second AI model based on the second information. 根据权利要求67所述的装置,其特征在于,所述第二信息是基于第一参数的量化结果进行处理得到的,或,所述第二信息是对所述第一参数的传输参数进行处理得到的。The device according to claim 67 is characterized in that the second information is obtained by processing the quantization result of the first parameter, or the second information is obtained by processing the transmission parameter of the first parameter. 根据权利要求67或68所述的装置,其特征在于,所述第一参数包括以下一项或多项:The device according to claim 67 or 68, wherein the first parameter includes one or more of the following: 更新后的第二AI模型的模型参数,用于更新所述第二AI模型的梯度信息,用于更新所述第二AI模型的中间特征信息,用于更新所述第二AI模型的中间梯度信息,所述第一AI模型的推理结果,所述第二AI模型对应的蒸馏损失信息,所述第二AI模型的任务损失信息。The updated model parameters of the second AI model, used to update the gradient information of the second AI model, used to update the intermediate feature information of the second AI model, used to update the intermediate gradient information of the second AI model, the inference result of the first AI model, the distillation loss information corresponding to the second AI model, and the task loss information of the second AI model. 根据权利要求67至69中任一项所述的装置,其特征在于,所述装置为芯片或芯片系统。The device according to any one of claims 67 to 69 is characterized in that the device is a chip or a chip system. 一种可读存储介质,其特征在于,所述存储介质中存储有计算机程序或指令,当所述计算机程序或指令被计算机执行时,实现如权利要求1至23中任一项所述的方法。A readable storage medium, characterized in that a computer program or instruction is stored in the storage medium, and when the computer program or instruction is executed by a computer, the method according to any one of claims 1 to 23 is implemented. 一种计算机程序产品,其特征在于,包括指令,当所述指令在计算机上运行时,使得所述计算机执行如权利要求1至23中任一项所述的方法。A computer program product, characterized by comprising instructions, which, when the instructions are executed on a computer, cause the computer to perform the method according to any one of claims 1 to 23.
PCT/CN2024/138059 2024-03-15 2024-12-10 Communication method and related apparatus Pending WO2025189861A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202410303028.6 2024-03-15
CN202410303028.6A CN120711414A (en) 2024-03-15 2024-03-15 Communication method and related device

Publications (1)

Publication Number Publication Date
WO2025189861A1 true WO2025189861A1 (en) 2025-09-18

Family

ID=97062764

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/138059 Pending WO2025189861A1 (en) 2024-03-15 2024-12-10 Communication method and related apparatus

Country Status (2)

Country Link
CN (1) CN120711414A (en)
WO (1) WO2025189861A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114091679A (en) * 2020-08-24 2022-02-25 华为技术有限公司 Method for updating machine learning model and communication device
WO2023279366A1 (en) * 2021-07-09 2023-01-12 Oppo广东移动通信有限公司 Noise reduction method based on transfer learning, terminal device, network device and storage medium
CN115983409A (en) * 2022-11-11 2023-04-18 北京大学 Federated learning training method, device, system and equipment based on differential privacy
CN116367091A (en) * 2023-04-16 2023-06-30 北京佰才邦技术股份有限公司 A communication method, device and system
CN116437416A (en) * 2021-12-30 2023-07-14 华为技术有限公司 Communication method and communication device
CN117581581A (en) * 2023-08-30 2024-02-20 北京小米移动软件有限公司 Communication method, terminal, network equipment, and communication system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114091679A (en) * 2020-08-24 2022-02-25 华为技术有限公司 Method for updating machine learning model and communication device
WO2023279366A1 (en) * 2021-07-09 2023-01-12 Oppo广东移动通信有限公司 Noise reduction method based on transfer learning, terminal device, network device and storage medium
CN116437416A (en) * 2021-12-30 2023-07-14 华为技术有限公司 Communication method and communication device
CN115983409A (en) * 2022-11-11 2023-04-18 北京大学 Federated learning training method, device, system and equipment based on differential privacy
CN116367091A (en) * 2023-04-16 2023-06-30 北京佰才邦技术股份有限公司 A communication method, device and system
CN117581581A (en) * 2023-08-30 2024-02-20 北京小米移动软件有限公司 Communication method, terminal, network equipment, and communication system

Also Published As

Publication number Publication date
CN120711414A (en) 2025-09-26

Similar Documents

Publication Publication Date Title
WO2025189861A1 (en) Communication method and related apparatus
WO2025190244A1 (en) Communication method and related apparatus
WO2025179919A1 (en) Communication method and related apparatus
WO2025189831A1 (en) Communication method and related apparatus
WO2025190246A1 (en) Communication method and related apparatus
WO2025190252A1 (en) Communication method and related apparatus
WO2025167443A1 (en) Communication method and related device
WO2025118980A1 (en) Communication method and related device
WO2025019990A1 (en) Communication method and related device
WO2025175756A1 (en) Communication method and related device
WO2025208880A1 (en) Communication method, and related apparatus
WO2025025193A1 (en) Communication method and related device
WO2025190248A1 (en) Communication method and related apparatus
WO2025092160A1 (en) Communication method and related device
WO2025092159A1 (en) Communication method and related device
WO2025189860A1 (en) Communication method and related apparatus
WO2025179920A1 (en) Communication method and related apparatus
WO2025103115A1 (en) Communication method and related device
WO2025086262A1 (en) Communication method and related apparatus
WO2025139534A1 (en) Communication method and related device
WO2025227699A1 (en) Communication method and related apparatus
WO2025107835A1 (en) Communication method and related device
WO2025118759A1 (en) Communication method and related device
WO2025059907A1 (en) Communication method and related device
WO2025140282A1 (en) Communication method and related device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24929305

Country of ref document: EP

Kind code of ref document: A1