WO2025237119A1 - Communication method and related apparatus - Google Patents
Communication method and related apparatusInfo
- Publication number
- WO2025237119A1 WO2025237119A1 PCT/CN2025/093003 CN2025093003W WO2025237119A1 WO 2025237119 A1 WO2025237119 A1 WO 2025237119A1 CN 2025093003 W CN2025093003 W CN 2025093003W WO 2025237119 A1 WO2025237119 A1 WO 2025237119A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- data
- model
- training
- receiving end
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Definitions
- This application relates to the field of communication technology, and in particular to a communication method and related apparatus.
- Wireless communication can be a transmission communication between two or more communication devices that does not propagate through conductors or cables.
- the two or more communication devices include network devices and terminal devices, or the two or more communication devices include different terminal devices.
- AI models (which can also be simply referred to as models in this application embodiment) suitable for various scenarios or tasks have emerged.
- AI models are usually trained on one side of the network device or the terminal device, and then the trained model is sent to the other end for deployment.
- This application provides a communication method for collaboratively training a neural network model between a sending end and a receiving end. This method is flexible and applicable to various scenarios and tasks, thus improving compatibility.
- inventions of this application propose a communication method applied to a sending end.
- the sending end can be a communication device (such as a terminal device or network device), or it can be a component within the communication device (such as a processor, chip, or chip system), or it can be a logic module or software capable of implementing all or part of the functions of the communication device.
- the method includes: first, the sending end sending first information to a receiving end, the first information including one or more first data and one or more first training label data, the first data including data output from the intermediate layer of a first model, the first training label data being used to update the first model; second, the sending end receiving second information sent by the receiving end, the second information indicating model update data output in reverse by the intermediate layer of the first model.
- model may be replaced with other terms, such as neural network model, artificial intelligence (AI) model, AI network, AI neural network model, neural network, machine learning model, or AI processing model, etc.
- AI artificial intelligence
- processing of updating the first model in the embodiments of this application includes, but is not limited to, one or more of the following: training, iteration, optimization, fine-tuning, tuning, or improvement.
- the first model in the embodiments of this application may include one or more models.
- the intermediate layers of the first model may also be referred to as the hidden layers of the first model, which refers to a series of processing layers connecting the input layer and the output layer in the first model.
- the sending end sends first information to the receiving end.
- the first information includes one or more first data points and one or more first training label data points.
- the first data points include data output from the intermediate layers of the first model, and the first training label data is used to update the first model.
- the sending end receives second information sent by the receiving end.
- the second information instructs the intermediate layers of the first model to output model update data in reverse order.
- the first information includes a first training label.
- the first information includes multiple first training labels.
- the first information sent by the sending end to the receiving end may be first information obtained after data compression processing.
- the sending end compresses and sends the first information to the receiving end.
- the second information sent from the receiving end to the sending end may be second information obtained through data compression.
- the receiving end compresses and sends the second information to the sending end.
- the sending end can also directly send the first information
- the receiving end can also directly send the second information
- the first training label data includes: index information of the first training label in the training label set, or the first training label, and the training label set includes one or more training labels.
- the first training label can be implemented in multiple ways, which improves the flexibility of the solution.
- the first information further includes: grouping indication information, which indicates the grouping method of one or more first data included in the first information; and/or, group number information, which indicates the number of one or more first data included in the first information.
- the first information can be realized through various grouping methods, thus effectively improving the generalization of the model.
- the second information includes: a plurality of second data, wherein each second data corresponds to a first data; or, the second information includes: a weighted value of the plurality of second data, and the first information further includes the weight of the first data.
- the second information can carry the second data in a variety of ways, thus effectively improving the generalization of the model.
- the method further includes: the sending end acquiring a first training sample, the training label corresponding to the first training sample being a first training label; the sending end using the first training sample to process the first model multiple times, outputting one or more first data, wherein the processing method and/or processing parameters are different each time.
- the method further includes:
- the sending end acquires multiple first training samples, and the training label corresponding to the first training sample is the first training label.
- the sending end uses multiple first training samples to process the first model once or multiple times, and outputs one or more first data, wherein the first training samples used in each processing are different.
- the sending end acquires multiple first training samples, including:
- the sending end obtains public training samples
- the sending end processes the common training samples to generate multiple first training samples, and the multiple first training samples are different from each other.
- the sending end can obtain multiple first training samples in various ways, which improves the flexibility of the solution implementation.
- the method further includes: the sending end sending third information to the receiving end, the third information including any one or more of the following:
- the third data, the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
- the third information sent by the sending end to the receiving end may be third information obtained through data compression.
- the sending end compresses and sends the third information to the receiving end.
- the sending end can also directly send this third information.
- the third information includes: third data, input location information of the third data in the first model, dimension information of the third data, identification information of the first model, and first type indication information, wherein the third data includes: first channel information, first environment information, and multimodal information.
- the sending end in addition to sending the first information to the receiver, can also send other information, which improves the generalization ability of the model and enhances the flexibility of implementation. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
- the method further includes:
- the sending end receives a fourth message sent by the receiving end, which includes any one or more of the following:
- the fourth data, the second type indication information, the termination interaction instruction of the first model, the dimension information of the fourth data, or the identification information of the first model, the second type indication information is used to indicate the data type of the fourth data
- the termination interaction instruction of the first model is used to instruct the sender to stop sending information related to the first model.
- the fourth data includes any one or more of the following information: second channel information, second environment information, multimodal information, training residual information generated by the receiver during the training of the first model, or the performance information of the updated first model.
- the fourth information sent from the receiving end to the sending end may be fourth information obtained through data compression.
- the receiving end compresses and sends the fourth information to the sending end.
- the receiving end can also directly send this fourth piece of information.
- the fourth information includes: fourth data, second type indication information, termination interaction instruction of the first model, dimension information of the fourth data, and identification information of the first model, wherein the fourth data includes: second channel information, second environment information, multimodal information, training residual information generated by the receiver during the training of the first model, and performance information of the updated first model.
- the receiving end can send other information besides the second information to the sending end, improving the model's generalization ability and implementation flexibility. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
- the first model includes one or more splitting positions; the first data includes: data output by intermediate layers corresponding to one or more splitting positions in the first model; and/or, the second data includes: model update data output in reverse from one or more splitting positions in the first model.
- the first model since the first model can output feature data or update data at multiple segmentation positions, it enriches the amount of data exchanged between the sender and receiver and improves the generalization of the model.
- the method further includes: the sending end sending configuration information to the receiving end, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
- Model information wherein the model information includes any one or more of the following: the identifier of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated;
- Segmentation information wherein the segmentation information includes: the number of segmentation positions in the first model, and/or, the position information of the segmentation positions in the first model;
- training information which includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
- the sending end receives configuration information sent by the receiving end.
- the sending end and the receiving end can support the adjustment of the trained model within a training cycle, thereby improving training efficiency.
- the configuration information further includes: first compression configuration information, and/or, second compression configuration information, wherein: the first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information, and the second compression configuration information is used to indicate the compression method and compression parameters of the second information and/or the fourth information.
- the data exchanged between the sending end and the receiving end supports multiple compression methods, which can effectively reduce the amount of data transmitted and improve communication efficiency.
- the first information is carried in a radio resource control (RRC) message or a media access control (MAC) message.
- RRC radio resource control
- MAC media access control
- the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message;
- RRC Radio Resource Control
- MAC Media Access Control
- the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the receiving end can be a communication device (such as a terminal device or network device), or it can be a component within the communication device (such as a processor, chip, or chip system), or it can be a logic module or software capable of implementing all or part of the functions of the communication device.
- the method includes: the receiving end receiving first information sent by a sending end, the first information including one or more first data items and one or more first training label data items, the first data including data output from the intermediate layer of a first model, the first training label data being used to update the first model; the receiving end sending second information to the sending end, the second information indicating model update data output in reverse from the intermediate layer of the first model.
- model may be replaced with other terms, such as neural network model, artificial intelligence (AI) model, AI network, AI neural network model, neural network, machine learning model, or AI processing model, etc.
- AI artificial intelligence
- processing of updating the first model in the embodiments of this application includes, but is not limited to, one or more of the following: training, iteration, optimization, fine-tuning, tuning, or improvement.
- first model in the embodiments of this application may include one or more models.
- the sending end sends first information to the receiving end.
- the first information includes one or more first data points and one or more first training label data points.
- the first data points include data output from the intermediate layers of the first model, and the first training label data is used to update the first model.
- the sending end receives second information sent by the receiving end.
- the second information instructs the intermediate layers of the first model to output model update data in reverse order.
- the first training label data includes: index information of the first training label in the training label set, or the first training label, and the training label set includes one or more training labels.
- the first training label can be implemented in multiple ways, which improves the flexibility of the solution.
- the first information further includes: grouping indication information, which indicates the grouping method of one or more first data included in the first information; and/or, group number information, which indicates the number of one or more first data included in the first information.
- the first information can be realized through various grouping methods, thus effectively improving the generalization of the model.
- the second information includes: a plurality of second data, wherein each second data corresponds to a first data; or, the second information includes: a weighted value of a plurality of second data, and the first information further includes the weight of the first data.
- the second information can carry the second data in a variety of ways, thus effectively improving the generalization of the model.
- the method further includes: the receiving end receiving third information sent by the sending end, the third information including any one or more of the following:
- the third data the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, wherein the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
- the sending end in addition to sending the first information to the receiver, can also send other information, which improves the generalization ability of the model and enhances the flexibility of implementation. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
- the method further includes:
- the receiving end can send other information besides the second information to the sending end, improving the model's generalization ability and implementation flexibility. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
- the first model includes one or more segmentation positions;
- the first data includes: data output from intermediate layers corresponding to one or more segmentation positions in the first model;
- the second data includes: model update data output in reverse from one or more split positions in the first model.
- the first model since the first model can output feature data or update data at multiple segmentation positions, it enriches the amount of data exchanged between the sender and receiver and improves the generalization of the model.
- the method further includes: receiving configuration information from the receiver, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
- Model information wherein the model information includes any one or more of the following: the identifier of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated;
- Segmentation information wherein the segmentation information includes: the number of segmentation positions in the first model, and/or, the position information of the segmentation positions in the first model;
- training information which includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
- the receiving end sends configuration information to the sending end.
- the sending end and the receiving end can support the adjustment of the trained model within a training cycle, thereby improving training efficiency.
- the configuration information also includes:
- First compression configuration information and/or, second compression configuration information, wherein:
- the first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information.
- the second compression configuration information is used to indicate the compression method and compression parameters of the second and/or fourth information.
- the data exchanged between the sending end and the receiving end supports multiple compression methods, which can effectively reduce the amount of data transmitted and improve communication efficiency.
- the first information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the third aspect of this application provides a communication device, which is a transmitting end or a receiving end.
- the device includes a transceiver unit and a processing unit.
- the constituent modules of the communication device can also be used to perform the steps performed in various possible implementations of the first aspect and achieve the corresponding technical effects. For details, please refer to the first aspect, which will not be repeated here.
- the fourth aspect of this application provides a communication device, which is a transmitting end or a receiving end.
- the device includes a transceiver unit and a processing unit.
- the constituent modules of the communication device can also be used to perform the steps performed in various possible implementations of the second aspect and achieve the corresponding technical effects. For details, please refer to the second aspect, which will not be repeated here.
- the fifth aspect of this application provides a communication device including at least one processor coupled to a memory; the memory is used to store a program or instructions; the at least one processor is used to execute the program or instructions to enable the device to implement the method described in any possible implementation of the first or second aspect.
- the sixth aspect of this application provides a communication device including at least one logic circuit and an input/output interface; the logic circuit is used to perform the method described in any of the possible implementations of the first or second aspect described above.
- the seventh aspect of this application provides a communication system that includes the aforementioned transmitting end or receiving end.
- the communication system may also include other communication devices that communicate with the transmitting end or the receiving end.
- An eighth aspect of this application provides a computer-readable storage medium for storing one or more computer-executable instructions that, when executed by a processor, perform the method as described in any possible implementation of the first or second aspect above.
- the ninth aspect of this application provides a computer program product (or computer program) that, when executed by a processor, performs the method described in any possible implementation of the first or second aspect described above.
- the tenth aspect of this application provides a chip or chip system including at least one processor for supporting a communication device in implementing the method described in any possible implementation of the first or second aspect above.
- the chip or chip system may further include a memory for storing program instructions and data necessary for the communication device.
- the chip system may be composed of chips or may include chips and other discrete devices.
- the chip system may also include interface circuitry that provides program instructions and/or data to the at least one processor.
- FIGS 1a to 1c are schematic diagrams of the communication system provided in this application.
- FIGS. 2a to 2e are schematic diagrams of the AI processing involved in this application.
- Figure 3 is a schematic diagram of a communication scenario in an embodiment of this application.
- FIG. 4 is a schematic diagram of another communication scenario in the embodiments of this application.
- FIG. 5 is a flowchart illustrating one embodiment of the communication method in this application.
- Figure 6 is a schematic diagram of the structure of the first information in an embodiment of this application.
- Figure 7 is a schematic diagram of another structure of the first information in an embodiment of this application.
- Figure 8 is a schematic diagram of a structure of the second information in an embodiment of this application.
- Figure 9 is a schematic diagram of another structure of the second information in an embodiment of this application.
- Figure 10 is a schematic diagram of the segmentation position of a first model in an embodiment of this application.
- Figure 11 is a schematic diagram of the segmentation position of another first model in an embodiment of this application.
- Figure 12 is a schematic diagram of another structure of the first model in the embodiments of this application.
- Figure 13 is a schematic diagram of a forward interaction data structure in an embodiment of this application.
- Figure 14 is a schematic diagram of another structure of forward interaction data in an embodiment of this application.
- Figure 15 is a schematic diagram of a structure of the second information in an embodiment of this application.
- Figure 16 is a schematic diagram of the structure of MAC message carrying forward interaction data in an embodiment of this application.
- Figure 17 is a schematic diagram of the structure of MAC messages carrying reverse interaction data in an embodiment of this application.
- Figure 18 is a structural schematic diagram of a communication device 1800 according to an embodiment of this application.
- Figure 19 is another schematic structural diagram of the communication device 1900 provided in this application.
- Figure 20 is a schematic diagram of the structure of the communication device 2000 provided in an embodiment of this application.
- Figure 21 is a schematic diagram of the structure of the communication device 2100 provided in an embodiment of this application.
- Terminal device can be a wireless terminal device that can receive network device scheduling and instruction information.
- the wireless terminal device can be a device that provides voice and/or data connectivity to the user, or a handheld device with wireless connection function, or other processing device connected to a wireless modem.
- Terminal devices can communicate with one or more core networks or the Internet via a radio access network (RAN).
- Terminal devices can be mobile terminal devices, such as mobile phones (or "cellular" phones), computers, and data cards.
- mobile phones or "cellular" phones
- computers and data cards.
- they can be portable, pocket-sized, handheld, computer-embedded, or vehicle-mounted mobile devices that exchange voice and/or data with the RAN.
- Examples include personal communication service (PCS) phones, cordless phones, session initiation protocol (SIP) phones, wireless local loop (WLL) stations, personal digital assistants (PDAs), tablets, and computers with wireless transceiver capabilities.
- PCS personal communication service
- SIP session initiation protocol
- WLL wireless local loop
- PDAs personal digital assistants
- tablets and computers with wireless transceiver capabilities.
- Wireless terminal equipment can also be referred to as a system, subscriber unit, subscriber station, mobile station, mobile station (MS), remote station, access point (AP), remote terminal, access terminal, user terminal, user agent, subscriber station (SS), customer premises equipment (CPE), terminal, user equipment (UE), mobile terminal (MT), etc.
- the terminal device can also be a wearable device.
- Wearable devices also known as wearable smart devices or smart wearable devices, are a general term for devices that utilize wearable technology to intelligently design and develop everyday wearables, such as glasses, gloves, watches, clothing, and shoes.
- Wearable devices are portable devices that are worn directly on the body or integrated into the user's clothing or accessories.
- Wearable devices are not merely hardware devices, but also achieve powerful functions through software support, data interaction, and cloud interaction.
- wearable smart devices include those that are feature-rich, large in size, and can achieve complete or partial functions without relying on a smartphone, such as smartwatches or smart glasses, as well as those that focus on a specific type of application function and require the use of other devices such as smartphones, such as various smart bracelets, smart helmets, and smart jewelry for vital sign monitoring.
- Terminals can also be drones, robots, devices in device-to-device (D2D) communication, vehicles to everything (V2X) communication, virtual reality (VR) terminal devices, augmented reality (AR) terminal devices, wireless terminals in industrial control, wireless terminals in self-driving, wireless terminals in remote medical care, wireless terminals in smart grids, wireless terminals in transportation safety, wireless terminals in smart cities, and wireless terminals in smart homes, etc.
- D2D device-to-device
- V2X vehicles to everything
- VR virtual reality
- AR augmented reality
- wireless terminals in industrial control wireless terminals in self-driving
- wireless terminals in remote medical care wireless terminals in smart grids
- wireless terminals in transportation safety wireless terminals in smart cities, and wireless terminals in smart homes, etc.
- terminal devices can also be terminal devices in communication systems evolved from fifth-generation (5G) communication systems (such as 5G Advanced or sixth-generation (6G) communication systems), or terminal devices in future public land mobile networks (PLMNs).
- 5G Advanced or 6G networks can further expand the form and function of 5G communication terminals; 6G terminals include, but are not limited to, vehicles, cellular network terminals (integrating satellite terminal functions), drones, and Internet of Things (IoT) devices.
- 5G fifth-generation
- 6G sixth-generation
- PLMNs public land mobile networks
- 5G Advanced or 6G networks can further expand the form and function of 5G communication terminals
- 6G terminals include, but are not limited to, vehicles, cellular network terminals (integrating satellite terminal functions), drones, and Internet of Things (IoT) devices.
- IoT Internet of Things
- the terminal device can also obtain artificial intelligence (AI) services provided by the network device.
- AI artificial intelligence
- the terminal device can also have AI processing capabilities.
- Network equipment This can be equipment in a wireless network.
- network equipment can be a RAN node (or device) that connects terminal devices to the wireless network, and can also be called a base station.
- RAN equipment include: base station, evolved NodeB (eNodeB), gNB (gNodeB) in 5G communication systems, transmission reception point (TRP), evolved Node B (eNB), radio network controller (RNC), Node B (NB), home base station (e.g., home evolved Node B, or home Node B, HNB), base band unit (BBU), or wireless fidelity (Wi-Fi) access point (AP), etc.
- network devices may include centralized unit (CU) nodes, distributed unit (DU) nodes, or RAN devices that include both CU and DU nodes.
- RAN nodes can also be macro base stations, micro base stations or indoor stations, relay nodes or donor nodes, or radio controllers in cloud radio access network (CRAN) scenarios.
- RAN nodes can also be servers, wearable devices, vehicles, or in-vehicle equipment.
- the access network equipment in vehicle-to-everything (V2X) technology can be a roadside unit (RSU).
- V2X vehicle-to-everything
- RSU roadside unit
- RAN nodes collaborate to assist the terminal in achieving wireless access, with different RAN nodes each implementing some of the base station's functions.
- RAN nodes can be CUs, DUs, CUs (control plane, CP), CUs (user plane, UP), or radio units (RUs).
- CUs and DUs can be set up separately or included in the same network element, such as a baseband unit (BBU).
- RUs can be included in radio frequency equipment or radio frequency units, such as remote radio units (RRUs), active antenna units (AAUs), radio heads (RHs), or remote radio heads (RRHs).
- RRUs remote radio units
- AAUs active antenna units
- RHs radio heads
- RRHs remote radio heads
- CU or CU-CP and CU-UP
- DU or RU
- RU may have different names, but those skilled in the art will understand their meaning.
- O-CU open CU
- DU can also be called O-DU
- CU-CP can also be called O-CU-CP
- CU-UP can also be called O-CU-UP
- RU can also be called O-RU.
- this application uses CU, CU-CP, CU-UP, DU, and RU as examples.
- Any of the units among CU (or CU-CP, CU-UP), DU, and RU in this application can be implemented through software modules, hardware modules, or a combination of software modules and hardware modules.
- This protocol layer may include a control plane protocol layer and a user plane protocol layer.
- the control plane protocol layer may include at least one of the following: radio resource control (RRC) layer, packet data convergence protocol (PDCP) layer, radio link control (RLC) layer, media access control (MAC) layer, or physical (PHY) layer, etc.
- the user plane protocol layer may include at least one of the following: service data adaptation protocol (SDAP) layer, PDCP layer, RLC layer, MAC layer, or physical layer, etc.
- SDAP service data adaptation protocol
- Network devices can be other devices that provide wireless communication functions for terminal devices.
- the embodiments of this application do not limit the specific technology or form of the network device. For ease of description, the embodiments of this application are not limited.
- Network equipment may also include core network equipment, such as the Mobility Management Entity (MME), Home Subscriber Server (HSS), Serving Gateway (S-GW), Policy and Charging Rules Function (PCRF), and Public Data Network Gateway (PDN Gateway) in 4G networks; and access and mobility management function (AMF), user plane function (UPF), or session management function (SMF) in 5G networks.
- MME Mobility Management Entity
- HSS Home Subscriber Server
- S-GW Serving Gateway
- PCRF Policy and Charging Rules Function
- PDN Gateway Public Data Network Gateway
- AMF access and mobility management function
- UPF user plane function
- SMF Public Data Network Gateway
- the network device may also have network nodes with AI capabilities, which can provide AI services to terminals or other network devices.
- network nodes with AI capabilities can provide AI services to terminals or other network devices.
- it may be an AI node, computing node, RAN node with AI capabilities, or core network element with AI capabilities on the network side (access network or core network).
- the device for implementing the function of the network device can be the network device itself, or it can be a device capable of supporting the network device in implementing that function, such as a chip system, which can be installed in the network device.
- a network device being used to implement the function of the network device is used to describe the technical solutions provided in this application embodiment.
- Configuration and Pre-configuration In this application, both configuration and pre-configuration are used. Configuration refers to the network device/server sending configuration information or parameter values to the terminal via messages or signaling, so that the terminal can determine communication parameters or resources for transmission based on these values or information. Pre-configuration is similar to configuration; it can be parameter information or parameter values pre-negotiated between the network device/server and the terminal device, parameter information or parameter values specified by standard protocols for use by the base station/network device or terminal device, or parameter information or parameter values pre-stored in the base station/server or terminal device. This application does not limit this.
- “send” and “receive” indicate the direction of signal transmission.
- “send information to XX” can be understood as the destination of the information being XX, which may include sending directly through the air interface or sending indirectly through the air interface by other units or modules.
- “Receive information from YY” can be understood as the source of the information being YY, which may include receiving directly from YY through the air interface or receiving indirectly from YY through the air interface by other units or modules.
- “Send” can also be understood as the "output” of the chip interface, and “receive” can also be understood as the "input” of the chip interface.
- sending and receiving can occur between devices, such as between network devices and terminal devices, or within a device, such as between components, modules, chips, software modules, or hardware modules within the device via buses, wiring, or interfaces.
- instruction may include direct instruction and indirect instruction, as well as explicit instruction and implicit instruction.
- the information indicated by a certain piece of information (hereinafter referred to as instruction information) is called the information to be instructed.
- instruction information The information indicated by a certain piece of information (hereinafter referred to as instruction information) is called the information to be instructed.
- instruction information indicates the information to be instructed.
- there are many ways to indicate the information to be instructed such as, but not limited to, directly indicating the information to be instructed, such as the information to be instructed itself or its index. It can also indirectly indicate the information to be instructed by indicating other information, where there is an association between the other information and the information to be instructed; or it can only indicate a part of the information to be instructed, while the other parts of the information to be instructed are known or pre-agreed upon.
- the instruction of specific information can be achieved by using a pre-agreed (e.g., protocol predefined) arrangement order of various information, thereby reducing the instruction overhead to a certain extent.
- a pre-agreed e.g., protocol predefined
- This application does not limit the specific method of instruction. It is understood that for the sender of the instruction information, the instruction information can be used to indicate the information to be instructed, and for the receiver of the instruction information, the instruction information can be used to determine the information to be instructed.
- the communication system includes at least one network device and/or at least one terminal device.
- Figure 1a is a schematic diagram of a communication system according to this application.
- Figure 1a exemplarily shows one network device and six terminal devices, namely terminal device 1, terminal device 2, terminal device 3, terminal device 4, terminal device 5, and terminal device 6.
- terminal device 1 is a smart teacup
- terminal device 2 is a smart air conditioner
- terminal device 3 is a smart gas pump
- terminal device 4 is a vehicle
- terminal device 5 is a mobile phone
- terminal device 6 is a printer.
- the entity sending the AI configuration information can be a network device.
- the entity receiving the AI configuration information can be terminal devices 1-6.
- the network device and terminal devices 1-6 form a communication system.
- terminal devices 1-6 can send data to the network device, and the network device needs to receive the data sent by terminal devices 1-6.
- the network device can send configuration information to terminal devices 1-6.
- terminal devices 4 and 6 can also form a communication system.
- Terminal device 5 acts as a network device, i.e., the entity sending AI configuration information
- terminal devices 4 and 6 act as terminal devices, i.e., the entities receiving AI configuration information.
- V2X vehicle-to-everything
- terminal device 5 sends AI configuration information to terminal devices 4 and 6 respectively, and receives data sent by terminal devices 4 and 6; correspondingly, terminal devices 4 and 6 receive the AI configuration information sent by terminal device 5 and send data back to terminal device 5.
- V2X vehicle-to-everything
- different devices may also perform AI-related services.
- the base station can perform communication-related services and AI-related services with one or more terminal devices, and different terminal devices can also perform communication-related services and AI-related services.
- communication-related services and AI-related services can also be performed between televisions and mobile phones.
- AI network elements can be introduced into the communication system provided in this application to implement some or all AI-related operations.
- AI network elements can also be called AI nodes, AI devices, AI entities, AI modules, AI models, or AI units, etc.
- AI network elements can be built into network elements within the communication system.
- an AI network element can be an AI module built into: access network equipment, core network equipment, cloud servers, or operation, administration, and maintenance (OAM) systems to implement AI-related functions.
- OAM can act as the network management system for core network equipment and/or access network equipment.
- AI network elements can also be independently configured network elements within the communication system.
- the terminal or its built-in chip can also include an AI entity to implement AI-related functions.
- AI can endow machines with human-like intelligence, for example, allowing them to use computer hardware and software to simulate certain intelligent human behaviors.
- machine learning methods can be employed.
- machines learn (or train) a model using training data. This model represents the mapping between inputs and outputs.
- the learned model can be used for reasoning (or prediction), that is, it can be used to predict the output corresponding to a given input. This output can also be called the reasoning result (or prediction result).
- Machine learning can include supervised learning, unsupervised learning, and reinforcement learning. Unsupervised learning can also be called learning without supervision.
- Supervised learning based on collected sample values and labels, uses machine learning algorithms to learn the mapping relationship between sample values and labels, and then expresses this learned mapping relationship using an AI model.
- the process of training the machine learning model is the process of learning this mapping relationship.
- sample values are input into the model to obtain the model's predicted values, and the model parameters are optimized by calculating the error between the model's predicted values and the sample labels (ideal values).
- the mapping relationship learned in supervised learning can include linear or non-linear mappings.
- the learning task can be divided into classification tasks and regression tasks.
- Unsupervised learning relies on collected sample values to discover inherent patterns within the samples themselves.
- One type of unsupervised learning algorithm uses the samples themselves as supervisory signals, meaning the model learns the mapping relationship from sample to sample; this is called self-supervised learning.
- model parameters are optimized by calculating the error between the model's predictions and the samples themselves.
- Self-supervised learning can be used for signal compression and decompression recovery applications; common algorithms include autoencoders and generative adversarial networks.
- Reinforcement learning unlike supervised learning, is a type of algorithm that learns problem-solving strategies through interaction with the environment. Unlike supervised and unsupervised learning, reinforcement learning problems do not have explicit "correct" action labels.
- the algorithm needs to interact with the environment to obtain reward signals from the environment, and then adjust its decision actions to obtain a larger reward signal value. For example, in downlink power control, the reinforcement learning model adjusts the downlink transmission power of each user based on the total system throughput feedback from the wireless network, aiming to achieve a higher system throughput.
- the goal of reinforcement learning is also to learn the mapping relationship between the environment state and a better (e.g., optimal) decision action.
- the network cannot be optimized by calculating the error between the action and the "correct action.” Reinforcement learning training is achieved through iterative interaction with the environment.
- Neural networks are a specific model in machine learning techniques. According to the general approximation theorem, neural networks can theoretically approximate any continuous function, thus enabling them to learn arbitrary mappings.
- Traditional communication systems rely on extensive expert knowledge to design communication modules, while deep learning communication systems based on neural networks can automatically discover hidden pattern structures from large datasets, establish mapping relationships between data, and achieve performance superior to traditional modeling methods.
- each neuron performs a weighted summation of its input values and outputs the result through an activation function.
- Figure 2a shows a schematic diagram of a neuron structure.
- w ⁇ sub>i ⁇ /sub> is used as the weight for xi , and is used to weight xi .
- the bias for the weighted summation of the input values based on the weights is, for example, b.
- b can be any possible type, such as a decimal, an integer (e.g., 0, a positive integer, or a negative integer), or a complex number.
- the activation functions of different neurons in a neural network can be the same or different.
- neural networks generally consist of multiple layers, each of which may include one or more neurons. Increasing the depth and/or width of a neural network can improve its expressive power, providing more powerful information extraction and abstract modeling capabilities for complex systems.
- the depth of a neural network can refer to the number of layers it includes, and the number of neurons in each layer can be called the width of that layer.
- a neural network includes an input layer and an output layer. The input layer processes the received input information through neurons and passes the processing result to the output layer, which then obtains the output of the neural network.
- a neural network includes an input layer, hidden layers, and an output layer. The input layer processes the received input information through neurons and passes the processing result to the hidden layer. The hidden layer calculates the received processing result and passes the calculation result to the output layer or the next adjacent hidden layer, ultimately obtaining the output of the neural network.
- a neural network may include one hidden layer or multiple sequentially connected hidden layers, without limitation.
- DNNs deep neural networks
- DNNs can include feedforward neural networks (FNNs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs).
- FNNs feedforward neural networks
- CNNs convolutional neural networks
- RNNs recurrent neural networks
- Figure 2b is a schematic diagram of an FNN network.
- a characteristic of FNN networks is that neurons in adjacent layers are completely connected pairwise. This characteristic makes FNNs typically require a large amount of storage space, leading to high computational complexity.
- CNNs are neural networks specifically designed to process data with a grid-like structure. For example, time-series data (e.g., discrete sampling along a time axis) and image data (e.g., two-dimensional discrete sampling) can both be considered grid-like data.
- CNNs do not use all the input information at once for computation; instead, they use a fixed-size window to extract a portion of the information for convolution operations, which significantly reduces the computational cost of model parameters.
- each window can use different convolution kernels, allowing CNNs to better extract features from the input data.
- RNNs are a type of distributed neural network (DNN) that utilizes feedback time-series information.
- the input to an RNN includes the current input value and its own output value from the previous time step.
- RNNs are well-suited for acquiring temporally correlated sequence features, and are particularly applicable to applications such as speech recognition and channel coding/decoding.
- a loss function can be defined.
- the loss function describes the difference or discrepancy between the model's output value and the ideal target value.
- the loss function can be expressed in various forms, and there are no restrictions on its specific form.
- the model training process can be viewed as follows: by adjusting some or all of the model's parameters, the value of the loss function is made to be less than a threshold value or to meet the target requirement.
- a model can also be called an AI model, a rule, or other names.
- An AI model can be considered a specific method for implementing AI functions.
- An AI model represents the mapping relationship or function between the model's input and output.
- AI functions can include one or more of the following: data collection, model training (or model learning), model information dissemination, model inference (or model reasoning, inference, or prediction, etc.), model monitoring or model validation, or inference result publication, etc.
- AI functions can also be called AI (related) operations or AI-related functions.
- a fully connected neural network is also called a multilayer perceptron (MLP).
- MLP multilayer perceptron
- an MLP consists of an input layer (left side), an output layer (right side), and multiple hidden layers (middle).
- Each layer of an MLP contains several nodes, called neurons. Neurons in adjacent layers are connected pairwise.
- w is the weight matrix
- b is the bias vector
- f is the activation function
- n is the index of the neural network layer
- n is greater than or equal to 1 and less than or equal to N, where N is the total number of layers in the neural network.
- a neural network can be understood as a mapping from an input data set to an output data set.
- Neural networks are typically initialized randomly; the process of obtaining this mapping from random values w and b using existing data is called training the neural network.
- the training process can involve using a loss function to evaluate the output of the neural network.
- the error can be backpropagated, and the neural network parameters (including w and b) can be iteratively optimized using gradient descent until the loss function reaches its minimum, which is the "better point (e.g., the optimal point)" in Figure 2d.
- the neural network parameters corresponding to the "better point (e.g., the optimal point)" in Figure 2d can be used as the neural network parameters in the trained AI model information.
- the gradient descent process can be represented as:
- ⁇ represents the parameters to be optimized (including w and b)
- L is the loss function
- ⁇ is the learning rate, controlling the step size of gradient descent. This represents the differentiation operation. This indicates taking the derivative of ⁇ with respect to L.
- the backpropagation process can utilize the chain rule for partial derivatives.
- the gradient of the parameters in the previous layer can be recursively calculated from the gradient of the parameters in the next layer, and can be expressed as:
- w ⁇ sub> ij ⁇ /sub> is the weight connecting node j to node i
- s ⁇ sub>i ⁇ /sub> is the weighted sum of the inputs at node i.
- AI models are typically trained on one side of the network device or the terminal device, and then the trained model is sent to the other end for deployment.
- one-sided training may not be feasible.
- models trained on one side may suffer from poor generalization. Therefore, how to achieve two-sided training of models has become an urgent problem to be solved.
- this application proposes a communication method and related apparatus.
- the sending end sends first information to the receiving end.
- the first information includes one or more first data and one or more first training label data, wherein the first data includes data output from the intermediate layer of the first model, and the first training label data is used to update the first model.
- the sending end receives second information sent by the receiving end, the second information indicating model update data output in reverse from the intermediate layer of the first model.
- the sending end and receiving end can collaboratively train a neural network model, which can be flexibly applied to various different scenarios and tasks, improving compatibility.
- the first information sent by the sending end to the receiving end includes one or more first data and one or more first training label data; by sending rich training-related data to the receiving end, the generalization ability of the model is improved.
- FIG. 3 is a schematic diagram of a communication scenario according to an embodiment of this application.
- the communication scenario involved in this embodiment includes: a sending end and a receiving end, wherein the direction from the sending end to the receiving end is defined as forward, and the direction from the receiving end to the sending end is defined as reverse (or backward).
- the sending end sends forward interaction data to the receiving end, the forward interaction data including first information; the receiving end sends reverse interaction data (or backward interaction data) to the sending end, the reverse interaction data including second information.
- the sending end can be a terminal device
- the receiving end can be a network device, as shown in Figure 4, which is a schematic diagram of another communication scenario in an embodiment of this application.
- the terminal device as the sending end, sends forward interaction data to the network device, which is the receiving end, and the network device sends reverse interaction data to the terminal device.
- the sending end can be a network device, and the receiving end can be a terminal device.
- the sender can be a network device
- the receiver can be another network device.
- the sending end can be a terminal device, and the receiving end can be another terminal device.
- the communication method proposed in this application includes:
- the sending end sends configuration information to the receiving end, and/or the receiving end sends configuration information to the sending end.
- step S1 when the sending end and the receiving end need to collaboratively train the neural network model, the sending end can send configuration information to the receiving end.
- the neural network model to be trained is referred to as the first model, which may include one or more neural network models.
- the sending end can send configuration information to the receiving end, or the receiving end can send the configuration information to the sending end.
- both the sending and receiving ends can send configuration information to each other, ensuring that the configuration information at both ends remains consistent. Through these methods, it is ensured that the sending and receiving ends are training the same neural network model (i.e., the first model).
- the configuration information related to the first model may also change.
- either the sending or receiving end can send the changed configuration information to the other end to ensure consistency between the configuration information at both ends.
- This method ensures that the sending and receiving ends are training the same neural network model (i.e., the first model). For example, if the segmentation information (segmentation position and/or the number of segmentation positions) of the first model changes during training, the sending and receiving ends need to exchange configuration information (including the updated segmentation information) to ensure consistency between their first models.
- the execution order of step S1 and subsequent steps S2-S3 is not restricted here.
- the first model may include two parts: a transmitter model and a receiver model, wherein the transmitter trains the transmitter model in the first model, and the receiver trains the receiver model in the first model.
- this configuration information is used to configure the training task for training the first model, and it includes one or more of the following:
- Model information wherein the model information includes any one or more of the following: the identifier of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated;
- Segmentation information wherein the segmentation information includes: the number of segmentation positions in the first model, and/or, the position information of the segmentation positions in the first model;
- training information which includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels.
- the identification information of the first model may include multiple identification information, each of which indicates a first model.
- the identification information of the first model included in the configuration information needs to be updated. For example, before the update, if the sending end and the receiving end train first model #1 and first model #2, then the identification information of the first model in the configuration information before the update includes: the identifier (ID) of first model #1 and the ID of first model #2. After the update, if the sending end and the receiving end train first model #3 and first model #4, then the identification information of the first model in the updated configuration information includes: the ID of first model #3 and the ID of first model #4.
- the training task information of the first model indicates the scenario (or environment) in which the first model is applicable, such as: reconstruction scenario, detection scenario, outdoor scenario, or indoor scenario.
- splitting position in this embodiment is used to indicate which intermediate layer in the first model outputs data as the first data, and the data output at that output position has been processed by that intermediate layer and other layers (e.g., other intermediate layers) before that intermediate layer.
- the number of segmentation positions of the first model in the segmentation information is 2.
- the position information of the segmentation positions in the first model in the segmentation information are: ReLU activation layer #1 and ReLU activation layer #4.
- the segmentation positions indicated by the segmentation information are: segmentation position #1 and segmentation position #2, as shown in Figures 10 and 11.
- Figure 10 is a schematic diagram of the segmentation position of the first model in an embodiment of this application
- Figure 11 is a schematic diagram of the segmentation position of the first model in another embodiment of this application.
- the first model includes multiple convolutional layers and multiple activation function layers.
- the weights (W) of the convolutional layer #1 are a matrix of size 192x192x1x7, and the bias (B) of the convolutional layer #1 is 192.
- Figures 10 and 11 show a partial structure of the first model, where the split position #1 is specifically the output of the activation function layer (ReLU) #1 in the first model, and the split position #2 is specifically the output of the activation function layer (ReLU) #4 in the first model.
- partial structure of the model shown in Figure 10 and the partial structure of the model shown in Figure 11 belong to the same first model. That is, one branch of the first model is shown in Figure 10, and another branch of the first model is shown in Figure 11.
- the configuration information may further include compression-related configuration information.
- the configuration information may also include: first compression configuration information, and/or second compression configuration information, wherein: the first compression configuration information indicates the compression method and compression parameters of the first and/or third information, and the second compression configuration information indicates the compression method and compression parameters of the second and/or fourth information.
- the sending end sends the first and/or third information to the receiving end, i.e., the forward interaction data sent by the sending end to the receiving end includes: the first and/or third information; the receiving end sends the second and/or fourth information to the sending end, i.e., the reverse interaction data sent by the receiving end to the sending end includes: the second and/or fourth information.
- the first compression configuration information indicates the compression method and compression parameters of the forward interaction data
- the second compression configuration information indicates the compression method and compression parameters of the reverse interaction data.
- the forward interaction data and the reverse interaction data use the same compression method and compression parameters.
- the first compression configuration information and the second compression configuration information are the same compression configuration information, which can also be called common compression configuration information.
- the forward and reverse interaction data use different compression methods and parameters.
- the configuration information sent from the sending end to the receiving end may include first compressed configuration information and second compressed configuration information; or, the configuration information sent from the receiving end to the sending end may include first compressed configuration information and second compressed configuration information; or, the configuration information sent from the sending end to the receiving end includes first compressed configuration information, and the configuration information sent from the receiving end to the sending end includes second compressed configuration information; or, the configuration information sent from the sending end to the receiving end includes second compressed configuration information, and the configuration information sent from the receiving end to the sending end includes first compressed configuration information.
- first compression configuration information and/or second compression configuration information may be carried in configuration information; or the first compression configuration information may be carried in first information and/or third information, and the second compression configuration information may be carried in second information and/or fourth information; or it may be carried in other independent information, and the embodiments of this application do not limit this.
- the embodiments of this application include, but are not limited to: no compression, fixed quantization, entropy coding, fixed quantization and entropy coding, dictionary compression, or dictionary compression and entropy coding, etc.
- embodiments of this application include, but are not limited to: quantization parameters, such as quantization boundaries or quantization bits, entropy coding parameters, or codebook dimensions, etc.
- the first compression configuration information and/or the second compression configuration information can be indicated using index information. Based on this index information, the corresponding compression configuration information can be determined from the compression configuration information table. The sending end and the receiving end configure this compression information table respectively.
- the compression configuration information table is shown in Table 2, for example.
- the codebook dimension in the compression parameters can also be indicated by index information.
- the corresponding dictionary compression parameters i.e., codebook dimensions
- the dictionary compression parameter table is shown in Table 3.
- the aforementioned compressed configuration information can be carried within the configuration information, or it can be sent together with the first, second, third, and/or fourth information.
- This application embodiment does not impose any limitations on this.
- the forward interaction data sent from the sending end to the receiving end includes the first and/or third information, and this forward interaction data also includes the first compressed configuration information.
- the reverse interaction data sent from the receiving end to the sending end includes the second and/or fourth information, and this reverse interaction data also includes the second compressed configuration information.
- the sending end sends the first information and/or the third information to the receiving end.
- step S2 after the sending end and receiving end configure the first model according to the configuration information, the sending end trains the first model and then sends the data output by the intermediate layer of the first model during the training process to the receiving end.
- the data output by the intermediate layer of the first model is referred to as the first data.
- the first information also includes first training label data, which is used to update the first model.
- the first training label data includes: the index information of the first training label in the training label set, or the first training label, wherein the training label set includes one or more training labels.
- This training label set is the training label set indicated by the training label set information in the configuration information.
- the sending end can determine the training label set based on the training label set information in the configuration information. Then, the sending end trains the first model based on the first training label in the training label set. The sending end uses the data output from the intermediate layer of the first model as the first data.
- the first training label data includes: the index information of the first training label in the training label set.
- the sending and receiving ends train by collecting training samples in real time. These training samples are called the first training samples.
- the training label corresponding to the first training sample is called the first training label.
- the first training label data includes: the first training label.
- the sending end can send one or more first data points to the receiving end.
- the segmentation information included in the configuration information indicates that the first model includes multiple segmentation positions
- the multiple feature data (or other data) output by the multiple segmentation positions are considered as one set of first data.
- the multiple data points included in a single set of first data can be ordered according to the hierarchical size of the segmentation position that outputs the first data.
- the sending end acquires multiple distinct first training samples. Then, the sending end trains a first model using these multiple first training samples, with each training iteration using a different first training sample. During the training process of the first model using multiple first training samples, multiple sets of first data are output.
- the sending end can acquire multiple first training samples by directly acquiring multiple different first training samples.
- the sending end can acquire multiple different first training samples through multiple acquisitions by a sensor.
- the sending end can acquire a common training sample and then perform different processing on that common training sample to obtain multiple different first training samples.
- different processing on the common training sample includes, but is not limited to: data cleaning, data standardization, data normalization, feature scaling, encoding, feature selection, feature extraction, data augmentation, or resampling.
- the sending end after obtaining a common training sample, performs different processing on the common training sample to obtain multiple first data.
- the sending end uses this first training sample to process the first model one or more times.
- This processing can be training processing.
- the sending end outputs one or more first data points.
- the processing method and/or processing parameters are inconsistent each time. For example, the initialization parameters of the first model are different in each processing step, other input information of the first model is different in each processing step, or the sampling operation of the first model is different in each processing step.
- the sending end After acquiring one or more first data points and one or more first training label data points at the sending end, the sending end sends first information to the receiving end.
- the first information includes the one or more first data points and one or more first training label data points.
- the one or more first data points and one or more first training label data points in the first information can be grouped in various different ways to improve generalization. The different grouping methods of the first information are described below.
- the first information includes one or more first data and one first training label data, wherein the first training label data includes a first training label (or index information indicating the first training label) corresponding to the one or more first data.
- Figure 6 is a schematic diagram of the structure of the first information in an embodiment of this application.
- the first information includes n first data and one first training label data, where n is a positive integer greater than or equal to 1.
- the first information includes one or more first data points and multiple first training label data points, wherein each first data point corresponds to one first training label data point.
- the first information includes n first data points and n first training label data points, where n is a positive integer greater than or equal to 1.
- Each first data point and its corresponding first training label data point are divided into a group of data, meaning the first information includes n groups of data, each group including one first data point and its corresponding first training label data point.
- the nth group of data includes first data point n and first training label data point n.
- the sending end may also inform the receiving end of the grouping method of the first information and the number of one or more first data points included in the first information.
- the first information may further include: grouping indication information, which indicates the grouping method of the one or more first data points included in the first information; and/or, group number information, which indicates the number of one or more first data points included in the first information.
- grouping indication information indicates whether the first information is grouped using grouping method A or grouping using grouping method B.
- the group number information indicates that the first information includes n groups of data (n first data points and n first training label data points, each group including one first data point and one first training label data point).
- the first information can be supplemented with indication information to indicate whether the first information adopts grouping method A or grouping method B. For example, when the indication information is 0, it indicates that the first information adopts grouping method A; when the indication information is 1, it indicates that the first information adopts grouping method B.
- the sending end may also send third information to the receiving end.
- the third information includes, but is not limited to: third data, input location information of the third data in the first model, dimension information of the third data, first type indication information, or identification information of the first model.
- the first type indication information is used to indicate the data type of the third data.
- the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
- the first type indication information is related to the channel between the sending end and the receiving end, and the first environment information is related to the environment in which the sending end and the receiving end are located.
- the channel information in the embodiments of this application includes, but is not limited to: channel capacity information, channel bandwidth information, channel noise information, channel modulation method, channel coding method, channel state information, or channel bit error rate, etc.
- the environmental information in the embodiments of this application includes, but is not limited to: terrain information, climate condition information, electromagnetic environment information, network topology information, network load information, signal quality information, or wireless spectrum information.
- the multimodal information in the embodiments of this application includes, but is not limited to, image information, point cloud information, text information, or audio information.
- the first type of indication information is shown in Table 4.
- the dimension information of the third data refers to the data dimension of the third data.
- the third data includes the first channel information, and the first channel information is a J*K dimension matrix
- the dimension information of the third data is J*K, where J is a positive integer and K is a positive integer.
- the identification information of the first model is used to indicate which models and/or the types of models included in the first model.
- the identification information of the first model is shown in Table 5, for example.
- the input position information for inputting third data in the first model is used to instruct the receiving end to input the third data at that input position in the first model after receiving the third information.
- Figure 12 is another structural schematic diagram of the first model in an embodiment of this application.
- the input position information for inputting third data in the first model instructs the input of third data at the output position of the activation function layer (ReLU) #6 of the first model.
- the third data also includes first channel information and first environment information, after receiving the third information (including the third data), the receiving end inputs the first channel information and first environment information included in the third data at the output position of the activation function layer (ReLU) #6 of the first model.
- first and third information mentioned above can be carried in the same message, that is, the sending end sends the first and third information at the same time; or the first and third information mentioned above can be carried in different messages, that is, the sending end sends the first and third information separately. This application embodiment does not limit this.
- the first and/or third information is carried in a radio resource control (RRC) message.
- RRC radio resource control
- the DataInfo field in RRC sequence format, indicates the specific type of the first and/or third information.
- the data type is implicitly indicated by the presence of an element representing the data type in the DataInfo field.
- the data type of the data included in the forward interaction data is indicated as follows: "DataInfo::SEQUENCE ⁇
- the data type of the data included in the forward interaction data is indicated as follows: "DataInfo::SEQUENCE ⁇
- This RRC message carries AI data, which specifically includes forward interaction data and model-related data.
- a DataType field indicates whether a certain data type exists. If it exists, the data type is "true”; otherwise, if it does not exist, the data type is "false”.
- the data type of the data included in the forward interaction data is indicated in the following way:
- the ⁇ includeForwardInterData ENUMERATED ⁇ true ⁇ indicates that the RRC message carries forward interactive data.
- the "forwardInterData” field indicates one or more of the following information: the identifier of the first model (modelIndicate), data configuration information (dataConfig), grouping indication information (groupMode), number of groups information (groupNum), feature data (featureData), or other data (otherData).
- the configuration information in step S1 can also be carried in an RRC message.
- the specific carrying method is similar to that of the first information and/or the third information carried in an RRC message, and will not be elaborated here.
- Implementation method B The first and/or third information is carried in a media access control (MAC) message.
- MAC media access control
- the data type carried by the MAC message is indicated in the MAC subheader using a Logical Channel ID (LCID) or an Extended Logical Channel ID (eLCID). Therefore, when first and/or third information is carried in a MAC message, the data type of the first and/or third information can be indicated by the LCID or eLCID in the MAC subheader.
- LCID Logical Channel ID
- eLCID Extended Logical Channel ID
- the receiving end receives a MAC message and finds that the value of the LCID field of the MAC message is 35, it determines that the MAC message carries the first and/or third information.
- LCID or eLCID is used to indicate that the MAC message carries AI data.
- AI data includes forward interaction data (first information and/or third information) and reverse interaction data (second information and/or fourth information). Therefore, in order to distinguish whether the AI data specifically carries forward interaction data or reverse interaction data, the type field of the MAC message is used to indicate this.
- both the type field and the payload mentioned above can be carried on the MAC Control Element (MAC CE).
- the payload can be used to carry AI data.
- Figure 16 is a schematic diagram of the structure of a MAC message carrying forward interaction data in an embodiment of this application.
- the LCID of the MAC message is 35, indicating that the MAC message carries AI data; the type of the MAC message is 0, indicating that the AI data carried by the MAC message specifically includes forward interaction data.
- Figure 17 is a schematic diagram of the structure of a MAC message carrying reverse interaction data in an embodiment of this application.
- the LCID of the MAC message is 35, indicating that the MAC message carries AI data; the type of the MAC message is 1, indicating that the AI data carried by the MAC message specifically includes reverse interaction data.
- the configuration information in step S1 can also be carried in a MAC message.
- the specific carrying method is similar to that of the first information and/or the third information carried in a MAC message, and will not be elaborated here.
- the forward interaction data sent from the sending end to the receiving end includes: first information and third information.
- the first information includes n first data points (first data 1 to first data n) and first training label data.
- the forward interaction data shown in Figure 13 may also include configuration information (see step S1) and/or compressed configuration information (e.g., first compressed configuration information).
- the forward interaction data sent from the sending end to the receiving end includes: first information and third information.
- the first information includes n first data items (first data 1 to first data n), n first training label data items (first training label data 1 to first training label data n), and n third information items (third information 1 to third information n).
- the first data items correspond to the first training label data and the third information items.
- the forward interaction data shown in Figure 14 may also include configuration information (see step S1) and/or compressed configuration information (e.g., first compressed configuration information).
- the first information may include only one such configuration information.
- the first information includes multiple configuration information.
- the receiving end sends the second and/or fourth information to the sending end.
- the receiving end receives forward interaction data (including first information and/or third information) from the sending end.
- the receiving end obtains the training loss function (loss) of the first model based on the forward interaction data and updates the decoding model (i.e., the receiving end model) in the first model.
- the receiving end obtains second data based on the first model.
- the second data includes model update data output from the reverse of the intermediate layers of the first model.
- the second data includes, but is not limited to, gradients.
- the gradient can be the gradient of the bias or the gradient of the weights, etc.
- the second information includes multiple second data, where each second data corresponds to a first data.
- Figure 8 is a schematic diagram of the structure of the second information in an embodiment of this application.
- the sending end sends the second information to the receiving end, which includes m second data, where m is a positive integer.
- the second information includes weighted values of multiple second-number data points (N), wherein the first information also includes the weights of the first data points.
- the receiving end determines multiple second-number data points based on one or more first data points. Then, the receiving end performs weighted processing on the multiple second-number data points according to the weights of the first data points corresponding to each second-number data point, to obtain a weighted value for the multiple second-number data points. For example, if the weights of each first data point are equal, then the second information includes the average value of the multiple second-number data points.
- Figure 9 is another structural schematic diagram of the second information in an embodiment of this application.
- the sending end sends the second information to the receiving end, which includes weighted values of m second-number data points (i.e., a total of m second-number data points from second-number data 1 to second-number data m), where m is a positive integer.
- the receiving end can also send a fourth information to the sending end.
- the fourth information includes, but is not limited to:
- the fourth data includes: second type indication information, termination interaction instruction of the first model, dimension information of the fourth data, or identification information of the first model.
- the second type indication information is used to indicate the data type of the fourth data.
- the termination interaction instruction of the first model is used to instruct the sending end to stop sending information related to the first model.
- the second channel information is related to the channel between the sending end and the receiving end.
- the second environment information is related to the environment in which the sending end and the receiving end are located.
- the fourth data includes any one or more of the following information: second channel information, second environment information, multimodal information, training residual information generated by the receiving end during the training of the first model, or performance information of the updated first model.
- the second channel information is similar to the first channel information in step S2 above, and will not be described in detail here.
- the second environmental information is similar to the first environmental information in step S2 above, and will not be described in detail here.
- the multimodal information is similar to the multimodal information in step S2 above, and will not be described in detail here.
- the second type of indication information, the dimension information of the fourth data, or the identification information of the first model are similar to the first type of indication information, the dimension information of the third data, or the identification information of the first model in the aforementioned step S2, and will not be elaborated here.
- the sending end can use this training residual information to train the first model.
- the sending end Upon receiving a termination interaction command from the first model, the sending end ceases exchanging data related to the first model with the receiving end, such as first information and/or third information.
- the sending end may also stop training the first model.
- the sending end can determine whether to continue exchanging data related to the first model, such as first information and/or third information, based on this performance information. If the sending end determines that the performance of the first model meets the requirements based on the performance information, it can stop exchanging data related to the first model with the receiving end. Optionally, the sending end can also stop training the first model.
- the performance information of the first model includes: test loss function, test accuracy, precision, and/or recall.
- the sending end sends the second information and the fourth information to the receiving end.
- the second information includes the weighted values of m second data (i.e., the weighted values of a total of m second data from second data 1 to second data m), where m is a positive integer. Since each of the m second data corresponds to one fourth information, the reverse interaction data also includes m fourth information (i.e., the m fourth information from fourth information 1 to fourth information m), where fourth information 1 corresponds to second data 1, and so on, with fourth information m corresponding to second data m.
- the weighted value of the m second data points can be the average of the m second data points.
- the reverse interaction data shown in Figure 15 may also include configuration information (see step S1) and/or compressed configuration information (e.g., second compressed configuration information).
- configuration information see step S1
- compressed configuration information e.g., second compressed configuration information
- the second information includes the weighted values of the second data 1 to the second data m and a fourth information corresponding to the m second data from the second data 1 to the second data m.
- the second information may include only one such configuration information.
- the second information includes multiple configuration information.
- steps S2 to S3 can be repeated until the performance of the first model meets the service requirements.
- the sending end can send the configuration information of the model to the receiving end (similar to step S1). Then, the sending end and the receiving end exchange the relevant data of the model, similar to steps S2 to S3 above, which will not be elaborated here.
- the sending end can send the updated configuration information to the receiving end (similar to step S1), or the receiving end can send the updated configuration information to the sending end (similar to step S1). Then, the sending end and the receiving end exchange the updated relevant data of the first model, similar to steps S2 to S3 above, which will not be elaborated here.
- the sending end sends first information to the receiving end.
- the first information includes one or more first data points and one or more first training label data points.
- the first data points include data output from the intermediate layers of the first model, and the first training label data is used to update the first model.
- the sending end receives second information sent by the receiving end.
- the second information instructs the intermediate layers of the first model to output model update data in reverse.
- the sending end and receiving end can collaboratively train a neural network model, which can be flexibly applied to various different scenarios and tasks, improving compatibility.
- the first information sent by the sending end to the receiving end includes one or more first data points and one or more first training label data points.
- the generalization ability of the model is improved.
- the sending end and the receiving end can flexibly train for different training scenarios and tasks, improving compatibility.
- the sending end and the receiving end can support training multiple models within a single training cycle, improving training efficiency.
- the sending end and the receiving end can also adjust the trained model within a single training cycle, further improving training efficiency. Because the data exchanged between the sending and receiving ends supports multiple compression methods, the amount of data transmitted can be effectively reduced, thus improving communication efficiency.
- FIG 18 is a schematic diagram of the structure of a communication device 1800 in an embodiment of this application.
- This communication device 1800 can realize the functions of a transmitting end or a receiving end in the above method embodiments, and therefore can also achieve the beneficial effects of the above method embodiments.
- the communication device 1800 can be a transmitting end or a receiving end, or it can be an integrated circuit or component inside the transmitting end or receiving end, such as a chip, baseband chip, modem chip, SoC chip containing a modem core, system-in-package (SIP) chip, communication module, chip system, processor, etc.
- SIP system-in-package
- the transceiver unit 1802 may include a transmitting unit and a receiving unit, which are used to perform transmitting and receiving respectively.
- the device 1800 when the device 1800 is used to execute the method performed by the transmitting end or the receiving end in FIG5 and related embodiments, the device 1800 includes a processing unit 1801 and a transceiver unit 1802.
- the communication device 1800 is used at a transmitting end, and the communication device 1800 includes:
- the transceiver unit 1802 is used to send first information to the receiving end.
- the first information includes one or more first data and one or more first training label data.
- the first data includes data output from the intermediate layer of the first model.
- the first training label data is used to update the first model.
- the transceiver unit 1802 is also used to receive second information sent by the receiving end, the second information indicating the model update data output in reverse by the intermediate layer of the first model.
- the first training label data includes:
- the first training label or the index information of the first training label in the training label set, wherein the training label set includes one or more training labels.
- the first information further includes:
- Grouping indication information wherein the grouping indication information is used to indicate the grouping method of the plurality of first data included in the first information
- group number information which indicates the number of the plurality of first data included in the first information.
- each second data corresponding to one first data.
- the first information may also include the weight of the first data.
- the transceiver unit 1802 is further configured to acquire a first training sample, wherein the training label corresponding to the first training sample is the first training label.
- the processing unit 1801 is used to process the first model multiple times using the first training samples and output the plurality of first data, wherein the processing method and/or processing parameters are different each time.
- the transceiver unit 1802 is further configured to acquire multiple first training samples, wherein the training label corresponding to the first training sample is the first training label.
- the processing unit 1801 is further configured to process the first model once or multiple times using the plurality of first training samples, and output the plurality of first data, wherein the first training samples used in each processing are different.
- the transceiver unit 1802 is also used to acquire public training samples
- the processing unit 1801 is further configured to process the common training samples to generate the plurality of first training samples, wherein the plurality of first training samples are different from each other.
- the transceiver unit 1802 is further configured to send third information to the receiving end, the third information including any one or more of the following:
- the third data the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, wherein the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
- the transceiver unit 1802 is further configured to receive fourth information sent by the receiving end, the fourth information including any one or more of the following:
- the first model includes one or more segmentation locations
- the first data includes: data output from intermediate layers corresponding to one or more segmentation positions in the first model.
- the transceiver unit 1802 is further configured to send configuration information to the receiving end, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
- Model information wherein the model information includes any one or more of the following: the identification information of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated.
- Segmentation information wherein the segmentation information includes: the number of segmentation positions of the first model, and/or, the position information of the segmentation positions in the first model;
- training information includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
- the transceiver unit 1802 is also used to receive the configuration information sent by the receiving end.
- the configuration information further includes:
- First compression configuration information and/or, second compression configuration information, wherein:
- the first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information.
- the second compression configuration information is used to indicate the compression method and compression parameters of the second information and/or the fourth information.
- the first information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the communication device 1800 is applied at the receiving end, and the communication device 1800 includes:
- the transceiver unit 1802 is used to receive first information sent by the sender.
- the first information includes one or more first data and one or more first training label data.
- the first data includes data output from the intermediate layer of the first model.
- the first training label data is used to update the first model.
- the transceiver unit 1802 is also used to send second information to the sending end, the second information indicating the model update data output in reverse by the intermediate layer of the first model.
- the first training label data includes:
- the first training label or the index information of the first training label in the training label set, wherein the training label set includes one or more training labels.
- the first information further includes:
- Grouping indication information wherein the grouping indication information is used to indicate the grouping method of the plurality of first data included in the first information
- group number information which indicates the number of the plurality of first data included in the first information.
- the second information includes: the weighted value of the plurality of second data.
- the first information may also include the weight of the first data.
- the first information may also include the weight of the first data.
- the transceiver unit 1802 is further configured to receive third information transmitted by the transmitting end, the third information including any one or more of the following:
- the third data the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, wherein the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
- the transceiver unit 1802 is further configured to send fourth information to the sending end, the fourth information including any one or more of the following:
- the first model includes one or more segmentation locations
- the first data includes: data output from intermediate layers corresponding to one or more segmentation positions in the first model;
- the second data includes: model update data output in reverse from one or more of the segmentation positions in the first model.
- the transceiver unit 1802 is further configured to receive configuration information from the receiving end, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
- Model information wherein the model information includes any one or more of the following: the identification information of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated.
- Segmentation information wherein the segmentation information includes: the number of segmentation positions of the first model, and/or, the position information of the segmentation positions in the first model;
- training information includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
- the transceiver unit 1802 is also used to send the configuration information to the sending end.
- the configuration information further includes:
- First compression configuration information and/or, second compression configuration information, wherein:
- the first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information.
- the second compression configuration information is used to indicate the compression method and compression parameters of the second information and/or the fourth information.
- the first information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
- RRC Radio Resource Control
- MAC Media Access Control
- the function of the processing unit 1801 can be implemented by one or more processors.
- the processor may include a modem chip, or a SoC chip or SIP chip containing a modem core.
- the function of the transceiver unit 1802 can be implemented by transceiver circuitry.
- the communication device 1800 when the communication device 1800 is a circuit or chip responsible for communication functions in a terminal (or network device), such as a modem chip or a SoC chip or SIP chip containing a modem core, the function of the processing unit 1801 can be implemented by a circuit system in the aforementioned chip that includes one or more processors or processor cores.
- the function of the transceiver unit 1802 can be implemented by the interface circuit or data transceiver circuit on the aforementioned chip.
- the communication device 1900 includes a logic circuit 1901 and an input/output interface 1902.
- the communication device 1900 can be a chip or an integrated circuit.
- the transceiver unit 1802 can be a communication interface, which can be the input/output interface 1902 in Figure 19.
- the input/output interface 1902 can include an input interface and an output interface.
- the communication interface can also be a transceiver circuit, which can include an input interface circuit and an output interface circuit.
- the logic circuit 1901 and the input/output interface 1902 can also perform other steps executed by the transmitting or receiving end in any embodiment and achieve corresponding beneficial effects, which will not be elaborated further here.
- the processing unit 1801 shown in FIG18 can be the logic circuit 1901 in FIG19.
- the logic circuit 1901 can be a processing device, the functions of which can be partially or entirely implemented in software.
- the processing apparatus may include a memory and a processor, wherein the memory is used to store a computer program, and the processor reads and executes the computer program stored in the memory to perform the corresponding processing and/or steps in any of the method embodiments.
- the processing device may consist of only a processor.
- a memory for storing computer programs is located outside the processing device, and the processor is connected to the memory via circuitry/wires to read and execute the computer programs stored in the memory.
- the memory and processor may be integrated together or physically independent of each other.
- the processing device may be one or more chips, or one or more integrated circuits.
- the processing device may be one or more field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), system-on-chips (SoCs), central processing units (CPUs), network processors (NPs), digital signal processors (DSPs), microcontroller units (MCUs), programmable logic controllers (PLDs), or other integrated chips, or any combination of the above chips or processors.
- FPGAs field-programmable gate arrays
- ASICs application-specific integrated circuits
- SoCs system-on-chips
- CPUs central processing units
- NPs network processors
- DSPs digital signal processors
- MCUs microcontroller units
- PLDs programmable logic controllers
- Figure 20 is a structural schematic diagram of the communication device 2000 involved in the above embodiments provided by the embodiments of this application.
- the communication device 2000 can be the communication device as the sending end (or receiving end) in the above embodiments.
- the example shown in Figure 20 is that the sending end is implemented by the sending end (or the component in the sending end), or the example shown in Figure 20 is that the receiving end is implemented by the receiving end (or the component in the receiving end).
- the present invention is a possible logical structure diagram of the communication device 2000, which may include, but is not limited to, at least one processor 2001 and a communication port 2002.
- the transceiver unit 1802 can be a communication interface, which can be the communication port 2002 in Figure 20.
- the communication port 2002 can include an input interface and an output interface.
- the communication port 2002 can also be a transceiver circuit, which can include an input interface circuit and an output interface circuit.
- the device may also include at least one of a memory 2003 and a bus 2004.
- the at least one processor 2001 is used to control the operation of the communication device 2000.
- the processor 2001 can be a central processing unit, a general-purpose processor, a digital signal processor, an application-specific integrated circuit, a field-programmable gate array, or other programmable logic devices, transistor logic devices, hardware components, or any combination thereof. It can implement or execute the various exemplary logic blocks, modules, and circuits described in conjunction with the disclosure of this application.
- the processor can also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a digital signal processor and a microprocessor, etc.
- the communication device 2000 shown in Figure 20 can be used to implement the steps implemented by the sending end or the receiving end in the aforementioned method embodiments, and to achieve the corresponding technical effects of the sending end or the receiving end.
- the specific implementation of the communication device shown in Figure 20 can be referred to the description in the aforementioned method embodiments, and will not be repeated here.
- Figure 21 is a schematic diagram of the communication device 2100 involved in the above embodiments provided by the present application.
- the communication device 2100 can be a communication device serving as a receiving end (or a transmitting end) as described in the above embodiments.
- the example shown in Figure 21 illustrates a receiving end implemented through a receiving end (or a component within a receiving end), or a transmitting end implemented through a transmitting end (or a component within a transmitting end).
- the structure of this communication device can be referenced from the structure shown in Figure 21.
- the communication device 2100 includes at least one processor 2111 and at least one network interface 2114. Further optionally, the communication device also includes at least one memory 2112, at least one transceiver 2113, and one or more antennas 2115.
- the processor 2111, memory 2112, transceiver 2113, and network interface 2114 are connected, for example, via a bus. In this embodiment, the connection may include various interfaces, transmission lines, or buses, etc., and this embodiment is not limited thereto.
- the antenna 2115 is connected to the transceiver 2113.
- the network interface 2114 enables the communication device to communicate with other communication devices through a communication link.
- the network interface 2114 may include a network interface between the communication device and core network equipment, such as an S1 interface; the network interface may also include a network interface between the communication device and other communication devices (e.g., other receivers, other transmitters, or core network equipment), such as an X2 or Xn interface.
- core network equipment such as an S1 interface
- other communication devices e.g., other receivers, other transmitters, or core network equipment
- the transceiver unit 1802 can be a communication interface, which can be the network interface 2114 in Figure 21.
- the network interface 2114 can include an input interface and an output interface.
- the network interface 2114 can also be a transceiver circuit, which can include an input interface circuit and an output interface circuit.
- the processor 2111 is primarily used to process communication protocols and communication data, control the entire communication device, execute software programs, and process data from the software programs, for example, to support the actions described in the embodiments of the communication device.
- the communication device may include a baseband processor and a central processing unit (CPU).
- the baseband processor is primarily used to process communication protocols and communication data, while the CPU is primarily used to control the entire communication device, execute software programs, and process data from the software programs.
- the processor 2111 in Figure 21 can integrate the functions of both a baseband processor and a CPU. Those skilled in the art will understand that the baseband processor and CPU can also be independent processors interconnected via technologies such as buses.
- the communication device may include multiple baseband processors to adapt to different network standards, and multiple CPUs to enhance its processing capabilities.
- the various components of the communication device can be connected via various buses.
- the baseband processor can also be described as a baseband processing circuit or a baseband processing chip.
- the CPU can also be described as a central processing circuit or a central processing chip.
- the function of processing communication protocols and communication data can be built into the processor or stored in memory as a software program, which is then executed by the processor to implement the baseband processing function.
- the memory is primarily used to store software programs and data.
- the memory 2112 can exist independently or be connected to the processor 2111.
- the memory 2112 can be integrated with the processor 2111, for example, integrated within a single chip.
- the memory 2112 can store program code that executes the technical solutions of the embodiments of this application, and its execution is controlled by the processor 2111.
- the various types of computer program code being executed can also be considered as drivers for the processor 2111.
- Figure 21 shows only one memory and one processor. In actual communication devices, there can be multiple processors and multiple memories. Memory can also be called storage medium or storage device, etc. Memory can be a storage element on the same chip as the processor, i.e., an on-chip storage element, or it can be a separate storage element; the embodiments of this application do not limit this.
- Transceiver 2113 can be used to support the reception or transmission of radio frequency (RF) signals between a communication device and a terminal.
- Transceiver 2113 can be connected to antenna 2115.
- Transceiver 2113 includes a transmitter Tx and a receiver Rx. Specifically, one or more antennas 2115 can receive RF signals.
- the receiver Rx of transceiver 2113 receives the RF signals from the antennas, converts the RF signals into digital baseband signals or digital intermediate frequency (IF) signals, and provides the digital baseband signals or IF signals to processor 2111 so that processor 2111 can perform further processing on the digital baseband signals or IF signals, such as demodulation and decoding.
- IF intermediate frequency
- the transmitter Tx in transceiver 2113 is also used to receive modulated digital baseband signals or IF signals from processor 2111, convert the modulated digital baseband signals or IF signals into RF signals, and transmit the RF signals through one or more antennas 2115.
- the receiver Rx can selectively perform one or more stages of downmixing and analog-to-digital conversion on the radio frequency signal to obtain a digital baseband signal or a digital intermediate frequency (IF) signal.
- IF digital intermediate frequency
- the order of these downmixing and IF conversion processes is adjustable.
- the transmitter Tx can selectively perform one or more stages of upmixing and digital-to-analog conversion on the modulated digital baseband signal or digital IF signal to obtain a radio frequency signal.
- the order of these upmixing and IF conversion processes is also adjustable.
- the digital baseband signal and the digital IF signal can be collectively referred to as digital signals.
- the transceiver 2113 can also be called a transceiver unit, transceiver, transceiver device, etc.
- the device in the transceiver unit that performs the receiving function can be regarded as the receiving unit
- the device in the transceiver unit that performs the transmitting function can be regarded as the transmitting unit. That is, the transceiver unit includes a receiving unit and a transmitting unit.
- the receiving unit can also be called a receiver, input port, receiving circuit, etc.
- the transmitting unit can be called a transmitter, transmitter, or transmitting circuit, etc.
- the communication device 2100 shown in Figure 21 can be used to implement the steps implemented by the receiving end or the sending end in the aforementioned method embodiments, and to achieve the corresponding technical effects of the receiving end or the sending end.
- the specific implementation of the communication device 2100 shown in Figure 21 can be referred to the description in the aforementioned method embodiments, and will not be repeated here.
- This application also provides a computer-readable storage medium for storing one or more computer-executable instructions.
- the processor When the computer-executable instructions are executed by a processor, the processor performs the method described in the possible implementations of the sending end, receiving end, or second communication device in the foregoing embodiments.
- This application also provides a computer program product (or computer program) that, when executed by the processor, executes the method described above for the possible implementation of the sending or receiving end.
- This application also provides a chip system including at least one processor for supporting a communication device in implementing the functions involved in the possible implementations of the communication device described above.
- the chip system further includes an interface circuit that provides program instructions and/or data to the at least one processor.
- the chip system may further include a memory for storing the program instructions and data necessary for the communication device.
- the chip system may be composed of chips or may include chips and other discrete devices, wherein the communication device may specifically be a transmitting end or a receiving end in the aforementioned method embodiments.
- This application also provides a communication system, the network system architecture of which includes the sending end or receiving end in any of the above embodiments.
- the communication system may also include other communication devices that communicate with the transmitting end or the receiving end.
- the disclosed systems, apparatuses, and methods can be implemented in other ways.
- the apparatus embodiments described above are merely illustrative; for instance, the division of units is only a logical functional division, and in actual implementation, there may be other division methods.
- multiple units or components may be combined or integrated into another system, or some features may be ignored or not executed.
- the coupling or direct coupling or communication connection shown or discussed may be through some interfaces, indirect coupling or communication connection between devices or units, and may be electrical, mechanical, or other forms. Whether a function is implemented in hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered beyond the scope of this application.
- the units described as separate components may or may not be physically separate.
- the components shown as units may or may not be physical units; that is, they may be located in one place or distributed across multiple network units. Some or all of the units can be selected to achieve the purpose of this embodiment according to actual needs.
- the functional units in the various embodiments of this application can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit.
- the integrated unit can be implemented in hardware or as a software functional unit. If the integrated unit is implemented as a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium.
- This computer software product is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, server, or network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of this application.
- the aforementioned storage medium includes various media capable of storing program code, such as USB flash drives, portable hard drives, read-only memory (ROM), random access memory (RAM), magnetic disks, or optical disks.
Landscapes
- Mobile Radio Communication Systems (AREA)
Abstract
Description
本申请要求于2024年05月11日提交国家知识产权局、申请号为202410585456.2、申请名称为“一种通信方法以及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to Chinese Patent Application No. 202410585456.2, filed on May 11, 2024, entitled "A Communication Method and Related Device", the entire contents of which are incorporated herein by reference.
本申请涉及通信技术领域,尤其涉及一种通信方法以及相关装置。This application relates to the field of communication technology, and in particular to a communication method and related apparatus.
无线通信,可以是两个或两个以上的通信设备间不经由导体或缆线传播而进行的传输通讯。一般地,该两个或两个以上的通信设备包括网络设备和终端设备,或者,该两个或两个以上的通信设备包括不同的终端设备。Wireless communication can be a transmission communication between two or more communication devices that does not propagate through conductors or cables. Generally, the two or more communication devices include network devices and terminal devices, or the two or more communication devices include different terminal devices.
随着人工智能(artificial intelligence,AI)与通信技术的发展,适用于各种场景或者任务的AI模型(本申请实施例中也可以简称为模型)应运而生。目前,AI模型通常是在网络设备一侧或者终端设备一侧进行单侧训练,然后将训练好的模型发送给对端部署。With the development of artificial intelligence (AI) and communication technologies, AI models (which can also be simply referred to as models in this application embodiment) suitable for various scenarios or tasks have emerged. Currently, AI models are usually trained on one side of the network device or the terminal device, and then the trained model is sent to the other end for deployment.
然而,受限于终端设备的能力或者数据隐私等问题,单侧训练可能无法实现。此外,单侧训练得到的模型可能存在泛化性差等问题。因此,如何实现双侧训练模型成为目前亟待解决的问题。However, due to limitations in terminal device capabilities or data privacy issues, one-sided training may not be feasible. Furthermore, models trained on one side may suffer from poor generalization. Therefore, how to achieve two-sided training of models has become a pressing problem to be solved.
本申请实施例提供一种通信方法,该方法用于发送端与接收端协同训练神经网络模型,可以灵活适用于多种不同的场景和任务,提升了兼容性。This application provides a communication method for collaboratively training a neural network model between a sending end and a receiving end. This method is flexible and applicable to various scenarios and tasks, thus improving compatibility.
第一方面,本申请实施例提出一种通信方法,方法应用于发送端,该发送端可以是通信设备(如终端设备或网络设备),或者,该发送端可以是通信设备中的部分组件(例如处理器、芯片或芯片系统等),或者该发送端还可以是能实现全部或部分通信设备功能的逻辑模块或软件。方法包括:首先,发送端向接收端发送第一信息,第一信息包括一个或多个第一数据,以及,一个或多个第一训练标签数据,第一数据包括第一模型的中间层输出的数据,第一训练标签数据用于更新第一模型;其次,发送端接收接收端发送的第二信息,第二信息指示第一模型的中间层反向输出的模型更新数据。In a first aspect, embodiments of this application propose a communication method applied to a sending end. The sending end can be a communication device (such as a terminal device or network device), or it can be a component within the communication device (such as a processor, chip, or chip system), or it can be a logic module or software capable of implementing all or part of the functions of the communication device. The method includes: first, the sending end sending first information to a receiving end, the first information including one or more first data and one or more first training label data, the first data including data output from the intermediate layer of a first model, the first training label data being used to update the first model; second, the sending end receiving second information sent by the receiving end, the second information indicating model update data output in reverse by the intermediate layer of the first model.
本申请中,模型可以替换为其它术语,例如神经网络模型、人工智能(artificial intelligence,AI)模型、AI网络、AI神经网络模型、神经网络、机器学习模型或AI处理模型等。In this application, "model" may be replaced with other terms, such as neural network model, artificial intelligence (AI) model, AI network, AI neural network model, neural network, machine learning model, or AI processing model, etc.
应理解,本申请实施例中更新第一模型的处理包括但不限于:训练、迭代、优化、微调、调优,或者改进中的一项或多项。It should be understood that the processing of updating the first model in the embodiments of this application includes, but is not limited to, one or more of the following: training, iteration, optimization, fine-tuning, tuning, or improvement.
需要说明的是,本申请实施例中的第一模型可以包括一个或多个模型。第一模型的中间层也可以称为第一模型的隐藏层(hidden layers),指的是第一模型中连接输入层和输出层之间的一系列处理层。It should be noted that the first model in the embodiments of this application may include one or more models. The intermediate layers of the first model may also be referred to as the hidden layers of the first model, which refers to a series of processing layers connecting the input layer and the output layer in the first model.
本申请实施例中,发送端向接收端发送第一信息,第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,其中,第一数据包括第一模型的中间层输出的数据,第一训练标签数据用于更新第一模型。然后,发送端接收该接收端发送的第二信息,第二信息指示第一模型的中间层反向输出的模型更新数据。通过上述方法,实现发送端与接收端协同训练神经网络模型,可以灵活适用于多种不同的场景和任务,提升了兼容性。此外,发送端向接收端发送的第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,通过向接收端发送丰富的训练相关数据,提升了模型的泛化性。In this embodiment, the sending end sends first information to the receiving end. The first information includes one or more first data points and one or more first training label data points. The first data points include data output from the intermediate layers of the first model, and the first training label data is used to update the first model. Then, the sending end receives second information sent by the receiving end. The second information instructs the intermediate layers of the first model to output model update data in reverse order. This method enables collaborative training of the neural network model between the sending end and the receiving end, allowing for flexible application to various scenarios and tasks and improving compatibility. Furthermore, the first information sent by the sending end to the receiving end includes one or more first data points and one or more first training label data points. By sending rich training-related data to the receiving end, the generalization ability of the model is improved.
一种可能的实现方式中,如果是一个第一训练标签得到的一个或多个第一数据,则第一信息包括一个第一训练标签。In one possible implementation, if one or more first data are obtained from a first training label, then the first information includes a first training label.
另一种可能的实现方式中,如果是多个第一训练标签得到一个或多个第一数据,则第一信息包括多个第一训练标签。In another possible implementation, if one or more first data are obtained from multiple first training labels, then the first information includes multiple first training labels.
可选地,发送端向接收端发送的第一信息,该发送的第一信息可以是经过数据压缩处理得到的第一信息。换言之,发送端向接收端压缩发送第一信息。Optionally, the first information sent by the sending end to the receiving end may be first information obtained after data compression processing. In other words, the sending end compresses and sends the first information to the receiving end.
可选地,接收端向发送端发送的第二信息,该发送的第二信息可以是经过数据压缩处理得到的第二信息。换言之,接收端向发送端压缩发送第二信息。Optionally, the second information sent from the receiving end to the sending end may be second information obtained through data compression. In other words, the receiving end compresses and sends the second information to the sending end.
可选地,发送端也可以直接发送该第一信息,接收端也可以直接发送该第二信息。Optionally, the sending end can also directly send the first information, and the receiving end can also directly send the second information.
在第一方面的一种可能的实现方式中,第一训练标签数据,包括:第一训练标签在训练标签集中的索引信息,或者,第一训练标签,训练标签集包括一个或多个训练标签。In one possible implementation of the first aspect, the first training label data includes: index information of the first training label in the training label set, or the first training label, and the training label set includes one or more training labels.
在上述技术方案中,第一训练标签可以通过多种方式实现,提升了方案的实现灵活性。In the above technical solution, the first training label can be implemented in multiple ways, which improves the flexibility of the solution.
在第一方面的一种可能的实现方式中,第一信息还包括:分组指示信息,分组指示信息用于指示第一信息包括的一个或多个第一数据的分组方式;和/或,组数信息,组数信息用于指示第一信息包括的一个或多个第一数据的数量。In one possible implementation of the first aspect, the first information further includes: grouping indication information, which indicates the grouping method of one or more first data included in the first information; and/or, group number information, which indicates the number of one or more first data included in the first information.
在上述技术方案中,第一信息可以通过多种分组方式实现,因此可以有效提升模型的泛化性。In the above technical solution, the first information can be realized through various grouping methods, thus effectively improving the generalization of the model.
在第一方面的一种可能的实现方式中,第二信息包括:多个第二数据,其中,每个第二数据对应于一个第一数据;或者,第二信息包括:多个第二数据的加权值,第一信息还包括第一数据的权重。In one possible implementation of the first aspect, the second information includes: a plurality of second data, wherein each second data corresponds to a first data; or, the second information includes: a weighted value of the plurality of second data, and the first information further includes the weight of the first data.
在上述技术方案中,第二信息可以通过多种方式携带第二数据,因此可以有效提升模型的泛化性。In the above technical solution, the second information can carry the second data in a variety of ways, thus effectively improving the generalization of the model.
在第一方面的一种可能的实现方式中,方法还包括:发送端获取第一训练样本,第一训练样本对应的训练标签为第一训练标签;发送端使用第一训练样本对第一模型进行多次处理,输出一个或多个第一数据,其中,每次处理的处理方式和/或处理参数不一致。In one possible implementation of the first aspect, the method further includes: the sending end acquiring a first training sample, the training label corresponding to the first training sample being a first training label; the sending end using the first training sample to process the first model multiple times, outputting one or more first data, wherein the processing method and/or processing parameters are different each time.
在第一方面的一种可能的实现方式中,方法还包括:In one possible implementation of the first aspect, the method further includes:
发送端获取多个第一训练样本,第一训练样本对应的训练标签为第一训练标签;The sending end acquires multiple first training samples, and the training label corresponding to the first training sample is the first training label.
发送端使用多个第一训练样本对第一模型分别进行一次或多次处理,输出一个或多个第一数据,其中,每次处理使用的第一训练样本互不相同。The sending end uses multiple first training samples to process the first model once or multiple times, and outputs one or more first data, wherein the first training samples used in each processing are different.
在第一方面的一种可能的实现方式中,发送端获取多个第一训练样本,包括:In one possible implementation of the first aspect, the sending end acquires multiple first training samples, including:
发送端获取公共训练样本;The sending end obtains public training samples;
发送端对公共训练样本进行处理,生成多个第一训练样本,多个第一训练样本互不相同。The sending end processes the common training samples to generate multiple first training samples, and the multiple first training samples are different from each other.
在上述技术方案中,发送端可以通过多种方式获取多个第一训练样本,提升了方案的实现灵活性。In the above technical solution, the sending end can obtain multiple first training samples in various ways, which improves the flexibility of the solution implementation.
在第一方面的一种可能的实现方式中,方法还包括:发送端向接收端发送第三信息,第三信息包括以下任意一项或多项信息:In one possible implementation of the first aspect, the method further includes: the sending end sending third information to the receiving end, the third information including any one or more of the following:
第三数据、第一模型中输入第三数据的输入位置信息、第三数据的维度信息、第一模型的标识信息,或者,第一类型指示信息,第一类型指示信息用于指示第三数据的数据类型,其中,第三数据包括以下任意一项或多项信息:第一信道信息、第一环境信息,或者,多模态信息。The third data, the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
可选地,发送端向接收端发送的第三信息,该发送的第三信息可以是经过数据压缩处理得到的第三信息。换言之,发送端向接收端压缩发送第三信息。Optionally, the third information sent by the sending end to the receiving end may be third information obtained through data compression. In other words, the sending end compresses and sends the third information to the receiving end.
可选地,发送端也可以直接发送该第三信息。Alternatively, the sending end can also directly send this third information.
一种示例中,第三信息包括:第三数据、第一模型中输入第三数据的输入位置信息、第三数据的维度信息、第一模型的标识信息,和,第一类型指示信息,其中,第三数据包括:第一信道信息、第一环境信息,和,多模态信息。In one example, the third information includes: third data, input location information of the third data in the first model, dimension information of the third data, identification information of the first model, and first type indication information, wherein the third data includes: first channel information, first environment information, and multimodal information.
在上述技术方案中,发送端除了向接收端发送第一信息以外,还可以发送其他信息,提升了模型的泛化性,提升了实现灵活性。此外,通过定义发送端与接收端之间交互数据的统一交互格式,使得上述发送端与接收端之间可以针对不同的训练场景和任务进行灵活地训练,提升了兼容性。In the above technical solution, in addition to sending the first information to the receiver, the sending end can also send other information, which improves the generalization ability of the model and enhances the flexibility of implementation. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
在第一方面的一种可能的实现方式中,方法还包括:In one possible implementation of the first aspect, the method further includes:
发送端接收接收端发送的第四信息,第四信息包括以下任意一项或多项信息:The sending end receives a fourth message sent by the receiving end, which includes any one or more of the following:
第四数据、第二类型指示信息、第一模型的终止交互指令、第四数据的维度信息,或者,第一模型的标识信息,第二类型指示信息用于指示第四数据的数据类型,第一模型的终止交互指令用于指示发送端停止发送第一模型相关的信息,其中,第四数据包括以下任意一项或多项信息:第二信道信息、第二环境信息、多模态信息、接收端在训练第一模型的过程中生成的训练残差信息,或者,更新后第一模型的性能信息。The fourth data, the second type indication information, the termination interaction instruction of the first model, the dimension information of the fourth data, or the identification information of the first model, the second type indication information is used to indicate the data type of the fourth data, and the termination interaction instruction of the first model is used to instruct the sender to stop sending information related to the first model. The fourth data includes any one or more of the following information: second channel information, second environment information, multimodal information, training residual information generated by the receiver during the training of the first model, or the performance information of the updated first model.
可选地,接收端向发送端发送的第四信息,该发送的第四信息可以是经过数据压缩处理得到的第四信息。换言之,接收端向发送端压缩发送第四信息。Optionally, the fourth information sent from the receiving end to the sending end may be fourth information obtained through data compression. In other words, the receiving end compresses and sends the fourth information to the sending end.
可选地,接收端也可以直接发送该第四信息。Alternatively, the receiving end can also directly send this fourth piece of information.
一种示例中,第四信息包括:第四数据、第二类型指示信息、第一模型的终止交互指令、第四数据的维度信息,和,第一模型的标识信息,其中,第四数据包括:第二信道信息、第二环境信息、多模态信息、接收端在训练第一模型的过程中生成的训练残差信息,和,更新后第一模型的性能信息。In one example, the fourth information includes: fourth data, second type indication information, termination interaction instruction of the first model, dimension information of the fourth data, and identification information of the first model, wherein the fourth data includes: second channel information, second environment information, multimodal information, training residual information generated by the receiver during the training of the first model, and performance information of the updated first model.
在上述技术方案中,接收端端除了向发送端发送第二信息以外,还可以发送其他信息,提升了模型的泛化性,提升了实现灵活性。此外,通过定义发送端与接收端之间交互数据的统一交互格式,使得上述发送端与接收端之间可以针对不同的训练场景和任务进行灵活地训练,提升了兼容性。In the above technical solution, the receiving end can send other information besides the second information to the sending end, improving the model's generalization ability and implementation flexibility. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
在第一方面的一种可能的实现方式中,第一模型包括一个或多个切分位置;第一数据包括:第一模型中一个或多个切分位置对应的中间层输出的数据;和/或,第二数据包括:第一模型中一个或多个切分位置反向输出的模型更新数据。In one possible implementation of the first aspect, the first model includes one or more splitting positions; the first data includes: data output by intermediate layers corresponding to one or more splitting positions in the first model; and/or, the second data includes: model update data output in reverse from one or more splitting positions in the first model.
在上述技术方案中,由于第一模型可以在多个切分位置输出特征数据或者更新数据,因此,丰富了发送端与接收端之间交互的数据量,提升了模型的泛化性。In the above technical solution, since the first model can output feature data or update data at multiple segmentation positions, it enriches the amount of data exchanged between the sender and receiver and improves the generalization of the model.
在第一方面的一种可能的实现方式中,方法还包括:发送端向接收端发送配置信息,配置信息用于配置训练第一模型的训练任务,配置信息包括以下任意一项或多项:In one possible implementation of the first aspect, the method further includes: the sending end sending configuration information to the receiving end, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
模型信息,其中,模型信息包括以下任意一项或多项信息:第一模型的标识信息、第一模型的结构信息、第一模型的训练任务信息、第一模型的初始化参数,或者,第一模型的待更新参数;Model information, wherein the model information includes any one or more of the following: the identifier of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated;
切分信息,其中,切分信息包括:第一模型的切分位置数量,和/或,切分位置在第一模型的位置信息;Segmentation information, wherein the segmentation information includes: the number of segmentation positions in the first model, and/or, the position information of the segmentation positions in the first model;
或者,训练信息,其中,训练信息包括以下任意一项或多项信息:训练标签集信息、验证集信息、训练迭代次数、学习率,或者损失函数,训练标签集包括一个或多个训练标签;Alternatively, training information, which includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
和/或,发送端接收接收端发送的配置信息。And/or, the sending end receives configuration information sent by the receiving end.
在上述技术方案中,发送端与接收端之间可以支持在一个训练周期内还可以调整训练的模型,提升了训练效率。In the above technical solution, the sending end and the receiving end can support the adjustment of the trained model within a training cycle, thereby improving training efficiency.
在第一方面的一种可能的实现方式中,配置信息还包括:第一压缩配置信息,和/或,第二压缩配置信息,其中:第一压缩配置信息用于指示第一信息和/或第三信息的压缩方式以及压缩参数,第二压缩配置信息用于指示第二信息和/或第四信息的压缩方式以及压缩参数。In one possible implementation of the first aspect, the configuration information further includes: first compression configuration information, and/or, second compression configuration information, wherein: the first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information, and the second compression configuration information is used to indicate the compression method and compression parameters of the second information and/or the fourth information.
在上述技术方案中,发送端与接收端之间交互的数据支持多种压缩方式,因此可以有效降低传输量,提升通信效率。In the above technical solution, the data exchanged between the sending end and the receiving end supports multiple compression methods, which can effectively reduce the amount of data transmitted and improve communication efficiency.
在第一方面的一种可能的实现方式中,第一信息承载于无线资源控制(radio resource control,RRC)消息,或者,媒体接入控制(media access control,MAC)消息;In one possible implementation of the first aspect, the first information is carried in a radio resource control (RRC) message or a media access control (MAC) message.
和/或,第二信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
在第一方面的一种可能的实现方式中,第三信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息;In one possible implementation of the first aspect, the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message;
和/或,第四信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
第二方面,本申请实施例提出一种通信方法,方法应用于接收端,该接收端可以是通信设备(如终端设备或网络设备),或者,该接收端可以是通信设备中的部分组件(例如处理器、芯片或芯片系统等),或者该接收端还可以是能实现全部或部分通信设备功能的逻辑模块或软件。方法包括:接收端接收发送端发送的第一信息,第一信息包括一个或多个第一数据,以及,一个或多个第一训练标签数据,第一数据包括第一模型的中间层输出的数据,第一训练标签数据用于更新第一模型;接收端向发送端发送第二信息,第二信息指示第一模型的中间层反向输出的模型更新数据。Secondly, embodiments of this application propose a communication method applied to a receiving end. The receiving end can be a communication device (such as a terminal device or network device), or it can be a component within the communication device (such as a processor, chip, or chip system), or it can be a logic module or software capable of implementing all or part of the functions of the communication device. The method includes: the receiving end receiving first information sent by a sending end, the first information including one or more first data items and one or more first training label data items, the first data including data output from the intermediate layer of a first model, the first training label data being used to update the first model; the receiving end sending second information to the sending end, the second information indicating model update data output in reverse from the intermediate layer of the first model.
本申请中,模型可以替换为其它术语,例如神经网络模型、人工智能(artificial intelligence,AI)模型、AI网络、AI神经网络模型、神经网络、机器学习模型或AI处理模型等。In this application, "model" may be replaced with other terms, such as neural network model, artificial intelligence (AI) model, AI network, AI neural network model, neural network, machine learning model, or AI processing model, etc.
应理解,本申请实施例中更新第一模型的处理包括但不限于:训练、迭代、优化、微调、调优,或者改进中的一项或多项。It should be understood that the processing of updating the first model in the embodiments of this application includes, but is not limited to, one or more of the following: training, iteration, optimization, fine-tuning, tuning, or improvement.
需要说明的是,本申请实施例中的第一模型可以包括一个或多个模型。It should be noted that the first model in the embodiments of this application may include one or more models.
本申请实施例中,发送端向接收端发送第一信息,第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,其中,第一数据包括第一模型的中间层输出的数据,第一训练标签数据用于更新第一模型。然后,发送端接收该接收端发送的第二信息,第二信息指示第一模型的中间层反向输出的模型更新数据。通过上述方法,实现发送端与接收端协同训练神经网络模型,可以灵活适用于多种不同的场景和任务,提升了兼容性。此外,发送端向接收端发送的第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,通过向接收端发送丰富的训练相关数据,提升了模型的泛化性。In this embodiment, the sending end sends first information to the receiving end. The first information includes one or more first data points and one or more first training label data points. The first data points include data output from the intermediate layers of the first model, and the first training label data is used to update the first model. Then, the sending end receives second information sent by the receiving end. The second information instructs the intermediate layers of the first model to output model update data in reverse order. This method enables collaborative training of the neural network model between the sending end and the receiving end, allowing for flexible application to various scenarios and tasks and improving compatibility. Furthermore, the first information sent by the sending end to the receiving end includes one or more first data points and one or more first training label data points. By sending rich training-related data to the receiving end, the generalization ability of the model is improved.
在第二方面的一种可能的实现方式中,第一训练标签数据,包括:第一训练标签在训练标签集中的索引信息,或者,第一训练标签,训练标签集包括一个或多个训练标签。In one possible implementation of the second aspect, the first training label data includes: index information of the first training label in the training label set, or the first training label, and the training label set includes one or more training labels.
在上述技术方案中,第一训练标签可以通过多种方式实现,提升了方案的实现灵活性。In the above technical solution, the first training label can be implemented in multiple ways, which improves the flexibility of the solution.
在第二方面的一种可能的实现方式中,第一信息还包括:分组指示信息,分组指示信息用于指示第一信息包括的一个或多个第一数据的分组方式;和/或,组数信息,组数信息用于指示第一信息包括的一个或多个第一数据的数量。In one possible implementation of the second aspect, the first information further includes: grouping indication information, which indicates the grouping method of one or more first data included in the first information; and/or, group number information, which indicates the number of one or more first data included in the first information.
在上述技术方案中,第一信息可以通过多种分组方式实现,因此可以有效提升模型的泛化性。In the above technical solution, the first information can be realized through various grouping methods, thus effectively improving the generalization of the model.
在第二方面的一种可能的实现方式中,第二信息包括:多个第二数据,其中,每个第二数据对应于一个第一数据;或者,第二信息包括:多个第二数据的加权值,第一信息还包括第一数据的权重。In one possible implementation of the second aspect, the second information includes: a plurality of second data, wherein each second data corresponds to a first data; or, the second information includes: a weighted value of a plurality of second data, and the first information further includes the weight of the first data.
在上述技术方案中,第二信息可以通过多种方式携带第二数据,因此可以有效提升模型的泛化性。In the above technical solution, the second information can carry the second data in a variety of ways, thus effectively improving the generalization of the model.
在第二方面的一种可能的实现方式中,方法还包括:接收端接收发送端发送的第三信息,第三信息包括以下任意一项或多项信息:In one possible implementation of the second aspect, the method further includes: the receiving end receiving third information sent by the sending end, the third information including any one or more of the following:
第三数据、所述第一模型中输入所述第三数据的输入位置信息、所述第三数据的维度信息、所述第一模型的标识信息,或者,第一类型指示信息,所述第一类型指示信息用于指示所述第三数据的数据类型,其中,所述第三数据包括以下任意一项或多项信息:第一信道信息、第一环境信息,或者,多模态信息。The third data, the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, wherein the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
在上述技术方案中,发送端除了向接收端发送第一信息以外,还可以发送其他信息,提升了模型的泛化性,提升了实现灵活性。此外,通过定义发送端与接收端之间交互数据的统一交互格式,使得上述发送端与接收端之间可以针对不同的训练场景和任务进行灵活地训练,提升了兼容性。In the above technical solution, in addition to sending the first information to the receiver, the sending end can also send other information, which improves the generalization ability of the model and enhances the flexibility of implementation. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
在第二方面的一种可能的实现方式中,方法还包括:In one possible implementation of the second aspect, the method further includes:
接收端向发送端发送第四信息,第四信息包括以下任意一项或多项信息:The receiving end sends a fourth piece of information to the sending end, which includes any one or more of the following:
第四数据、第二类型指示信息、所述第一模型的终止交互指令、所述第四数据的维度信息,或者,所述第一模型的标识信息,所述第二类型指示信息用于指示所述第四数据的数据类型,所述第一模型的终止交互指令用于指示所述发送端停止发送所述第一模型相关的信息,其中,所述第四数据包括以下任意一项或多项信息:第二信道信息、第二环境信息、多模态信息、所述接收端在训练所述第一模型的过程中生成的训练残差信息,或者,更新后所述第一模型的性能信息。The fourth data, the second type indication information, the termination interaction instruction of the first model, the dimension information of the fourth data, or the identification information of the first model, the second type indication information is used to indicate the data type of the fourth data, and the termination interaction instruction of the first model is used to instruct the sending end to stop sending information related to the first model, wherein the fourth data includes any one or more of the following information: second channel information, second environment information, multimodal information, training residual information generated by the receiving end during the training of the first model, or the performance information of the updated first model.
在上述技术方案中,接收端端除了向发送端发送第二信息以外,还可以发送其他信息,提升了模型的泛化性,提升了实现灵活性。此外,通过定义发送端与接收端之间交互数据的统一交互格式,使得上述发送端与接收端之间可以针对不同的训练场景和任务进行灵活地训练,提升了兼容性。In the above technical solution, the receiving end can send other information besides the second information to the sending end, improving the model's generalization ability and implementation flexibility. Furthermore, by defining a unified interaction format for the data exchanged between the sending and receiving ends, the two ends can be flexibly trained for different training scenarios and tasks, improving compatibility.
在第二方面的一种可能的实现方式中,第一模型包括一个或多个切分位置;第一数据包括:第一模型中一个或多个切分位置对应的中间层输出的数据;In one possible implementation of the second aspect, the first model includes one or more segmentation positions; the first data includes: data output from intermediate layers corresponding to one or more segmentation positions in the first model;
和/或,第二数据包括:第一模型中一个或多个切分位置反向输出的模型更新数据。And/or, the second data includes: model update data output in reverse from one or more split positions in the first model.
在上述技术方案中,由于第一模型可以在多个切分位置输出特征数据或者更新数据,因此,丰富了发送端与接收端之间交互的数据量,提升了模型的泛化性。In the above technical solution, since the first model can output feature data or update data at multiple segmentation positions, it enriches the amount of data exchanged between the sender and receiver and improves the generalization of the model.
在第二方面的一种可能的实现方式中,方法还包括:接收端接收来自接收端的配置信息,配置信息用于配置训练第一模型的训练任务,配置信息包括以下任意一项或多项:In one possible implementation of the second aspect, the method further includes: receiving configuration information from the receiver, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
模型信息,其中,模型信息包括以下任意一项或多项信息:第一模型的标识信息、第一模型的结构信息、第一模型的训练任务信息、第一模型的初始化参数,或者,第一模型的待更新参数;Model information, wherein the model information includes any one or more of the following: the identifier of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated;
切分信息,其中,切分信息包括:第一模型的切分位置数量,和/或,切分位置在第一模型的位置信息;Segmentation information, wherein the segmentation information includes: the number of segmentation positions in the first model, and/or, the position information of the segmentation positions in the first model;
或者,训练信息,其中,训练信息包括以下任意一项或多项信息:训练标签集信息、验证集信息、训练迭代次数、学习率,或者损失函数,训练标签集包括一个或多个训练标签;Alternatively, training information, which includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
和/或,接收端向发送端发送配置信息。And/or, the receiving end sends configuration information to the sending end.
在上述技术方案中,发送端与接收端之间可以支持在一个训练周期内还可以调整训练的模型,提升了训练效率。In the above technical solution, the sending end and the receiving end can support the adjustment of the trained model within a training cycle, thereby improving training efficiency.
在第二方面的一种可能的实现方式中,配置信息还包括:In one possible implementation of the second aspect, the configuration information also includes:
第一压缩配置信息,和/或,第二压缩配置信息,其中:First compression configuration information, and/or, second compression configuration information, wherein:
第一压缩配置信息用于指示第一信息和/或第三信息的压缩方式以及压缩参数,The first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information.
第二压缩配置信息用于指示第二信息和/或第四信息的压缩方式以及压缩参数。The second compression configuration information is used to indicate the compression method and compression parameters of the second and/or fourth information.
在上述技术方案中,发送端与接收端之间交互的数据支持多种压缩方式,因此可以有效降低传输量,提升通信效率。In the above technical solution, the data exchanged between the sending end and the receiving end supports multiple compression methods, which can effectively reduce the amount of data transmitted and improve communication efficiency.
在第二方面的一种可能的实现方式中,第一信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息;In one possible implementation of the second aspect, the first information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
和/或,第二信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
在第二方面的一种可能的实现方式中,第三信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息;In one possible implementation of the second aspect, the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
和/或,第四信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
本申请第三方面提供了一种通信装置,该装置为发送端或者接收端,该装置包括收发单元和处理单元,该通信装置的组成模块还可以用于执行第一方面的各个可能实现方式中所执行的步骤,并实现相应的技术效果,具体均可以参阅第一方面,此处不再赘述。The third aspect of this application provides a communication device, which is a transmitting end or a receiving end. The device includes a transceiver unit and a processing unit. The constituent modules of the communication device can also be used to perform the steps performed in various possible implementations of the first aspect and achieve the corresponding technical effects. For details, please refer to the first aspect, which will not be repeated here.
本申请第四方面提供了一种通信装置,该装置为发送端或者接收端,该装置包括收发单元和处理单元;该通信装置的组成模块还可以用于执行第二方面的各个可能实现方式中所执行的步骤,并实现相应的技术效果,具体均可以参阅第二方面,此处不再赘述。The fourth aspect of this application provides a communication device, which is a transmitting end or a receiving end. The device includes a transceiver unit and a processing unit. The constituent modules of the communication device can also be used to perform the steps performed in various possible implementations of the second aspect and achieve the corresponding technical effects. For details, please refer to the second aspect, which will not be repeated here.
本申请第五方面提供了一种通信装置,包括至少一个处理器,所述至少一个处理器与存储器耦合;该存储器用于存储程序或指令;该至少一个处理器用于执行该程序或指令,以使该装置实现前述第一方面或第二方面中的任意一种可能的实现方式所述的方法。The fifth aspect of this application provides a communication device including at least one processor coupled to a memory; the memory is used to store a program or instructions; the at least one processor is used to execute the program or instructions to enable the device to implement the method described in any possible implementation of the first or second aspect.
本申请第六方面提供了一种通信装置,包括至少一个逻辑电路和输入输出接口;该逻辑电路用于执行如前述第一方面或第二方面中的任意一种可能的实现方式所述的方法。The sixth aspect of this application provides a communication device including at least one logic circuit and an input/output interface; the logic circuit is used to perform the method described in any of the possible implementations of the first or second aspect described above.
本申请第七方面提供了一种通信系统,该通信系统包括上述发送端或者接收端。The seventh aspect of this application provides a communication system that includes the aforementioned transmitting end or receiving end.
可选地,该通信系统还包括与该发送端或者接收端进行通信的其它通信装置。Optionally, the communication system may also include other communication devices that communicate with the transmitting end or the receiving end.
本申请第八方面提供一种计算机可读存储介质,该存储介质用于存储一个或多个计算机执行指令,当计算机执行指令被处理器执行时,该处理器执行如上述第一方面或第二方面的任意一种可能的实现方式所述的方法。An eighth aspect of this application provides a computer-readable storage medium for storing one or more computer-executable instructions that, when executed by a processor, perform the method as described in any possible implementation of the first or second aspect above.
本申请第九方面提供一种计算机程序产品(或称计算机程序),当计算机程序产品中的计算机程序被该处理器执行时,该处理器执行上述第一方面或第二方面的任意一种可能的实现方式所述的方法。The ninth aspect of this application provides a computer program product (or computer program) that, when executed by a processor, performs the method described in any possible implementation of the first or second aspect described above.
本申请第十方面提供了一种芯片或芯片系统,该芯片或芯片系统包括至少一个处理器,用于支持通信装置实现上述第一方面或第二方面的任意一种可能的实现方式所述的方法。The tenth aspect of this application provides a chip or chip system including at least one processor for supporting a communication device in implementing the method described in any possible implementation of the first or second aspect above.
在一种可能的设计中,该芯片或芯片系统还可以包括存储器,存储器,用于保存该通信装置必要的程序指令和数据。该芯片系统,可以由芯片构成,也可以包含芯片和其他分立器件。可选的,所述芯片系统还包括接口电路,所述接口电路为所述至少一个处理器提供程序指令和/或数据。In one possible design, the chip or chip system may further include a memory for storing program instructions and data necessary for the communication device. The chip system may be composed of chips or may include chips and other discrete devices. Optionally, the chip system may also include interface circuitry that provides program instructions and/or data to the at least one processor.
其中,第三方面至第十方面中任一种设计方式所带来的技术效果可参见上述第一方面或第二方面中不同设计方式所带来的技术效果,在此不再赘述。The technical effects of any of the design methods in aspects three through ten can be found in the technical effects of different design methods in aspects one or two above, and will not be repeated here.
图1a至图1c为本申请提供的通信系统的示意图;Figures 1a to 1c are schematic diagrams of the communication system provided in this application;
图2a至图2e为本申请涉及的AI处理过程的示意图;Figures 2a to 2e are schematic diagrams of the AI processing involved in this application;
图3为本申请实施例中一种通信场景的示意图;Figure 3 is a schematic diagram of a communication scenario in an embodiment of this application;
图4为本申请实施例中又一种通信场景的示意图;Figure 4 is a schematic diagram of another communication scenario in the embodiments of this application;
图5为本申请实施例中通信方法的一种实施例流程示意图;Figure 5 is a flowchart illustrating one embodiment of the communication method in this application.
图6为本申请实施例中第一信息的一种结构示意图;Figure 6 is a schematic diagram of the structure of the first information in an embodiment of this application;
图7为本申请实施例中第一信息的又一种结构示意图;Figure 7 is a schematic diagram of another structure of the first information in an embodiment of this application;
图8为本申请实施例中第二信息的一种结构示意图;Figure 8 is a schematic diagram of a structure of the second information in an embodiment of this application;
图9为本申请实施例中第二信息的另一种结构示意图;Figure 9 is a schematic diagram of another structure of the second information in an embodiment of this application;
图10为本申请实施例中一种第一模型的切分位置示意图;Figure 10 is a schematic diagram of the segmentation position of a first model in an embodiment of this application;
图11为本申请实施例中另一种第一模型的切分位置示意图;Figure 11 is a schematic diagram of the segmentation position of another first model in an embodiment of this application;
图12为本申请实施例中第一模型的又一种结构示意图;Figure 12 is a schematic diagram of another structure of the first model in the embodiments of this application;
图13为本申请实施例中前向交互数据的一种结构示意图;Figure 13 is a schematic diagram of a forward interaction data structure in an embodiment of this application;
图14为本申请实施例中前向交互数据的又一种结构示意图;Figure 14 is a schematic diagram of another structure of forward interaction data in an embodiment of this application;
图15为本申请实施例中第二信息的一种结构示意图;Figure 15 is a schematic diagram of a structure of the second information in an embodiment of this application;
图16为本申请实施例中MAC消息承载前向交互数据的结构示意图;Figure 16 is a schematic diagram of the structure of MAC message carrying forward interaction data in an embodiment of this application;
图17为本申请实施例中MAC消息承载反向交互数据的结构示意图;Figure 17 is a schematic diagram of the structure of MAC messages carrying reverse interaction data in an embodiment of this application;
图18为本申请实施例中一种通信装置1800的结构示意图;Figure 18 is a structural schematic diagram of a communication device 1800 according to an embodiment of this application;
图19为本申请提供的通信装置1900的另一种示意性结构图;Figure 19 is another schematic structural diagram of the communication device 1900 provided in this application;
图20为本申请的实施例提供的通信装置2000的结构示意图;Figure 20 is a schematic diagram of the structure of the communication device 2000 provided in an embodiment of this application;
图21为本申请的实施例提供的通信装置2100的结构示意图。Figure 21 is a schematic diagram of the structure of the communication device 2100 provided in an embodiment of this application.
首先,对本申请实施例中的部分用语进行解释说明,以便于本领域技术人员理解。First, some terms used in the embodiments of this application will be explained to facilitate understanding by those skilled in the art.
(1)终端设备:可以是能够接收网络设备调度和指示信息的无线终端设备,无线终端设备可以是指向用户提供语音和/或数据连通性的设备,或具有无线连接功能的手持式设备,或连接到无线调制解调器的其他处理设备。(1) Terminal device: can be a wireless terminal device that can receive network device scheduling and instruction information. The wireless terminal device can be a device that provides voice and/or data connectivity to the user, or a handheld device with wireless connection function, or other processing device connected to a wireless modem.
终端设备可以经无线接入网(radio access network,RAN)与一个或多个核心网或者互联网进行通信,终端设备可以是移动终端设备,如移动电话(或称为“蜂窝”电话,手机(mobile phone))、计算机和数据卡,例如,可以是便携式、袖珍式、手持式、计算机内置的或者车载的移动装置,它们与无线接入网交换语音和/或数据。例如,个人通信业务(personal communication service,PCS)电话、无绳电话、会话发起协议(session initiation protocol,SIP)话机、无线本地环路(wireless local loop,WLL)站、个人数字助理(personal digital assistant,PDA)、平板电脑(tablet或pad)、带无线收发功能的电脑等设备。无线终端设备也可以称为系统、订户单元(subscriber unit)、订户站(subscriber station),移动站(mobile station)、移动台(mobile station,MS)、远程站(remote station)、接入点(access point,AP)、远程终端设备(remote terminal)、接入终端设备(access terminal)、用户终端设备(user terminal)、用户代理(user agent)、用户站(subscriber station,SS)、用户端设备(customer premises equipment,CPE)、终端(terminal)、用户设备(user equipment,UE)、移动终端(mobile terminal,MT)等。Terminal devices can communicate with one or more core networks or the Internet via a radio access network (RAN). Terminal devices can be mobile terminal devices, such as mobile phones (or "cellular" phones), computers, and data cards. For example, they can be portable, pocket-sized, handheld, computer-embedded, or vehicle-mounted mobile devices that exchange voice and/or data with the RAN. Examples include personal communication service (PCS) phones, cordless phones, session initiation protocol (SIP) phones, wireless local loop (WLL) stations, personal digital assistants (PDAs), tablets, and computers with wireless transceiver capabilities. Wireless terminal equipment can also be referred to as a system, subscriber unit, subscriber station, mobile station, mobile station (MS), remote station, access point (AP), remote terminal, access terminal, user terminal, user agent, subscriber station (SS), customer premises equipment (CPE), terminal, user equipment (UE), mobile terminal (MT), etc.
作为示例而非限定,在本申请实施例中,该终端设备还可以是可穿戴设备。可穿戴设备也可以称为穿戴式智能设备或智能穿戴式设备等,是应用穿戴式技术对日常穿戴进行智能化设计、开发出可以穿戴的设备的总称,如眼镜、手套、手表、服饰及鞋等。可穿戴设备即直接穿在身上,或是整合到用户的衣服或配件的一种便携式设备。可穿戴设备不仅仅是一种硬件设备,更是通过软件支持以及数据交互、云端交互来实现强大的功能。广义穿戴式智能设备包括功能全、尺寸大、可不依赖智能手机实现完整或者部分的功能,例如:智能手表或智能眼镜等,以及只专注于某一类应用功能,需要和其它设备如智能手机配合使用,如各类进行体征监测的智能手环、智能头盔、智能首饰等。By way of example and not limitation, in this embodiment, the terminal device can also be a wearable device. Wearable devices, also known as wearable smart devices or smart wearable devices, are a general term for devices that utilize wearable technology to intelligently design and develop everyday wearables, such as glasses, gloves, watches, clothing, and shoes. Wearable devices are portable devices that are worn directly on the body or integrated into the user's clothing or accessories. Wearable devices are not merely hardware devices, but also achieve powerful functions through software support, data interaction, and cloud interaction. Broadly speaking, wearable smart devices include those that are feature-rich, large in size, and can achieve complete or partial functions without relying on a smartphone, such as smartwatches or smart glasses, as well as those that focus on a specific type of application function and require the use of other devices such as smartphones, such as various smart bracelets, smart helmets, and smart jewelry for vital sign monitoring.
终端还可以是无人机、机器人、设备到设备通信(device-to-device,D2D)中的终端、车到一切(vehicle to everything,V2X)中的终端、虚拟现实(virtual reality,VR)终端设备、增强现实(augmented reality,AR)终端设备、工业控制(industrial control)中的无线终端、无人驾驶(self driving)中的无线终端、远程医疗(remote medical)中的无线终端、智能电网(smart grid)中的无线终端、运输安全(transportation safety)中的无线终端、智慧城市(smart city)中的无线终端、智慧家庭(smart home)中的无线终端等。Terminals can also be drones, robots, devices in device-to-device (D2D) communication, vehicles to everything (V2X) communication, virtual reality (VR) terminal devices, augmented reality (AR) terminal devices, wireless terminals in industrial control, wireless terminals in self-driving, wireless terminals in remote medical care, wireless terminals in smart grids, wireless terminals in transportation safety, wireless terminals in smart cities, and wireless terminals in smart homes, etc.
此外,终端设备也可以是第五代(5th generation,5G)通信系统之后演进的通信系统(例如5G Advanced或第六代(6th generation,6G)通信系统等)中的终端设备或者未来演进的公共陆地移动网络(public land mobile network,PLMN)中的终端设备等。示例性的,5G Advanced或6G网络可以进一步扩展5G通信终端的形态和功能,6G终端包括但不限于车、蜂窝网络终端(融合卫星终端功能)、无人机、物联网(internet of things,IoT)设备。Furthermore, terminal devices can also be terminal devices in communication systems evolved from fifth-generation (5G) communication systems (such as 5G Advanced or sixth-generation (6G) communication systems), or terminal devices in future public land mobile networks (PLMNs). For example, 5G Advanced or 6G networks can further expand the form and function of 5G communication terminals; 6G terminals include, but are not limited to, vehicles, cellular network terminals (integrating satellite terminal functions), drones, and Internet of Things (IoT) devices.
在本申请实施例中,上述终端设备还可以获得网络设备提供的人工智能(artificial intelligence,AI)服务。可选地,终端设备还可以具有AI处理能力。In this embodiment, the terminal device can also obtain artificial intelligence (AI) services provided by the network device. Optionally, the terminal device can also have AI processing capabilities.
(2)网络设备:可以是无线网络中的设备,例如网络设备可以为将终端设备接入到无线网络的RAN节点(或设备),又可以称为基站。目前,一些RAN设备的举例为:基站(base station)、演进型基站(evolved NodeB,eNodeB)、5G通信系统中的基站gNB(gNodeB)、传输接收点(transmission reception point,TRP)、演进型节点B(evolved Node B,eNB)、无线网络控制器(radio network controller,RNC)、节点B(Node B,NB)、家庭基站(例如,home evolved Node B,或home Node B,HNB)、基带单元(base band unit,BBU),或无线保真(wireless fidelity,Wi-Fi)接入点(access point,AP)等。另外,在一种网络结构中,网络设备可以包括集中单元(centralized unit,CU)节点、或分布单元(distributed unit,DU)节点、或包括CU节点和DU节点的RAN设备。(2) Network equipment: This can be equipment in a wireless network. For example, network equipment can be a RAN node (or device) that connects terminal devices to the wireless network, and can also be called a base station. Currently, some examples of RAN equipment include: base station, evolved NodeB (eNodeB), gNB (gNodeB) in 5G communication systems, transmission reception point (TRP), evolved Node B (eNB), radio network controller (RNC), Node B (NB), home base station (e.g., home evolved Node B, or home Node B, HNB), base band unit (BBU), or wireless fidelity (Wi-Fi) access point (AP), etc. In addition, in a network architecture, network devices may include centralized unit (CU) nodes, distributed unit (DU) nodes, or RAN devices that include both CU and DU nodes.
可选的,RAN节点还可以是宏基站、微基站或室内站、中继节点或施主节点、或者是云无线接入网络(cloud radio access network,CRAN)场景下的无线控制器。RAN节点还可以是服务器,可穿戴设备,车辆或车载设备等。例如,车辆外联(vehicle to everything,V2X)技术中的接入网设备可以为路侧单元(road side unit,RSU)。Optionally, RAN nodes can also be macro base stations, micro base stations or indoor stations, relay nodes or donor nodes, or radio controllers in cloud radio access network (CRAN) scenarios. RAN nodes can also be servers, wearable devices, vehicles, or in-vehicle equipment. For example, the access network equipment in vehicle-to-everything (V2X) technology can be a roadside unit (RSU).
在另一种可能的场景中,由多个RAN节点协作协助终端实现无线接入,不同RAN节点分别实现基站的部分功能。例如,RAN节点可以是CU,DU,CU-控制面(control plane,CP),CU-用户面(user plane,UP),或者无线单元(radio unit,RU)等。CU和DU可以是单独设置,或者也可以包括在同一个网元中,例如基带单元(baseband unit,BBU)中。RU可以包括在射频设备或者射频单元中,例如包括在射频拉远单元(remote radio unit,RRU)、有源天线处理单元(active antenna unit,AAU)、射频头(radio head,RH)或远程射频头(remote radio head,RRH)中。In another possible scenario, multiple RAN nodes collaborate to assist the terminal in achieving wireless access, with different RAN nodes each implementing some of the base station's functions. For example, RAN nodes can be CUs, DUs, CUs (control plane, CP), CUs (user plane, UP), or radio units (RUs). CUs and DUs can be set up separately or included in the same network element, such as a baseband unit (BBU). RUs can be included in radio frequency equipment or radio frequency units, such as remote radio units (RRUs), active antenna units (AAUs), radio heads (RHs), or remote radio heads (RRHs).
在不同系统中,CU(或CU-CP和CU-UP)、DU或RU也可以有不同的名称,但是本领域的技术人员可以理解其含义。例如,在开放式接入网(open RAN,O-RAN或ORAN)系统中,CU也可以称为O-CU(开放式CU),DU也可以称为O-DU,CU-CP也可以称为O-CU-CP,CU-UP也可以称为O-CU-UP,RU也可以称为O-RU。为描述方便,本申请中以CU,CU-CP,CU-UP、DU和RU为例进行描述。本申请中的CU(或CU-CP、CU-UP)、DU和RU中的任一单元,可以是通过软件模块、硬件模块、或者软件模块与硬件模块结合来实现。In different systems, CU (or CU-CP and CU-UP), DU, or RU may have different names, but those skilled in the art will understand their meaning. For example, in an open access network (open RAN, O-RAN, or ORAN) system, CU can also be called O-CU (open CU), DU can also be called O-DU, CU-CP can also be called O-CU-CP, CU-UP can also be called O-CU-UP, and RU can also be called O-RU. For ease of description, this application uses CU, CU-CP, CU-UP, DU, and RU as examples. Any of the units among CU (or CU-CP, CU-UP), DU, and RU in this application can be implemented through software modules, hardware modules, or a combination of software modules and hardware modules.
接入网设备和终端设备之间的通信遵循一定的协议层结构。该协议层可以包括控制面协议层和用户面协议层。控制面协议层可以包括以下至少一项:无线资源控制(radio resource control,RRC)层、分组数据汇聚层协议(packet data convergence protocol,PDCP)层、无线链路控制(radio link control,RLC)层、媒体接入控制(media access control,MAC)层、或物理(physical,PHY)层等。用户面协议层可以包括以下至少一项:业务数据适配协议(service data adaptation protocol,SDAP)层、PDCP层、RLC层、MAC层、或物理层等。Communication between access network devices and terminal devices follows a specific protocol layer structure. This protocol layer may include a control plane protocol layer and a user plane protocol layer. The control plane protocol layer may include at least one of the following: radio resource control (RRC) layer, packet data convergence protocol (PDCP) layer, radio link control (RLC) layer, media access control (MAC) layer, or physical (PHY) layer, etc. The user plane protocol layer may include at least one of the following: service data adaptation protocol (SDAP) layer, PDCP layer, RLC layer, MAC layer, or physical layer, etc.
对于ORAN系统中的网元及其可实现的协议层功能对应关系,可参照下表1。The correspondence between network elements and their achievable protocol layer functions in the ORAN system can be found in Table 1 below.
表1
Table 1
网络设备可以是其它为终端设备提供无线通信功能的装置。本申请的实施例对网络设备所采用的具体技术和具体设备形态不做限定。为方便描述,本申请实施例并不限定。Network devices can be other devices that provide wireless communication functions for terminal devices. The embodiments of this application do not limit the specific technology or form of the network device. For ease of description, the embodiments of this application are not limited.
网络设备还可以包括核心网设备,核心网设备例如包括第四代(4th generation,4G)网络中的移动性管理实体(mobility management entity,MME),归属用户服务器(home subscriber server,HSS),服务网关(serving gateway,S-GW),策略和计费规则功能(policy and charging rules function,PCRF),公共数据网网关(public data network gateway,PDN gateway,P-GW);5G网络中的访问和移动管理功能(access and mobility management function,AMF)、用户面功能(user plane function,UPF)或会话管理功能(session management function,SMF)等网元。此外,该核心网设备还可以包括5G网络以及5G网络的下一代网络中的其他核心网设备。Network equipment may also include core network equipment, such as the Mobility Management Entity (MME), Home Subscriber Server (HSS), Serving Gateway (S-GW), Policy and Charging Rules Function (PCRF), and Public Data Network Gateway (PDN Gateway) in 4G networks; and access and mobility management function (AMF), user plane function (UPF), or session management function (SMF) in 5G networks. Furthermore, this core network equipment may also include other core network equipment in 5G networks and next-generation networks of 5G networks.
本申请实施例中,上述网络设备还可以具有AI能力的网络节点,可以为终端或其他网络设备提供AI服务,例如,可以为网络侧(接入网或核心网)的AI节点、算力节点、具有AI能力的RAN节点、具有AI能力的核心网网元等。In this embodiment of the application, the network device may also have network nodes with AI capabilities, which can provide AI services to terminals or other network devices. For example, it may be an AI node, computing node, RAN node with AI capabilities, or core network element with AI capabilities on the network side (access network or core network).
本申请实施例中,用于实现网络设备的功能的装置可以是网络设备,也可以是能够支持网络设备实现该功能的装置,例如芯片系统,该装置可以被安装在网络设备中。在本申请实施例提供的技术方案中,以用于实现网络设备的功能的装置是网络设备为例,描述本申请实施例提供的技术方案。In this application embodiment, the device for implementing the function of the network device can be the network device itself, or it can be a device capable of supporting the network device in implementing that function, such as a chip system, which can be installed in the network device. In the technical solutions provided in this application embodiment, the example of a network device being used to implement the function of the network device is used to describe the technical solutions provided in this application embodiment.
(3)配置与预配置:在本申请中,会同时用到配置与预配置。其中,配置是指网络设备/服务器通过消息或信令将一些参数的配置信息或参数的取值发送给终端,以便终端根据这些取值或信息来确定通信的参数或传输时的资源。预配置与配置类似,可以是网络设备/服务器预先与终端设备协商好的参数信息或参数值,也可以是标准协议规定的基站/网络设备或终端设备采用的参数信息或参数值,还可以是预先存储在基站/服务器或终端设备的参数信息或参数值。本申请对此不做限定。(3) Configuration and Pre-configuration: In this application, both configuration and pre-configuration are used. Configuration refers to the network device/server sending configuration information or parameter values to the terminal via messages or signaling, so that the terminal can determine communication parameters or resources for transmission based on these values or information. Pre-configuration is similar to configuration; it can be parameter information or parameter values pre-negotiated between the network device/server and the terminal device, parameter information or parameter values specified by standard protocols for use by the base station/network device or terminal device, or parameter information or parameter values pre-stored in the base station/server or terminal device. This application does not limit this.
进一步地,这些取值和参数,是可以变化或更新的。Furthermore, these values and parameters can be changed or updated.
(4)本申请实施例中的术语“系统”和“网络”可被互换使用。“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A、同时存在A和B、单独存在B的情况,其中A,B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如“A,B和C中的至少一项”包括A,B,C,AB,AC,BC或ABC。以及,除非有特别说明,本申请实施例提及“第一”、“第二”等序数词是用于对多个对象进行区分,不用于限定多个对象的顺序、时序、优先级或者重要程度。(4) The terms "system" and "network" in the embodiments of this application can be used interchangeably. "Multiple" refers to two or more. "And/or" describes the relationship between related objects, indicating that there can be three relationships. For example, A and/or B can mean: A exists alone, A and B exist simultaneously, or B exists alone, where A and B can be singular or plural. The character "/" generally indicates that the related objects before and after are in an "or" relationship. "At least one of the following" or similar expressions refer to any combination of these items, including any combination of single or plural items. For example, "at least one of A, B and C" includes A, B, C, AB, AC, BC or ABC. And, unless otherwise specified, the ordinal numbers such as "first" and "second" mentioned in the embodiments of this application are used to distinguish multiple objects and are not used to limit the order, sequence, priority or importance of multiple objects.
(5)本申请实施例中的“发送”和“接收”,表示信号传递的走向。例如,“向XX发送信息”可以理解为该信息的目的端是XX,可以包括通过空口直接发送,也包括其他单元或模块通过空口间接发送。“接收来自YY的信息”可以理解为该信息的源端是YY,可以包括通过空口直接从YY接收,也可以包括通过空口从其他单元或模块间接地从YY接收。“发送”也可以理解为芯片接口的“输出”,“接收”也可以理解为芯片接口的“输入”。(5) In the embodiments of this application, "send" and "receive" indicate the direction of signal transmission. For example, "send information to XX" can be understood as the destination of the information being XX, which may include sending directly through the air interface or sending indirectly through the air interface by other units or modules. "Receive information from YY" can be understood as the source of the information being YY, which may include receiving directly from YY through the air interface or receiving indirectly from YY through the air interface by other units or modules. "Send" can also be understood as the "output" of the chip interface, and "receive" can also be understood as the "input" of the chip interface.
换言之,发送和接收可以是在设备之间进行的,例如,网络设备和终端设备之间进行的,也可以是在设备内进行的,例如,通过总线、走线或接口在设备内的部件之间、模组之间、芯片之间、软件模块或者硬件模块之间发送或接收。In other words, sending and receiving can occur between devices, such as between network devices and terminal devices, or within a device, such as between components, modules, chips, software modules, or hardware modules within the device via buses, wiring, or interfaces.
可以理解的是,信息在信息发送的源端和目的端之间可能会被进行必要的处理,比如编码、调制等,但目的端可以理解来自源端的有效信息。本申请中类似的表述可以做相似的理解,不再赘述。It is understandable that information may undergo necessary processing, such as encoding and modulation, between the source and destination, but the destination can understand the valid information from the source. Similar statements in this application can be interpreted in a similar way and will not be elaborated further.
(6)在本申请实施例中,“指示”可以包括直接指示和间接指示,也可以包括显式指示和隐式指示。将某一信息(如下文的指示信息)所指示的信息称为待指示信息,则具体实现过程中,对待指示信息进行指示的方式有很多种,例如但不限于,可以直接指示待指示信息,如待指示信息本身或者该待指示信息的索引等。也可以通过指示其他信息来间接指示待指示信息,其中该其他信息与待指示信息之间存在关联关系;还可以仅仅指示待指示信息的一部分,而待指示信息的其他部分则是已知的或者提前约定的,例如可以借助预先约定(例如协议预定义)的各个信息的排列顺序来实现对特定信息的指示,从而在一定程度上降低指示开销。本申请对于指示的具体方式不作限定。可以理解的是,对于该指示信息的发送方来说,该指示信息可用于指示待指示信息,对于指示信息的接收方来说,该指示信息可用于确定待指示信息。(6) In the embodiments of this application, "instruction" may include direct instruction and indirect instruction, as well as explicit instruction and implicit instruction. The information indicated by a certain piece of information (hereinafter referred to as instruction information) is called the information to be instructed. In the specific implementation process, there are many ways to indicate the information to be instructed, such as, but not limited to, directly indicating the information to be instructed, such as the information to be instructed itself or its index. It can also indirectly indicate the information to be instructed by indicating other information, where there is an association between the other information and the information to be instructed; or it can only indicate a part of the information to be instructed, while the other parts of the information to be instructed are known or pre-agreed upon. For example, the instruction of specific information can be achieved by using a pre-agreed (e.g., protocol predefined) arrangement order of various information, thereby reducing the instruction overhead to a certain extent. This application does not limit the specific method of instruction. It is understood that for the sender of the instruction information, the instruction information can be used to indicate the information to be instructed, and for the receiver of the instruction information, the instruction information can be used to determine the information to be instructed.
本申请中,除特殊说明外,各个实施例之间相同或相似的部分可以互相参考。在本申请中各个实施例、以及各实施例中的各个方法/设计/实现方式中,如果没有特殊说明以及逻辑冲突,不同的实施例之间、以及各实施例中的各个方法/设计/实现方式之间的术语和/或描述具有一致性、且可以相互引用,不同的实施例、以及各实施例中的各个方法/设计/实现方式中的技术特征根据其内在的逻辑关系可以组合形成新的实施例、方法、或实现方式。以下的本申请实施方式并不构成对本申请保护范围的限定。In this application, unless otherwise specified, the same or similar parts between the various embodiments can be referred to each other. In the various embodiments of this application, and the various methods/designs/implementations within each embodiment, unless otherwise specified or logically conflicting, the terminology and/or descriptions between different embodiments and between the various methods/designs/implementations within each embodiment are consistent and can be mutually referenced. The technical features in different embodiments and the various methods/designs/implementations within each embodiment can be combined to form new embodiments, methods, or implementations based on their inherent logical relationships. The following embodiments of this application do not constitute a limitation on the scope of protection of this application.
本申请可以应用于长期演进(long term evolution,LTE)系统、新无线(new radio,NR)系统,或者是5G之后演进的通信系统(例如6G等)。其中,该通信系统中包括至少一个网络设备和/或至少一个终端设备。This application can be applied to long-term evolution (LTE) systems, new radio (NR) systems, or communication systems evolving after 5G (such as 6G). The communication system includes at least one network device and/or at least one terminal device.
请参阅图1a,为本申请中通信系统的一种示意图。图1a中,示例性的示出了一个网络设备和6个终端设备,6个终端设备分别为终端设备1、终端设备2、终端设备3、终端设备4、终端设备5以及终端设备6等。在图1a所示的示例中,是以终端设备1为智能茶杯,终端设备2为智能空调,终端设备3为智能加油机,终端设备4为交通工具,终端设备5为手机,终端设备6为打印机进行举例说明的。Please refer to Figure 1a, which is a schematic diagram of a communication system according to this application. Figure 1a exemplarily shows one network device and six terminal devices, namely terminal device 1, terminal device 2, terminal device 3, terminal device 4, terminal device 5, and terminal device 6. In the example shown in Figure 1a, terminal device 1 is a smart teacup, terminal device 2 is a smart air conditioner, terminal device 3 is a smart gas pump, terminal device 4 is a vehicle, terminal device 5 is a mobile phone, and terminal device 6 is a printer.
如图1a所示,AI配置信息的发送实体可以为网络设备。AI配置信息的接收实体可以为终端设备1-终端设备6,此时,网络设备和终端设备1-终端设备6组成一个通信系统,在该通信系统中,终端设备1-终端设备6可以发送数据给网络设备,网络设备需要接收终端设备1-终端设备6发送的数据。同时,网络设备可以向终端设备1-终端设备6发送配置信息。As shown in Figure 1a, the entity sending the AI configuration information can be a network device. The entity receiving the AI configuration information can be terminal devices 1-6. In this case, the network device and terminal devices 1-6 form a communication system. In this communication system, terminal devices 1-6 can send data to the network device, and the network device needs to receive the data sent by terminal devices 1-6. At the same time, the network device can send configuration information to terminal devices 1-6.
示例性的,在图1a中,终端设备4-终端设备6也可以组成一个通信系统。其中,终端设备5作为网络设备,即AI配置信息的发送实体;终端设备4和终端设备6作为终端设备,即AI配置信息的接收实体。例如车联网系统中,终端设备5分别向终端设备4和终端设备6发送AI配置信息,并且接收终端设备4和终端设备6发送的数据;相应的,终端设备4和终端设备6接收终端设备5发送的AI配置信息,并向终端设备5发送数据。For example, in Figure 1a, terminal devices 4 and 6 can also form a communication system. Terminal device 5 acts as a network device, i.e., the entity sending AI configuration information; terminal devices 4 and 6 act as terminal devices, i.e., the entities receiving AI configuration information. For instance, in a vehicle-to-everything (V2X) system, terminal device 5 sends AI configuration information to terminal devices 4 and 6 respectively, and receives data sent by terminal devices 4 and 6; correspondingly, terminal devices 4 and 6 receive the AI configuration information sent by terminal device 5 and send data back to terminal device 5.
以图1a所示通信系统为例,不同的设备之间(包括网络设备与网络设备之间,网络设备与终端设备之间,和/或,终端设备和终端设备之间)除了执行通信相关业务之外,还有可能执行AI相关业务。Taking the communication system shown in Figure 1a as an example, in addition to performing communication-related services, different devices (including network devices and network devices, network devices and terminal devices, and/or terminal devices and terminal devices) may also perform AI-related services.
如图1b所示,以网络设备为基站为例,基站可以与一个或多个终端设备之间可以执行通信相关业务和AI相关业务,不同终端设备之间也可以执行通信相关业务和AI相关业务。As shown in Figure 1b, taking a network device as a base station as an example, the base station can perform communication-related services and AI-related services with one or more terminal devices, and different terminal devices can also perform communication-related services and AI-related services.
如图1c所示,以终端设备包括电视和手机为例,电视和手机之间也可以执行通信相关业务和AI相关业务。As shown in Figure 1c, taking terminal devices including televisions and mobile phones as an example, communication-related services and AI-related services can also be performed between televisions and mobile phones.
本申请提供的技术方案可以应用于无线通信系统(例如图1a、图1b或图1c所示系统),例如本申请提供的通信系统中可以引入AI网元来实现部分或全部AI相关的操作。AI网元也可以称为AI节点、AI设备、AI实体、AI模块、AI模型、或AI单元等。AI网元可以是内置在通信系统的网元中。例如,AI网元可以是内置在:接入网设备、核心网设备、云服务器、或操作管理维护(operation,administration and maintenance,OAM)中的AI模块,用以实现AI相关的功能。OAM可以作为核心网设备的网管和/或作为接入网设备的网管。或者,AI网元也可以是通信系统中独立设置的网元。可选的,终端或终端内置的芯片中也可以包括AI实体,用于实现AI相关的功能。The technical solutions provided in this application can be applied to wireless communication systems (such as the systems shown in Figures 1a, 1b, or 1c). For example, AI network elements can be introduced into the communication system provided in this application to implement some or all AI-related operations. AI network elements can also be called AI nodes, AI devices, AI entities, AI modules, AI models, or AI units, etc. AI network elements can be built into network elements within the communication system. For example, an AI network element can be an AI module built into: access network equipment, core network equipment, cloud servers, or operation, administration, and maintenance (OAM) systems to implement AI-related functions. OAM can act as the network management system for core network equipment and/or access network equipment. Alternatively, AI network elements can also be independently configured network elements within the communication system. Optionally, the terminal or its built-in chip can also include an AI entity to implement AI-related functions.
下面将本申请中可能涉及到的AI进行简要介绍。The following is a brief introduction to the AI that may be involved in this application.
AI可以让机器具有人类的智能,例如可以让机器应用计算机的软硬件来模拟人类某些智能行为。为了实现人工智能,可以采用机器学习方法。机器学习方法中,机器利用训练数据学习(或训练)得到模型。该模型表征了从输入到输出之间的映射。学习得到的模型可以用于进行推理(或预测),即可以利用该模型预测出给定输入所对应的输出。其中,该输出还可以称为推理结果(或预测结果)。AI can endow machines with human-like intelligence, for example, allowing them to use computer hardware and software to simulate certain intelligent human behaviors. To achieve artificial intelligence, machine learning methods can be employed. In machine learning, machines learn (or train) a model using training data. This model represents the mapping between inputs and outputs. The learned model can be used for reasoning (or prediction), that is, it can be used to predict the output corresponding to a given input. This output can also be called the reasoning result (or prediction result).
机器学习可以包括监督学习、无监督学习、和强化学习。其中,无监督学习还可以称为非监督学习。Machine learning can include supervised learning, unsupervised learning, and reinforcement learning. Unsupervised learning can also be called learning without supervision.
监督学习依据已采集到的样本值和样本标签,利用机器学习算法学习样本值到样本标签的映射关系,并用AI模型来表达学到的映射关系。训练机器学习模型的过程就是学习这种映射关系的过程。在训练过程中,将样本值输入模型得到模型的预测值,通过计算模型的预测值与样本标签(理想值)之间的误差来优化模型参数。映射关系学习完成后,就可以利用学到的映射来预测新的样本标签。监督学习学到的映射关系可以包括线性映射或非线性映射。根据标签的类型可将学习的任务分为分类任务和回归任务。Supervised learning, based on collected sample values and labels, uses machine learning algorithms to learn the mapping relationship between sample values and labels, and then expresses this learned mapping relationship using an AI model. The process of training the machine learning model is the process of learning this mapping relationship. During training, sample values are input into the model to obtain the model's predicted values, and the model parameters are optimized by calculating the error between the model's predicted values and the sample labels (ideal values). After the mapping relationship is learned, it can be used to predict new sample labels. The mapping relationship learned in supervised learning can include linear or non-linear mappings. Based on the type of label, the learning task can be divided into classification tasks and regression tasks.
无监督学习依据采集到的样本值,利用算法自行发掘样本的内在模式。无监督学习中有一类算法将样本自身作为监督信号,即模型学习从样本到样本的映射关系,称为自监督学习。训练时,通过计算模型的预测值与样本本身之间的误差来优化模型参数。自监督学习可用于信号压缩及解压恢复的应用,常见的算法包括自编码器和对抗生成型网络等。Unsupervised learning relies on collected sample values to discover inherent patterns within the samples themselves. One type of unsupervised learning algorithm uses the samples themselves as supervisory signals, meaning the model learns the mapping relationship from sample to sample; this is called self-supervised learning. During training, model parameters are optimized by calculating the error between the model's predictions and the samples themselves. Self-supervised learning can be used for signal compression and decompression recovery applications; common algorithms include autoencoders and generative adversarial networks.
强化学习不同于监督学习,是一类通过与环境进行交互来学习解决问题的策略的算法。与监督、无监督学习不同,强化学习问题并没有明确的“正确的”动作标签数据,算法需要与环境进行交互,获取环境反馈的奖励信号,进而调整决策动作以获得更大的奖励信号数值。如下行功率控制中,强化学习模型根据无线网络反馈的系统总吞吐率,调整各个用户的下行发送功率,进而期望获得更高的系统吞吐率。强化学习的目标也是学习环境状态与较优(例如最优)决策动作之间的映射关系。但因为无法事先获得“正确动作”的标签,所以不能通过计算动作与“正确动作”之间的误差来优化网络。强化学习的训练是通过与环境的迭代交互而实现的。Reinforcement learning, unlike supervised learning, is a type of algorithm that learns problem-solving strategies through interaction with the environment. Unlike supervised and unsupervised learning, reinforcement learning problems do not have explicit "correct" action labels. The algorithm needs to interact with the environment to obtain reward signals from the environment, and then adjust its decision actions to obtain a larger reward signal value. For example, in downlink power control, the reinforcement learning model adjusts the downlink transmission power of each user based on the total system throughput feedback from the wireless network, aiming to achieve a higher system throughput. The goal of reinforcement learning is also to learn the mapping relationship between the environment state and a better (e.g., optimal) decision action. However, because the label of the "correct action" cannot be obtained in advance, the network cannot be optimized by calculating the error between the action and the "correct action." Reinforcement learning training is achieved through iterative interaction with the environment.
神经网络(neural network,NN)是机器学习技术中的一种具体的模型。根据通用近似定理,神经网络在理论上可以逼近任意连续函数,从而使得神经网络具备学习任意映射的能力。传统的通信系统需要借助丰富的专家知识来设计通信模块,而基于神经网络的深度学习通信系统可以从大量的数据集中自动发现隐含的模式结构,建立数据之间的映射关系,获得优于传统建模方法的性能。Neural networks (NNs) are a specific model in machine learning techniques. According to the general approximation theorem, neural networks can theoretically approximate any continuous function, thus enabling them to learn arbitrary mappings. Traditional communication systems rely on extensive expert knowledge to design communication modules, while deep learning communication systems based on neural networks can automatically discover hidden pattern structures from large datasets, establish mapping relationships between data, and achieve performance superior to traditional modeling methods.
神经网络的思想来源于大脑组织的神经元结构。例如,每个神经元都对其输入值进行加权求和运算,通过一个激活函数输出运算结果。The idea behind neural networks comes from the neuronal structure of the brain. For example, each neuron performs a weighted summation of its input values and outputs the result through an activation function.
如图2a所示,为神经元结构的一种示意图。假设神经元的输入为x=[x0,x1,…,xn],与各个输入对应的权值分别为w=[w,w1,…,wn],其中,n为正整数,wi和xi可以是小数、整数(例如0、正整数或负整数等)、或复数等各种可能的类型。wi作为xi的权值,用于对xi进行加权。根据权值对输入值进行加权求和的偏置例如为b。激活函数的形式可以有多种,假设一个神经元的激活函数为:y=f(z)=max(0,z),则该神经元的输出为: 再例如,一个神经元的激活函数为:y=f(z)=z,则该神经元的输出为:其中,b可以是小数、整数(例如0、正整数或负整数)、或复数等各种可能的类型。神经网络中不同神经元的激活函数可以相同或不同。Figure 2a shows a schematic diagram of a neuron structure. Assume the neuron's input is x = [ x0 , x1 , ..., xn ], and the corresponding weights for each input are w = [w, w1 , ..., wn ], where n is a positive integer, and w<sub>i</sub> and xi can be decimals, integers (e.g., 0, positive integers, or negative integers), or complex numbers, etc. w<sub>i</sub> is used as the weight for xi , and is used to weight xi . The bias for the weighted summation of the input values based on the weights is, for example, b. The activation function can take many forms. Assuming a neuron's activation function is y = f(z) = max(0, z), then the neuron's output is: For example, if the activation function of a neuron is y = f(z) = z, then the output of that neuron is: Here, b can be any possible type, such as a decimal, an integer (e.g., 0, a positive integer, or a negative integer), or a complex number. The activation functions of different neurons in a neural network can be the same or different.
此外,神经网络一般包括多个层,每层可包括一个或多个神经元。通过增加神经网络的深度和/或宽度,能够提高该神经网络的表达能力,为复杂系统提供更强大的信息提取和抽象建模能力。其中,神经网络的深度可以是指神经网络包括的层数,每层包括的神经元个数可以称为该层的宽度。在一种实现方式中,神经网络包括输入层和输出层。神经网络的输入层将接收到的输入信息经过神经元处理,将处理结果传递给输出层,由输出层得到神经网络的输出结果。在另一种实现方式中,神经网络包括输入层、隐藏层和输出层。神经网络的输入层将接收到的输入信息经过神经元处理,将处理结果传递给中间的隐藏层,隐藏层对接收的处理结果进行计算,得到计算结果,隐藏层将计算结果传递给输出层或者下一个相邻的隐藏层,最终由输出层得到神经网络的输出结果。其中,一个神经网络可以包括一个隐藏层,或者包括多个依次连接的隐藏层,不予限制。Furthermore, neural networks generally consist of multiple layers, each of which may include one or more neurons. Increasing the depth and/or width of a neural network can improve its expressive power, providing more powerful information extraction and abstract modeling capabilities for complex systems. The depth of a neural network can refer to the number of layers it includes, and the number of neurons in each layer can be called the width of that layer. In one implementation, a neural network includes an input layer and an output layer. The input layer processes the received input information through neurons and passes the processing result to the output layer, which then obtains the output of the neural network. In another implementation, a neural network includes an input layer, hidden layers, and an output layer. The input layer processes the received input information through neurons and passes the processing result to the hidden layer. The hidden layer calculates the received processing result and passes the calculation result to the output layer or the next adjacent hidden layer, ultimately obtaining the output of the neural network. A neural network may include one hidden layer or multiple sequentially connected hidden layers, without limitation.
神经网络例如为深度神经网络(deep neural network,DNN)。根据网络的构建方式,DNN可以包括前馈神经网络(feedforward neural network,FNN)、卷积神经网络(convolutional neural networks,CNN)和递归神经网络(recurrent neural network,RNN)。Neural networks, for example, are deep neural networks (DNNs). Depending on how the network is constructed, DNNs can include feedforward neural networks (FNNs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs).
图2b为一种FNN网络示意图。FNN网络的特点为相邻层的神经元之间两两完全相连。该特点使得FNN通常需要大量的存储空间、导致较高的计算复杂度。Figure 2b is a schematic diagram of an FNN network. A characteristic of FNN networks is that neurons in adjacent layers are completely connected pairwise. This characteristic makes FNNs typically require a large amount of storage space, leading to high computational complexity.
CNN是一种专门来处理具有类似网格结构的数据的神经网络。例如,时间序列数据(例如时间轴离散采样)和图像数据(例如二维离散采样)都可以认为是类似网格结构的数据。CNN并不一次性利用全部的输入信息做运算,而是采用一个固定大小的窗截取部分信息做卷积运算,这就大大降低了模型参数的计算量。另外根据窗截取的信息类型的不同(如同一副图中的人和物为不同类型信息),每个窗可以采用不同的卷积核运算,这使得CNN能更好的提取输入数据的特征。CNNs are neural networks specifically designed to process data with a grid-like structure. For example, time-series data (e.g., discrete sampling along a time axis) and image data (e.g., two-dimensional discrete sampling) can both be considered grid-like data. CNNs do not use all the input information at once for computation; instead, they use a fixed-size window to extract a portion of the information for convolution operations, which significantly reduces the computational cost of model parameters. Furthermore, depending on the type of information extracted by the window (e.g., people and objects in an image represent different types of information), each window can use different convolution kernels, allowing CNNs to better extract features from the input data.
RNN是一类利用反馈时间序列信息的DNN网络。RNN的输入包括当前时刻的新的输入值和自身在前一时刻的输出值。RNN适合获取在时间上具有相关性的序列特征,特别适用于语音识别、信道编译码等应用。Recurrent Neural Networks (RNNs) are a type of distributed neural network (DNN) that utilizes feedback time-series information. The input to an RNN includes the current input value and its own output value from the previous time step. RNNs are well-suited for acquiring temporally correlated sequence features, and are particularly applicable to applications such as speech recognition and channel coding/decoding.
在上述机器学习的模型训练过程中,可以定义损失函数。损失函数描述了模型的输出值和理想目标值之间的差距或差异。损失函数可以通过多种形式体现,对于损失函数的具体形式不予限制。模型训练过程可以看作以下过程:通过调整模型的部分或全部参数,使得损失函数的值小于门限值或者满足目标需求。In the model training process described above for machine learning, a loss function can be defined. The loss function describes the difference or discrepancy between the model's output value and the ideal target value. The loss function can be expressed in various forms, and there are no restrictions on its specific form. The model training process can be viewed as follows: by adjusting some or all of the model's parameters, the value of the loss function is made to be less than a threshold value or to meet the target requirement.
模型还可以被称为AI模型、规则或者其他名称等。AI模型可以认为是实现AI功能的具体方法。AI模型表征了模型的输入和输出之间的映射关系或者函数。AI功能可以包括以下一项或多项:数据收集、模型训练(或模型学习)、模型信息发布、模型推断(或称为模型推理、推理、或预测等)、模型监控或模型校验、或推理结果发布等。AI功能还可以称为AI(相关的)操作、或AI相关的功能。A model can also be called an AI model, a rule, or other names. An AI model can be considered a specific method for implementing AI functions. An AI model represents the mapping relationship or function between the model's input and output. AI functions can include one or more of the following: data collection, model training (or model learning), model information dissemination, model inference (or model reasoning, inference, or prediction, etc.), model monitoring or model validation, or inference result publication, etc. AI functions can also be called AI (related) operations or AI-related functions.
下面将结合附图,对全连接神经网络的实现过程进行示例性描述。其中,全连接神经网络,又叫多层感知机(multilayer perceptron,MLP)。The implementation process of a fully connected neural network will be described below with reference to the accompanying drawings. A fully connected neural network is also called a multilayer perceptron (MLP).
如图2c所示,一个MLP包含一个输入层(左侧),一个输出层(右侧),及多个隐藏层(中间)。其中,MLP的每层包含若干个节点,称为神经元。其中,相邻两层的神经元间两两相连。As shown in Figure 2c, an MLP consists of an input layer (left side), an output layer (right side), and multiple hidden layers (middle). Each layer of an MLP contains several nodes, called neurons. Neurons in adjacent layers are connected pairwise.
可选的,考虑相邻两层的神经元,下一层的神经元的输出h为所有与之相连的上一层神经元x的加权和并经过激活函数,可以表示为:
h=f(wx+b)。Optionally, considering neurons in two adjacent layers, the output h of a neuron in the next layer is the weighted sum of all neurons x in the previous layer connected to it, after passing through an activation function, and can be expressed as:
h = f(wx + b).
其中,w为权重矩阵,b为偏置向量,f为激活函数。Where w is the weight matrix, b is the bias vector, and f is the activation function.
进一步可选的,神经网络的输出可以递归表达为:
y=fn(wnfn-1(…)+bn)。Alternatively, the output of the neural network can be recursively expressed as:
y=f n (w n f n-1 (…)+b n ).
其中,n是神经网络层的索引,n大于或等于1,且n小于或等于N,其中N为神经网络的总层数。Where n is the index of the neural network layer, n is greater than or equal to 1 and less than or equal to N, where N is the total number of layers in the neural network.
换言之,可以将神经网络理解为一个从输入数据集合到输出数据集合的映射关系。而通常神经网络都是随机初始化的,用已有数据从随机的w和b得到这个映射关系的过程被称为神经网络的训练。In other words, a neural network can be understood as a mapping from an input data set to an output data set. Neural networks are typically initialized randomly; the process of obtaining this mapping from random values w and b using existing data is called training the neural network.
可选的,训练的具体方式为采用损失函数(loss function)对神经网络的输出结果进行评价。Optionally, the training process can involve using a loss function to evaluate the output of the neural network.
如图2d所示,可以将误差反向传播,通过梯度下降的方法即能迭代优化神经网络参数(包括w和b),直到损失函数达到最小值,即图2d中的“较优点(例如最优点)”。可以理解的是,图2d中的“较优点(例如最优点)”对应的神经网络参数可以作为训练好的AI模型信息中的神经网络参数。As shown in Figure 2d, the error can be backpropagated, and the neural network parameters (including w and b) can be iteratively optimized using gradient descent until the loss function reaches its minimum, which is the "better point (e.g., the optimal point)" in Figure 2d. It can be understood that the neural network parameters corresponding to the "better point (e.g., the optimal point)" in Figure 2d can be used as the neural network parameters in the trained AI model information.
进一步可选的,梯度下降的过程可以表示为:
Alternatively, the gradient descent process can be represented as:
其中,θ为待优化参数(包括w和b),L为损失函数,η为学习率,控制梯度下降的步长,表示求导运算,表示对L求θ的导数。Where θ represents the parameters to be optimized (including w and b), L is the loss function, and η is the learning rate, controlling the step size of gradient descent. This represents the differentiation operation. This indicates taking the derivative of θ with respect to L.
进一步可选的,反向传播的过程利用到求偏导的链式法则。Alternatively, the backpropagation process can utilize the chain rule for partial derivatives.
如图2e所示,前一层参数的梯度可以由后一层参数的梯度递推计算得到,可以表达为:
As shown in Figure 2e, the gradient of the parameters in the previous layer can be recursively calculated from the gradient of the parameters in the next layer, and can be expressed as:
其中,wij为节点j连接节点i的权重,si为节点i上的输入加权和。Where w<sub> ij </sub> is the weight connecting node j to node i, and s <sub>i </sub> is the weighted sum of the inputs at node i.
在当前的通信系统中,AI模型通常是在网络设备一侧或者终端设备一侧进行单侧训练,然后将训练好的模型发送给对端部署。然而,受限于终端设备的能力或者数据隐私等问题,单侧训练可能无法实现。此外,单侧训练得到的模型可能存在泛化性差等问题。因此,如何实现双侧训练模型成为目前亟待解决的问题。In current communication systems, AI models are typically trained on one side of the network device or the terminal device, and then the trained model is sent to the other end for deployment. However, due to limitations in the capabilities of the terminal device or data privacy issues, one-sided training may not be feasible. Furthermore, models trained on one side may suffer from poor generalization. Therefore, how to achieve two-sided training of models has become an urgent problem to be solved.
基于此,本申请实施例提出一种通信方法以及相关装置。首先,发送端向接收端发送第一信息,第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,其中,第一数据包括第一模型的中间层输出的数据,第一训练标签数据用于更新第一模型。然后,发送端接收该接收端发送的第二信息,第二信息指示第一模型的中间层反向输出的模型更新数据。通过上述方法,实现发送端与接收端协同训练神经网络模型,可以灵活适用于多种不同的场景和任务,提升了兼容性。此外,发送端向接收端发送的第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,通过向接收端发送丰富的训练相关数据,提升了模型的泛化性。Based on this, this application proposes a communication method and related apparatus. First, the sending end sends first information to the receiving end. The first information includes one or more first data and one or more first training label data, wherein the first data includes data output from the intermediate layer of the first model, and the first training label data is used to update the first model. Then, the sending end receives second information sent by the receiving end, the second information indicating model update data output in reverse from the intermediate layer of the first model. Through the above method, the sending end and receiving end can collaboratively train a neural network model, which can be flexibly applied to various different scenarios and tasks, improving compatibility. Furthermore, the first information sent by the sending end to the receiving end includes one or more first data and one or more first training label data; by sending rich training-related data to the receiving end, the generalization ability of the model is improved.
接下来,介绍本申请实施例适用的通信场景。请参阅图3,图3为本申请实施例中一种通信场景的示意图。本申请实施例涉及的一种通信场景,包括:发送端和接收端,其中,将发送端向接收端的方向定义为前向,将接收端向发送端的方向定义为反向(或者后向)。发送端向接收端发送前向交互数据,该前向交互数据包括第一信息;接收端向发送端发送反向交互数据(或者后向交互数据),该反向交互数据包括第二信息。Next, the communication scenarios applicable to the embodiments of this application are described. Please refer to Figure 3, which is a schematic diagram of a communication scenario according to an embodiment of this application. The communication scenario involved in this embodiment includes: a sending end and a receiving end, wherein the direction from the sending end to the receiving end is defined as forward, and the direction from the receiving end to the sending end is defined as reverse (or backward). The sending end sends forward interaction data to the receiving end, the forward interaction data including first information; the receiving end sends reverse interaction data (or backward interaction data) to the sending end, the reverse interaction data including second information.
在一种可能的实现方式中,该发送端可以是终端设备,该接收端可以是网络设备,例如图4所示,图4为本申请实施例中又一种通信场景的示意图。在图4示意的通信场景中,终端设备作为发送端向作为接收端的网络设备发送前向交互数据,网络设备向终端设备发送反向交互数据。In one possible implementation, the sending end can be a terminal device, and the receiving end can be a network device, as shown in Figure 4, which is a schematic diagram of another communication scenario in an embodiment of this application. In the communication scenario illustrated in Figure 4, the terminal device, as the sending end, sends forward interaction data to the network device, which is the receiving end, and the network device sends reverse interaction data to the terminal device.
另一种可能的实现方式中,该发送端可以是网络设备,该接收端可以是终端设备。In another possible implementation, the sending end can be a network device, and the receiving end can be a terminal device.
在另一种可能的实现方式中,该发送端可以是网络设备,该接收端可以是另一台网络设备。In another possible implementation, the sender can be a network device, and the receiver can be another network device.
在另一种可能的实现方式中,该发送端可以是终端设备,该接收端可以是另一台终端设备。In another possible implementation, the sending end can be a terminal device, and the receiving end can be another terminal device.
结合上述通信场景,下面介绍本申请实施例提出的通信方法。请参阅图5,图5为本申请实施例中通信方法的一种实施例流程示意图。本申请实施例提出的一种通信方法,包括:Based on the above communication scenarios, the communication method proposed in this application is described below. Please refer to Figure 5, which is a schematic flowchart of an embodiment of the communication method in this application. The communication method proposed in this application includes:
S1、发送端向接收端发送配置信息,和/或,接收端向发送端发送配置信息。S1. The sending end sends configuration information to the receiving end, and/or the receiving end sends configuration information to the sending end.
步骤S1中,当发送端与接收端之间需要协同训练神经网络模型时,发送端可以向接收端发送配置信息。本申请实施例中,将待训练的神经网络模型称为第一模型,第一模型可以包括一个或多个神经网络模型。In step S1, when the sending end and the receiving end need to collaboratively train the neural network model, the sending end can send configuration information to the receiving end. In this embodiment, the neural network model to be trained is referred to as the first model, which may include one or more neural network models.
此外,当发送端与接收端之间需要协同训练第一模型时,可以是发送端向接收端发送配置信息,也可以是接收端向发送端发送该配置信息。还可以是发送端与接收端分别向对端发送配置信息,使得两端的配置信息保持一致。通过上述方法,确保发送端与接收端训练的是相同的神经网络模型(即第一模型)。Furthermore, when the sending and receiving ends need to collaboratively train the first model, the sending end can send configuration information to the receiving end, or the receiving end can send the configuration information to the sending end. Alternatively, both the sending and receiving ends can send configuration information to each other, ensuring that the configuration information at both ends remains consistent. Through these methods, it is ensured that the sending and receiving ends are training the same neural network model (i.e., the first model).
可选地,在发送端与接收端训练第一模型的过程中,第一模型相关的配置信息也可以发生变化。此时,发送端或者接收端可以向对端发送变化后的配置信息,使得两端的配置信息保持一致。通过上述方法,确保发送端与接收端训练的是相同的神经网络模型(即第一模型)。例如,当训练过程中第一模型的切分信息(切分位置和/或切分位置数量)发生变化,则发送端与接收端之间需要交互配置信息(包括更新后的切分信息),使得发送端与接收端的第一模型保持一致。换言之,步骤S1与后续步骤S2~步骤S3的执行顺序此处不作限制。Optionally, during the training of the first model at the sending and receiving ends, the configuration information related to the first model may also change. In this case, either the sending or receiving end can send the changed configuration information to the other end to ensure consistency between the configuration information at both ends. This method ensures that the sending and receiving ends are training the same neural network model (i.e., the first model). For example, if the segmentation information (segmentation position and/or the number of segmentation positions) of the first model changes during training, the sending and receiving ends need to exchange configuration information (including the updated segmentation information) to ensure consistency between their first models. In other words, the execution order of step S1 and subsequent steps S2-S3 is not restricted here.
可选地,第一模型可以包括两部分:发送端模型和接收端模型,其中,发送端训练第一模型中的发送端模型,接收端训练第一模型中的接收端模型。Optionally, the first model may include two parts: a transmitter model and a receiver model, wherein the transmitter trains the transmitter model in the first model, and the receiver trains the receiver model in the first model.
下面介绍该配置信息。具体的,该配置信息用于配置训练第一模型的训练任务,该配置信息包括以下任意一项或多项信息:The following describes this configuration information. Specifically, this configuration information is used to configure the training task for training the first model, and it includes one or more of the following:
模型信息,其中,模型信息包括以下任意一项或多项信息:第一模型的标识信息、第一模型的结构信息、第一模型的训练任务信息、第一模型的初始化参数,或者,第一模型的待更新参数;Model information, wherein the model information includes any one or more of the following: the identifier of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated;
切分信息,其中,切分信息包括:第一模型的切分位置数量,和/或,切分位置在第一模型的位置信息;Segmentation information, wherein the segmentation information includes: the number of segmentation positions in the first model, and/or, the position information of the segmentation positions in the first model;
或者,训练信息,其中,训练信息包括以下任意一项或多项信息:训练标签集信息、验证集信息、训练迭代次数、学习率,或者损失函数,训练标签集包括一个或多个训练标签。Alternatively, training information, which includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels.
示例性地,当发送端与接收端之间并行训练多个第一模型时,第一模型的标识信息可以包括多个标识信息,每个标识信息用于指示一个第一模型。当发送端与接收端之间并行训练的第一模型发生变化时,配置信息包括的第一模型的标识信息需要更新。例如,更新前,发送端与接收端之间训练第一模型#1和第一模型#2,则更新前的配置信息中第一模型的标识信息包括:第一模型#1的标识(ID)和第一模型#2的ID。更新后,发送端与接收端之间训练第一模型#3和第一模型#4,则更新后的配置信息中第一模型的标识信息包括:第一模型#3的ID和第一模型#4的ID。For example, when multiple first models are trained in parallel between the sending end and the receiving end, the identification information of the first model may include multiple identification information, each of which indicates a first model. When the first model trained in parallel between the sending end and the receiving end changes, the identification information of the first model included in the configuration information needs to be updated. For example, before the update, if the sending end and the receiving end train first model #1 and first model #2, then the identification information of the first model in the configuration information before the update includes: the identifier (ID) of first model #1 and the ID of first model #2. After the update, if the sending end and the receiving end train first model #3 and first model #4, then the identification information of the first model in the updated configuration information includes: the ID of first model #3 and the ID of first model #4.
示例性地,该第一模型的训练任务信息指示该第一模型所适用的场景(或者环境),例如:重构场景、检测场景、室外场景或者室内场景等。For example, the training task information of the first model indicates the scenario (or environment) in which the first model is applicable, such as: reconstruction scenario, detection scenario, outdoor scenario, or indoor scenario.
需要说明的是,本申请实施例中的切分位置用于指示第一模型中哪个中间层的输出位置所输出的数据作为第一数据,该输出位置所输出的数据经过了该中间层以及该中间层之前的其他层(例如其他中间层)的处理。It should be noted that the splitting position in this embodiment is used to indicate which intermediate layer in the first model outputs data as the first data, and the data output at that output position has been processed by that intermediate layer and other layers (e.g., other intermediate layers) before that intermediate layer.
示例性地,切分信息中第一模型的切分位置数量为2,该切分信息中切分位置在第一模型的位置信息分别是:激活函数层(Relu)#1,和,激活函数层(Relu)#4。则该切分信息所指示的切分位置:切分位置#1和切分位置#2,如图10和图11所示。图10为本申请实施例中一种第一模型的切分位置示意图,图11为本申请实施例中另一种第一模型的切分位置示意图。以第一模型是卷积神经网络(CNN)为例,第一模型包括多个卷积层和多个激活函数层,其中,以卷积层#1为例,该卷积层#1的权重(weights,W)是尺寸为192x192x1x7的矩阵,该卷积层#1的偏置(Biases,B)是192。图10~图11所示的是第一模型的部分结构,其中,切分位置#1具体是第一模型中激活函数层(Relu)#1的输出,切分位置#2具体是第一模型中激活函数层(Relu)#4的输出。For example, the number of segmentation positions of the first model in the segmentation information is 2. The position information of the segmentation positions in the first model in the segmentation information are: ReLU activation layer #1 and ReLU activation layer #4. Then the segmentation positions indicated by the segmentation information are: segmentation position #1 and segmentation position #2, as shown in Figures 10 and 11. Figure 10 is a schematic diagram of the segmentation position of the first model in an embodiment of this application, and Figure 11 is a schematic diagram of the segmentation position of the first model in another embodiment of this application. Taking the first model as a convolutional neural network (CNN) as an example, the first model includes multiple convolutional layers and multiple activation function layers. Taking convolutional layer #1 as an example, the weights (W) of the convolutional layer #1 are a matrix of size 192x192x1x7, and the bias (B) of the convolutional layer #1 is 192. Figures 10 and 11 show a partial structure of the first model, where the split position #1 is specifically the output of the activation function layer (ReLU) #1 in the first model, and the split position #2 is specifically the output of the activation function layer (ReLU) #4 in the first model.
需要说明的是,图10示意的模型部分结构与图11示意的模型部分结构属于同一个第一模型。即,第一模型包括的一个分支如图10所示,第一模型包括的另一个分支如图11所示。It should be noted that the partial structure of the model shown in Figure 10 and the partial structure of the model shown in Figure 11 belong to the same first model. That is, one branch of the first model is shown in Figure 10, and another branch of the first model is shown in Figure 11.
可选地,若第一信息和/或第二信息(或者第三信息和/或第四信息)是经过压缩处理后的数据,则配置信息中还可以包括压缩处理相关的配置信息。具体如下,配置信息还包括:第一压缩配置信息,和/或,第二压缩配置信息,其中:第一压缩配置信息用于指示第一信息和/或第三信息的压缩方式以及压缩参数,第二压缩配置信息用于指示第二信息和/或第四信息的压缩方式以及压缩参数。发送端向接收端发送第一信息和/或第三信息,即发送端向接收端发送的前向交互数据包括:第一信息和/或第三信息;接收端向发送端发送第二信息和/或第四信息,即接收端向发送端发送的反向交互数据包括:第二信息和/或第四信息。换言之,第一压缩配置信息用于指示前向交互数据的压缩方式以及压缩参数,第二压缩配置信息用于指示反向交互数据的压缩方式以及压缩参数。Optionally, if the first and/or second (or third and/or fourth) information is compressed data, the configuration information may further include compression-related configuration information. Specifically, the configuration information may also include: first compression configuration information, and/or second compression configuration information, wherein: the first compression configuration information indicates the compression method and compression parameters of the first and/or third information, and the second compression configuration information indicates the compression method and compression parameters of the second and/or fourth information. The sending end sends the first and/or third information to the receiving end, i.e., the forward interaction data sent by the sending end to the receiving end includes: the first and/or third information; the receiving end sends the second and/or fourth information to the sending end, i.e., the reverse interaction data sent by the receiving end to the sending end includes: the second and/or fourth information. In other words, the first compression configuration information indicates the compression method and compression parameters of the forward interaction data, and the second compression configuration information indicates the compression method and compression parameters of the reverse interaction data.
一种可能的实现方式中,前向交互数据与反向交互数据采用相同的压缩方式以及压缩参数,则第一压缩配置信息和第二压缩配置信息是相同的压缩配置信息,该压缩配置信息也可以称为公共压缩配置信息。In one possible implementation, the forward interaction data and the reverse interaction data use the same compression method and compression parameters. In this case, the first compression configuration information and the second compression configuration information are the same compression configuration information, which can also be called common compression configuration information.
另一种可能的实现方式中,前向交互数据与反向交互数据采用不同的压缩方式以及压缩参数,则发送端向接收端发送的配置信息可以包括第一压缩配置信息,和,第二压缩配置信息;或者,接收端向发送端发送的配置信息可以包括第一压缩配置信息,和,第二压缩配置信息;或者,发送端向接收端发送的配置信息包括第一压缩配置信息,接收端向发送端发送的配置信息包括第二压缩配置信息;或者,发送端向接收端发送的配置信息包括第二压缩配置信息,接收端向发送端发送的配置信息包括第一压缩配置信息。In another possible implementation, the forward and reverse interaction data use different compression methods and parameters. In this case, the configuration information sent from the sending end to the receiving end may include first compressed configuration information and second compressed configuration information; or, the configuration information sent from the receiving end to the sending end may include first compressed configuration information and second compressed configuration information; or, the configuration information sent from the sending end to the receiving end includes first compressed configuration information, and the configuration information sent from the receiving end to the sending end includes second compressed configuration information; or, the configuration information sent from the sending end to the receiving end includes second compressed configuration information, and the configuration information sent from the receiving end to the sending end includes first compressed configuration information.
可选地,上述第一压缩配置信息和/或第二压缩配置信息,可以承载于配置信息中;也可以第一压缩配置信息承载于第一信息和/或第三信息,第二压缩配置信息承载于第二信息和/或第四信息;还可以承载于独立的其他信息中,本申请实施例对此不作限制。Optionally, the aforementioned first compression configuration information and/or second compression configuration information may be carried in configuration information; or the first compression configuration information may be carried in first information and/or third information, and the second compression configuration information may be carried in second information and/or fourth information; or it may be carried in other independent information, and the embodiments of this application do not limit this.
关于压缩方式,本申请实施例包括但不限于:不压缩、固定量化、熵编码、固定量化和熵编码、字典压缩,或者,字典压缩和熵编码等。Regarding compression methods, the embodiments of this application include, but are not limited to: no compression, fixed quantization, entropy coding, fixed quantization and entropy coding, dictionary compression, or dictionary compression and entropy coding, etc.
关于压缩参数,本申请实施例包括但不限于:量化参数,该量化参数例如量化边界或者量化比特、熵编码参数,或者,码本维度等。Regarding compression parameters, embodiments of this application include, but are not limited to: quantization parameters, such as quantization boundaries or quantization bits, entropy coding parameters, or codebook dimensions, etc.
示例性地,第一压缩配置信息和/或第二压缩配置信息可以采用索引信息的方式进行指示,根据该索引信息,可以从压缩配置信息表格中确定对应的压缩配置信息。发送端与接收端分别配置该压缩信息表格。该压缩配置信息表格例如表2所示。For example, the first compression configuration information and/or the second compression configuration information can be indicated using index information. Based on this index information, the corresponding compression configuration information can be determined from the compression configuration information table. The sending end and the receiving end configure this compression information table respectively. The compression configuration information table is shown in Table 2, for example.
表2
Table 2
可选地,压缩参数中的码本维度也可以通过索引信息的方式进行指示,根据该索引信息,可以从字典压缩参数表格中确定对应的字典压缩参数(即码本维度)。该字典压缩参数表格如表3所示。Optionally, the codebook dimension in the compression parameters can also be indicated by index information. Based on this index information, the corresponding dictionary compression parameters (i.e., codebook dimensions) can be determined from the dictionary compression parameter table. The dictionary compression parameter table is shown in Table 3.
表3
Table 3
需要说明的是,上述压缩配置信息可以承载于配置信息,也可以随着第一信息、第二信息、第三信息和/或第四信息一并发送,本申请实施例对此不作限制。例如,发送端向接收端发送的前向交互数据包括第一信息和/或第三信息,该前向交互数据还包括第一压缩配置信息。又例如,接收端向发送端发送的反向交互数据包括第二信息和/或第四信息,该反向交互数据还包括第二压缩配置信息。It should be noted that the aforementioned compressed configuration information can be carried within the configuration information, or it can be sent together with the first, second, third, and/or fourth information. This application embodiment does not impose any limitations on this. For example, the forward interaction data sent from the sending end to the receiving end includes the first and/or third information, and this forward interaction data also includes the first compressed configuration information. As another example, the reverse interaction data sent from the receiving end to the sending end includes the second and/or fourth information, and this reverse interaction data also includes the second compressed configuration information.
需要说明的是,上述配置信息所包括的各项信息,可以分开发送也可以一并发送,本申请实施例对此不作限制。It should be noted that the various information items included in the above configuration information can be sent separately or together, and this application embodiment does not impose any restrictions on this.
S2、发送端向接收端发送第一信息和/或第三信息。S2. The sending end sends the first information and/or the third information to the receiving end.
步骤S2中,当发送端与接收端之间根据配置信息配置好第一模型之后,发送端对第一模型进行训练,然后将训练第一模型的过程中第一模型的中间层输出的数据发送给接收端。本申请实施例中,将第一模型的中间层输出的数据称为第一数据。除了第一数据之外,第一信息还包括第一训练标签数据,第一训练标签数据用于更新第一模型。具体的,第一训练标签数据包括:第一训练标签在训练标签集中的索引信息,或者,第一训练标签,其中,训练标签集包括一个或多个训练标签(Training Labels)。该训练标签集即配置信息中训练标签集信息指示的训练标签集。In step S2, after the sending end and receiving end configure the first model according to the configuration information, the sending end trains the first model and then sends the data output by the intermediate layer of the first model during the training process to the receiving end. In this embodiment, the data output by the intermediate layer of the first model is referred to as the first data. In addition to the first data, the first information also includes first training label data, which is used to update the first model. Specifically, the first training label data includes: the index information of the first training label in the training label set, or the first training label, wherein the training label set includes one or more training labels. This training label set is the training label set indicated by the training label set information in the configuration information.
一种可能的实现方式中,发送端可以根据配置信息中的训练标签集信息,确定训练标签集。然后,发送端根据训练标签集中的第一训练标签对第一模型进行训练。发送端将第一模型的中间层输出的数据作为第一数据。在这种可能的实现方式中,第一训练标签数据包括:第一训练标签在训练标签集中的索引信息。In one possible implementation, the sending end can determine the training label set based on the training label set information in the configuration information. Then, the sending end trains the first model based on the first training label in the training label set. The sending end uses the data output from the intermediate layer of the first model as the first data. In this possible implementation, the first training label data includes: the index information of the first training label in the training label set.
另一种可能的实现方式中,当配置信息中不包括训练标签集信息时,发送端与接收端之间通过实时采集训练样本进行训练,该训练样本称为第一训练样本。第一训练样本对应的训练标签为第一训练标签。在这种可能的实现方式中,第一训练标签数据包括:第一训练标签。In another possible implementation, when the configuration information does not include training label set information, the sending and receiving ends train by collecting training samples in real time. These training samples are called the first training samples. The training label corresponding to the first training sample is called the first training label. In this possible implementation, the first training label data includes: the first training label.
在第一模型的训练过程中,为了提升训练效率,发送端可以向接收端发送一个或多个第一数据。一种可能的实现方式中,当配置信息包括的切分信息指示第一模型包括多个切分位置,则多个切分位置输出的多个特征数据(或者其他数据)作为一个第一数据。在第一信息中,一个第一数据所包括的多个数据可以按照输出该第一数据的切分位置的层级大小进行排序。During the training of the first model, to improve training efficiency, the sending end can send one or more first data points to the receiving end. In one possible implementation, when the segmentation information included in the configuration information indicates that the first model includes multiple segmentation positions, the multiple feature data (or other data) output by the multiple segmentation positions are considered as one set of first data. In the first information, the multiple data points included in a single set of first data can be ordered according to the hierarchical size of the segmentation position that outputs the first data.
另一种可能的实现方式中,发送端获取多个第一训练样本,该多个第一训练样本互不相同。然后,发送端使用多个第一训练样本对第一模型进行训练,每次训练使用不同的第一训练样本。在使用多个第一训练样本对第一模型的训练过程中,输出多个第一数据。In another possible implementation, the sending end acquires multiple distinct first training samples. Then, the sending end trains a first model using these multiple first training samples, with each training iteration using a different first training sample. During the training process of the first model using multiple first training samples, multiple sets of first data are output.
可选地,发送端获取多个第一训练样本可以是直接获取多个不同的第一训练样本,例如,发送端通过传感器多次采集得到多个不同的第一训练样本;发送端也可以获取一个公共训练样本之后,对该公共训练样本进行不同的处理得到多个不同的第一训练样本。具体的,对公共训练样本进行不同的处理包括但不限于:数据清洗、数据标准化、数据归一化、特征缩放、编码、特征选择、特征提取、数据增强,或者重采样处理等。Optionally, the sending end can acquire multiple first training samples by directly acquiring multiple different first training samples. For example, the sending end can acquire multiple different first training samples through multiple acquisitions by a sensor. Alternatively, the sending end can acquire a common training sample and then perform different processing on that common training sample to obtain multiple different first training samples. Specifically, different processing on the common training sample includes, but is not limited to: data cleaning, data standardization, data normalization, feature scaling, encoding, feature selection, feature extraction, data augmentation, or resampling.
可选地,发送端获取一个公共训练样本之后,对该公共训练样本进行不同处理得到多个第一数据。Optionally, after obtaining a common training sample, the sending end performs different processing on the common training sample to obtain multiple first data.
另一种可能的实现方式中,发送端获取一个第一训练样本后,发送端使用该第一训练样本对第一模型分别进行一次或多次处理,该处理可以是训练处理。发送端在使用同一个第一训练样本对第一模型进行多次处理的过程中,输出一个或多个第一数据。该多次处理中,每次处理的处理方式和/或处理参数不一致,例如:在每次处理中第一模型的初始化参数不同、在每次处理中第一模型的其他输入信息不同,或者,在每次处理中第一模型的采样操作不同。In another possible implementation, after acquiring a first training sample, the sending end uses this first training sample to process the first model one or more times. This processing can be training processing. During the multiple processing steps using the same first training sample, the sending end outputs one or more first data points. In these multiple processing steps, the processing method and/or processing parameters are inconsistent each time. For example, the initialization parameters of the first model are different in each processing step, other input information of the first model is different in each processing step, or the sampling operation of the first model is different in each processing step.
在发送端获取一个或多个第一数据,以及一个或多个第一训练标签数据之后,发送端向接收端发送第一信息,第一信息包括该一个或多个第一数据以及一个或多个第一训练标签数据。第一信息中,一个或多个第一数据与一个或多个第一训练标签数据可以采用多种不同的分组方式,以提升泛化性。下面分别介绍第一信息的不同分组方式。After acquiring one or more first data points and one or more first training label data points at the sending end, the sending end sends first information to the receiving end. The first information includes the one or more first data points and one or more first training label data points. The one or more first data points and one or more first training label data points in the first information can be grouped in various different ways to improve generalization. The different grouping methods of the first information are described below.
分组方式A中,第一信息包括一个或多个第一数据以及一个第一训练标签数据,其中,该第一训练标签数据包括的第一训练标签(或者指示了第一训练标签的索引信息)对应于该一个或多个第一数据。例如图6所示,图6为本申请实施例中第一信息的一种结构示意图。图6中,第一信息包括n个第一数据以及一个第一训练标签数据,n为大于或等于1的正整数。In grouping method A, the first information includes one or more first data and one first training label data, wherein the first training label data includes a first training label (or index information indicating the first training label) corresponding to the one or more first data. For example, as shown in Figure 6, Figure 6 is a schematic diagram of the structure of the first information in an embodiment of this application. In Figure 6, the first information includes n first data and one first training label data, where n is a positive integer greater than or equal to 1.
分组方式B中,第一信息包括一个或多个第一数据以及多个第一训练标签数据,其中,每个第一数据对应于一个第一训练标签数据。例如图7所示,图7为本申请实施例中第一信息的又一种结构示意图。图7中,第一信息包括n个第一数据以及n个第一训练标签数据,n为大于或等于1的正整数。其中,每个第一数据以及对应的第一训练标签数据划分为一组数据,即该第一信息包括n组数据,每组数据包括一个第一数据以及对应的第一训练标签数据。例如,第n组数据包括第一数据n和第一训练标签数据n。In grouping method B, the first information includes one or more first data points and multiple first training label data points, wherein each first data point corresponds to one first training label data point. For example, as shown in Figure 7, which is another structural diagram of the first information in an embodiment of this application, the first information includes n first data points and n first training label data points, where n is a positive integer greater than or equal to 1. Each first data point and its corresponding first training label data point are divided into a group of data, meaning the first information includes n groups of data, each group including one first data point and its corresponding first training label data point. For example, the nth group of data includes first data point n and first training label data point n.
可选地,发送端还可以告知接收端该第一信息的分组方式以及该第一信息包括的一个或多个第一数据的数量。具体的,第一信息还可以包括:分组指示信息,分组指示信息用于指示第一信息包括的一个或多个第一数据的分组方式;和/或,组数信息,组数信息用于指示第一信息包括的一个或多个第一数据的数量。例如,分组指示信息用于指示第一信息是分组方式A或者是分组方式B。又例如,组数信息指示第一信息包括n组数据(n个第一数据和n个第一训练标签数据,每组数据包括一个第一数据和一个第一训练标签数据)。Optionally, the sending end may also inform the receiving end of the grouping method of the first information and the number of one or more first data points included in the first information. Specifically, the first information may further include: grouping indication information, which indicates the grouping method of the one or more first data points included in the first information; and/or, group number information, which indicates the number of one or more first data points included in the first information. For example, the grouping indication information indicates whether the first information is grouped using grouping method A or grouping using grouping method B. As another example, the group number information indicates that the first information includes n groups of data (n first data points and n first training label data points, each group including one first data point and one first training label data point).
示例性地,第一信息可以增加指示信息用于指示该第一信息是采用分组方式A还是采用分组方式B,例如该指示信息为0时,指示该第一信息采用分组方式A;该指示信息为1时,指示该第一信息采用分组方式B。For example, the first information can be supplemented with indication information to indicate whether the first information adopts grouping method A or grouping method B. For example, when the indication information is 0, it indicates that the first information adopts grouping method A; when the indication information is 1, it indicates that the first information adopts grouping method B.
在步骤S2中,发送端向接收端发送第一信息之外,发送端还可以向接收端发送第三信息。具体的,第三信息包括但不限于:第三数据、第一模型中输入第三数据的输入位置信息、第三数据的维度信息,第一类型指示信息,或者,第一模型的标识信息,第一类型指示信息用于指示第三数据的数据类型,其中,第三数据包括以下任意一项或多项信息:第一信道信息、第一环境信息,或者,多模态信息,第一类型指示信息,第一信道信息为发送端与接收端之间的信道的相关信息,第一环境信息为发送端和接收端所处的环境的相关信息。In step S2, in addition to sending the first information to the receiving end, the sending end may also send third information to the receiving end. Specifically, the third information includes, but is not limited to: third data, input location information of the third data in the first model, dimension information of the third data, first type indication information, or identification information of the first model. The first type indication information is used to indicate the data type of the third data. The third data includes any one or more of the following information: first channel information, first environment information, or multimodal information. The first type indication information is related to the channel between the sending end and the receiving end, and the first environment information is related to the environment in which the sending end and the receiving end are located.
示例性地,本申请实施例中的信道信息包括但不限于:信道容量信息、信道带宽信息、信道噪声信息、信道的调制方式、信道的编码方式、信道状态信息,或者信道的误码率等。For example, the channel information in the embodiments of this application includes, but is not limited to: channel capacity information, channel bandwidth information, channel noise information, channel modulation method, channel coding method, channel state information, or channel bit error rate, etc.
示例性地,本申请实施例中的环境信息包括但不限于:地形信息、气候条件信息、电磁环境信息、网络拓扑信息、网络负载信息、信号质量信息、或者无线频谱信息等。For example, the environmental information in the embodiments of this application includes, but is not limited to: terrain information, climate condition information, electromagnetic environment information, network topology information, network load information, signal quality information, or wireless spectrum information.
示例性地,本申请实施例中的多模态信息包括但不限于:图像信息、点云信息、文本信息或者音频信息等。For example, the multimodal information in the embodiments of this application includes, but is not limited to, image information, point cloud information, text information, or audio information.
示例性地,第一类型指示信息如表4所示。For example, the first type of indication information is shown in Table 4.
表4
Table 4
示例性地,第三数据的维度信息指的是第三数据的数据维度。例如,当第三数据包括第一信道信息时,第一信道信息是J*K的维度矩阵,则第三数据的维度信息是J*K,其中,J为正整数,K为正整数。For example, the dimension information of the third data refers to the data dimension of the third data. For instance, when the third data includes the first channel information, and the first channel information is a J*K dimension matrix, then the dimension information of the third data is J*K, where J is a positive integer and K is a positive integer.
示例性地,第一模型的标识信息用于指示第一模型包括哪些模型和/或模型的类型。第一模型的标识信息例如表5所示。For example, the identification information of the first model is used to indicate which models and/or the types of models included in the first model. The identification information of the first model is shown in Table 5, for example.
表5
Table 5
示例性地,第一模型中输入第三数据的输入位置信息用于指示接收端收到第三信息之后,在第一模型的该输入位置将第三数据输入。为了便于理解,请参阅图12,图12为本申请实施例中第一模型的又一种结构示意图。第一模型中输入第三数据的输入位置信息指示在第一模型的激活函数层(Relu)#6的输出位置输入第三数据。当第三数据还包括第一信道信息和第一环境信息时,接收端收到第三信息(包括第三数据)后,在第一模型的激活函数层(Relu)#6的输出位置输入第三数据包括的第一信道信息和第一环境信息。For example, the input position information for inputting third data in the first model is used to instruct the receiving end to input the third data at that input position in the first model after receiving the third information. For ease of understanding, please refer to Figure 12, which is another structural schematic diagram of the first model in an embodiment of this application. The input position information for inputting third data in the first model instructs the input of third data at the output position of the activation function layer (ReLU) #6 of the first model. When the third data also includes first channel information and first environment information, after receiving the third information (including the third data), the receiving end inputs the first channel information and first environment information included in the third data at the output position of the activation function layer (ReLU) #6 of the first model.
需要说明的是,上述第一信息和第三信息可以承载于同一条消息中,即发送端同时发送第一信息和第三信息;上述第一信息和第三信息也可以承载于不同的消息中,即发送端分别发送第一信息和第三信息,本申请实施例对此不作限制。It should be noted that the first and third information mentioned above can be carried in the same message, that is, the sending end sends the first and third information at the same time; or the first and third information mentioned above can be carried in different messages, that is, the sending end sends the first and third information separately. This application embodiment does not limit this.
下面介绍第一信息和/或第三信息的承载方式:The following describes how the first and/or third information is carried:
实现方式A:第一信息和/或第三信息承载于无线资源控制(radio resource control,RRC)消息。具体如下:采用RRC序列(SEQUENCE)格式的数据信息(DataInfo)字段指示第一信息和/或第三信息的具体类型。Implementation Method A: The first and/or third information is carried in a radio resource control (RRC) message. Specifically, the DataInfo field, in RRC sequence format, indicates the specific type of the first and/or third information.
一种可能的实现方式中,通过DataInfo字段中数据类型的元素是否存在隐式指示数据类型。In one possible implementation, the data type is implicitly indicated by the presence of an element representing the data type in the DataInfo field.
一种示例中,通过如下方式指示前向交互数据所包括的数据的数据类型:“DataInfo::SEQUENCE{In one example, the data type of the data included in the forward interaction data is indicated as follows: "DataInfo::SEQUENCE{
forward-inter-Data ForwardInterData OPTIONAL,forward-inter-Data ForwardInterData OPTIONAL,
model-Data ModelData OPTIONAL,model-Data ModelData OPTIONAL,
}”,其中,“ForwardInterData”指示该RRC消息承载前向交互数据,“ModelData”指示该RRC消息承载模型相关的数据。}”, where “ForwardInterData” indicates that the RRC message carries forward interaction data, and “ModelData” indicates that the RRC message carries model-related data.
另一种示例中,通过如下方式指示前向交互数据所包括的数据的数据类型:“DataInfo::SEQUENCE{In another example, the data type of the data included in the forward interaction data is indicated as follows: "DataInfo::SEQUENCE{
ai-DataInfo AIDataInfo OPTIONAL,ai-DataInfo AIDataInfo OPTIONAL,
sensing-DataInfo SensingDataInfo OPTIONAL,sensing-DataInfo SensingDataInfo OPTIONAL,
}}
AIDataInfo::SEQUENCE{AIDataInfo::SEQUENCE{
Forward-inter-Data ForwardInterData OPTIONAL,Forward-inter-Data ForwardInterData OPTIONAL,
Model-Data ModelData OPTIONAL,Model-Data ModelData OPTIONAL,
}”,具体含义是:该RRC消息承载的是AI数据(AIData),在该AI数据中具体包括前向交互数据和模型相关的数据。The specific meaning of "}" is: This RRC message carries AI data, which specifically includes forward interaction data and model-related data.
另一种可能的实现方式中,通过数据类型(DataType)字段指示某种数据的数据类型是否存在。如果存在则该数据的数据类型为“ture”;反之,如果不存在,则该数据的数据类型为“fales”。In another possible implementation, a DataType field indicates whether a certain data type exists. If it exists, the data type is "true"; otherwise, if it does not exist, the data type is "false".
一种示例中,通过如下方式指示前向交互数据所包括的数据的数据类型:In one example, the data type of the data included in the forward interaction data is indicated in the following way:
“DataType::=SEQUENCE{“DataType::=SEQUENCE{
includeForwardInterData ENUMERATED{true}OPTIONALincludeForwardInterData ENUMERATED{true}OPTIONAL
}}
DataInfo::=SEQUENCE{DataInfo::=SEQUENCE{
Forward-inter-Data ForwardInterDataForward-inter-Data ForwardInterData
}”,其中,“includeForwardInterData ENUMERATED{true}”指示该RRC消息承载了前向交互数据。The `includeForwardInterData ENUMERATED{true}` indicates that the RRC message carries forward interactive data.
又一种示例中,在“forwardInterData”字段中指示以下一项或多项信息:第一模型的标识信息(modelIndicate)、数据配置信息(dataConfig)、分组指示信息(groupMode)、组数信息(groupNum)、特征数据(featureData),或者,其他数据(otherData)。在“dataConfig”字段中指示以下一项或多项信息:其他数据的数据类型(otherDataType)、其他数据的位置信息(otherDataPosition)、其他数据的维度信息(otherDataDim),或者,压缩配置信息(configCompress)。具体如下:“ForwrardInterData::=SEQUENCE{In another example, the "forwardInterData" field indicates one or more of the following information: the identifier of the first model (modelIndicate), data configuration information (dataConfig), grouping indication information (groupMode), number of groups information (groupNum), feature data (featureData), or other data (otherData). The "dataConfig" field indicates one or more of the following information: the data type of other data (otherDataType), the location information of other data (otherDataPosition), the dimension information of other data (otherDataDim), or compression configuration information (configCompress). Specifically: "ForwardInterData::=SEQUENCE{
modelIndicate OCTET STRING,OPTIONAL,modelIndicate OCTET STRING,OPTIONAL,
dataConfig ConfigModel,OPTIONAL,dataConfig ConfigModel,OPTIONAL,
groupMode OCTET STRING,OPTIONAL,groupMode OCTET STRING,OPTIONAL,
groupNum OCTET STRING,OPTIONAL,groupNum OCTET STRING,OPTIONAL,
featureData OCTET STRING,OPTIONAL,featureData OCTET STRING,OPTIONAL,
trainingLabel OCTET STRING,OPTIONAL,trainingLabel OCTET STRING,OPTIONAL,
otherData OCTET STRING,OPTIONALotherData OCTET STRING,OPTIONAL
}}
ConfigModel::=SEQUENCE{ConfigModel::=SEQUENCE{
otherDataType OCTET STRING OPTIONAL,otherDataType OCTET STRING OPTIONAL,
otherDataPosition OCTET STRING OPTIONAL,otherDataPosition OCTET STRING OPTIONAL,
otherDataDim OCTET STRING OPTIONAL,otherDataDim OCTET STRING OPTIONAL,
configCompress OCTET STRING OPTIONALconfigCompress OCTET STRING OPTIONAL
}”。}".
可选地,步骤S1的配置信息也可以承载于RRC消息中,具体的承载方式与第一信息和/或第三信息承载于RRC消息的方式类似,此处不作赘述。Optionally, the configuration information in step S1 can also be carried in an RRC message. The specific carrying method is similar to that of the first information and/or the third information carried in an RRC message, and will not be elaborated here.
实现方式B:第一信息和/或第三信息承载于媒体接入控制(media access control,MAC)消息。Implementation method B: The first and/or third information is carried in a media access control (MAC) message.
一种可能的实现方式中,在MAC消息的MAC子头(subheader),通过逻辑信道标识(Logical Channel ID,LCID)或者扩展逻辑信道标识(extended Logical Channel ID,eLCID)指示该MAC消息承载的数据类型。因此,当第一信息和/或第三信息承载于MAC消息时,可以通过MAC消息的MAC子头的LCID或者eLCID指示第一信息和/或第三数据的数据类型。In one possible implementation, the data type carried by the MAC message is indicated in the MAC subheader using a Logical Channel ID (LCID) or an Extended Logical Channel ID (eLCID). Therefore, when first and/or third information is carried in a MAC message, the data type of the first and/or third information can be indicated by the LCID or eLCID in the MAC subheader.
例如,当前,LCID的取值从0~34已使用,因此,可以使用LCID=35作为第一信息和/或第三信息(即前向交互数据)的数据类型。当接收端接收到MAC消息之后,发现该MAC消息的LCID字段的值为35时,确定该MAC消息承载第一信息和/或第三信息。此外,还可以使用LCID=36作为第二信息和/或第四信息(即反向交互数据)的数据类型。For example, since the values of LCID from 0 to 34 are currently in use, LCID = 35 can be used as the data type for the first and/or third information (i.e., forward interaction data). When the receiving end receives a MAC message and finds that the value of the LCID field of the MAC message is 35, it determines that the MAC message carries the first and/or third information. Furthermore, LCID = 36 can also be used as the data type for the second and/or fourth information (i.e., reverse interaction data).
另一种可能的实现方式中,使用LCID或者eLCID指示MAC消息承载AI数据,例如LCID=35指示该MAC消息承载的数据类型为AI数据。AI数据包括前向交互数据(第一信息和/或第三信息)以及反向交互数据(第二信息和/或第四信息),因此为了区分AI数据中具体承载的是前向交互数据还是反向交互数据,通过MAC消息的类型(type)字段指示。In another possible implementation, LCID or eLCID is used to indicate that the MAC message carries AI data. For example, LCID=35 indicates that the data type carried by the MAC message is AI data. AI data includes forward interaction data (first information and/or third information) and reverse interaction data (second information and/or fourth information). Therefore, in order to distinguish whether the AI data specifically carries forward interaction data or reverse interaction data, the type field of the MAC message is used to indicate this.
需要说明的是,上述类型(type)字段和有效载荷(payload)都可以承载于MAC控制元素(MAC Control Element,MAC CE)上,其中,有效载荷可用于承载AI数据。It should be noted that both the type field and the payload mentioned above can be carried on the MAC Control Element (MAC CE). The payload can be used to carry AI data.
一种示例,请参阅图16,图16为本申请实施例中MAC消息承载前向交互数据的结构示意图。图16中,该MAC消息的LCID=35,指示该MAC消息承载AI数据;该MAC消息的type=0,指示该MAC消息所承载的AI数据具体包括前向交互数据。As an example, please refer to Figure 16, which is a schematic diagram of the structure of a MAC message carrying forward interaction data in an embodiment of this application. In Figure 16, the LCID of the MAC message is 35, indicating that the MAC message carries AI data; the type of the MAC message is 0, indicating that the AI data carried by the MAC message specifically includes forward interaction data.
另一种示例,请参阅图17,图17为本申请实施例中MAC消息承载反向交互数据的结构示意图。图17中,该MAC消息的LCID=35,指示该MAC消息承载AI数据;该MAC消息的type=1,指示该MAC消息所承载的AI数据具体包括反向交互数据。Another example is shown in Figure 17, which is a schematic diagram of the structure of a MAC message carrying reverse interaction data in an embodiment of this application. In Figure 17, the LCID of the MAC message is 35, indicating that the MAC message carries AI data; the type of the MAC message is 1, indicating that the AI data carried by the MAC message specifically includes reverse interaction data.
可选地,步骤S1的配置信息也可以承载于MAC消息中,具体的承载方式与第一信息和/或第三信息承载于MAC消息的方式类似,此处不作赘述。Optionally, the configuration information in step S1 can also be carried in a MAC message. The specific carrying method is similar to that of the first information and/or the third information carried in a MAC message, and will not be elaborated here.
需要说明的是,上述第一信息或者第三信息所包括的各项信息,可以分开发送也可以一并发送,本申请实施例对此不作限制。It should be noted that the information included in the first or third information mentioned above can be sent separately or together, and this application embodiment does not impose any restrictions on this.
例如,图13所示,图13为本申请实施例中前向交互数据的一种结构示意图。发送端向接收端发送的前向交互数据包括:第一信息和第三信息,其中,第一信息包括第一数据1~第一数据n一共n个第一数据以及第一训练标签数据。可选地,图13所示的前向交互数据还可以包括配置信息(请参阅步骤S1)和/或压缩配置信息(例如第一压缩配置信息)。For example, as shown in Figure 13, which is a schematic diagram of a forward interaction data structure in an embodiment of this application, the forward interaction data sent from the sending end to the receiving end includes: first information and third information. The first information includes n first data points (first data 1 to first data n) and first training label data. Optionally, the forward interaction data shown in Figure 13 may also include configuration information (see step S1) and/or compressed configuration information (e.g., first compressed configuration information).
又例如,图14所示,图14为本申请实施例中前向交互数据的又一种结构示意图。发送端向接收端发送的前向交互数据包括:第一信息和第三信息,其中,第一信息包括第一数据1~第一数据n一共n个第一数据、第一训练标签数据1~第一训练标签数据n一共n个第一训练标签数据,以及,第三信息1~第三信息n一共n个第三信息,其中,第一数据与第一训练标签数据和第三信息具有对应关系。可选地,图14所示的前向交互数据还可以包括配置信息(请参阅步骤S1)和/或压缩配置信息(例如第一压缩配置信息)。For example, as shown in Figure 14, which is another structural diagram of forward interaction data in an embodiment of this application, the forward interaction data sent from the sending end to the receiving end includes: first information and third information. The first information includes n first data items (first data 1 to first data n), n first training label data items (first training label data 1 to first training label data n), and n third information items (third information 1 to third information n). The first data items correspond to the first training label data and the third information items. Optionally, the forward interaction data shown in Figure 14 may also include configuration information (see step S1) and/or compressed configuration information (e.g., first compressed configuration information).
可以理解的是,当第一信息包括的一个或多个第一数据共用一个配置信息(例如一个或多个第一数据包括同一个模型输出的数据)时,则第一信息可以只包括一个该配置信息。当第一信息包括的一个或多个第一数据对应不同的配置信息时,则第一信息包括多个配置信息。It is understandable that when one or more pieces of first data included in the first information share a single configuration information (e.g., one or more pieces of first data include data output from the same model), then the first information may include only one such configuration information. When one or more pieces of first data included in the first information correspond to different configuration information, then the first information includes multiple configuration information.
S3、接收端向发送端发送第二信息和/或第四信息。S3. The receiving end sends the second and/or fourth information to the sending end.
步骤S2之后,接收端接收到来自发送端的前向交互数据(包括第一信息和/或第三信息),则接收端根据前向交互数据得到第一模型的训练损失函数(loss)并更新第一模型中的解码模型(即接收端模型)。然后,接收端在反向更新过程中,根据第一模型得到第二数据,第二数据包括第一模型的中间层反向输出的模型更新数据。After step S2, the receiving end receives forward interaction data (including first information and/or third information) from the sending end. The receiving end then obtains the training loss function (loss) of the first model based on the forward interaction data and updates the decoding model (i.e., the receiving end model) in the first model. Then, during the reverse update process, the receiving end obtains second data based on the first model. The second data includes model update data output from the reverse of the intermediate layers of the first model.
示例性地,第二数据包括但不限于:梯度。具体的,该梯度可以是偏置的梯度或者权重的梯度等。For example, the second data includes, but is not limited to, gradients. Specifically, the gradient can be the gradient of the bias or the gradient of the weights, etc.
一种可能的实现方式中,第二信息包括多个第二数据,其中,每个第二数据对应于一个第一数据。例如,图8所示,图8为本申请实施例中第二信息的一种结构示意图。发送端向接收端发送第二信息,该第二信息包括m个第二数据,m为正整数。In one possible implementation, the second information includes multiple second data, where each second data corresponds to a first data. For example, as shown in Figure 8, which is a schematic diagram of the structure of the second information in an embodiment of this application. The sending end sends the second information to the receiving end, which includes m second data, where m is a positive integer.
另一种可能的实现方式中,第二信息包括多个第N二数据的加权值,其中,在第一信息中还包括第一数据的权重。接收端根据一个或多个第一数据确定多个第二数据。然后,接收端根据每个第二数据对应的第一数据的权重,对多个第二数据进行加权处理,得到多个第二数据的加权值。例如,每个第一数据的权重相等,则第二信息包括多个第二数据的平均值。例如,图9所示,图9为本申请实施例中第二信息的另一种结构示意图。发送端向接收端发送第二信息,该第二信息包括m个第二数据的加权值(即第二数据1~第二数据m一共m个第二数据),m为正整数。In another possible implementation, the second information includes weighted values of multiple second-number data points (N), wherein the first information also includes the weights of the first data points. The receiving end determines multiple second-number data points based on one or more first data points. Then, the receiving end performs weighted processing on the multiple second-number data points according to the weights of the first data points corresponding to each second-number data point, to obtain a weighted value for the multiple second-number data points. For example, if the weights of each first data point are equal, then the second information includes the average value of the multiple second-number data points. For example, as shown in Figure 9, Figure 9 is another structural schematic diagram of the second information in an embodiment of this application. The sending end sends the second information to the receiving end, which includes weighted values of m second-number data points (i.e., a total of m second-number data points from second-number data 1 to second-number data m), where m is a positive integer.
在步骤S3中,接收端向发送端发送第二信息之外,接收端还可以向发送端发送第四信息。具体的,第四信息包括但不限于:In step S3, in addition to sending the second information to the sending end, the receiving end can also send a fourth information to the sending end. Specifically, the fourth information includes, but is not limited to:
第四数据、第二类型指示信息、所述第一模型的终止交互指令、所述第四数据的维度信息,或者,所述第一模型的标识信息,所述第二类型指示信息用于指示所述第四数据的数据类型,所述第一模型的终止交互指令用于指示所述发送端停止发送所述第一模型相关的信息,第二信道信息为发送端与接收端之间的信道的相关信息,第二环境信息为发送端和接收端所处的环境的相关信息,其中,所述第四数据包括以下任意一项或多项信息:第二信道信息、第二环境信息、多模态信息、所述接收端在训练所述第一模型的过程中生成的训练残差信息,或者,更新后所述第一模型的性能信息。The fourth data includes: second type indication information, termination interaction instruction of the first model, dimension information of the fourth data, or identification information of the first model. The second type indication information is used to indicate the data type of the fourth data. The termination interaction instruction of the first model is used to instruct the sending end to stop sending information related to the first model. The second channel information is related to the channel between the sending end and the receiving end. The second environment information is related to the environment in which the sending end and the receiving end are located. The fourth data includes any one or more of the following information: second channel information, second environment information, multimodal information, training residual information generated by the receiving end during the training of the first model, or performance information of the updated first model.
示例性地,第二信道信息与前述步骤S2中的第一信道信息类似,此处不作赘述。For example, the second channel information is similar to the first channel information in step S2 above, and will not be described in detail here.
示例性地,第二环境信息与前述步骤S2中的第一环境信息类似,此处不作赘述。For example, the second environmental information is similar to the first environmental information in step S2 above, and will not be described in detail here.
示例性地,多模态信息与前述步骤S2中的多模态信息类似,此处不作赘述。For example, the multimodal information is similar to the multimodal information in step S2 above, and will not be described in detail here.
示例性地,第二类型指示信息、第四数据的维度信息,或者,第一模型的标识信息与前述步骤S2中的第一类型指示信息、第三数据的维度信息,或者,第一模型的标识信息类似,此处不作赘述。For example, the second type of indication information, the dimension information of the fourth data, or the identification information of the first model are similar to the first type of indication information, the dimension information of the third data, or the identification information of the first model in the aforementioned step S2, and will not be elaborated here.
当发送端接收到接收端在训练第一模型的过程中生成的训练残差信息之后,发送端可以使用该训练残差信息用于训练第一模型。Once the sending end receives the training residual information generated by the receiving end during the training of the first model, the sending end can use this training residual information to train the first model.
当发送端接收到第一模型的终止交互指令之后,发送端停止与接收端之间交互第一模型相关的数据,例如第一信息和/或第三信息。可选地,发送端还可以停止训练第一模型。Upon receiving a termination interaction command from the first model, the sending end ceases exchanging data related to the first model with the receiving end, such as first information and/or third information. Optionally, the sending end may also stop training the first model.
当发送端接收到第一模型的性能信息之后,发送端可以根据该性能信息确定是否继续交互第一模型相关的数据,例如第一信息和/或第三信息。如果发送端根据该性能信息确定第一模型的性能已满足要求,则发送端可以停止与接收端之间交互第一模型相关的数据。可选地,发送端还可以停止训练第一模型。例如,第一模型的性能信息包括:测试损失函数(test loss)、测试准确率(test accuracy)、精确度(Precision)和/或召回率(Recall)等。After receiving the performance information of the first model, the sending end can determine whether to continue exchanging data related to the first model, such as first information and/or third information, based on this performance information. If the sending end determines that the performance of the first model meets the requirements based on the performance information, it can stop exchanging data related to the first model with the receiving end. Optionally, the sending end can also stop training the first model. For example, the performance information of the first model includes: test loss function, test accuracy, precision, and/or recall.
需要说明的是,上述第二信息或者第四信息所包括的各项信息,可以分开发送也可以一并发送,本申请实施例对此不作限制。It should be noted that the various pieces of information included in the second or fourth information mentioned above can be sent separately or together, and this application embodiment does not impose any restrictions on this.
例如,图15所示,图15为本申请实施例中第二信息的一种结构示意图。发送端向接收端发送第二信息和第四信息,其中,该第二信息包括m个第二数据的加权值(即第二数据1~第二数据m一共m个第二数据的加权值),m为正整数。由于m个第二数据中每个第二数据对应一个第四信息,因此该反向交互数据还包括m个第四信息(即第四信息1~第四信息m一共m个第四信息),其中,第四信息1对应第二数据1,以此类推,第四信息m对应第二数据m。For example, as shown in Figure 15, which is a structural schematic diagram of the second information in an embodiment of this application, the sending end sends the second information and the fourth information to the receiving end. The second information includes the weighted values of m second data (i.e., the weighted values of a total of m second data from second data 1 to second data m), where m is a positive integer. Since each of the m second data corresponds to one fourth information, the reverse interaction data also includes m fourth information (i.e., the m fourth information from fourth information 1 to fourth information m), where fourth information 1 corresponds to second data 1, and so on, with fourth information m corresponding to second data m.
可选地,m个第二数据的加权值可以是m个第二数据的平均值。Optionally, the weighted value of the m second data points can be the average of the m second data points.
可选地,图15所示的反向交互数据还可以包括配置信息(请参阅步骤S1)和/或压缩配置信息(例如第二压缩配置信息)。Optionally, the reverse interaction data shown in Figure 15 may also include configuration information (see step S1) and/or compressed configuration information (e.g., second compressed configuration information).
在另一种可能的实现方式中,第二信息包括第二数据1~第二数据m的加权值以及一个第四信息,该第四信息对应于第二数据1~第二数据m一共m个第二数据。In another possible implementation, the second information includes the weighted values of the second data 1 to the second data m and a fourth information corresponding to the m second data from the second data 1 to the second data m.
可以理解的是,当第二信息包括的多个第二数据共用一个配置信息(例如多个第二数据包括同一个模型反向输出的模型更新数据)时,则第二信息可以只包括一个该配置信息。当第二信息包括的多个第二数据对应不同的配置信息时,则第二信息包括多个配置信息。It is understandable that when multiple pieces of second data included in the second information share a single configuration information (e.g., multiple pieces of second data include model update data output from the same model), then the second information may include only one such configuration information. When multiple pieces of second data included in the second information correspond to different configuration information, then the second information includes multiple configuration information.
可选地,在步骤S3之后,如果发送端或者接收端确定还需要继续更新第一模型,则可以重复执行步骤S2~步骤S3,直到第一模型的性能满足业务需求。Optionally, after step S3, if the sending end or the receiving end determines that it is necessary to continue updating the first model, steps S2 to S3 can be repeated until the performance of the first model meets the service requirements.
可选地,在步骤S1~步骤S3的执行过程中,如果发送端确定需要训练其他的模型,则发送端可以向接收端下发该模型的配置信息(类似步骤S1)。然后,发送端与接收端之间交互该模型的相关数据,类似上述步骤S2~步骤S3,此处不作赘述。Optionally, during the execution of steps S1 to S3, if the sending end determines that other models need to be trained, the sending end can send the configuration information of the model to the receiving end (similar to step S1). Then, the sending end and the receiving end exchange the relevant data of the model, similar to steps S2 to S3 above, which will not be elaborated here.
可选地,在步骤S1~步骤S3的执行过程中,如果发送端或者接收端需要修改第一模型的配置信息,则发送端可以向接收端发送更新后的配置信息(类似步骤S1),或者接收端向发送端发送更新后的配置信息(类似步骤S1)。然后,发送端与接收端之间交互更新后的第一模型的相关数据,类似上述步骤S2~步骤S3,此处不作赘述。Optionally, during the execution of steps S1 to S3, if the sending end or the receiving end needs to modify the configuration information of the first model, the sending end can send the updated configuration information to the receiving end (similar to step S1), or the receiving end can send the updated configuration information to the sending end (similar to step S1). Then, the sending end and the receiving end exchange the updated relevant data of the first model, similar to steps S2 to S3 above, which will not be elaborated here.
本申请实施例中,发送端向接收端发送第一信息,第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,其中,第一数据包括第一模型的中间层输出的数据,第一训练标签数据用于更新第一模型。然后,发送端接收该接收端发送的第二信息,第二信息指示第一模型的中间层反向输出的模型更新数据。通过上述方法,实现发送端与接收端协同训练神经网络模型,可以灵活适用于多种不同的场景和任务,提升了兼容性。此外,发送端向接收端发送的第一信息包括一个或多个第一数据以及一个或多个第一训练标签数据,通过向接收端发送丰富的训练相关数据,提升了模型的泛化性。此外,通过定义发送端与接收端之间交互数据的统一交互格式,使得上述发送端与接收端之间可以针对不同的训练场景和任务进行灵活地训练,提升了兼容性。此外,发送端与接收端之间可以支持在一个训练周期内训练多个模型,提升了训练效率。发送端与接收端之间可以支持在一个训练周期内还可以调整训练的模型,提升了训练效率。由于发送端与接收端之间交互的数据支持多种压缩方式,因此可以有效降低传输量,提升通信效率。In this embodiment, the sending end sends first information to the receiving end. The first information includes one or more first data points and one or more first training label data points. The first data points include data output from the intermediate layers of the first model, and the first training label data is used to update the first model. Then, the sending end receives second information sent by the receiving end. The second information instructs the intermediate layers of the first model to output model update data in reverse. Through this method, the sending end and receiving end can collaboratively train a neural network model, which can be flexibly applied to various different scenarios and tasks, improving compatibility. Furthermore, the first information sent by the sending end to the receiving end includes one or more first data points and one or more first training label data points. By sending rich training-related data to the receiving end, the generalization ability of the model is improved. In addition, by defining a unified interaction format for the data exchanged between the sending end and the receiving end, the sending end and the receiving end can flexibly train for different training scenarios and tasks, improving compatibility. Furthermore, the sending end and the receiving end can support training multiple models within a single training cycle, improving training efficiency. The sending end and the receiving end can also adjust the trained model within a single training cycle, further improving training efficiency. Because the data exchanged between the sending and receiving ends supports multiple compression methods, the amount of data transmitted can be effectively reduced, thus improving communication efficiency.
请参阅图18,图18为本申请实施例中一种通信装置1800的结构示意图,该通信装置1800可以实现上述方法实施例中发送端或者接收端的功能,因此也能实现上述方法实施例所具备的有益效果。在本申请实施例中,该通信装置1800可以是发送端或者接收端,也可以是发送端或者接收端内部的集成电路或者元件等,例如芯片、基带芯片、modem芯片、包含modem核的SoC芯片、系统级封装(systemin package,SIP)芯片、通信模组、芯片系统、处理器等。Please refer to Figure 18, which is a schematic diagram of the structure of a communication device 1800 in an embodiment of this application. This communication device 1800 can realize the functions of a transmitting end or a receiving end in the above method embodiments, and therefore can also achieve the beneficial effects of the above method embodiments. In this embodiment, the communication device 1800 can be a transmitting end or a receiving end, or it can be an integrated circuit or component inside the transmitting end or receiving end, such as a chip, baseband chip, modem chip, SoC chip containing a modem core, system-in-package (SIP) chip, communication module, chip system, processor, etc.
需要说明的是,收发单元1802可以包括发送单元和接收单元,分别用于执行发送和接收。It should be noted that the transceiver unit 1802 may include a transmitting unit and a receiving unit, which are used to perform transmitting and receiving respectively.
一种可能的实现方式中,当该装置1800为用于执行图5及相关实施例中发送端或者接收端所执行的方法时,该装置1800包括处理单元1801和收发单元1802;In one possible implementation, when the device 1800 is used to execute the method performed by the transmitting end or the receiving end in FIG5 and related embodiments, the device 1800 includes a processing unit 1801 and a transceiver unit 1802.
一种示例中,该通信装置1800应用于发送端,该通信装置1800包括:In one example, the communication device 1800 is used at a transmitting end, and the communication device 1800 includes:
收发单元1802,用于向接收端发送第一信息,所述第一信息包括一个或多个第一数据,以及,一个或多个第一训练标签数据,所述第一数据包括第一模型的中间层输出的数据,所述第一训练标签数据用于更新所述第一模型;The transceiver unit 1802 is used to send first information to the receiving end. The first information includes one or more first data and one or more first training label data. The first data includes data output from the intermediate layer of the first model. The first training label data is used to update the first model.
收发单元1802,还用于接收所述接收端发送的第二信息,所述第二信息指示所述第一模型的中间层反向输出的模型更新数据。The transceiver unit 1802 is also used to receive second information sent by the receiving end, the second information indicating the model update data output in reverse by the intermediate layer of the first model.
在一种可能的实现方式中,所述第一训练标签数据,包括:In one possible implementation, the first training label data includes:
第一训练标签,或者,所述第一训练标签在训练标签集中的索引信息,所述训练标签集包括一个或多个训练标签。The first training label, or the index information of the first training label in the training label set, wherein the training label set includes one or more training labels.
在一种可能的实现方式中,所述第一信息还包括:In one possible implementation, the first information further includes:
分组指示信息,所述分组指示信息用于指示所述第一信息包括的所述多个所述第一数据的分组方式;Grouping indication information, wherein the grouping indication information is used to indicate the grouping method of the plurality of first data included in the first information;
和/或,组数信息,所述组数信息用于指示所述第一信息包括的所述多个所述第一数据的数量。And/or, group number information, which indicates the number of the plurality of first data included in the first information.
在一种可能的实现方式中,多个第二数据的加权值,每个所述第二数据对应于一个所述第一数据。In one possible implementation, there is a weighted sum of multiple second data, each second data corresponding to one first data.
在一种可能的实现方式中,所述第一信息还包括所述第一数据的权重。In one possible implementation, the first information may also include the weight of the first data.
在一种可能的实现方式中,收发单元1802,还用于获取第一训练样本,所述第一训练样本对应的训练标签为所述第一训练标签;In one possible implementation, the transceiver unit 1802 is further configured to acquire a first training sample, wherein the training label corresponding to the first training sample is the first training label.
处理单元1801,用于使用所述第一训练样本对所述第一模型进行多次处理,输出所述多个所述第一数据,其中,每次所述处理的处理方式和/或处理参数不一致。The processing unit 1801 is used to process the first model multiple times using the first training samples and output the plurality of first data, wherein the processing method and/or processing parameters are different each time.
在一种可能的实现方式中,收发单元1802,还用于获取多个第一训练样本,所述第一训练样本对应的训练标签为所述第一训练标签;In one possible implementation, the transceiver unit 1802 is further configured to acquire multiple first training samples, wherein the training label corresponding to the first training sample is the first training label.
处理单元1801,还用于使用所述多个所述第一训练样本对所述第一模型分别进行一次或多次处理,输出所述多个所述第一数据,其中,每次所述处理使用的所述第一训练样本互不相同。The processing unit 1801 is further configured to process the first model once or multiple times using the plurality of first training samples, and output the plurality of first data, wherein the first training samples used in each processing are different.
在一种可能的实现方式中,收发单元1802,还用于获取公共训练样本;In one possible implementation, the transceiver unit 1802 is also used to acquire public training samples;
处理单元1801,还用于对所述公共训练样本进行处理,生成所述多个所述第一训练样本,所述多个所述第一训练样本互不相同。The processing unit 1801 is further configured to process the common training samples to generate the plurality of first training samples, wherein the plurality of first training samples are different from each other.
在一种可能的实现方式中,收发单元1802,还用于向所述接收端发送第三信息,所述第三信息包括以下任意一项或多项信息:In one possible implementation, the transceiver unit 1802 is further configured to send third information to the receiving end, the third information including any one or more of the following:
第三数据、所述第一模型中输入所述第三数据的输入位置信息、所述第三数据的维度信息、所述第一模型的标识信息,或者,第一类型指示信息,所述第一类型指示信息用于指示所述第三数据的数据类型,其中,所述第三数据包括以下任意一项或多项信息:第一信道信息、第一环境信息,或者,多模态信息。The third data, the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, wherein the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
在一种可能的实现方式中,收发单元1802,还用于接收所述接收端发送的第四信息,所述第四信息包括以下任意一项或多项信息:In one possible implementation, the transceiver unit 1802 is further configured to receive fourth information sent by the receiving end, the fourth information including any one or more of the following:
第四数据、第二类型指示信息、所述第一模型的终止交互指令、所述第四数据的维度信息,或者,所述第一模型的标识信息,所述第二类型指示信息用于指示所述第四数据的数据类型,所述第一模型的终止交互指令用于指示所述发送端停止发送所述第一模型相关的信息,其中,所述第四数据包括以下任意一项或多项信息:第二信道信息、第二环境信息、多模态信息、所述接收端在训练所述第一模型的过程中生成的训练残差信息,或者,更新后所述第一模型的性能信息。The fourth data, the second type indication information, the termination interaction instruction of the first model, the dimension information of the fourth data, or the identification information of the first model, the second type indication information is used to indicate the data type of the fourth data, and the termination interaction instruction of the first model is used to instruct the sending end to stop sending information related to the first model, wherein the fourth data includes any one or more of the following information: second channel information, second environment information, multimodal information, training residual information generated by the receiving end during the training of the first model, or the performance information of the updated first model.
在一种可能的实现方式中,所述第一模型包括一个或多个切分位置;In one possible implementation, the first model includes one or more segmentation locations;
所述第一数据包括:所述第一模型中所述一个或多个切分位置对应的中间层输出的数据。The first data includes: data output from intermediate layers corresponding to one or more segmentation positions in the first model.
在一种可能的实现方式中,收发单元1802,还用于向所述接收端发送配置信息,所述配置信息用于配置训练所述第一模型的训练任务,所述配置信息包括以下任意一项或多项:In one possible implementation, the transceiver unit 1802 is further configured to send configuration information to the receiving end, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
模型信息,其中,所述模型信息包括以下任意一项或多项信息:所述第一模型的标识信息、所述第一模型的结构信息、所述第一模型的训练任务信息、所述第一模型的初始化参数,或者,所述第一模型的待更新参数;Model information, wherein the model information includes any one or more of the following: the identification information of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated.
切分信息,其中,所述切分信息包括:所述第一模型的切分位置数量,和/或,所述切分位置在所述第一模型的位置信息;Segmentation information, wherein the segmentation information includes: the number of segmentation positions of the first model, and/or, the position information of the segmentation positions in the first model;
或者,训练信息,其中,所述训练信息包括以下任意一项或多项信息:训练标签集信息、验证集信息、训练迭代次数、学习率,或者损失函数,所述训练标签集包括一个或多个训练标签;Alternatively, training information, wherein the training information includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
和/或,收发单元1802,还用于接收所述接收端发送的所述配置信息。And/or, the transceiver unit 1802 is also used to receive the configuration information sent by the receiving end.
在一种可能的实现方式中,所述配置信息还包括:In one possible implementation, the configuration information further includes:
第一压缩配置信息,和/或,第二压缩配置信息,其中:First compression configuration information, and/or, second compression configuration information, wherein:
所述第一压缩配置信息用于指示所述第一信息和/或所述第三信息的压缩方式以及压缩参数,The first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information.
所述第二压缩配置信息用于指示所述第二信息和/或所述第四信息的压缩方式以及压缩参数。The second compression configuration information is used to indicate the compression method and compression parameters of the second information and/or the fourth information.
在一种可能的实现方式中,所述第一信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息;In one possible implementation, the first information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
和/或,所述第二信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
在一种可能的实现方式中,所述第三信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息;In one possible implementation, the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
和/或,所述第四信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
在另一种示例中,该通信装置1800应用于接收端,该通信装置1800包括:In another example, the communication device 1800 is applied at the receiving end, and the communication device 1800 includes:
收发单元1802,用于接收发送端发送的第一信息,所述第一信息包括一个或多个第一数据,以及,一个或多个第一训练标签数据,所述第一数据包括第一模型的中间层输出的数据,所述第一训练标签数据用于更新所述第一模型;The transceiver unit 1802 is used to receive first information sent by the sender. The first information includes one or more first data and one or more first training label data. The first data includes data output from the intermediate layer of the first model. The first training label data is used to update the first model.
收发单元1802,还用于向所述发送端发送第二信息,所述第二信息指示所述第一模型的中间层反向输出的模型更新数据。The transceiver unit 1802 is also used to send second information to the sending end, the second information indicating the model update data output in reverse by the intermediate layer of the first model.
在一种可能的实现方式中,所述第一训练标签数据,包括:In one possible implementation, the first training label data includes:
第一训练标签,或者,所述第一训练标签在训练标签集中的索引信息,所述训练标签集包括一个或多个训练标签。The first training label, or the index information of the first training label in the training label set, wherein the training label set includes one or more training labels.
在一种可能的实现方式中,所述第一信息还包括:In one possible implementation, the first information further includes:
分组指示信息,所述分组指示信息用于指示所述第一信息包括的所述多个所述第一数据的分组方式;Grouping indication information, wherein the grouping indication information is used to indicate the grouping method of the plurality of first data included in the first information;
和/或,组数信息,所述组数信息用于指示所述第一信息包括的所述多个所述第一数据的数量。And/or, group number information, which indicates the number of the plurality of first data included in the first information.
在一种可能的实现方式中,所述第二信息包括:所述多个第二数据的加权值。In one possible implementation, the second information includes: the weighted value of the plurality of second data.
在一种可能的实现方式中,所述第一信息还包括所述第一数据的权重。In one possible implementation, the first information may also include the weight of the first data.
在一种可能的实现方式中,所述第一信息还包括所述第一数据的权重。In one possible implementation, the first information may also include the weight of the first data.
在一种可能的实现方式中,收发单元1802,还用于接收所述发送端发送的第三信息,所述第三信息包括以下任意一项或多项信息:In one possible implementation, the transceiver unit 1802 is further configured to receive third information transmitted by the transmitting end, the third information including any one or more of the following:
第三数据、所述第一模型中输入所述第三数据的输入位置信息、所述第三数据的维度信息、所述第一模型的标识信息,或者,第一类型指示信息,所述第一类型指示信息用于指示所述第三数据的数据类型,其中,所述第三数据包括以下任意一项或多项信息:第一信道信息、第一环境信息,或者,多模态信息。The third data, the input location information of the third data in the first model, the dimension information of the third data, the identification information of the first model, or the first type indication information, wherein the first type indication information is used to indicate the data type of the third data, wherein the third data includes any one or more of the following information: first channel information, first environment information, or multimodal information.
在一种可能的实现方式中,收发单元1802,还用于向所述发送端发送第四信息,所述第四信息包括以下任意一项或多项信息:In one possible implementation, the transceiver unit 1802 is further configured to send fourth information to the sending end, the fourth information including any one or more of the following:
第四数据、第二类型指示信息、所述第一模型的终止交互指令、所述第四数据的维度信息,或者,所述第一模型的标识信息,所述第二类型指示信息用于指示所述第四数据的数据类型,所述第一模型的终止交互指令用于指示所述发送端停止发送所述第一模型相关的信息,其中,所述第四数据包括以下任意一项或多项信息:第二信道信息、第二环境信息、多模态信息、所述接收端在训练所述第一模型的过程中生成的训练残差信息,或者,更新后所述第一模型的性能信息。The fourth data, the second type indication information, the termination interaction instruction of the first model, the dimension information of the fourth data, or the identification information of the first model, the second type indication information is used to indicate the data type of the fourth data, and the termination interaction instruction of the first model is used to instruct the sending end to stop sending information related to the first model, wherein the fourth data includes any one or more of the following information: second channel information, second environment information, multimodal information, training residual information generated by the receiving end during the training of the first model, or the performance information of the updated first model.
在一种可能的实现方式中,所述第一模型包括一个或多个切分位置;In one possible implementation, the first model includes one or more segmentation locations;
所述第一数据包括:所述第一模型中所述一个或多个切分位置对应的中间层输出的数据;The first data includes: data output from intermediate layers corresponding to one or more segmentation positions in the first model;
和/或,所述第二数据包括:所述第一模型中所述一个或多个切分位置反向输出的模型更新数据。And/or, the second data includes: model update data output in reverse from one or more of the segmentation positions in the first model.
在一种可能的实现方式中,收发单元1802,还用于接收来自所述接收端的配置信息,所述配置信息用于配置训练所述第一模型的训练任务,所述配置信息包括以下任意一项或多项:In one possible implementation, the transceiver unit 1802 is further configured to receive configuration information from the receiving end, the configuration information being used to configure the training task for training the first model, the configuration information including any one or more of the following:
模型信息,其中,所述模型信息包括以下任意一项或多项信息:所述第一模型的标识信息、所述第一模型的结构信息、所述第一模型的训练任务信息、所述第一模型的初始化参数,或者,所述第一模型的待更新参数;Model information, wherein the model information includes any one or more of the following: the identification information of the first model, the structural information of the first model, the training task information of the first model, the initialization parameters of the first model, or the parameters of the first model to be updated.
切分信息,其中,所述切分信息包括:所述第一模型的切分位置数量,和/或,所述切分位置在所述第一模型的位置信息;Segmentation information, wherein the segmentation information includes: the number of segmentation positions of the first model, and/or, the position information of the segmentation positions in the first model;
或者,训练信息,其中,所述训练信息包括以下任意一项或多项信息:训练标签集信息、验证集信息、训练迭代次数、学习率,或者损失函数,所述训练标签集包括一个或多个训练标签;Alternatively, training information, wherein the training information includes any one or more of the following: training label set information, validation set information, number of training iterations, learning rate, or loss function, wherein the training label set includes one or more training labels;
和/或,收发单元1802,还用于向所述发送端发送所述配置信息。And/or, the transceiver unit 1802 is also used to send the configuration information to the sending end.
在一种可能的实现方式中,所述配置信息还包括:In one possible implementation, the configuration information further includes:
第一压缩配置信息,和/或,第二压缩配置信息,其中:First compression configuration information, and/or, second compression configuration information, wherein:
所述第一压缩配置信息用于指示所述第一信息和/或所述第三信息的压缩方式以及压缩参数,The first compression configuration information is used to indicate the compression method and compression parameters of the first information and/or the third information.
所述第二压缩配置信息用于指示所述第二信息和/或所述第四信息的压缩方式以及压缩参数。The second compression configuration information is used to indicate the compression method and compression parameters of the second information and/or the fourth information.
在一种可能的实现方式中,所述第一信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息;In one possible implementation, the first information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
和/或,所述第二信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the second information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
在一种可能的实现方式中,所述第三信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息;In one possible implementation, the third information is carried in a Radio Resource Control (RRC) message or a Media Access Control (MAC) message.
和/或,所述第四信息承载于无线资源控制RRC消息,或者,媒体访问控制MAC消息。And/or, the fourth information is carried in a Radio Resource Control (RRC) message, or a Media Access Control (MAC) message.
在一种可能的设计中,当该通信装置1800是终端设备或终端(或者网络设备)中的通信模组时,该处理单元1801的功能可以由一个或多个处理器实现。具体的该处理器可以包括modem芯片,或包含modem核的SoC芯片或SIP芯片。收发单元1802的功能可以由收发机电路来实现。In one possible design, when the communication device 1800 is a terminal device or a communication module within a terminal (or network device), the function of the processing unit 1801 can be implemented by one or more processors. Specifically, the processor may include a modem chip, or a SoC chip or SIP chip containing a modem core. The function of the transceiver unit 1802 can be implemented by transceiver circuitry.
在一种可能的设计中,当该通信装置1800是终端(或者网络设备)中负责通信功能的电路或芯片,如modem芯片或包含modem核的SoC芯片或SIP芯片时,该处理单元1801的功能可以由上述芯片中包括一个或多个处理器或处理器核的电路系统来实现。收发单元1802功能可以由上述芯片上的接口电路或数据收发电路来实现。In one possible design, when the communication device 1800 is a circuit or chip responsible for communication functions in a terminal (or network device), such as a modem chip or a SoC chip or SIP chip containing a modem core, the function of the processing unit 1801 can be implemented by a circuit system in the aforementioned chip that includes one or more processors or processor cores. The function of the transceiver unit 1802 can be implemented by the interface circuit or data transceiver circuit on the aforementioned chip.
需要说明的是,上述通信装置1800的单元的信息执行过程等内容,具体可参见本申请前述所示的方法实施例中的叙述,此处不再赘述。It should be noted that the information execution process of the unit of the above-mentioned communication device 1800 can be specifically described in the method embodiment shown above in this application, and will not be repeated here.
请参阅图19,图19为本申请提供的通信装置1900的另一种示意性结构图,通信装置1900包括逻辑电路1901和输入输出接口1902。其中,通信装置1900可以为芯片或集成电路。Please refer to Figure 19, which is another schematic structural diagram of the communication device 1900 provided in this application. The communication device 1900 includes a logic circuit 1901 and an input/output interface 1902. The communication device 1900 can be a chip or an integrated circuit.
其中,图18所示收发单元1802可以为通信接口,该通信接口可以是图19中的输入输出接口1902,该输入输出接口1902可以包括输入接口和输出接口。或者,该通信接口也可以是收发电路,该收发电路可以包括输入接口电路和输出接口电路。其中,逻辑电路1901和输入输出接口1902还可以执行任一实施例中发送端或者接收端执行的其他步骤并实现对应的有益效果,此处不再赘述。In Figure 18, the transceiver unit 1802 can be a communication interface, which can be the input/output interface 1902 in Figure 19. The input/output interface 1902 can include an input interface and an output interface. Alternatively, the communication interface can also be a transceiver circuit, which can include an input interface circuit and an output interface circuit. The logic circuit 1901 and the input/output interface 1902 can also perform other steps executed by the transmitting or receiving end in any embodiment and achieve corresponding beneficial effects, which will not be elaborated further here.
在一种可能的实现方式中,图18所示处理单元1801可以为图19中的逻辑电路1901。In one possible implementation, the processing unit 1801 shown in FIG18 can be the logic circuit 1901 in FIG19.
可选的,逻辑电路1901可以是一个处理装置,处理装置的功能可以部分或全部通过软件实现。其中,处理装置的功能可以部分或全部通过软件实现。Optionally, the logic circuit 1901 can be a processing device, the functions of which can be partially or entirely implemented in software.
可选的,处理装置可以包括存储器和处理器,其中,存储器用于存储计算机程序,处理器读取并执行存储器中存储的计算机程序,以执行任意一个方法实施例中的相应处理和/或步骤。Optionally, the processing apparatus may include a memory and a processor, wherein the memory is used to store a computer program, and the processor reads and executes the computer program stored in the memory to perform the corresponding processing and/or steps in any of the method embodiments.
可选地,处理装置可以仅包括处理器。用于存储计算机程序的存储器位于处理装置之外,处理器通过电路/电线与存储器连接,以读取并执行存储器中存储的计算机程序。其中,存储器和处理器可以集成在一起,或者也可以是物理上互相独立的。Optionally, the processing device may consist of only a processor. A memory for storing computer programs is located outside the processing device, and the processor is connected to the memory via circuitry/wires to read and execute the computer programs stored in the memory. The memory and processor may be integrated together or physically independent of each other.
可选地,该处理装置可以是一个或多个芯片,或一个或多个集成电路。例如,处理装置可以是一个或多个现场可编程门阵列(field-programmable gate array,FPGA)、专用集成芯片(application specific integrated circuit,ASIC)、系统芯片(system on chip,SoC)、中央处理器(central processor unit,CPU)、网络处理器(network processor,NP)、数字信号处理电路(digital signal processor,DSP)、微控制器(micro controller unit,MCU),可编程逻辑控制器(programmable logic device,PLD)或其它集成芯片,或者上述芯片或者处理器的任意组合等。Optionally, the processing device may be one or more chips, or one or more integrated circuits. For example, the processing device may be one or more field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), system-on-chips (SoCs), central processing units (CPUs), network processors (NPs), digital signal processors (DSPs), microcontroller units (MCUs), programmable logic controllers (PLDs), or other integrated chips, or any combination of the above chips or processors.
请参阅图20,图20为本申请的实施例提供的上述实施例中所涉及的通信装置2000的结构示意图,该通信装置2000具体可以为上述实施例中的作为发送端(或者接收端)的通信装置,图20所示示例为发送端通过发送端(或者发送端中的部件)实现,或者图20所示示例为接收端通过接收端(或者接收端中的部件)实现。Please refer to Figure 20. Figure 20 is a structural schematic diagram of the communication device 2000 involved in the above embodiments provided by the embodiments of this application. Specifically, the communication device 2000 can be the communication device as the sending end (or receiving end) in the above embodiments. The example shown in Figure 20 is that the sending end is implemented by the sending end (or the component in the sending end), or the example shown in Figure 20 is that the receiving end is implemented by the receiving end (or the component in the receiving end).
其中,该通信装置2000的一种可能的逻辑结构示意图,该通信装置2000可以包括但不限于至少一个处理器2001以及通信端口2002。The present invention is a possible logical structure diagram of the communication device 2000, which may include, but is not limited to, at least one processor 2001 and a communication port 2002.
其中,图18所示收发单元1802可以为通信接口,该通信接口可以是图20中的通信端口2002,该通信端口2002可以包括输入接口和输出接口。或者,该通信端口2002也可以是收发电路,该收发电路可以包括输入接口电路和输出接口电路。In Figure 18, the transceiver unit 1802 can be a communication interface, which can be the communication port 2002 in Figure 20. The communication port 2002 can include an input interface and an output interface. Alternatively, the communication port 2002 can also be a transceiver circuit, which can include an input interface circuit and an output interface circuit.
进一步可选的,该装置还可以包括存储器2003、总线2004中的至少一个,在本申请的实施例中,该至少一个处理器2001用于对通信装置2000的动作进行控制处理。Further optionally, the device may also include at least one of a memory 2003 and a bus 2004. In embodiments of this application, the at least one processor 2001 is used to control the operation of the communication device 2000.
此外,处理器2001可以是中央处理器单元,通用处理器,数字信号处理器,专用集成电路,现场可编程门阵列或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框,模块和电路。该处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,数字信号处理器和微处理器的组合等等。所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Furthermore, the processor 2001 can be a central processing unit, a general-purpose processor, a digital signal processor, an application-specific integrated circuit, a field-programmable gate array, or other programmable logic devices, transistor logic devices, hardware components, or any combination thereof. It can implement or execute the various exemplary logic blocks, modules, and circuits described in conjunction with the disclosure of this application. The processor can also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a digital signal processor and a microprocessor, etc. Those skilled in the art will clearly understand that, for the sake of convenience and brevity, the specific working processes of the systems, devices, and units described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be repeated here.
需要说明的是,图20所示通信装置2000具体可以用于实现前述方法实施例中发送端或者接收端所实现的步骤,并实现发送端或者接收端对应的技术效果,图20所示通信装置的具体实现方式,均可以参考前述方法实施例中的叙述,此处不再一一赘述。It should be noted that the communication device 2000 shown in Figure 20 can be used to implement the steps implemented by the sending end or the receiving end in the aforementioned method embodiments, and to achieve the corresponding technical effects of the sending end or the receiving end. The specific implementation of the communication device shown in Figure 20 can be referred to the description in the aforementioned method embodiments, and will not be repeated here.
又一种示例,请参阅图21,图21为本申请的实施例提供的上述实施例中所涉及的通信装置2100的结构示意图,该通信装置2100具体可以为上述实施例中的作为接收端(或者发送端)的通信装置,图21所示示例为接收端通过接收端(或者接收端中的部件)实现,或者,图21所示示例为发送端通过发送端(或者发送端中的部件)实现。其中,该通信装置的结构可以参考图21所示的结构。Another example is shown in Figure 21, which is a schematic diagram of the communication device 2100 involved in the above embodiments provided by the present application. Specifically, the communication device 2100 can be a communication device serving as a receiving end (or a transmitting end) as described in the above embodiments. The example shown in Figure 21 illustrates a receiving end implemented through a receiving end (or a component within a receiving end), or a transmitting end implemented through a transmitting end (or a component within a transmitting end). The structure of this communication device can be referenced from the structure shown in Figure 21.
通信装置2100包括至少一个处理器2111以及至少一个网络接口2114。进一步可选的,该通信装置还包括至少一个存储器2112、至少一个收发器2113和一个或多个天线2115。处理器2111、存储器2112、收发器2113和网络接口2114相连,例如通过总线相连,在本申请实施例中,该连接可包括各类接口、传输线或总线等,本实施例对此不做限定。天线2115与收发器2113相连。网络接口2114用于使得通信装置通过通信链路,与其它通信设备通信。例如网络接口2114可以包括通信装置与核心网设备之间的网络接口,例如S1接口,网络接口可以包括通信装置和其他通信装置(例如其他接收端,其他发送端或者核心网设备)之间的网络接口,例如X2或者Xn接口。The communication device 2100 includes at least one processor 2111 and at least one network interface 2114. Further optionally, the communication device also includes at least one memory 2112, at least one transceiver 2113, and one or more antennas 2115. The processor 2111, memory 2112, transceiver 2113, and network interface 2114 are connected, for example, via a bus. In this embodiment, the connection may include various interfaces, transmission lines, or buses, etc., and this embodiment is not limited thereto. The antenna 2115 is connected to the transceiver 2113. The network interface 2114 enables the communication device to communicate with other communication devices through a communication link. For example, the network interface 2114 may include a network interface between the communication device and core network equipment, such as an S1 interface; the network interface may also include a network interface between the communication device and other communication devices (e.g., other receivers, other transmitters, or core network equipment), such as an X2 or Xn interface.
其中,图18所示收发单元1802可以为通信接口,该通信接口可以是图21中的网络接口2114,该网络接口2114可以包括输入接口和输出接口。或者,该网络接口2114也可以是收发电路,该收发电路可以包括输入接口电路和输出接口电路。In Figure 18, the transceiver unit 1802 can be a communication interface, which can be the network interface 2114 in Figure 21. The network interface 2114 can include an input interface and an output interface. Alternatively, the network interface 2114 can also be a transceiver circuit, which can include an input interface circuit and an output interface circuit.
处理器2111主要用于对通信协议以及通信数据进行处理,以及对整个通信装置进行控制,执行软件程序,处理软件程序的数据,例如用于支持通信装置执行实施例中所描述的动作。通信装置可以包括基带处理器和中央处理器,基带处理器主要用于对通信协议以及通信数据进行处理,中央处理器主要用于对整个通信装置进行控制,执行软件程序,处理软件程序的数据。图21中的处理器2111可以集成基带处理器和中央处理器的功能,本领域技术人员可以理解,基带处理器和中央处理器也可以是各自独立的处理器,通过总线等技术互联。本领域技术人员可以理解,通信装置可以包括多个基带处理器以适应不同的网络制式,通信装置可以包括多个中央处理器以增强其处理能力,通信装置的各个部件可以通过各种总线连接。该基带处理器也可以表述为基带处理电路或者基带处理芯片。该中央处理器也可以表述为中央处理电路或者中央处理芯片。对通信协议以及通信数据进行处理的功能可以内置在处理器中,也可以以软件程序的形式存储在存储器中,由处理器执行软件程序以实现基带处理功能。The processor 2111 is primarily used to process communication protocols and communication data, control the entire communication device, execute software programs, and process data from the software programs, for example, to support the actions described in the embodiments of the communication device. The communication device may include a baseband processor and a central processing unit (CPU). The baseband processor is primarily used to process communication protocols and communication data, while the CPU is primarily used to control the entire communication device, execute software programs, and process data from the software programs. The processor 2111 in Figure 21 can integrate the functions of both a baseband processor and a CPU. Those skilled in the art will understand that the baseband processor and CPU can also be independent processors interconnected via technologies such as buses. Those skilled in the art will understand that the communication device may include multiple baseband processors to adapt to different network standards, and multiple CPUs to enhance its processing capabilities. The various components of the communication device can be connected via various buses. The baseband processor can also be described as a baseband processing circuit or a baseband processing chip. The CPU can also be described as a central processing circuit or a central processing chip. The function of processing communication protocols and communication data can be built into the processor or stored in memory as a software program, which is then executed by the processor to implement the baseband processing function.
存储器主要用于存储软件程序和数据。存储器2112可以是独立存在,与处理器2111相连。可选的,存储器2112可以和处理器2111集成在一起,例如集成在一个芯片之内。其中,存储器2112能够存储执行本申请实施例的技术方案的程序代码,并由处理器2111来控制执行,被执行的各类计算机程序代码也可被视为是处理器2111的驱动程序。The memory is primarily used to store software programs and data. The memory 2112 can exist independently or be connected to the processor 2111. Optionally, the memory 2112 can be integrated with the processor 2111, for example, integrated within a single chip. The memory 2112 can store program code that executes the technical solutions of the embodiments of this application, and its execution is controlled by the processor 2111. The various types of computer program code being executed can also be considered as drivers for the processor 2111.
图21仅示出了一个存储器和一个处理器。在实际的通信装置中,可以存在多个处理器和多个存储器。存储器也可以称为存储介质或者存储设备等。存储器可以为与处理器处于同一芯片上的存储元件,即片内存储元件,或者为独立的存储元件,本申请实施例对此不做限定。Figure 21 shows only one memory and one processor. In actual communication devices, there can be multiple processors and multiple memories. Memory can also be called storage medium or storage device, etc. Memory can be a storage element on the same chip as the processor, i.e., an on-chip storage element, or it can be a separate storage element; the embodiments of this application do not limit this.
收发器2113可以用于支持通信装置与终端之间射频信号的接收或者发送,收发器2113可以与天线2115相连。收发器2113包括发射机Tx和接收机Rx。具体地,一个或多个天线2115可以接收射频信号,该收发器2113的接收机Rx用于从天线接收该射频信号,并将射频信号转换为数字基带信号或数字中频信号,并将该数字基带信号或数字中频信号提供给该处理器2111,以便处理器2111对该数字基带信号或数字中频信号做进一步的处理,例如解调处理和译码处理。此外,收发器2113中的发射机Tx还用于从处理器2111接收经过调制的数字基带信号或数字中频信号,并将该经过调制的数字基带信号或数字中频信号转换为射频信号,并通过一个或多个天线2115发送该射频信号。具体地,接收机Rx可以选择性地对射频信号进行一级或多级下混频处理和模数转换处理以得到数字基带信号或数字中频信号,该下混频处理和模数转换处理的先后顺序是可调整的。发射机Tx可以选择性地对经过调制的数字基带信号或数字中频信号时进行一级或多级上混频处理和数模转换处理以得到射频信号,该上混频处理和数模转换处理的先后顺序是可调整的。数字基带信号和数字中频信号可以统称为数字信号。Transceiver 2113 can be used to support the reception or transmission of radio frequency (RF) signals between a communication device and a terminal. Transceiver 2113 can be connected to antenna 2115. Transceiver 2113 includes a transmitter Tx and a receiver Rx. Specifically, one or more antennas 2115 can receive RF signals. The receiver Rx of transceiver 2113 receives the RF signals from the antennas, converts the RF signals into digital baseband signals or digital intermediate frequency (IF) signals, and provides the digital baseband signals or IF signals to processor 2111 so that processor 2111 can perform further processing on the digital baseband signals or IF signals, such as demodulation and decoding. Furthermore, the transmitter Tx in transceiver 2113 is also used to receive modulated digital baseband signals or IF signals from processor 2111, convert the modulated digital baseband signals or IF signals into RF signals, and transmit the RF signals through one or more antennas 2115. Specifically, the receiver Rx can selectively perform one or more stages of downmixing and analog-to-digital conversion on the radio frequency signal to obtain a digital baseband signal or a digital intermediate frequency (IF) signal. The order of these downmixing and IF conversion processes is adjustable. The transmitter Tx can selectively perform one or more stages of upmixing and digital-to-analog conversion on the modulated digital baseband signal or digital IF signal to obtain a radio frequency signal. The order of these upmixing and IF conversion processes is also adjustable. The digital baseband signal and the digital IF signal can be collectively referred to as digital signals.
收发器2113也可以称为收发单元、收发机、收发装置等。可选的,可以将收发单元中用于实现接收功能的器件视为接收单元,将收发单元中用于实现发送功能的器件视为发送单元,即收发单元包括接收单元和发送单元,接收单元也可以称为接收机、输入口、接收电路等,发送单元可以称为发射机、发射器或者发射电路等。The transceiver 2113 can also be called a transceiver unit, transceiver, transceiver device, etc. Optionally, the device in the transceiver unit that performs the receiving function can be regarded as the receiving unit, and the device in the transceiver unit that performs the transmitting function can be regarded as the transmitting unit. That is, the transceiver unit includes a receiving unit and a transmitting unit. The receiving unit can also be called a receiver, input port, receiving circuit, etc., and the transmitting unit can be called a transmitter, transmitter, or transmitting circuit, etc.
需要说明的是,图21所示通信装置2100具体可以用于实现前述方法实施例中接收端或者发送端所实现的步骤,并实现接收端或者发送端对应的技术效果,图21所示通信装置2100的具体实现方式,均可以参考前述方法实施例中的叙述,此处不再一一赘述。It should be noted that the communication device 2100 shown in Figure 21 can be used to implement the steps implemented by the receiving end or the sending end in the aforementioned method embodiments, and to achieve the corresponding technical effects of the receiving end or the sending end. The specific implementation of the communication device 2100 shown in Figure 21 can be referred to the description in the aforementioned method embodiments, and will not be repeated here.
本申请实施例还提供一种计算机可读存储介质,该存储介质用于存储一个或多个计算机执行指令,当计算机执行指令被处理器执行时,该处理器执行如前述实施例中发送端或者接收端或第二通信装置可能的实现方式所述的方法。This application also provides a computer-readable storage medium for storing one or more computer-executable instructions. When the computer-executable instructions are executed by a processor, the processor performs the method described in the possible implementations of the sending end, receiving end, or second communication device in the foregoing embodiments.
本申请实施例还提供一种计算机程序产品(或称计算机程序),当计算机程序产品被该处理器执行时,该处理器执行上述发送端或者接收端可能实现方式的方法。This application also provides a computer program product (or computer program) that, when executed by the processor, executes the method described above for the possible implementation of the sending or receiving end.
本申请实施例还提供了一种芯片系统,该芯片系统包括至少一个处理器,用于支持通信装置实现上述通信装置可能的实现方式中所涉及的功能。可选的,所述芯片系统还包括接口电路,所述接口电路为所述至少一个处理器提供程序指令和/或数据。在一种可能的设计中,该芯片系统还可以包括存储器,存储器,用于保存该通信装置必要的程序指令和数据。该芯片系统,可以由芯片构成,也可以包含芯片和其他分立器件,其中,该通信装置具体可以为前述方法实施例中发送端或者接收端。This application also provides a chip system including at least one processor for supporting a communication device in implementing the functions involved in the possible implementations of the communication device described above. Optionally, the chip system further includes an interface circuit that provides program instructions and/or data to the at least one processor. In one possible design, the chip system may further include a memory for storing the program instructions and data necessary for the communication device. The chip system may be composed of chips or may include chips and other discrete devices, wherein the communication device may specifically be a transmitting end or a receiving end in the aforementioned method embodiments.
本申请实施例还提供了一种通信系统,该网络系统架构包括上述任一实施例中的发送端或者接收端。This application also provides a communication system, the network system architecture of which includes the sending end or receiving end in any of the above embodiments.
可选地,该通信系统还包括与该发送端或者接收端进行通信的其它通信装置。Optionally, the communication system may also include other communication devices that communicate with the transmitting end or the receiving end.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。某个功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。本领域技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。In the embodiments provided in this application, it should be understood that the disclosed systems, apparatuses, and methods can be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative; for instance, the division of units is only a logical functional division, and in actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated into another system, or some features may be ignored or not executed. Furthermore, the coupling or direct coupling or communication connection shown or discussed may be through some interfaces, indirect coupling or communication connection between devices or units, and may be electrical, mechanical, or other forms. Whether a function is implemented in hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered beyond the scope of this application.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separate. The components shown as units may or may not be physical units; that is, they may be located in one place or distributed across multiple network units. Some or all of the units can be selected to achieve the purpose of this embodiment according to actual needs.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。Furthermore, the functional units in the various embodiments of this application can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit. The integrated unit can be implemented in hardware or as a software functional unit. If the integrated unit is implemented as a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of this application, in essence, or the part that contributes, or all or part of the technical solution, can be embodied in the form of a software product. This computer software product is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, server, or network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of this application. The aforementioned storage medium includes various media capable of storing program code, such as USB flash drives, portable hard drives, read-only memory (ROM), random access memory (RAM), magnetic disks, or optical disks.
Claims (27)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410585456.2 | 2024-05-11 | ||
| CN202410585456.2A CN120935033A (en) | 2024-05-11 | 2024-05-11 | A communication method and related apparatus |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025237119A1 true WO2025237119A1 (en) | 2025-11-20 |
Family
ID=97591638
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2025/093003 Pending WO2025237119A1 (en) | 2024-05-11 | 2025-05-07 | Communication method and related apparatus |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN120935033A (en) |
| WO (1) | WO2025237119A1 (en) |
-
2024
- 2024-05-11 CN CN202410585456.2A patent/CN120935033A/en active Pending
-
2025
- 2025-05-07 WO PCT/CN2025/093003 patent/WO2025237119A1/en active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| CN120935033A (en) | 2025-11-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2025237119A1 (en) | Communication method and related apparatus | |
| WO2025227699A1 (en) | Communication method and related apparatus | |
| WO2025167443A1 (en) | Communication method and related device | |
| WO2025227700A1 (en) | Communication method and related apparatus | |
| WO2025227698A1 (en) | Communication method and related apparatus | |
| WO2025179919A1 (en) | Communication method and related apparatus | |
| WO2025190244A1 (en) | Communication method and related apparatus | |
| WO2025189831A1 (en) | Communication method and related apparatus | |
| WO2025175756A1 (en) | Communication method and related device | |
| WO2025189860A1 (en) | Communication method and related apparatus | |
| WO2025019990A1 (en) | Communication method and related device | |
| WO2025107835A1 (en) | Communication method and related device | |
| WO2025189861A1 (en) | Communication method and related apparatus | |
| WO2025140282A1 (en) | Communication method and related device | |
| WO2025118759A1 (en) | Communication method and related device | |
| WO2025190246A1 (en) | Communication method and related apparatus | |
| WO2025118980A1 (en) | Communication method and related device | |
| WO2025139534A1 (en) | Communication method and related device | |
| WO2025179920A1 (en) | Communication method and related apparatus | |
| WO2025190252A1 (en) | Communication method and related apparatus | |
| WO2025107649A1 (en) | Communication method and related device | |
| WO2025103115A1 (en) | Communication method and related device | |
| WO2025092160A1 (en) | Communication method and related device | |
| WO2025019989A1 (en) | Communication method and related device | |
| WO2025025193A1 (en) | Communication method and related device |