WO2025140663A1 - Model data acquisition method, apparatus and system - Google Patents
Model data acquisition method, apparatus and system Download PDFInfo
- Publication number
- WO2025140663A1 WO2025140663A1 PCT/CN2024/143477 CN2024143477W WO2025140663A1 WO 2025140663 A1 WO2025140663 A1 WO 2025140663A1 CN 2024143477 W CN2024143477 W CN 2024143477W WO 2025140663 A1 WO2025140663 A1 WO 2025140663A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- node
- data
- original data
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/098—Distributed learning, e.g. federated learning
Definitions
- the present application relates to the field of artificial intelligence (AI), and in particular to a model data acquisition method, device and system.
- AI artificial intelligence
- model training or fine-tuning requires sufficient training data for the corresponding scenarios to ensure the convergence performance of the model, but the measured data obtained on the device side is limited, and it is difficult to ensure model performance only by training with data collected by the device; on the other hand, model training or fine-tuning requires data features that are rich enough to ensure the generalization and robustness of the model, but the measured data obtained on the device side is limited, resulting in limited diversity of data features.
- the central node receives and merges the original local data of multiple devices as a training set to increase the amount of training data and data diversity of the training set. Then, the central node uses the merged training set to perform model training or model update.
- the trained model is deployed on the device side for model inference. Among them, the central node only uses the original local data of multiple devices as the input of the generation model to obtain synthetic data, and the quality of the synthetic data is not high.
- the present application discloses a model data acquisition method, device and system, which can improve the quality of synthetic data.
- the first node or a circuit used for the first node such as a chip, generates a plurality of synthetic data based on the acquired original data and characteristic information of the original data.
- the first node further sends second information to the second node.
- the second information indicates a processing method for the second node to obtain the first information.
- the processing method for the second node to obtain the first information includes a method for acquiring feature information of the original data.
- the first node determines the number of similar synthetic data to be generated according to the priority or probability of each original data, so that the distribution of the synthetic data is consistent with expectations.
- the processing method for the second node to obtain the first information also includes a method for obtaining the data label of the original data.
- the second node Based on the method for obtaining the data label of the original data, the second node processes the original data to obtain the data label of the original data.
- the first node further sends third information to the second node, where the third information indicates a reporting configuration of the first information.
- the first node sends the third information to the second node so that the second node reports based on the reporting configuration of the first information indicated in the third information, so that the data provider and the synthesized data end separated from each other can align their understanding of the data and improve the quality of the synthesized data.
- the reporting configuration includes at least one of the following: the type of the first information, and the data volume of the first information.
- the first node sends a reporting configuration of the type of first information and/or a reporting configuration of the data amount of the first information to the second node, so that the second node sends the corresponding type of first information or the corresponding data amount of first information to the first node, which helps to improve the quality of the synthesized data.
- the first node further sends fourth information to the second node.
- the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
- the joint processing may be, for example, weighted summing of feature information of multiple original data of the same group.
- the first node sends the fourth information to the second node so that the second node can divide the original data that can be jointly processed into the same group, while dividing the original data that cannot be jointly processed into the same group, thereby avoiding the fusion of original data from different groups to generate unexpected synthetic data.
- the first information further includes the following type of information: group information of the original data.
- the second node reports the group information of the original data to the first node, and then the first node can jointly process the feature information of multiple original data in the same group.
- the first node further sends at least one subset of the plurality of synthesized data to the second node.
- the first node further receives fifth information from the second node.
- the fifth information indicates an evaluation result of at least one synthesized data in the at least one subset.
- the evaluation result indicates at least one synthesized data that needs to be eliminated in the at least one subset, or indicates at least one synthesized data that does not need to be eliminated in the at least one subset.
- the evaluation result is used to filter the plurality of synthetic data to obtain at least one filtered synthetic data, wherein the at least one filtered synthetic data is used for processing the model.
- the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the multiple synthesized data, and the first synthesized data to be eliminated is the synthesized data that needs to be eliminated determined based on the fifth information.
- an embodiment of the present application provides a model data acquisition method, which is executed by a first node or a circuit for the first node.
- the method includes: the first node receives sixth information from the second node.
- the sixth information includes the following types of information: original data.
- the first node generates multiple synthetic data based on the sixth information and the characteristic information of the original data.
- the multiple synthetic data are used for model processing. The following description is based on the first node as the execution subject. It can be understood that the first node can also be replaced by a circuit for the first node, such as a chip.
- the first node further receives eleventh information from the third node, where the eleventh information includes the following type of information: characteristic information of the original data.
- the first node further receives ninth information from the second node, the ninth information indicating a method for the first node to obtain the characteristic information of the original data. Further, the first node processes the original data based on the ninth information to obtain the characteristic information of the original data.
- the first node further receives ninth information from the third node, the ninth information indicating a method for the first node to obtain the characteristic information of the original data. Further, the first node processes the original data based on the ninth information to obtain the characteristic information of the original data.
- the characteristic information of the original data is determined based on a method for acquiring the characteristic information of the original data, and the method for acquiring the characteristic information of the original data is determined by the first node.
- the sixth information further includes at least one of the following types of information: the priority of the original data, or a data label of the original data.
- the first node determines the number of similar synthetic data to be generated according to the priority or probability of each original data, so that the distribution of the synthetic data is consistent with expectations.
- the first node also sends eighth information to the second node, where the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining a data label of the original data.
- the second node Based on the method for obtaining the data label of the original data, the second node processes the original data to obtain the data label of the original data.
- the first node further sends seventh information to the second node, where the seventh information indicates a reporting configuration of the sixth information.
- the first node sends the seventh information to the second node so that the second node reports based on the reporting configuration of the sixth information indicated in the seventh information.
- the data provider and the synthesized data end separated from each other can align their understanding of the data, thereby improving the quality of the synthesized data.
- the reporting configuration includes at least one of the following: the type of the sixth information, or the data volume of the sixth information.
- the first node sends a reporting configuration of the type of sixth information and/or a reporting configuration of the data amount of the sixth information to the second node, so that the second node sends the corresponding type of sixth information, or the corresponding data amount of sixth information to the first node, which helps to improve the quality of the synthesized data.
- the first node further sends fourth information to the second node.
- the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
- the joint processing may be, for example, weighted summing of feature information of multiple original data of the same group.
- the first node sends the fourth information to the second node so that the second node can divide the original data that can be jointly processed into the same group, while dividing the original data that cannot be jointly processed into the same group, thereby avoiding the fusion of original data from different groups to generate unexpected synthetic data.
- the sixth information further includes the following type of information: group information of the original data.
- the second node reports the group information of the original data to the first node, and then the first node can jointly process the feature information of multiple original data in the same group.
- the first node further sends at least one subset of the plurality of synthesized data to the second node.
- the first node further receives fifth information from the second node.
- the fifth information indicates an evaluation result of at least one synthesized data in the at least one subset.
- the evaluation result indicates at least one synthesized data that needs to be eliminated in the at least one subset, or indicates at least one synthesized data that does not need to be eliminated in the at least one subset.
- the evaluation result is used to filter the plurality of synthetic data to obtain at least one filtered synthetic data, wherein the at least one filtered synthetic data is used for processing the model.
- the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the multiple synthesized data, and the first synthesized data to be eliminated is the synthesized data that needs to be eliminated determined based on the fifth information.
- an embodiment of the present application provides a model data acquisition method, which is executed by a second node or a circuit for a second node.
- the method includes: sending first information to a first node.
- the first information includes the following types of information: original data and characteristic information of the original data.
- the following description is based on the second node as the execution subject. It can be understood that the second node can also be replaced by a circuit for the second node, such as a chip.
- the second node sends original data and characteristic information of the original data to the first node, and the original data and the characteristic information of the original data are used to generate multiple synthetic data.
- synthetic data that meets diversity requirements and has the same data distribution as the measured data can be generated, which can improve the quality of the synthetic data and thus help improve the performance of model training or updating.
- the second node further receives second information from the first node.
- the second information indicates a processing method by which the second node obtains the first information.
- the processing method by which the second node obtains the first information includes a method for obtaining characteristic information of the original data. Further, the second node obtains the first information based on the second information.
- the first information further includes at least one of the following types of information: a data label of the original data, or a priority of the original data.
- the processing method also includes a method for acquiring data labels of the original data.
- the second node further receives third information from the first node, where the third information indicates a reporting configuration of the first information.
- the reporting configuration includes at least one of the following: the type of the first information, and the data volume of the first information.
- the second node further receives fourth information from the first node, where the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
- the first information further includes the following type of information: group information of the original data.
- the second node further receives at least one subset of the multiple synthetic data from the first node.
- the second node further sends fifth information to the first node, where the fifth information indicates an evaluation result of at least one synthetic data in the at least one subset.
- the evaluation result indicates at least one synthetic data that needs to be eliminated from the at least one subset, or indicates at least one synthetic data that does not need to be eliminated from the at least one subset.
- the evaluation result is determined based on a distance between at least one synthetic data in the at least one subset and the local data.
- the evaluation result is used to filter the multiple synthetic data to obtain at least one filtered synthetic data; wherein the at least one filtered synthetic data is used for model processing.
- an embodiment of the present application provides a model data acquisition method, which is executed by a second node or a circuit for a second node.
- the method includes: the second node sends sixth information to the first node.
- the sixth information includes the following types of information: original data.
- the second node also receives eighth information from the first node, where the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining a data label of the original data.
- the second node further receives at least one subset of the multiple synthetic data from the first node.
- the second node further sends fifth information to the first node, where the fifth information indicates an evaluation result of at least one synthetic data in the at least one subset.
- the evaluation result indicates at least one synthetic data that needs to be eliminated from the at least one subset, or indicates at least one synthetic data that does not need to be eliminated from the at least one subset.
- the evaluation result is used to filter the multiple synthetic data to obtain at least one filtered synthetic data; wherein the at least one filtered synthetic data is used for model processing.
- an embodiment of the present application provides a model data acquisition method, which is performed by a third node or a circuit for a third node.
- the method includes: sending eleventh information to a first node, the eleventh information including the following types of information: characteristic information of the original data.
- the first node obtains synthetic data based on the characteristic information of the original data, that is, the characteristic information of the original data is used to generate the synthetic data.
- an embodiment of the present application provides a model data acquisition method, which is executed by a third node or a circuit for a third node.
- the method includes: sending ninth information to a first node, wherein the ninth information indicates a method for the first node to obtain characteristic information of the original data.
- the first node processes the original data based on the ninth information to obtain characteristic information of the original data, that is, the method for obtaining characteristic information of the original data is used to obtain characteristic information of the original data.
- the present application provides a model data acquisition device, which may include a transceiver module and a processing module, as follows:
- a transceiver module configured to receive first information from a second node, wherein the first information includes the following types of information: original data and characteristic information of the original data;
- a processing module is used to generate a plurality of synthetic data based on the first information, and the plurality of synthetic data are used for processing the model.
- the transceiver module is further used to send second information to the second node, where the second information indicates a processing method for the second node to obtain the first information, and the processing method for the second node to obtain the first information includes a method for obtaining characteristic information of the original data.
- the first information further includes the following types of information: a data label of the original data and/or a priority of the original data.
- the processing method for the second node to obtain the first information also includes a method for obtaining the data label of the original data.
- the transceiver module is further configured to send third information to the second node, where the third information indicates a reporting configuration of the first information.
- the reporting configuration includes at least one of the following: a type of the first information, and a data volume of the first information.
- the transceiver module is further configured to send fourth information to the second node, where the fourth information indicates that the device has the ability to jointly process feature information of multiple original data of the same group.
- the first information further includes the following type of information: group information of the original data.
- the transceiver module is further configured to send at least one subset of the plurality of synthesized data to the second node;
- the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the multiple synthesized data, and the first synthesized data to be eliminated is the synthesized data that needs to be eliminated determined based on the fifth information.
- the transceiver module is used to receive sixth information from the second node, where the sixth information includes the following types of information: original data.
- the first information further includes the following types of information: a data label of the original data and/or a priority of the original data.
- the transceiver module is further configured to receive at least one subset of the plurality of synthesized data from the first node;
- the transceiver module is used to send sixth information to the first node.
- the sixth information includes the following types of information: original data.
- the transceiver module is used to send eleventh information to the first node, where the eleventh information includes the following types of information: characteristic information of original data.
- the present application provides a computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement a method provided in any possible implementation of the first aspect, or a method provided in any possible implementation of the second aspect, or a method provided in any possible implementation of the third aspect.
- FIG6 is a schematic diagram of a flow chart of a method for acquiring model data provided in an embodiment of the present application.
- FIG11 is a schematic diagram of the structure of a model data acquisition device provided in an embodiment of the present application.
- FIG. 12 is a schematic diagram of the structure of another model data acquisition device provided in an embodiment of the present application.
- the present disclosure relates to at least one (item) as follows, indicating one (item) or more (items). More than one (item) refers to two (items) or more than two (items).
- "And/or" describes the association relationship of associated objects, indicating that three relationships may exist.
- a and/or B can represent: A exists alone, A and B exist at the same time, and B exists alone.
- the character "/” generally indicates that the objects associated before and after are in an "or” relationship.
- first, second, etc. may be used to describe each object in the present disclosure, these objects should not be limited to these terms. These terms are only used to distinguish each object from each other.
- the communication system can be a fifth generation (5G) or new radio (NR) system, a long term evolution (LTE) system, an LTE frequency division duplex (FDD) system, an LTE time division duplex (TDD) system, a wireless local area network (WLAN) system, a satellite communication system, a future communication system such as a sixth generation (6G) mobile communication system, or a fusion system of multiple systems.
- 5G fifth generation
- NR new radio
- LTE long term evolution
- FDD LTE frequency division duplex
- TDD LTE time division duplex
- WLAN wireless local area network
- 6G sixth generation
- the technical solution provided by the present application can also be applied to device to device (D2D) communication, vehicle to everything (V2X) communication, machine to machine (M2M) communication, machine type communication (MTC), and Internet of things (IoT) communication system or other communication systems.
- V2X vehicle to everything
- M2M machine to machine
- MTC machine type communication
- IoT Internet of things
- a device in a communication system can send a signal to another device or receive a signal from another device.
- the signal may include information, signaling, or data, etc.
- the device may also be replaced by an entity, a network entity, a network element, a communication device, a communication module, a node, a communication node, etc.
- the present disclosure is described by taking the device as an example.
- the communication system may include at least one terminal device and at least one access network device.
- the access network device may send a downlink signal to the terminal device, and/or the terminal device may send an uplink signal to the access network device.
- multiple terminal devices may also send signals to each other, that is, the signal sending device and the signal receiving device may both be terminal devices.
- the wireless communication system may include multiple network devices (also referred to as access network devices) at the same time, or may include multiple communication devices at the same time.
- a network device may serve one or more communication devices at the same time.
- a communication device may also access one or more network devices at the same time.
- the embodiment of the present application does not limit the number of communication devices and network devices included in the wireless communication system.
- the network device can be an entity on the network side for transmitting or receiving signals.
- the network device can be an access device for the communication device to access the wireless communication system by wireless means, such as the network device can be a base station.
- the base station can broadly cover the following various names, or be replaced with the following names, such as: Node B (NodeB), evolved NodeB (eNB), next generation NodeB (gNB), access network equipment in open radio access network (O-RAN), relay station, access point, transmission point (transmitting and receiving point, TRP), transmitting point (transmitting point, TP), master station (Master eNodeB, MeNB), secondary eNodeB (SeNB), multi-standard radio (Multi-standard radio o, MSR) node, home base station, network controller, access node, wireless node, access point (AP), transmission node, transceiver node, baseband unit (BBU), remote radio unit (RRU), active antenna unit (AAU), remote radio head (RRH), central unit (CU), distributed
- the base station can be a macro base station, a micro base station, a relay node, a donor node or the like, or a combination thereof.
- the network device can also refer to a communication module, a modem or a chip used to be set in the aforementioned device or apparatus.
- the network device may also be a mobile switching center and a device that performs base station functions in device-to-device (D2D), vehicle-to-everything (V2X), and machine-to-machine (M2M) communications, a network-side device in a 6G network, and a device that performs base station functions in future communication systems.
- the network device may support networks with the same or different access technologies. The embodiments of the present application do not limit the specific technology and specific device form used by the network device.
- the network equipment may be fixed or mobile.
- base stations 110a, 110b are stationary and are responsible for wireless transmission and reception in one or more cells from the communication device 120.
- the helicopter or drone 120i shown in FIG. 1 may be configured to act as a mobile base station, and one or more cells may move according to the location of the mobile base station 120i.
- the helicopter or drone (120i) may be configured to act as a communication device that communicates with the base station 110b.
- the communication device used to implement the above access network function can be an access network device, or a network device with some functions of accessing the network, or a device capable of supporting the implementation of the access network function, such as a chip system, a hardware circuit, a software module, or a hardware circuit plus a software module, which can be installed in the access network device or used in combination with the access network device.
- the communication device used to implement the access network device function is an access network device for description as an example.
- the communication device can be an entity on the user side for receiving or transmitting signals, such as a mobile phone.
- the communication device can be used to connect people, objects and machines.
- the communication device can communicate with one or more core networks through a network device.
- the communication device includes a handheld device with a wireless connection function, other processing devices connected to a wireless modem, or a vehicle-mounted device.
- the communication device can be a portable, pocket-sized, handheld, computer-built-in or vehicle-mounted mobile device.
- the communication device 120 can be widely used in various scenarios, such as cellular communication, device-to-device D2D, vehicle-to-all V2X, peer-to-peer (P2P), machine-to-machine M2M, machine type communication MTC, Internet of Things IoT, virtual reality (VR), augmented reality (AR), industrial control, autonomous driving, telemedicine, smart grid, smart furniture, smart office, smart wear, smart transportation, smart city, drone, robot, remote sensing, passive sensing, positioning, navigation and tracking, autonomous delivery and mobility, etc.
- cellular communication device-to-device D2D, vehicle-to-all V2X, peer-to-peer (P2P), machine-to-machine M2M, machine type communication MTC, Internet of Things IoT, virtual reality (VR), augmented reality (AR), industrial control, autonomous driving, telemedicine, smart grid, smart furniture, smart office, smart wear, smart transportation, smart city, drone, robot, remote sensing, passive sensing, positioning, navigation and tracking, autonomous delivery and mobility
- Some examples of communication devices 120 are: 3GPP standard user equipment (UE), fixed equipment, mobile devices, handheld devices, wearable devices, cellular phones, smart phones, Session Initialization Protocol (SIP) phones, laptops, personal computers, smart books, vehicles, satellites, Global Positioning System (GPS) devices, target tracking devices, drones, helicopters, aircraft, ships, remote control devices, smart home devices, industrial equipment, personal communication service (PCS) phones, wireless local loop (WLL) stations, personal digital assistants (PDAs), etc.
- the communication device 120 may be a wireless device in the above-mentioned scenarios or a device used to be set in a wireless device, for example, a communication module, a modem or a chip in the above-mentioned device.
- the communication device may also be referred to as a terminal, a terminal device, a user equipment UE, a mobile station (MS), a mobile terminal (MT), or the like.
- the communication device may also be referred to as a terminal, a terminal device, a user equipment UE, a mobile station (MS), a mobile terminal (MT), or the like.
- the communication device may also be referred to as a communication device in a future wireless communication system.
- the communication device can be used in a dedicated network device or a general device. The embodiments of the present application do not limit the specific technology and specific device form used by the communication device.
- the communication device used to implement the functions of the communication device may be a terminal device, or a terminal device having some of the functions of the above communication device, or a device capable of supporting the implementation of the functions of the above communication device, such as a chip system, which may be installed in the terminal device or used in combination with the terminal device.
- the chip system may be composed of a chip, or may include a chip and other discrete devices.
- the communication device is described as a terminal device or UE as an example.
- a wireless communication system is usually composed of cells, and a base station provides management of the cell.
- the base station provides communication services to multiple mobile stations (MS) in the cell.
- the base station includes a baseband unit (BBU) and a remote radio unit (RRU).
- BBU baseband unit
- RRU remote radio unit
- the BBU and RRU can be placed in different places, for example: the RRU is remote and placed in an area with high traffic volume, and the BBU is placed in a central computer room.
- the BBU and RRU can also be placed in the same computer room.
- the BBU and RRU can also be different components under one rack.
- a cell can correspond to one carrier or component carrier.
- the present disclosure can be applied between a network device and a communication device, between a network device and a network device, or between a communication device and a communication device, that is, between a primary device and a secondary device.
- the primary device can be a network device or a communication device.
- the secondary device can be another network device or a communication device.
- the secondary device can be another communication device.
- AI nodes may also be introduced into the network.
- the method provided by the present invention can be used for communication between access network equipment and terminal equipment, and can also be used for communication between other communication equipment, such as communication between macro base stations and micro base stations in a wireless backhaul link, and communication between two terminal devices in a side link (SL), without limitation.
- other communication equipment such as communication between macro base stations and micro base stations in a wireless backhaul link, and communication between two terminal devices in a side link (SL), without limitation.
- SL side link
- An AI model is an algorithm or computer program that can implement AI functions.
- the AI model represents the mapping relationship between the input and output of the model.
- the type of AI model can be a neural network, linear regression model, decision tree model, support vector machine (SVM), Bayesian network, Q learning model or other machine learning (ML) model.
- An autoencoder is a neural network for unsupervised learning. Its characteristic is that it uses input data as labels, so an autoencoder can also be understood as a neural network for self-supervised learning.
- An autoencoder can be used for data compression and recovery. For example, the encoder in an autoencoder can compress (encode) data A to obtain data B; the decoder in an autoencoder can decompress (decode) data B to recover data A. Alternatively, it can be understood that the decoder is the inverse operation of the encoder.
- the AI model in the embodiment of the present application may be a single-end model, which may be deployed on a terminal device or a network device.
- ground truth usually refers to data that is believed to be accurate or real.
- the training data set is used for training the AI model.
- the training data set may include the input of the AI model, or the input and target output of the AI model.
- the training data set includes one or more training data.
- the training data may include training samples input to the AI model, or may include the target output of the AI model.
- the target output may also be referred to as a label, sample label, or label sample.
- the label is the true value.
- training data sets can include simulation data collected through simulation platforms, experimental data collected in experimental scenarios, or measured data collected in actual communication networks. Due to differences in the geographical environment and channel conditions in which the data is generated, such as indoor and outdoor conditions, mobile speeds, frequency bands, or antenna configurations, the collected data can be classified when acquiring the data. For example, data with the same channel propagation environment and antenna configuration can be grouped together.
- Model training is essentially learning some of its features from the training data.
- AI models such as neural network models
- the AI model is a neural network, and adjusting the model parameters of the neural network includes adjusting at least one of the following parameters: the number of layers, width, weights of neurons, or parameters in the activation function of the neurons of the neural network.
- Inference data can be used as input to the trained AI model for inference of the AI model.
- the inference data is input into the AI model, and the corresponding output is the inference result.
- the design of AI models mainly includes data collection (such as collecting training data and/or inference data), model training, and model inference. It can also include the application of inference results.
- FIG5 shows an AI application framework
- the data source is used to provide training data sets and inference data.
- the AI model is obtained by analyzing or training the training data provided by the data source. Among them, the AI model represents the mapping relationship between the input and output of the model. Learning the AI model through the model training node is equivalent to using the training data to learn the mapping relationship between the input and output of the model.
- the AI model trained in the model training link is used to perform inference based on the inference data provided by the data source to obtain the inference result.
- This link can also be understood as: inputting the inference data into the AI model, and obtaining the output through the AI model, which is the inference result.
- the inference result can indicate: the configuration parameters used (executed) by the execution object, and/or the operation performed by the execution object.
- the inference result is published in the inference result application link.
- the inference result can be uniformly planned by the execution (actor) entity, for example, the execution entity can send the inference result to one or more execution objects (for example, network devices or terminal devices, etc.) for execution.
- the execution entity can also feedback the performance of the model to the data source to facilitate the subsequent implementation of the model update training.
- RI is used to indicate the number of downlink transmission layers recommended by the terminal device
- CQI is used to indicate the modulation and coding mode supported by the current channel conditions determined by the terminal device
- PMI is used to indicate the precoding recommended by the terminal device.
- the number of precoding layers indicated by PMI corresponds to RI.
- the introduction of AI technology into wireless communication networks has resulted in a CSI feedback method based on the AI model.
- the terminal device uses the AI model to compress and feedback the CSI
- the network device uses the AI model to restore the compressed CSI.
- a sequence (such as a bit sequence) is transmitted, and the overhead is lower than that of traditional CSI feedback.
- the encoder in FIG. 4 may be a CSI generator, and the decoder may be a CSI reconstructor.
- the encoder may be deployed in a terminal device, and the decoder may be deployed in a network device.
- the terminal device may generate CSI feedback information z from the CSI original information V through the encoder.
- the terminal device reports a CSI report, which may include the CSI feedback information z.
- the network device may reconstruct the CSI information through the decoder, that is, obtain CSI recovery information V'.
- the training data used to train the AI model includes training samples and sample labels.
- the training samples are channel information determined by the terminal device, and the sample labels are real channel information, i.e., true CSI.
- the training data may only include training samples, or the training samples are sample labels.
- the true CSI may be a high-precision CSI.
- network element A sends information A to network element B
- network element B in each embodiment of the present application can be understood as the destination end of the information A or the intermediate network element in the transmission path between the destination end and the network element B, which may include directly or indirectly sending information to network element B.
- Network element B receives information A from network element A can be understood as the source end of the information A or the intermediate network element in the transmission path between the source end and the network element A, which may include directly or indirectly receiving information from network element A.
- the information may be processed as necessary between the source end and the destination end of the information transmission, such as format changes, etc., but the destination end can understand the valid information from the source end. Similar expressions in the present application can be understood similarly and will not be repeated here.
- FIG. 6 it is a flow chart of a model data acquisition method provided by an embodiment of the present application.
- the method can be applied to the aforementioned model data acquisition system, such as the model data acquisition system shown in Figure 1.
- the example shown in Figure 6 is an example of the execution subject of the interaction diagram using the first node (data synthesis node, or it can be called a central node, a first entity, etc.) and the second node (data providing node, or it can be called a node to be served, a second entity, etc.) as an example.
- It can further include model processing (such as model training or updating, etc.) nodes and model use (such as model reasoning, etc.) nodes.
- the model processing node can be the first node, or it can be other nodes.
- the model use node can be the second node, or it can be other nodes, etc.
- the model data acquisition method shown in Figure 6 may include steps 601-602. Steps 601-602 are as follows:
- a second node sends first information to a first node, where the first information includes the following types of information: original data and characteristic information of the original data.
- the first node receives the first information.
- the first node may be a data synthesis node, or may be referred to as a data generation node, a central node, a first entity, etc.
- the second node may be a data providing node, or may be referred to as a node to be served, a second entity, etc.
- the first node is an access network device, and the second node is a terminal device.
- Data labeling (or data annotation) is part of the preprocessing phase when developing a machine learning (ML) model. It is responsible for identifying raw data (such as images, text files, videos), and then one or more data labels can be added to the raw data to specify the context of the model and help the machine learning model make accurate predictions. Data labeling supports a variety of different machine learning and deep learning use cases, including computer vision and natural language processing (NLP).
- NLP natural language processing
- the priority of the original data can be taken from the predefined P level.
- the meaning of each priority level can be predefined.
- the generation model For example, for the original data with priority level P1, the generation model generates N1 synthetic data similar to it, and for the original data with priority level P2, the generation model generates N2 synthetic data similar to it.
- the number of synthetic data generated by the generation model that is similar to the original data with priority level P1 is x1
- the number of synthetic data generated by the generation model that is similar to the original data with priority level P2 is x2
- the ratio of x1 to x2 is equal to the ratio of N1 to N2, where N1 and N2 are predefined values.
- the way in which the second node obtains the first information also includes a way to obtain the data label of the original data.
- the second information indicates the way or standard by which the second node determines the data label of the data.
- E1 indicates that the data label is determined according to sparsity (for example, according to a threshold of the sparsity of the channel, the channel label is determined to be LOS or NLOS); or, E2: determines the data label according to the delay distribution, etc.
- E4 or indicates that the data label is the RI of the channel, etc.
- the second information may include an acquisition method identifier (for example, E1/E2/E3/E4).
- the method may further include step 605, wherein the first node further sends fourth information to the second node, wherein the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
- the joint processing may, for example, be weighted summation of feature information of multiple original data of the same group.
- the first node sends the fourth information to the second node so that the second node can divide the original data that can be jointly processed into the same group, while dividing the original data that cannot be jointly processed into the same group, thereby avoiding the fusion of original data from different groups to generate unexpected synthetic data.
- data generation is performed with a small amount of data, which can simultaneously meet the requirements of high diversity of the synthesized data and close feature distribution of the synthesized data to the feature distribution of the original data.
- the original data is the channel information
- the characteristic information of the original data may include channel angle power spectrum characteristics, channel delay spectrum characteristics, etc. Then the generation model can know that the generated channel should be similar to the original channel information in angle and delay power spectrum distribution, while other characteristic values, such as the randomness of Doppler frequency deviation, can be higher.
- weighted summing of feature information of multiple original data of the same group can improve the diversity of synthesized data.
- the original data is channel information
- the feature information of the original data may include channel angle power spectrum features, channel delay spectrum features, etc.
- the above-mentioned multiple synthetic data can be used as the input of the model, and the input can be used for model processing, such as model training, model updating, model reasoning or model performance monitoring.
- the model training, updating, etc. can be performed at the first node, or the synthesized data can be transmitted to the model processing node, etc., for model training, updating, etc.
- the model can be inferred, or the model can be transmitted to the model using node for model inference. For example, it can be transmitted to the second node for model inference.
- the first node further sends at least a subset of the plurality of synthesized data to the second node.
- At least one subset of the multiple synthetic data may be, for example, a subset of the synthetic data obtained by randomly sampling multiple samples from the multiple synthetic data.
- multiple samples are randomly sampled from the synthetic data obtained according to at least one of the multiple data labels to obtain at least one subset of the synthetic data.
- the data label of the original data has three possible values (for example, data label a, data label b and data label c), one subset of the synthetic data is a subset obtained by randomly sampling multiple samples from the synthetic data obtained by inputting data label a as a condition, another subset of the synthetic data is a subset obtained by randomly sampling multiple samples from the synthetic data obtained by inputting data label b (or data label c) as a condition, and so on.
- a subset of the synthetic data may be obtained by randomly sampling multiple samples S1 from the synthetic data obtained by inputting data label a as a condition, randomly sampling multiple samples S2 from the synthetic data obtained by inputting data label b as a condition, and randomly sampling multiple samples S3 from the synthetic data obtained by inputting data label c as a condition, and then obtaining one of the subsets based on the multiple samples S1, multiple samples S2, and multiple samples S3, and obtaining another subset by repeating the sampling method again.
- the subset may also be obtained in other ways, and this solution does not limit this.
- the second node sends fifth information to the first node based on the received at least one subset.
- the fifth information indicates an evaluation result of at least one synthetic data in the at least one subset.
- the evaluation result indicates at least one synthetic data that needs to be eliminated in the at least one subset, or indicates at least one synthetic data that does not need to be eliminated in the at least one subset. Accordingly, the first node receives the fifth information.
- the synthetic data to be eliminated may be, for example, synthetic data having a different characteristic distribution from the original data of the second node.
- the second node obtains the evaluation result by determining whether the received synthetic data is similar to the actual measured data. For example, the second node calculates the distance between a synthetic data in at least one subset and its local data. If there is a local data such that the distance between the synthetic data and the local data satisfies the first condition, the synthetic data is determined to have the same characteristic distribution as the original data, i.e., it is not eliminated; if the distance between the synthetic data and all local data does not meet the first condition, the synthetic data is determined to have a different characteristic distribution from the original data, i.e., it needs to be eliminated.
- the first condition may be a certain specific range, or a certain threshold, etc.
- the calculation of the distance between the synthetic data and the local data may be to use the norm of the difference between the synthetic data and the local data as the distance between the synthetic data and the local data.
- the data feature information of the synthetic data and the local data is extracted respectively, and the norm of the difference between the data feature information of the synthetic data and the local data is used as the distance between the synthetic data and the local data.
- the first node filters the multiple synthetic data based on the received evaluation result, and thereby obtains at least one synthetic data after filtering.
- the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the plurality of synthesized data.
- the first synthesized data to be eliminated is the synthesized data to be eliminated determined according to the fifth information.
- the first node calculates the distance between the synthetic data to be removed and other synthetic data in the first node. If there is a synthetic data such that the distance between the synthetic data and the synthetic data to be removed is not greater than a certain threshold, it is determined that the synthetic data is also removed.
- the at least one synthetic data after screening is used as the input of the model.
- step 601 For the introduction of this part, please refer to the description of step 601, which will not be repeated here.
- the second node is a third-party device that performs the aforementioned second node-related actions.
- the above step 601 is performed by a third-party device.
- the first node is a network device.
- the above steps 601 and 602 are both performed by the network device.
- the second node is a terminal device.
- the above step 601 is performed by the terminal device.
- the first node includes a network device and a third-party device.
- the above step 602 can be performed by a third-party device, such as an OTT, or a cloud server, and the above step 601 can be performed by the network device.
- the network device and the third-party device can also communicate with each other to transmit the content transmitted in the above step 601.
- the second information indicates the method for obtaining the characteristic information of the original data as an example.
- the second information can also be sent by the third node to the second node (the embodiment shown in Figure 7). That is to say, the third node indicates the method for obtaining the characteristic information of the original data to the second node, so that the second node processes based on the method for obtaining the characteristic information of the original data to obtain the characteristic information of the original data.
- the third node can be a model training node or a model use node, and can also be called a third entity, such as a model training entity or a model use entity.
- the first node may process the original data to obtain the characteristic information of the original data (the embodiment shown in FIG8 and the embodiment shown in FIG9 ) or the third node may process the original data to obtain the characteristic information of the original data (the embodiment shown in FIG10 ).
- FIG. 7 it is a flow chart of another model data acquisition method provided by an embodiment of the present application.
- the example shown in Figure 7 is illustrated by taking the first node (data synthesis node, or it can be called a central node, a first entity, etc.), the second node (data providing node, or it can be called a node to be served, a second entity, etc.) and the third node (model training node or model use node, or it can be called a third entity, such as a model training entity or a model use entity) as an example of the execution subject of the interaction diagram.
- the difference between this example and the example shown in Figure 6 is that the second information is sent by the third node to the second node, rather than by the first node to the second node.
- the model data acquisition method shown in Figure 7 may include steps 701-705. Steps 701-705 are as follows:
- a first node sends third information to a second node, where the third information indicates a reporting configuration of the first information.
- the second node receives the third information.
- the second node processes the original data based on the method for obtaining the characteristic information of the original data from the third node to obtain the characteristic information of the original data. Furthermore, the second node sends the original data and the characteristic information of the original data to the first node. The first node generates multiple synthetic data based on the original data from the second node and the characteristic information of the original data.
- the model training node or the model use node determines the method for obtaining the characteristic information of the original data (or the method for obtaining the data label of the original data) according to the model requirements, which can improve the quality of the synthetic data, thereby helping to improve the performance of model training or updating.
- the second node sends sixth information to the first node, where the sixth information includes the following type of information: original data.
- the first node receives the sixth information.
- the third node sends the eighth information to the second node, the eighth information indicating the processing method of the second node to obtain the sixth information, and the processing method of the second node to obtain the sixth information includes the method of obtaining the data label of the original data.
- This solution does not limit this.
- the second node is a third-party device that performs the aforementioned actions related to the second node.
- the above steps 1001-1003 are performed by a third-party device.
- the third node is a third-party device that performs the aforementioned actions related to the third node.
- the above steps 1003-1006 are performed by a third-party device.
- the first node is a network device.
- the above steps 1001, 1002, 1005-1007 are all performed by the network device.
- the second node is a terminal device.
- the above steps 1001-1003 are performed by the terminal device.
- the third node is an AI node.
- the above steps 1003-1006 are performed by the AI node.
- the first node includes a network device and a third-party device.
- the above step 1007 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 1001, 1002, 1005, and 1006 can be performed by the network device.
- the network device and the third-party device can also communicate with each other to transmit the content transmitted in the above steps 1001, 1002, 1005, and 1006.
- the third node includes a third-party device, and a network device or a terminal device.
- the above step 1004 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 1003, 1005, and 1006 can be performed by a network device or a terminal device.
- the network device or the terminal device can also communicate with the third-party device to transmit the content transmitted in the above steps 1003, 1005, and 1006.
- the second node and the third node in the embodiment of the present application may be located in the same node or in two different nodes, and this solution does not impose any restrictions on this.
- the division of multiple units or modules is only a logical division according to function, and is not used as a limitation on the specific structure of the device.
- some functional modules may be subdivided into more small functional modules, and some functional modules may also be combined into one functional module, but no matter whether these functional modules are subdivided or combined, the general process performed by the device is the same.
- some devices contain a receiving unit and a sending unit.
- the sending unit and the receiving unit can also be integrated into a communication unit, which can implement the functions implemented by the receiving unit and the sending unit.
- each unit corresponds to its own program code (or program instruction), and when the program code corresponding to each of these units is run on the processor, the unit is controlled by the processing unit to execute the corresponding process to implement the corresponding function.
- a model data acquisition device including a module (or means) for implementing each step performed by the first node in any of the above methods.
- a model data acquisition device including a module (or means) for implementing each step performed by the second node in any of the above methods.
- a model data acquisition device is provided, including a module (or means) for implementing each step performed by the third node in any of the above methods.
- the memory 1203 may be located in the one or more processors, or located outside the one or more processors, or may include a storage part located in the one or more processors and a storage part located outside the one or more processors.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
Description
本申请要求在2023年12月29日提交中国国家知识产权局、申请号为202311871100.7的中国专利申请的优先权,发明名称为“模型数据获取方法及装置、系统”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application filed with the State Intellectual Property Office of China on December 29, 2023, with application number 202311871100.7, and priority to the Chinese patent application with the invention name “Model Data Acquisition Method, Device and System”, all contents of which are incorporated by reference in this application.
本申请涉及人工智能(Artificial Intelligence,AI)领域,尤其涉及一种模型数据获取方法及装置、系统。The present application relates to the field of artificial intelligence (AI), and in particular to a model data acquisition method, device and system.
随着人工智能研究的发展,神经网络的应用场景不断扩展,包括智慧医疗、智能网络等。因此,如何有效进行特定场景的神经网络的训练成为重要的研究方向。神经网络的训练中三个重要的因素分别为数据、神经网络与计算平台。随着图形处理器(Graphic Processing Unit,GPU)部署的普及,具备神经网络计算平台的设备的占比逐渐增加,因此面向工作者的专有神经网络训练需求增加。With the development of artificial intelligence research, the application scenarios of neural networks continue to expand, including smart medical care, smart networks, etc. Therefore, how to effectively train neural networks for specific scenarios has become an important research direction. The three important factors in neural network training are data, neural networks, and computing platforms. With the popularization of graphics processing units (GPUs), the proportion of devices equipped with neural network computing platforms has gradually increased, so the demand for proprietary neural network training for workers has increased.
训练数据的质量是影响模型训练和微调性能的重要因素。而面向分立设备,例如终端侧/特定基站侧,部署的专用模型/特定场景专用模型的训练或微调,训练数据的获取有很高的挑战性。一方面是因为模型训练或微调需要充分多对应场景的训练数据,以确保模型的收敛性能,但是设备端获取的实测数据有限,仅利用设备采集的数据训练难以确保模型性能;另一方面是因为模型训练或微调需要数据特征足够丰富,以确保模型的泛化性和鲁棒性,但是设备端获取的实测数据有限,导致数据特征多样性有限。The quality of training data is an important factor affecting the performance of model training and fine-tuning. However, for the training or fine-tuning of dedicated models/dedicated models for specific scenarios deployed on discrete devices, such as the terminal side/specific base station side, it is very challenging to obtain training data. On the one hand, model training or fine-tuning requires sufficient training data for the corresponding scenarios to ensure the convergence performance of the model, but the measured data obtained on the device side is limited, and it is difficult to ensure model performance only by training with data collected by the device; on the other hand, model training or fine-tuning requires data features that are rich enough to ensure the generalization and robustness of the model, but the measured data obtained on the device side is limited, resulting in limited diversity of data features.
现有技术中,中心节点接收并合并多个设备的原始本地数据作为训练集,以提高训练集的训练数据量与数据多样性。进而,中心节点以合并的训练集进行模型训练或模型更新。训练完成的模型部署于设备端进行模型推断。其中,中心节点仅根据多个设备的原始本地数据作为生成模型的输入来得到合成数据,合成数据质量不高。In the prior art, the central node receives and merges the original local data of multiple devices as a training set to increase the amount of training data and data diversity of the training set. Then, the central node uses the merged training set to perform model training or model update. The trained model is deployed on the device side for model inference. Among them, the central node only uses the original local data of multiple devices as the input of the generation model to obtain synthetic data, and the quality of the synthetic data is not high.
本申请公开了一种模型数据获取方法及装置、系统,可以提高合成数据的质量。The present application discloses a model data acquisition method, device and system, which can improve the quality of synthetic data.
第一方面,本申请实施例提供一种模型数据获取方法,由第一节点或用于第一节点的电路执行。该方法包括:接收来自第二节点的第一信息。该第一信息包括以下类型的信息:原始数据和所述原始数据的特征信息。以及基于所述第一信息,生成多个合成数据。该多个合成数据用于模型的处理。In a first aspect, an embodiment of the present application provides a model data acquisition method, which is performed by a first node or a circuit for a first node. The method includes: receiving first information from a second node. The first information includes the following types of information: original data and feature information of the original data. And based on the first information, generating multiple synthetic data. The multiple synthetic data are used for model processing.
本申请实施例,第一节点或用于第一节点的电路,如芯片,基于获取的原始数据和原始数据的特征信息,生成多个合成数据。In an embodiment of the present application, the first node or a circuit used for the first node, such as a chip, generates a plurality of synthetic data based on the acquired original data and characteristic information of the original data.
基于原始数据和原始数据的特征信息可以生成满足多样性要求且数据分布与实测数据分布相同的合成数据,提高了合成数据的质量,进而有助于提高模型训练或更新的性能。以下以第一节点作为执行主体进行描述,可以理解的是,第一节点也可以替换为用于第一节点的电路,如芯片。Based on the original data and the feature information of the original data, synthetic data that meets the diversity requirements and has the same data distribution as the measured data distribution can be generated, thereby improving the quality of the synthetic data, and thus helping to improve the performance of model training or updating. The following description is based on the first node as the execution subject. It can be understood that the first node can also be replaced by a circuit for the first node, such as a chip.
在一种可能的实现方式中,第一节点还向第二节点发送第二信息。该第二信息指示所述第二节点得到所述第一信息的处理方式。其中,所述第二节点得到所述第一信息的处理方式包括所述原始数据的特征信息的获取方式。In a possible implementation, the first node further sends second information to the second node. The second information indicates a processing method for the second node to obtain the first information. The processing method for the second node to obtain the first information includes a method for acquiring feature information of the original data.
基于该第二信息的指示,以便第二节点对原始数据进行处理,得到原始数据的特征信息。Based on the instruction of the second information, the second node processes the original data to obtain characteristic information of the original data.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息中的至少一项:所述原始数据的优先级、所述原始数据的数据标签。In a possible implementation manner, the first information further includes at least one of the following types of information: a priority of the original data and a data label of the original data.
通过上报原始数据的优先级,以便第一节点根据每个原始数据的优先级或者概率大小,确定要生成的与之相似的合成数据的数量等,以使得合成数据的分布与预期一致。By reporting the priority of the original data, the first node determines the number of similar synthetic data to be generated according to the priority or probability of each original data, so that the distribution of the synthetic data is consistent with expectations.
在一种可能的实现方式中,所述第二节点得到所述第一信息的处理方式还包括所述原始数据的数据标签的获取方式。In a possible implementation manner, the processing method for the second node to obtain the first information also includes a method for obtaining the data label of the original data.
基于该原始数据的数据标签的获取方式,以便第二节点对原始数据进行处理,得到原始数据的数据标签。Based on the method for obtaining the data label of the original data, the second node processes the original data to obtain the data label of the original data.
在一种可能的实现方式中,第一节点还向所述第二节点发送第三信息,所述第三信息指示所述第一信息的上报配置。In a possible implementation manner, the first node further sends third information to the second node, where the third information indicates a reporting configuration of the first information.
第一节点通过向第二节点发送该第三信息,以便第二节点基于第三信息中指示的第一信息的上报配置进行上报。这样使得相互分离的数据提供端与合成数据端可以对齐对数据的理解,提高合成数据的质量。The first node sends the third information to the second node so that the second node reports based on the reporting configuration of the first information indicated in the third information, so that the data provider and the synthesized data end separated from each other can align their understanding of the data and improve the quality of the synthesized data.
可选的,所述上报配置包括以下至少一项:所述第一信息的类型,所述第一信息的数据量。Optionally, the reporting configuration includes at least one of the following: the type of the first information, and the data volume of the first information.
第一节点通过向第二节点发送第一信息的类型的上报配置和/或第一信息的数据量的上报配置,以便第二节点向第一节点发送对应的第一信息的类型,或者对应数据量的第一信息,有助于提高合成数据的质量。The first node sends a reporting configuration of the type of first information and/or a reporting configuration of the data amount of the first information to the second node, so that the second node sends the corresponding type of first information or the corresponding data amount of first information to the first node, which helps to improve the quality of the synthesized data.
在一种可能的实现方式中,第一节点还向所述第二节点发送第四信息。所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。该联合处理,例如可以是将相同分组的多个原始数据的特征信息加权求和等。In a possible implementation, the first node further sends fourth information to the second node. The fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group. The joint processing may be, for example, weighted summing of feature information of multiple original data of the same group.
第一节点通过向第二节点发送该第四信息,以便第二节点可以将可进行联合处理的原始数据划分为同一组,而将不可进行联合处理的原始数据不划分为同一组,从而可以避免将不同组的原始数据做融合,而生成不符合预期的合成数据。The first node sends the fourth information to the second node so that the second node can divide the original data that can be jointly processed into the same group, while dividing the original data that cannot be jointly processed into the same group, thereby avoiding the fusion of original data from different groups to generate unexpected synthetic data.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息:所述原始数据的组信息。In a possible implementation manner, the first information further includes the following type of information: group information of the original data.
第二节点向第一节点上报原始数据的组信息,进而第一节点可将相同分组的多个原始数据的特征信息进行联合处理。The second node reports the group information of the original data to the first node, and then the first node can jointly process the feature information of multiple original data in the same group.
在一种可能的实现方式中,第一节点还向所述第二节点发送所述多个合成数据中的至少一个子集。第一节点还接收来自所述第二节点的第五信息。所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果。所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。In a possible implementation, the first node further sends at least one subset of the plurality of synthesized data to the second node. The first node further receives fifth information from the second node. The fifth information indicates an evaluation result of at least one synthesized data in the at least one subset. The evaluation result indicates at least one synthesized data that needs to be eliminated in the at least one subset, or indicates at least one synthesized data that does not need to be eliminated in the at least one subset.
可选的,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据。其中,所述筛选后的至少一个合成数据用于所述模型的处理。Optionally, the evaluation result is used to filter the plurality of synthetic data to obtain at least one filtered synthetic data, wherein the at least one filtered synthetic data is used for processing the model.
在一种可能的实现方式中,所述筛选后的至少一个合成数据是基于待剔除的第一合成数据与所述多个合成数据中的其他合成数据之间的距离确定的,所述待剔除的第一合成数据为基于所述第五信息确定的需要剔除的合成数据。In a possible implementation, the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the multiple synthesized data, and the first synthesized data to be eliminated is the synthesized data that needs to be eliminated determined based on the fifth information.
通过合成数据的评估过程对合成数据进行筛选,可以有效提高合成数据的质量。By screening the synthetic data through the synthetic data evaluation process, the quality of the synthetic data can be effectively improved.
或者,本申请实施例提供一种模型数据获取方法,由第一节点或用于第一节点的电路执行。该方法包括:第一节点接收来自第二节点的第六信息。所述第六信息包括以下类型的信息:原始数据。以及第一节点基于所述第六信息和原始数据的特征信息,生成多个合成数据。该多个合成数据用于模型的处理。以下以第一节点作为执行主体进行描述,可以理解的是,第一节点也可以替换为用于第一节点的电路,如芯片。Alternatively, an embodiment of the present application provides a model data acquisition method, which is executed by a first node or a circuit for the first node. The method includes: the first node receives sixth information from the second node. The sixth information includes the following types of information: original data. And the first node generates multiple synthetic data based on the sixth information and the characteristic information of the original data. The multiple synthetic data are used for model processing. The following description is based on the first node as the execution subject. It can be understood that the first node can also be replaced by a circuit for the first node, such as a chip.
在第一种可能的实现方式中,第一节点还接收来自第三节点的第十一信息,所述第十一信息包括以下类型的信息:所述原始数据的特征信息。In a first possible implementation manner, the first node further receives eleventh information from the third node, where the eleventh information includes the following type of information: characteristic information of the original data.
在第二种可能的实现方式中,第一节点还接收来自所述第二节点的第九信息,所述第九信息指示所述第一节点得到所述原始数据的特征信息的获取方式。进而,第一节点基于所述第九信息对所述原始数据进行处理得到所述原始数据的特征信息。In a second possible implementation, the first node further receives ninth information from the second node, the ninth information indicating a method for the first node to obtain the characteristic information of the original data. Further, the first node processes the original data based on the ninth information to obtain the characteristic information of the original data.
在第三种可能的实现方式中,第一节点还接收来自第三节点的第九信息,所述第九信息指示所述第一节点得到所述原始数据的特征信息的获取方式。进而,第一节点基于所述第九信息对所述原始数据进行处理得到所述原始数据的特征信息。In a third possible implementation, the first node further receives ninth information from the third node, the ninth information indicating a method for the first node to obtain the characteristic information of the original data. Further, the first node processes the original data based on the ninth information to obtain the characteristic information of the original data.
在第四种可能的实现方式中,所述原始数据的特征信息是基于原始数据的特征信息的获取方式确定的,所述原始数据的特征信息的获取方式是所述第一节点确定的。In a fourth possible implementation manner, the characteristic information of the original data is determined based on a method for acquiring the characteristic information of the original data, and the method for acquiring the characteristic information of the original data is determined by the first node.
在一种可能的实现方式中,所述第六信息还包括以下类型的信息中的至少一项:所述原始数据的优先级、或,所述原始数据的数据标签。In a possible implementation manner, the sixth information further includes at least one of the following types of information: the priority of the original data, or a data label of the original data.
通过上报原始数据的优先级,以便第一节点根据每个原始数据的优先级或者概率大小,确定要生成的与之相似的合成数据的数量等,以使得合成数据的分布与预期一致。By reporting the priority of the original data, the first node determines the number of similar synthetic data to be generated according to the priority or probability of each original data, so that the distribution of the synthetic data is consistent with expectations.
在一种可能的实现方式中,第一节点还向第二节点发送第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。In a possible implementation, the first node also sends eighth information to the second node, where the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining a data label of the original data.
基于该原始数据的数据标签的获取方式,以便第二节点对原始数据进行处理,得到原始数据的数据标签。Based on the method for obtaining the data label of the original data, the second node processes the original data to obtain the data label of the original data.
在一种可能的实现方式中,第一节点还向所述第二节点发送第七信息,该第七信息指示第六信息的上报配置。In a possible implementation manner, the first node further sends seventh information to the second node, where the seventh information indicates a reporting configuration of the sixth information.
第一节点通过向第二节点发送该第七信息,以便第二节点基于第七信息中指示的第六信息的上报配置进行上报。这样使得相互分离的数据提供端与合成数据端可以对齐对数据的理解,提高合成数据的质量。The first node sends the seventh information to the second node so that the second node reports based on the reporting configuration of the sixth information indicated in the seventh information. In this way, the data provider and the synthesized data end separated from each other can align their understanding of the data, thereby improving the quality of the synthesized data.
可选的,所述上报配置包括以下至少一项:所述第六信息的类型,或,所述第六信息的数据量。Optionally, the reporting configuration includes at least one of the following: the type of the sixth information, or the data volume of the sixth information.
第一节点通过向第二节点发送第六信息的类型的上报配置和/或第六信息的数据量的上报配置,以便第二节点向第一节点发送对应的第六信息的类型,或者对应数据量的第六信息,有助于提高合成数据的质量。The first node sends a reporting configuration of the type of sixth information and/or a reporting configuration of the data amount of the sixth information to the second node, so that the second node sends the corresponding type of sixth information, or the corresponding data amount of sixth information to the first node, which helps to improve the quality of the synthesized data.
在一种可能的实现方式中,第一节点还向所述第二节点发送第四信息。所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。该联合处理,例如可以是将相同分组的多个原始数据的特征信息加权求和等。In a possible implementation, the first node further sends fourth information to the second node. The fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group. The joint processing may be, for example, weighted summing of feature information of multiple original data of the same group.
第一节点通过向第二节点发送该第四信息,以便第二节点可以将可进行联合处理的原始数据划分为同一组,而将不可进行联合处理的原始数据不划分为同一组,从而可以避免将不同组的原始数据做融合,而生成不符合预期的合成数据。The first node sends the fourth information to the second node so that the second node can divide the original data that can be jointly processed into the same group, while dividing the original data that cannot be jointly processed into the same group, thereby avoiding the fusion of original data from different groups to generate unexpected synthetic data.
在一种可能的实现方式中,所述第六信息还包括以下类型的信息:所述原始数据的组信息。In a possible implementation manner, the sixth information further includes the following type of information: group information of the original data.
第二节点向第一节点上报原始数据的组信息,进而第一节点可将相同分组的多个原始数据的特征信息进行联合处理。The second node reports the group information of the original data to the first node, and then the first node can jointly process the feature information of multiple original data in the same group.
在一种可能的实现方式中,第一节点还向所述第二节点发送所述多个合成数据中的至少一个子集。第一节点还接收来自所述第二节点的第五信息。所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果。所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。In a possible implementation, the first node further sends at least one subset of the plurality of synthesized data to the second node. The first node further receives fifth information from the second node. The fifth information indicates an evaluation result of at least one synthesized data in the at least one subset. The evaluation result indicates at least one synthesized data that needs to be eliminated in the at least one subset, or indicates at least one synthesized data that does not need to be eliminated in the at least one subset.
可选的,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据。其中,所述筛选后的至少一个合成数据用于所述模型的处理。Optionally, the evaluation result is used to filter the plurality of synthetic data to obtain at least one filtered synthetic data, wherein the at least one filtered synthetic data is used for processing the model.
在一种可能的实现方式中,所述筛选后的至少一个合成数据是基于待剔除的第一合成数据与所述多个合成数据中的其他合成数据之间的距离确定的,所述待剔除的第一合成数据为基于所述第五信息确定的需要剔除的合成数据。In a possible implementation, the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the multiple synthesized data, and the first synthesized data to be eliminated is the synthesized data that needs to be eliminated determined based on the fifth information.
通过合成数据的评估过程对合成数据进行筛选,可以有效提高合成数据的质量。By screening the synthetic data through the synthetic data evaluation process, the quality of the synthetic data can be effectively improved.
第二方面,本申请实施例提供一种模型数据获取方法,由第二节点或用于第二节点的电路执行。该方法包括:向第一节点发送第一信息。该第一信息包括以下类型的信息:原始数据和所述原始数据的特征信息。以下以第二节点作为执行主体进行描述,可以理解的是,第二节点也可以替换为用于第二节点的电路,如芯片。In a second aspect, an embodiment of the present application provides a model data acquisition method, which is executed by a second node or a circuit for a second node. The method includes: sending first information to a first node. The first information includes the following types of information: original data and characteristic information of the original data. The following description is based on the second node as the execution subject. It can be understood that the second node can also be replaced by a circuit for the second node, such as a chip.
本申请实施例,第二节点向第一节点发送原始数据和所述原始数据的特征信息,所述原始数据和所述原始数据的特征信息用于多个合成数据的生成。In an embodiment of the present application, the second node sends original data and characteristic information of the original data to the first node, and the original data and the characteristic information of the original data are used to generate multiple synthetic data.
基于原始数据和原始数据的特征信息可以生成满足多样性要求且数据分布与实测数据分布相同的合成数据,可以提高合成数据的质量,进而有助于提高模型训练或更新的性能。Based on the original data and its feature information, synthetic data that meets diversity requirements and has the same data distribution as the measured data can be generated, which can improve the quality of the synthetic data and thus help improve the performance of model training or updating.
在一种可能的实现方式中,第二节点还接收来自所述第一节点的第二信息。该第二信息指示所述第二节点得到所述第一信息的处理方式。示例性的,所述第二节点得到所述第一信息的处理方式包括所述原始数据的特征信息的获取方式。进而,第二节点基于所述第二信息得到所述第一信息。In a possible implementation, the second node further receives second information from the first node. The second information indicates a processing method by which the second node obtains the first information. Exemplarily, the processing method by which the second node obtains the first information includes a method for obtaining characteristic information of the original data. Further, the second node obtains the first information based on the second information.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息中的至少一项:所述原始数据的数据标签、或,所述原始数据的优先级。In a possible implementation manner, the first information further includes at least one of the following types of information: a data label of the original data, or a priority of the original data.
在一种可能的实现方式中,所述处理方式还包括所述原始数据的数据标签的获取方式。In a possible implementation manner, the processing method also includes a method for acquiring data labels of the original data.
在一种可能的实现方式中,第二节点还接收来自所述第一节点的第三信息,所述第三信息指示所述第一信息的上报配置。In a possible implementation manner, the second node further receives third information from the first node, where the third information indicates a reporting configuration of the first information.
可选的,所述上报配置包括以下至少一项:所述第一信息的类型,所述第一信息的数据量。Optionally, the reporting configuration includes at least one of the following: the type of the first information, and the data volume of the first information.
在一种可能的实现方式中,第二节点还接收来自所述第一节点的第四信息,所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。In a possible implementation, the second node further receives fourth information from the first node, where the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息:所述原始数据的组信息。In a possible implementation manner, the first information further includes the following type of information: group information of the original data.
在一种可能的实现方式中,第二节点还接收来自所述第一节点的多个合成数据中的至少一个子集。第二节点还向所述第一节点发送第五信息,所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果。所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。In a possible implementation, the second node further receives at least one subset of the multiple synthetic data from the first node. The second node further sends fifth information to the first node, where the fifth information indicates an evaluation result of at least one synthetic data in the at least one subset. The evaluation result indicates at least one synthetic data that needs to be eliminated from the at least one subset, or indicates at least one synthetic data that does not need to be eliminated from the at least one subset.
在一种可能的实现方式中,所述评估结果是基于所述至少一个子集中的至少一个合成数据与本地数据之间的距离确定的。In a possible implementation manner, the evaluation result is determined based on a distance between at least one synthetic data in the at least one subset and the local data.
在一种可能的实现方式中,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据;其中,所述筛选后的至少一个合成数据用于模型的处理。In a possible implementation, the evaluation result is used to filter the multiple synthetic data to obtain at least one filtered synthetic data; wherein the at least one filtered synthetic data is used for model processing.
或者,本申请实施例提供一种模型数据获取方法,由第二节点或用于第二节点的电路执行。该方法包括:第二节点向第一节点发送第六信息。所述第六信息包括以下类型的信息:原始数据。Alternatively, an embodiment of the present application provides a model data acquisition method, which is executed by a second node or a circuit for a second node. The method includes: the second node sends sixth information to the first node. The sixth information includes the following types of information: original data.
其中,第二节点还向第一节点发送第九信息,所述第九信息指示所述第一节点得到所述原始数据的特征信息的获取方式。The second node also sends ninth information to the first node, where the ninth information indicates a method for the first node to obtain the characteristic information of the original data.
在一种可能的实现方式中,第二节点还接收来自第一节点的第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。In a possible implementation, the second node also receives eighth information from the first node, where the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining a data label of the original data.
在一种可能的实现方式中,第二节点还接收来自第一节点的第七信息,该第七信息指示第六信息的上报配置。In a possible implementation manner, the second node further receives seventh information from the first node, where the seventh information indicates a reporting configuration of the sixth information.
在一种可能的实现方式中,第二节点还接收来自第一节点的第四信息。所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。In a possible implementation manner, the second node further receives fourth information from the first node, wherein the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
在一种可能的实现方式中,第二节点还接收来自所述第一节点的多个合成数据中的至少一个子集。第二节点还向所述第一节点发送第五信息,所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果。所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。In a possible implementation, the second node further receives at least one subset of the multiple synthetic data from the first node. The second node further sends fifth information to the first node, where the fifth information indicates an evaluation result of at least one synthetic data in the at least one subset. The evaluation result indicates at least one synthetic data that needs to be eliminated from the at least one subset, or indicates at least one synthetic data that does not need to be eliminated from the at least one subset.
在一种可能的实现方式中,所述评估结果是基于所述至少一个子集中的至少一个合成数据与本地数据之间的距离确定的。In a possible implementation manner, the evaluation result is determined based on a distance between at least one synthetic data in the at least one subset and the local data.
在一种可能的实现方式中,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据;其中,所述筛选后的至少一个合成数据用于模型的处理。In a possible implementation, the evaluation result is used to filter the multiple synthetic data to obtain at least one filtered synthetic data; wherein the at least one filtered synthetic data is used for model processing.
第三方面,本申请实施例提供一种模型数据获取方法,由第三节点或用于第三节点的电路执行。该方法包括:向第一节点发送第十一信息,所述第十一信息包括以下类型的信息:原始数据的特征信息。这样以便第一节点基于该原始数据的特征信息得到合成数据,也即,该原始数据的特征信息用于合成数据的生成。In a third aspect, an embodiment of the present application provides a model data acquisition method, which is performed by a third node or a circuit for a third node. The method includes: sending eleventh information to a first node, the eleventh information including the following types of information: characteristic information of the original data. In this way, the first node obtains synthetic data based on the characteristic information of the original data, that is, the characteristic information of the original data is used to generate the synthetic data.
或者,本申请实施例提供一种模型数据获取方法,由第三节点或用于第三节点的电路执行。该方法包括:向第一节点发送第九信息,所述第九信息指示第一节点得到所述原始数据的特征信息的获取方式。这样以便第一节点基于所述第九信息对所述原始数据进行处理得到所述原始数据的特征信息,也即,该原始数据的特征信息的获取方式用于原始数据的特征信息的获取。Alternatively, an embodiment of the present application provides a model data acquisition method, which is executed by a third node or a circuit for a third node. The method includes: sending ninth information to a first node, wherein the ninth information indicates a method for the first node to obtain characteristic information of the original data. In this way, the first node processes the original data based on the ninth information to obtain characteristic information of the original data, that is, the method for obtaining characteristic information of the original data is used to obtain characteristic information of the original data.
第四方面,本申请提供了一种模型数据获取装置,该装置可包括收发模块和处理模块,具体如下:In a fourth aspect, the present application provides a model data acquisition device, which may include a transceiver module and a processing module, as follows:
收发模块,用于接收来自第二节点的第一信息,所述第一信息包括以下类型的信息:原始数据和所述原始数据的特征信息;A transceiver module, configured to receive first information from a second node, wherein the first information includes the following types of information: original data and characteristic information of the original data;
处理模块,用于基于所述第一信息,生成多个合成数据,所述多个合成数据用于模型的处理。A processing module is used to generate a plurality of synthetic data based on the first information, and the plurality of synthetic data are used for processing the model.
在一种可能的实现方式中,所述收发模块,还用于向所述第二节点发送第二信息,所述第二信息指示所述第二节点得到所述第一信息的处理方式,所述第二节点得到所述第一信息的处理方式包括所述原始数据的特征信息的获取方式。In a possible implementation, the transceiver module is further used to send second information to the second node, where the second information indicates a processing method for the second node to obtain the first information, and the processing method for the second node to obtain the first information includes a method for obtaining characteristic information of the original data.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息:所述原始数据的数据标签和/或所述原始数据的优先级。In a possible implementation manner, the first information further includes the following types of information: a data label of the original data and/or a priority of the original data.
在一种可能的实现方式中,所述第二节点得到所述第一信息的处理方式还包括所述原始数据的数据标签的获取方式。In a possible implementation manner, the processing method for the second node to obtain the first information also includes a method for obtaining the data label of the original data.
在一种可能的实现方式中,所述收发模块,还用于向所述第二节点发送第三信息,所述第三信息指示所述第一信息的上报配置。In a possible implementation manner, the transceiver module is further configured to send third information to the second node, where the third information indicates a reporting configuration of the first information.
在一种可能的实现方式中,所述上报配置包括以下至少一项:所述第一信息的类型,所述第一信息的数据量。In a possible implementation manner, the reporting configuration includes at least one of the following: a type of the first information, and a data volume of the first information.
在一种可能的实现方式中,所述收发模块,还用于向所述第二节点发送第四信息,所述第四信息指示所述装置具有将相同分组的多个原始数据的特征信息进行联合处理的能力。In a possible implementation, the transceiver module is further configured to send fourth information to the second node, where the fourth information indicates that the device has the ability to jointly process feature information of multiple original data of the same group.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息:所述原始数据的组信息。In a possible implementation manner, the first information further includes the following type of information: group information of the original data.
在一种可能的实现方式中,所述收发模块,还用于向所述第二节点发送所述多个合成数据中的至少一个子集;In a possible implementation manner, the transceiver module is further configured to send at least one subset of the plurality of synthesized data to the second node;
接收来自所述第二节点的第五信息,所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果,所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。Receive fifth information from the second node, wherein the fifth information indicates an evaluation result of at least one synthetic data in the at least one subset, wherein the evaluation result indicates at least one synthetic data that needs to be eliminated in the at least one subset, or indicates at least one synthetic data that does not need to be eliminated in the at least one subset.
在一种可能的实现方式中,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据;其中,所述筛选后的至少一个合成数据用于所述模型的处理。In a possible implementation, the evaluation result is used to filter the multiple synthetic data to obtain at least one filtered synthetic data; wherein the at least one filtered synthetic data is used for processing the model.
在一种可能的实现方式中,所述筛选后的至少一个合成数据是基于待剔除的第一合成数据与所述多个合成数据中的其他合成数据之间的距离确定的,所述待剔除的第一合成数据为基于所述第五信息确定的需要剔除的合成数据。In a possible implementation, the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the multiple synthesized data, and the first synthesized data to be eliminated is the synthesized data that needs to be eliminated determined based on the fifth information.
或者,本申请提供了一种模型数据获取装置,该装置可包括收发模块和处理模块,具体如下:Alternatively, the present application provides a model data acquisition device, which may include a transceiver module and a processing module, as follows:
收发模块,用于接收来自第二节点的第六信息,所述第六信息包括以下类型的信息:原始数据。The transceiver module is used to receive sixth information from the second node, where the sixth information includes the following types of information: original data.
处理模块,用于基于所述第六信息和原始数据的特征信息,生成多个合成数据。该多个合成数据用于模型的处理。The processing module is used to generate a plurality of synthetic data based on the sixth information and the characteristic information of the original data. The plurality of synthetic data are used for processing the model.
在第一种可能的实现方式中,收发模块,还用于接收来自第三节点的第十一信息,所述第十一信息包括以下类型的信息:所述原始数据的特征信息。In a first possible implementation manner, the transceiver module is further configured to receive eleventh information from the third node, where the eleventh information includes the following type of information: characteristic information of the original data.
在第二种可能的实现方式中,收发模块,还用于接收来自所述第二节点的第九信息,所述第九信息指示所述第一节点得到所述原始数据的特征信息的获取方式。In a second possible implementation manner, the transceiver module is further configured to receive ninth information from the second node, where the ninth information indicates a method for the first node to obtain the characteristic information of the original data.
在第三种可能的实现方式中,收发模块,还用于接收来自第三节点的第九信息,所述第九信息指示所述第一节点得到所述原始数据的特征信息的获取方式。In a third possible implementation manner, the transceiver module is further used to receive ninth information from the third node, where the ninth information indicates a method for the first node to obtain the characteristic information of the original data.
在第四种可能的实现方式中,所述原始数据的特征信息是基于原始数据的特征信息的获取方式确定的,所述原始数据的特征信息的获取方式是所述第一节点确定的。In a fourth possible implementation manner, the characteristic information of the original data is determined based on a method for acquiring the characteristic information of the original data, and the method for acquiring the characteristic information of the original data is determined by the first node.
在一种可能的实现方式中,收发模块,还用于向第二节点发送第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。In a possible implementation, the transceiver module is also used to send eighth information to the second node, where the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining a data label of the original data.
在一种可能的实现方式中,收发模块,还用于向所述第二节点发送第七信息,该第七信息指示第六信息的上报配置。In a possible implementation manner, the transceiver module is further configured to send seventh information to the second node, where the seventh information indicates a reporting configuration of the sixth information.
可选的,所述上报配置包括以下至少一项:所述第六信息的类型,所述第六信息的数据量。Optionally, the reporting configuration includes at least one of the following: the type of the sixth information, and the data volume of the sixth information.
在一种可能的实现方式中,收发模块,还用于向所述第二节点发送第四信息。所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。该联合处理,例如可以是将相同分组的多个原始数据的特征信息加权求和等。In a possible implementation, the transceiver module is further configured to send fourth information to the second node. The fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group. The joint processing may be, for example, weighted summing of feature information of multiple original data of the same group.
在一种可能的实现方式中,所述第六信息还包括以下类型的信息:所述原始数据的组信息。In a possible implementation manner, the sixth information further includes the following type of information: group information of the original data.
在一种可能的实现方式中,收发模块,还用于向所述第二节点发送所述多个合成数据中的至少一个子集。第一节点还接收来自所述第二节点的第五信息。所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果。所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。In a possible implementation, the transceiver module is further configured to send at least one subset of the plurality of synthesized data to the second node. The first node further receives fifth information from the second node. The fifth information indicates an evaluation result of at least one synthesized data in the at least one subset. The evaluation result indicates at least one synthesized data that needs to be eliminated in the at least one subset, or indicates at least one synthesized data that does not need to be eliminated in the at least one subset.
可选的,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据。其中,所述筛选后的至少一个合成数据用于所述模型的处理。Optionally, the evaluation result is used to filter the plurality of synthetic data to obtain at least one filtered synthetic data, wherein the at least one filtered synthetic data is used for processing the model.
在一种可能的实现方式中,所述筛选后的至少一个合成数据是基于待剔除的第一合成数据与所述多个合成数据中的其他合成数据之间的距离确定的,所述待剔除的第一合成数据为基于所述第五信息确定的需要剔除的合成数据。In a possible implementation, the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the multiple synthesized data, and the first synthesized data to be eliminated is the synthesized data that needs to be eliminated determined based on the fifth information.
第五方面,本申请提供了一种模型数据获取装置,该装置可包括收发模块,具体如下:In a fifth aspect, the present application provides a model data acquisition device, which may include a transceiver module, specifically as follows:
收发模块,用于向第一节点发送第一信息,所述第一信息包括以下类型的信息:原始数据和所述原始数据的特征信息。The transceiver module is used to send first information to the first node, where the first information includes the following types of information: original data and characteristic information of the original data.
在一种可能的实现方式中,所述收发模块,还用于接收来自所述第一节点的第二信息,所述第二信息指示所述装置得到所述第一信息的处理方式,所述处理方式包括所述原始数据的特征信息的获取方式;In a possible implementation, the transceiver module is further configured to receive second information from the first node, where the second information indicates a processing method by which the device obtains the first information, and the processing method includes a method for acquiring characteristic information of the original data;
还包括处理模块,用于基于所述第二信息得到所述第一信息。The method further includes a processing module, configured to obtain the first information based on the second information.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息:所述原始数据的数据标签和/或所述原始数据的优先级。In a possible implementation manner, the first information further includes the following types of information: a data label of the original data and/or a priority of the original data.
在一种可能的实现方式中,所述处理方式还包括所述原始数据的数据标签的获取方式。In a possible implementation manner, the processing method also includes a method for acquiring data labels of the original data.
在一种可能的实现方式中,所述收发模块,还用于接收来自所述第一节点的第三信息,所述第三信息指示所述第一信息的上报配置。In a possible implementation manner, the transceiver module is further configured to receive third information from the first node, where the third information indicates a reporting configuration of the first information.
在一种可能的实现方式中,所述上报配置包括以下至少一项:所述第一信息的类型,所述第一信息的数据量。In a possible implementation manner, the reporting configuration includes at least one of the following: a type of the first information, and a data volume of the first information.
在一种可能的实现方式中,所述收发模块,还用于接收来自所述第一节点的第四信息,所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。In a possible implementation, the transceiver module is further configured to receive fourth information from the first node, where the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
在一种可能的实现方式中,所述第一信息还包括以下类型的信息:所述原始数据的组信息。In a possible implementation manner, the first information further includes the following type of information: group information of the original data.
在一种可能的实现方式中,所述收发模块,还用于接收来自所述第一节点的多个合成数据中的至少一个子集;In a possible implementation, the transceiver module is further configured to receive at least one subset of the plurality of synthesized data from the first node;
向所述第一节点发送第五信息,所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果,所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。Send fifth information to the first node, wherein the fifth information indicates an evaluation result of at least one synthetic data in the at least one subset, and the evaluation result indicates at least one synthetic data that needs to be eliminated in the at least one subset, or indicates at least one synthetic data that does not need to be eliminated in the at least one subset.
在一种可能的实现方式中,所述评估结果是基于所述至少一个子集中至少一个合成数据与本地数据之间的距离确定的。In a possible implementation manner, the evaluation result is determined based on a distance between at least one synthetic data in the at least one subset and the local data.
在一种可能的实现方式中,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据;其中,所述筛选后的至少一个合成数据用于模型的处理。In a possible implementation, the evaluation result is used to filter the multiple synthetic data to obtain at least one filtered synthetic data; wherein the at least one filtered synthetic data is used for model processing.
或者,本申请提供了一种模型数据获取装置,该装置可包括收发模块,具体如下:Alternatively, the present application provides a model data acquisition device, which may include a transceiver module, specifically as follows:
收发模块,用于向第一节点发送第六信息。所述第六信息包括以下类型的信息:原始数据。The transceiver module is used to send sixth information to the first node. The sixth information includes the following types of information: original data.
其中,该收发模块,还用于向第一节点发送第九信息,所述第九信息指示所述第一节点得到所述原始数据的特征信息的获取方式。The transceiver module is further used to send ninth information to the first node, where the ninth information indicates a method for the first node to obtain the characteristic information of the original data.
在一种可能的实现方式中,收发模块,用于接收来自第一节点的第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。In a possible implementation, the transceiver module is used to receive eighth information from the first node, where the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining a data label of the original data.
在一种可能的实现方式中,收发模块,用于接收来自第一节点的第七信息,该第七信息指示第六信息的上报配置。In a possible implementation manner, the transceiver module is configured to receive seventh information from the first node, where the seventh information indicates a reporting configuration of the sixth information.
在一种可能的实现方式中,收发模块,用于接收来自第一节点的第四信息。所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。In a possible implementation, the transceiver module is configured to receive fourth information from the first node. The fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group.
在一种可能的实现方式中,收发模块,用于接收来自所述第一节点的多个合成数据中的至少一个子集。第二节点还向所述第一节点发送第五信息,所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果。所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。In a possible implementation, the transceiver module is configured to receive at least one subset of the plurality of synthesized data from the first node. The second node further sends fifth information to the first node, wherein the fifth information indicates an evaluation result of at least one synthesized data in the at least one subset. The evaluation result indicates at least one synthesized data that needs to be eliminated from the at least one subset, or indicates at least one synthesized data that does not need to be eliminated from the at least one subset.
在一种可能的实现方式中,所述评估结果是基于所述至少一个子集中的至少一个合成数据与本地数据之间的距离确定的。In a possible implementation manner, the evaluation result is determined based on a distance between at least one synthetic data in the at least one subset and the local data.
在一种可能的实现方式中,所述评估结果用于对所述多个合成数据进行筛选,得到筛选后的至少一个合成数据;其中,所述筛选后的至少一个合成数据用于模型的处理。In a possible implementation, the evaluation result is used to filter the multiple synthetic data to obtain at least one filtered synthetic data; wherein the at least one filtered synthetic data is used for model processing.
第六方面,本申请提供了一种模型数据获取装置,该装置可包括收发模块,其中:In a sixth aspect, the present application provides a model data acquisition device, which may include a transceiver module, wherein:
收发模块,用于向第一节点发送第十一信息,所述第十一信息包括以下类型的信息:原始数据的特征信息。The transceiver module is used to send eleventh information to the first node, where the eleventh information includes the following types of information: characteristic information of original data.
或者,本申请提供了一种模型数据获取装置,该装置可包括收发模块,其中:Alternatively, the present application provides a model data acquisition device, which may include a transceiver module, wherein:
收发模块,用于向第一节点发送第九信息,所述第九信息指示第一节点得到所述原始数据的特征信息的获取方式。The transceiver module is used to send ninth information to the first node, where the ninth information indicates a method for the first node to obtain the characteristic information of the original data.
第七方面,本申请提供了一种模型数据获取装置,包括处理器和存储器;其中,所述存储器用于存储程序代码,所述处理器用于调用所述程序代码,以执行如第一方面任一种可能的实施方式提供的方法。In a seventh aspect, the present application provides a model data acquisition device, comprising a processor and a memory; wherein the memory is used to store program code, and the processor is used to call the program code to execute a method provided in any possible implementation manner of the first aspect.
第八方面,本申请提供了一种模型数据获取装置,包括处理电路和存储器;其中,所述存储器用于存储程序代码,所述处理电路用于调用所述程序代码,以执行如第二方面任一种可能的实施方式提供的方法。In an eighth aspect, the present application provides a model data acquisition device, comprising a processing circuit and a memory; wherein the memory is used to store program code, and the processing circuit is used to call the program code to execute a method provided in any possible implementation manner of the second aspect.
第九方面,本申请提供了一种模型数据获取装置,包括处理电路和存储器;其中,所述存储器用于存储程序代码,所述处理电路用于调用所述程序代码,以执行如第三方面任一种可能的实施方式提供的方法。In a ninth aspect, the present application provides a model data acquisition device, comprising a processing circuit and a memory; wherein the memory is used to store program code, and the processing circuit is used to call the program code to execute a method provided in any possible implementation manner of the third aspect.
第十方面,本申请提供了一种模型数据获取系统,包括如第四方面任一种可能的实施方式提供的装置,以及如第五方面任一种可能的实施方式提供的装置;或者包括如第七方面任一种可能的实施方式提供的装置,以及如第八方面任一种可能的实施方式提供的装置;或者包括如第四方面任一种可能的实施方式提供的装置,以及如第五方面任一种可能的实施方式提供的装置,以及如第六方面任一种可能的实施方式提供的装置;或者包括如第七方面任一种可能的实施方式提供的装置,以及如第八方面任一种可能的实施方式提供的装置,以及如第九方面任一种可能的实施方式提供的装置。In a tenth aspect, the present application provides a model data acquisition system, comprising an apparatus provided in any possible implementation manner of the fourth aspect, and an apparatus provided in any possible implementation manner of the fifth aspect; or comprising an apparatus provided in any possible implementation manner of the seventh aspect, and an apparatus provided in any possible implementation manner of the eighth aspect; or comprising an apparatus provided in any possible implementation manner of the fourth aspect, and an apparatus provided in any possible implementation manner of the fifth aspect, and an apparatus provided in any possible implementation manner of the sixth aspect; or comprising an apparatus provided in any possible implementation manner of the seventh aspect, and an apparatus provided in any possible implementation manner of the eighth aspect, and an apparatus provided in any possible implementation manner of the ninth aspect.
第十一方面,本申请提供了一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行以实现如第一方面任一种可能的实施方式提供的方法,或者如第二方面任一种可能的实施方式提供的方法,或者如第三方面任一种可能的实施方式提供的方法。In the eleventh aspect, the present application provides a computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement a method provided in any possible implementation of the first aspect, or a method provided in any possible implementation of the second aspect, or a method provided in any possible implementation of the third aspect.
第十二方面,本申请提供了一种计算机程序产品,其特征在于,当计算机程序产品在计算机上运行时,使得所述计算机执行如第一方面任一种可能的实施方式提供的方法,或者如第二方面任一种可能的实施方式提供的方法,或者如第三方面任一种可能的实施方式提供的方法。In the twelfth aspect, the present application provides a computer program product, characterized in that when the computer program product runs on a computer, the computer executes a method provided in any possible implementation of the first aspect, or a method provided in any possible implementation of the second aspect, or a method provided in any possible implementation of the third aspect.
可以理解地,上述提供的第四方面所述的装置至第九方面所述的装置、第十方面所述的系统、第十一方面所述的计算机可读存储介质或者第十二方面所述的计算机程序产品均用于执行第一方面中任一所提供的方法或第二方面中任一所提供的方法、或第三方面中任一所提供的方法。因此,其所能达到的有益效果可参考对应方法中的有益效果,此处不再赘述。It can be understood that the apparatus described in the fourth aspect to the apparatus described in the ninth aspect, the system described in the tenth aspect, the computer-readable storage medium described in the eleventh aspect, or the computer program product described in the twelfth aspect are all used to execute any method provided in the first aspect, any method provided in the second aspect, or any method provided in the third aspect. Therefore, the beneficial effects that can be achieved can refer to the beneficial effects in the corresponding method, which will not be repeated here.
下面对本申请实施例用到的附图进行介绍。The following is an introduction to the drawings used in the embodiments of the present application.
图1是本申请实施例提供的无线通信系统的一简化示意图;FIG1 is a simplified schematic diagram of a wireless communication system provided by an embodiment of the present application;
图2a是本申请实施例提供的一种通信系统的示意图;FIG2a is a schematic diagram of a communication system provided by an embodiment of the present application;
图2b是本申请实施例提供的另一种通信系统的示意图;FIG2b is a schematic diagram of another communication system provided in an embodiment of the present application;
图3a是本申请实施例提供的通信系统中的一种可能的应用框架示意图;FIG3a is a schematic diagram of a possible application framework in a communication system provided in an embodiment of the present application;
图3b是本申请实施例提供的通信系统中的另一种可能的应用框架示意图;FIG3b is a schematic diagram of another possible application framework in the communication system provided in an embodiment of the present application;
图4是本申请实施例提供的一种编码器和解码器的示意图;FIG4 is a schematic diagram of an encoder and a decoder provided in an embodiment of the present application;
图5是本申请实施例提供的一种AI应用框架示意图;FIG5 is a schematic diagram of an AI application framework provided in an embodiment of the present application;
图6是本申请实施例提供的一种模型数据获取方法的流程示意图;FIG6 is a schematic diagram of a flow chart of a method for acquiring model data provided in an embodiment of the present application;
图7是本申请实施例提供的另一种模型数据获取方法的流程示意图;FIG7 is a flow chart of another method for acquiring model data provided in an embodiment of the present application;
图8是本申请实施例提供的又一种模型数据获取方法的流程示意图;FIG8 is a flow chart of another model data acquisition method provided in an embodiment of the present application;
图9是本申请实施例提供的又一种模型数据获取方法的流程示意图;FIG9 is a flow chart of another model data acquisition method provided in an embodiment of the present application;
图10是本申请实施例提供的又一种模型数据获取方法的流程示意图;FIG10 is a flow chart of another model data acquisition method provided in an embodiment of the present application;
图11是本申请实施例提供的一种模型数据获取装置的结构示意图;FIG11 is a schematic diagram of the structure of a model data acquisition device provided in an embodiment of the present application;
图12是本申请实施例提供的另一种模型数据获取装置的结构示意图。FIG. 12 is a schematic diagram of the structure of another model data acquisition device provided in an embodiment of the present application.
为了使本公开的目的、技术方案和优点更加清楚,下面将结合附图对本公开作进一步地详细描述。In order to make the objectives, technical solutions and advantages of the present disclosure more clear, the present disclosure will be further described in detail below with reference to the accompanying drawings.
本公开如下涉及的至少一个(项),指示一个(项)或多个(项)。多个(项),是指两个(项)或两个(项)以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。另外,应当理解,尽管在本公开中可能采用术语第一、第二等来描述各对象、但这些对象不应限于这些术语。这些术语仅用来将各对象彼此区分开。The present disclosure relates to at least one (item) as follows, indicating one (item) or more (items). More than one (item) refers to two (items) or more than two (items). "And/or" describes the association relationship of associated objects, indicating that three relationships may exist. For example, A and/or B can represent: A exists alone, A and B exist at the same time, and B exists alone. The character "/" generally indicates that the objects associated before and after are in an "or" relationship. In addition, it should be understood that although the terms first, second, etc. may be used to describe each object in the present disclosure, these objects should not be limited to these terms. These terms are only used to distinguish each object from each other.
本公开如下描述中所提到的术语“包括”和“具有”以及它们的任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括其他没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。需要说明的是,本公开中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本公开中被描述为“示例性的”或者“例如”的任何方法或设计方案不应被解释为比其它方法或设计方案更优选或更具优势。确切而言,使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念。The terms "including" and "having" and any variations thereof mentioned in the following description of the present disclosure are intended to cover non-exclusive inclusions. For example, a process, method, system, product or device that includes a series of steps or units is not limited to the listed steps or units, but optionally includes other steps or units that are not listed, or optionally includes other steps or units that are inherent to these processes, methods, products or devices. It should be noted that in the present disclosure, words such as "exemplary" or "for example" are used to indicate examples, illustrations or explanations. Any method or design described in the present disclosure as "exemplary" or "for example" should not be interpreted as being more preferred or more advantageous than other methods or designs. Specifically, the use of words such as "exemplary" or "for example" is intended to present related concepts in a concrete way.
本公开提供的技术可以应用于各种通信系统,例如,该通信系统可以是第五代(5th generation,5G)或新无线(new radio,NR)系统、长期演进(long term evolution,LTE)系统、LTE频分双工(frequency division duplex,FDD)系统、LTE时分双工(time division duplex,TDD)系统、无线局域网(wireless local area network,WLAN)系统、卫星通信系统、未来的通信系统,如第六代(6th generation,6G)移动通信系统,或者多种系统的融合系统等。本申请提供的技术方案还可以应用于设备到设备(device to device,D2D)通信,车到万物(vehicle-to-everything,V2X)通信,机器到机器(machine to machine,M2M)通信,机器类型通信(machine type communication,MTC),以及物联网(internet of things,IoT)通信系统或者其他通信系统。The technology provided by the present disclosure can be applied to various communication systems, for example, the communication system can be a fifth generation (5G) or new radio (NR) system, a long term evolution (LTE) system, an LTE frequency division duplex (FDD) system, an LTE time division duplex (TDD) system, a wireless local area network (WLAN) system, a satellite communication system, a future communication system such as a sixth generation (6G) mobile communication system, or a fusion system of multiple systems. The technical solution provided by the present application can also be applied to device to device (D2D) communication, vehicle to everything (V2X) communication, machine to machine (M2M) communication, machine type communication (MTC), and Internet of things (IoT) communication system or other communication systems.
通信系统中的一个设备可以向另一个设备发送信号或从另一个设备接收信号。其中信号可以包括信息、信令或者数据等。其中,设备也可以被替换为实体、网络实体、网元、通信设备、通信模块、节点、通信节点等等,本公开中以设备为例进行描述。例如,通信系统可以包括至少一个终端设备和至少一个接入网设备。接入网设备可以向终端设备发送下行信号,和/或终端设备可以向接入网设备发送上行信号此外可以理解的是,若通信系统中包括多个终端设备,多个终端设备之间也可以互发信号,即信号的发送设备和信号的接收设备均可以是终端设备。A device in a communication system can send a signal to another device or receive a signal from another device. The signal may include information, signaling, or data, etc. The device may also be replaced by an entity, a network entity, a network element, a communication device, a communication module, a node, a communication node, etc. The present disclosure is described by taking the device as an example. For example, the communication system may include at least one terminal device and at least one access network device. The access network device may send a downlink signal to the terminal device, and/or the terminal device may send an uplink signal to the access network device. In addition, it can be understood that if the communication system includes multiple terminal devices, multiple terminal devices may also send signals to each other, that is, the signal sending device and the signal receiving device may both be terminal devices.
本申请实施例提供的信息生成方法可以应用于5G、6G、卫星通信等无线通信系统中。参见图1,图1是本申请实施例提供的无线通信系统的一简化示意图。如图1所示,该无线通信系统包括无线接入网100。无线接入网100可以是下一代(例如6G或更高版本)无线接入网,或传统(例如5G、4G、3G或2G)无线接入网。一个或多个通信设备(120a-120j,统称为120)可以相互连接或连接到无线接入网100中的一个或多个网络设备(110a、110b,统称为110)。可选的,图1只是示意图,该无线通信系统中还可以包括其它设备,如还可以包括核心网设备、无线中继设备和/或无线回传设备等,在图1中未画出。The information generation method provided in the embodiment of the present application can be applied to wireless communication systems such as 5G, 6G, and satellite communication. Referring to FIG1, FIG1 is a simplified schematic diagram of a wireless communication system provided in an embodiment of the present application. As shown in FIG1, the wireless communication system includes a wireless access network 100. The wireless access network 100 may be a next generation (e.g., 6G or higher) wireless access network, or a traditional (e.g., 5G, 4G, 3G, or 2G) wireless access network. One or more communication devices (120a-120j, collectively referred to as 120) may be interconnected or connected to one or more network devices (110a, 110b, collectively referred to as 110) in the wireless access network 100. Optionally, FIG1 is only a schematic diagram, and other devices may also be included in the wireless communication system, such as core network devices, wireless relay devices, and/or wireless backhaul devices, which are not shown in FIG1.
可选的,在实际应用中,该无线通信系统可以同时包括多个网络设备(也称为接入网设备),也可以同时包括多个通信设备。一个网络设备可以同时服务于一个或多个通信设备。一个通信设备也可以同时接入一个或多个网络设备。本申请实施例对该无线通信系统中包括的通信设备和网络设备的数量不做限定。Optionally, in practical applications, the wireless communication system may include multiple network devices (also referred to as access network devices) at the same time, or may include multiple communication devices at the same time. A network device may serve one or more communication devices at the same time. A communication device may also access one or more network devices at the same time. The embodiment of the present application does not limit the number of communication devices and network devices included in the wireless communication system.
其中,网络设备可以是网络侧的一种用于发射或接收信号的实体。网络设备可以为通信设备通过无线方式接入到该无线通信系统中的接入设备,如网络设备可以是基站。基站可以广义的覆盖如下中的各种名称,或与如下名称进行替换,比如:节点B(NodeB)、演进型基站(evolved NodeB,eNB)、下一代基站(next generation NodeB,gNB)、开放无线接入网(open radio access network,O-RAN)中的接入网设备、中继站、接入点、传输点(transmitting and receiving point,TRP)、发射点(transmitting point,TP)、主站(Master eNodeB,MeNB)、辅站(Secondary eNodeB,SeNB)、多制式无线(Multi standard radio,MSR)节点、家庭基站、网络控制器、接入节点、无线节点、接入点(AP)、传输节点、收发节点、基带单元(baseband unit,BBU)、射频拉远单元(remote radio unit,RRU)、有源天线单元(Active antenna unit,AAU)、射频头(Remote Radio Head,RRH)、中心单元(central unit,CU)、分布单元(distributed unit,DU)、无线单元(radio unit,RU)、集中单元控制面(CU control plane,CU-CP)节点、集中单元用户面(CU user plane,CU-UP)节点、定位节点等。基站可以是宏基站、微基站、中继节点、施主节点或类似物,或其组合。网络设备还可以指用于设置于前述设备或装置内的通信模块、调制解调器或芯片。网络设备还可以是移动交换中心以及设备到设备(Device-to-Device,D2D)、车辆外联(vehicle-to-everything,V2X)、机器到机器(machine-to-machine,M2M)通信中承担基站功能的设备、6G网络中的网络侧设备、未来的通信系统中承担基站功能的设备等。网络设备可以支持相同或不同接入技术的网络。本申请的实施例对网络设备所采用的具体技术和具体设备形态不做限定。Among them, the network device can be an entity on the network side for transmitting or receiving signals. The network device can be an access device for the communication device to access the wireless communication system by wireless means, such as the network device can be a base station. The base station can broadly cover the following various names, or be replaced with the following names, such as: Node B (NodeB), evolved NodeB (eNB), next generation NodeB (gNB), access network equipment in open radio access network (O-RAN), relay station, access point, transmission point (transmitting and receiving point, TRP), transmitting point (transmitting point, TP), master station (Master eNodeB, MeNB), secondary eNodeB (SeNB), multi-standard radio (Multi-standard radio o, MSR) node, home base station, network controller, access node, wireless node, access point (AP), transmission node, transceiver node, baseband unit (BBU), remote radio unit (RRU), active antenna unit (AAU), remote radio head (RRH), central unit (CU), distributed unit (DU), radio unit (RU), centralized unit control plane (CU-CP) node, centralized unit user plane (CU-UP) node, positioning node, etc. The base station can be a macro base station, a micro base station, a relay node, a donor node or the like, or a combination thereof. The network device can also refer to a communication module, a modem or a chip used to be set in the aforementioned device or apparatus. The network device may also be a mobile switching center and a device that performs base station functions in device-to-device (D2D), vehicle-to-everything (V2X), and machine-to-machine (M2M) communications, a network-side device in a 6G network, and a device that performs base station functions in future communication systems. The network device may support networks with the same or different access technologies. The embodiments of the present application do not limit the specific technology and specific device form used by the network device.
网络设备可以是固定的,也可以是移动的。例如,基站110a、110b是静止的,并负责来自通信设备120的一个或多个小区中的无线传输和接收。图1中示出的直升机或无人机120i可以被配置成充当移动基站,并且一个或多个小区可以根据移动基站120i的位置移动。在其他示例中,直升机或无人机(120i)可以被配置成用作与基站110b通信的通信设备。The network equipment may be fixed or mobile. For example, base stations 110a, 110b are stationary and are responsible for wireless transmission and reception in one or more cells from the communication device 120. The helicopter or drone 120i shown in FIG. 1 may be configured to act as a mobile base station, and one or more cells may move according to the location of the mobile base station 120i. In other examples, the helicopter or drone (120i) may be configured to act as a communication device that communicates with the base station 110b.
本公开中,用于实现如上接入网络功能的通信装置可以是接入网设备,也可以是具有接入网络的部分功能的网络设备,也可以是能够支持实现接入网络功能的装置,例如芯片系统,硬件电路、软件模块、或硬件电路加软件模块,该装置可以被安装在接入网设备中或者和接入网设备匹配使用。本公开的方法中,以用于实现接入网设备功能的通信装置是接入网设备为例进行描述。In the present disclosure, the communication device used to implement the above access network function can be an access network device, or a network device with some functions of accessing the network, or a device capable of supporting the implementation of the access network function, such as a chip system, a hardware circuit, a software module, or a hardware circuit plus a software module, which can be installed in the access network device or used in combination with the access network device. In the method of the present disclosure, the communication device used to implement the access network device function is an access network device for description as an example.
通信设备可以是用户侧的一种用于接收或发射信号的实体,如手机。通信设备可以用于连接人、物和机器。通信设备可通过网络设备与一个或多个核心网进行通信。通信设备包括具有无线连接功能的手持式设备、连接到无线调制解调器的其他处理设备或车载设备等。通信设备可以是便携式、袖珍式、手持式、计算机内置的或者车载的移动装置。通信设备120可以广泛应用于各种场景,例如蜂窝通信、设备到设备D2D、车到所有V2X、端到端(peer-to-peer,P2P)、机器到机器M2M、机器类型通信MTC、物联网IoT、虚拟现实(virtual reality,VR)、增强现实(augmented reality,AR)、工业控制、自动驾驶、远程医疗、智能电网、智能家具、智能办公、智能穿戴、智能交通、智慧城市、无人机、机器人、遥感、被动传感、定位、导航与跟踪、自主交付与移动等。通信设备120的一些举例为:3GPP标准的用户设备(user equipment,UE)、固定设备、移动设备、手持设备、可穿戴设备、蜂窝电话、智能电话、会话发起协议(Session initialization Protocol,SIP)电话、笔记本电脑、个人计算机、智能书、车辆、卫星、全球定位系统(GPS)设备、目标跟踪设备、无人机、直升机、飞行器、船只、遥控设备、智能家居设备、工业设备、个人通信业务(personal communication service,PCS)电话、无线本地环路(wireless local loop,WLL)站、个人数字助理(personal digital assistant,PDA)、无线网络摄像头、平板电脑、掌上电脑、移动互联网设备(mobile internet device,MID)、可穿戴设备如智能手表、虚拟现实VR设备、增强现实AR设备、工业控制(industrial control)中的无线终端、车联网系统中的终端、无人驾驶(self driving)中的无线终端、智能电网(smart grid)中的无线终端、运输安全(transportation safety)中的无线终端、智慧城市(smart city)中的无线终端如智能加油器,高铁上的终端设备以及智慧家庭(smart home)中的无线终端,如智能音响、智能咖啡机、智能打印机等。通信设备120可以为以上各种场景中的无线设备或用于设置于无线设备的装置,例如,上述设备中的通信模块、调制解调器或芯片等。通信设备也可以称为终端、终端设备、用户设备UE、移动台(mobile station,MS)、移动终端(mobile terminal,MT)等。通信设备还可以是未来的无线通信系统中的通信设备。通信设备可以用于专用网设备或者通用设备中。本申请的实施例对通信设备所采用的具体技术和具体设备形态不做限定。The communication device can be an entity on the user side for receiving or transmitting signals, such as a mobile phone. The communication device can be used to connect people, objects and machines. The communication device can communicate with one or more core networks through a network device. The communication device includes a handheld device with a wireless connection function, other processing devices connected to a wireless modem, or a vehicle-mounted device. The communication device can be a portable, pocket-sized, handheld, computer-built-in or vehicle-mounted mobile device. The communication device 120 can be widely used in various scenarios, such as cellular communication, device-to-device D2D, vehicle-to-all V2X, peer-to-peer (P2P), machine-to-machine M2M, machine type communication MTC, Internet of Things IoT, virtual reality (VR), augmented reality (AR), industrial control, autonomous driving, telemedicine, smart grid, smart furniture, smart office, smart wear, smart transportation, smart city, drone, robot, remote sensing, passive sensing, positioning, navigation and tracking, autonomous delivery and mobility, etc. Some examples of communication devices 120 are: 3GPP standard user equipment (UE), fixed equipment, mobile devices, handheld devices, wearable devices, cellular phones, smart phones, Session Initialization Protocol (SIP) phones, laptops, personal computers, smart books, vehicles, satellites, Global Positioning System (GPS) devices, target tracking devices, drones, helicopters, aircraft, ships, remote control devices, smart home devices, industrial equipment, personal communication service (PCS) phones, wireless local loop (WLL) stations, personal digital assistants (PDAs), etc. The communication device 120 may be a wireless device in the above-mentioned scenarios or a device used to be set in a wireless device, for example, a communication module, a modem or a chip in the above-mentioned device. The communication device may also be referred to as a terminal, a terminal device, a user equipment UE, a mobile station (MS), a mobile terminal (MT), or the like. The communication device may also be referred to as a terminal, a terminal device, a user equipment UE, a mobile station (MS), a mobile terminal (MT), or the like. The communication device may also be referred to as a communication device in a future wireless communication system. The communication device can be used in a dedicated network device or a general device. The embodiments of the present application do not limit the specific technology and specific device form used by the communication device.
可选的,通信设备可以用于充当基站。例如,UE可以充当调度实体,其在V2X、D2D或P2P等中的UE之间提供侧行链路信号。如图1所示,蜂窝电话120a和汽车120b利用侧行链路信号彼此通信。蜂窝电话120a和智能家居设备120e之间通信,而无需通过基站110b中继通信信号。Optionally, the communication device can be used to act as a base station. For example, the UE can act as a scheduling entity that provides sidelink signals between UEs in V2X, D2D, or P2P, etc. As shown in Figure 1, the cellular phone 120a and the car 120b communicate with each other using sidelink signals. The cellular phone 120a and the smart home device 120e communicate without relaying the communication signal through the base station 110b.
本公开中,用于实现通信设备功能的通信装置可以是终端设备,也可以是具有以上通信设备的部分功能的终端设备,也可以是能够支持实现以上通信设备的功能的装置,例如芯片系统,该装置可以被安装在终端设备中或者和终端设备匹配使用。本公开中,芯片系统可以由芯片构成,也可以包括芯片和其他分立器件。本公开提供的技术方案中,以通信装置是终端设备或UE为例进行描述。In the present disclosure, the communication device used to implement the functions of the communication device may be a terminal device, or a terminal device having some of the functions of the above communication device, or a device capable of supporting the implementation of the functions of the above communication device, such as a chip system, which may be installed in the terminal device or used in combination with the terminal device. In the present disclosure, the chip system may be composed of a chip, or may include a chip and other discrete devices. In the technical solution provided in the present disclosure, the communication device is described as a terminal device or UE as an example.
可选的,无线通信系统通常由小区组成,基站提供小区的管理,基站向小区中多个移动台(mobile station,MS)提供通信服务。其中基站包含基带单元(baseband unit,BBU)和远端射频单元(remote radio unit,RRU)。BBU和RRU可以放置在不同的地方,例如:RRU拉远,放置于高话务量的区域,BBU放置于中心机房。BBU和RRU也可以放置在同一机房。BBU和RRU也可以为一个机架下的不同部件。可选的,一个小区可以对应于一个载波或成员载波。Optionally, a wireless communication system is usually composed of cells, and a base station provides management of the cell. The base station provides communication services to multiple mobile stations (MS) in the cell. The base station includes a baseband unit (BBU) and a remote radio unit (RRU). The BBU and RRU can be placed in different places, for example: the RRU is remote and placed in an area with high traffic volume, and the BBU is placed in a central computer room. The BBU and RRU can also be placed in the same computer room. The BBU and RRU can also be different components under one rack. Optionally, a cell can correspond to one carrier or component carrier.
可以理解的是,本公开可以应用在网络设备和通信设备之间,网络设备和网络设备之间,或,通信设备和通信设备之间,也即,主设备和次设备之间,主设备可以为网络设备或通信设备,主设备为网络设备时,次设备可以为另一网络设备或通信设备,主设备为通信设备时,次设备可以为另一通信设备。It can be understood that the present disclosure can be applied between a network device and a communication device, between a network device and a network device, or between a communication device and a communication device, that is, between a primary device and a secondary device. The primary device can be a network device or a communication device. When the primary device is a network device, the secondary device can be another network device or a communication device. When the primary device is a communication device, the secondary device can be another communication device.
以下以主设备为网络设备,如,接入网设备,次设备为通信设备,如终端设备,为例进行方案的描述。其中,下行对应的通信方向为主设备向次设备的发送,上行对应的通信方向为次设备向主设备的发送。The following describes the scheme by taking the master device as a network device, such as an access network device, and the slave device as a communication device, such as a terminal device, as an example. The communication direction corresponding to the downlink is the transmission from the master device to the slave device, and the communication direction corresponding to the uplink is the transmission from the slave device to the master device.
接入网设备和终端设备之间的协议层结构Protocol layer structure between access network equipment and terminal equipment
接入网设备和终端设备之间的通信遵循一定的协议层结构。该协议层结构可以包括控制面协议层结构和用户面协议层结构。例如,控制面协议层结构可以包括无线资源控制(radio resource control,RRC)层、分组数据汇聚层协议(packet data convergence protocol,PDCP)层、无线链路控制(radio link control,RLC)层、媒体接入控制(medium access control,MAC)层和物理层等协议层的功能。例如,用户面协议层结构可以包括PDCP层、RLC层、MAC层和物理层等协议层的功能,在一种可能的实现中,PDCP层之上还可以包括业务数据适配协议(service data adaptation protocol,SDAP)层。The communication between the access network equipment and the terminal equipment follows a certain protocol layer structure. The protocol layer structure may include a control plane protocol layer structure and a user plane protocol layer structure. For example, the control plane protocol layer structure may include the functions of the radio resource control (RRC) layer, the packet data convergence protocol (PDCP) layer, the radio link control (RLC) layer, the medium access control (MAC) layer and the physical layer. For example, the user plane protocol layer structure may include the functions of the PDCP layer, the RLC layer, the MAC layer and the physical layer. In a possible implementation, the service data adaptation protocol (SDAP) layer may also be included above the PDCP layer.
可选的,接入网设备和终端之间的协议层结构还可以包括人工智能(artificial intelligence,AI)层,用于传输AI功能相关的数据。Optionally, the protocol layer structure between the access network device and the terminal may also include an artificial intelligence (AI) layer for transmitting data related to AI functions.
以接入网设备和终端设备之间的数据传输为例,数据传输需要经过用户面协议层,比如经过SDAP层、PDCP层、RLC层、MAC层、物理层。其中,SDAP层、PDCP层、RLC层、MAC层和物理层也可以统称为接入层。根据数据的传输方向分为发送或接收,上述每层又分为发送部分和接收部分。以下行数据传输为例,PDCP层自上层取得数据后,将数据传送到RLC层与MAC层,再由MAC层生成传输块,然后通过物理层进行无线传输。数据在各个层中进行相对应的封装。例如,某一层从该层的上层收到的数据视为该层的服务数据单元(service data unit,SDU),经过该层封装后成为协议数据单元(protocol data unit,PDU),再传递给下一个层。Taking data transmission between access network equipment and terminal equipment as an example, data transmission needs to pass through the user plane protocol layer, such as the SDAP layer, PDCP layer, RLC layer, MAC layer, and physical layer. Among them, the SDAP layer, PDCP layer, RLC layer, MAC layer, and physical layer can also be collectively referred to as the access layer. According to the transmission direction of data, it is divided into sending or receiving, and each of the above layers is divided into a sending part and a receiving part. Taking downlink data transmission as an example, after the PDCP layer obtains data from the upper layer, it transmits the data to the RLC layer and the MAC layer, and then the MAC layer generates a transmission block, which is then wirelessly transmitted through the physical layer. Data is encapsulated accordingly in each layer. For example, the data received by a layer from the upper layer of the layer is regarded as the service data unit (SDU) of the layer, which becomes a protocol data unit (PDU) after being encapsulated by the layer, and then passed to the next layer.
示例性的,终端设备还可以具有应用层和非接入层。其中,应用层可以用于向终端设备中所安装的应用程序提供服务,比如,终端设备接收到的下行数据可以由物理层依次传输到应用层,进而由应用层提供给应用程序;又比如,应用层可以获取应用程序产生的数据,并将数据依次传输到物理层,发送给其它通信装置。非接入层可以用于转发用户数据,比如将从应用层接收到的上行数据转发给SDAP层或者将从SDAP层接收到的下行数据转发给应用层。Exemplarily, the terminal device may also have an application layer and a non-access layer. The application layer may be used to provide services to applications installed in the terminal device. For example, downlink data received by the terminal device may be sequentially transmitted from the physical layer to the application layer, and then provided to the application by the application layer; for another example, the application layer may obtain data generated by the application, and sequentially transmit the data to the physical layer and send it to other communication devices. The non-access layer may be used to forward user data, such as forwarding uplink data received from the application layer to the SDAP layer or forwarding downlink data received from the SDAP layer to the application layer.
接入网设备的结构Structure of access network equipment
接入网设备可以包括集中式单元(central unit,CU)和分布式单元(distributed unit,DU)。多个DU可以由一个CU集中控制。作为示例,CU和DU之间的接口可以称为F1接口。其中,控制面(control panel,CP)接口可以为F1-C,用户面(user panel,UP)接口可以为F1-U。CU和DU可以根据无线网络的协议层划分:比如,PDCP层及以上协议层的功能设置在CU,PDCP层以下协议层(例如RLC层和MAC层等)的功能设置在DU;又比如,PDCP层以上协议层的功能设置在CU,PDCP层及以下协议层的功能设置在DU。The access network equipment may include a centralized unit (CU) and a distributed unit (DU). Multiple DUs may be centrally controlled by one CU. As an example, the interface between the CU and the DU may be referred to as an F1 interface. Among them, the control plane (CP) interface may be F1-C, and the user plane (UP) interface may be F1-U. The CU and the DU may be divided according to the protocol layers of the wireless network: for example, the functions of the PDCP layer and above are set in the CU, and the functions of the protocol layers below the PDCP layer (such as the RLC layer and the MAC layer, etc.) are set in the DU; for another example, the functions of the protocol layers above the PDCP layer are set in the CU, and the functions of the protocol layers below the PDCP layer are set in the DU.
可以理解的是,上述对CU和DU的处理功能按照协议层的划分仅仅是一种举例,也可以按照其他的方式进行划分,例如可以将CU或者DU划分为具有更多协议层的功能,又例如将CU或DU还可以划分为具有协议层的部分处理功能。在一种设计中,将RLC层的部分功能和RLC层以上的协议层的功能设置在CU,将RLC层的剩余功能和RLC层以下的协议层的功能设置在DU。在另一种设计中,还可以按照业务类型或者其他系统需求对CU或者DU的功能进行划分,例如按时延划分,将处理时间需要满足时延要求的功能设置在DU,不需要满足该时延要求的功能设置在CU。在另一种设计中,CU也可以具有核心网的一个或多个功能。示例性的,CU可以设置在网络侧方便集中管理。在另一种设计中,将DU的RU拉远设置。其中,RU具有射频功能。It is understandable that the above-mentioned division of the processing functions of CU and DU according to the protocol layer is only an example, and it can also be divided in other ways, for example, CU or DU can be divided into functions with more protocol layers, and for example, CU or DU can also be divided into partial processing functions with protocol layers. In one design, some functions of the RLC layer and the functions of the protocol layers above the RLC layer are set in the CU, and the remaining functions of the RLC layer and the functions of the protocol layers below the RLC layer are set in the DU. In another design, the functions of the CU or DU can also be divided according to the service type or other system requirements, for example, divided by latency, and the functions whose processing time needs to meet the latency requirements are set in the DU, and the functions that do not need to meet the latency requirements are set in the CU. In another design, the CU can also have one or more functions of the core network. Exemplarily, the CU can be set on the network side to facilitate centralized management. In another design, the RU of the DU is set remotely. Among them, the RU has a radio frequency function.
可选的,DU和RU可以在物理层(physical layer,PHY)进行划分。例如,DU可以实现PHY层中的高层功能,RU可以实现PHY层中的低层功能。其中,用于发送时,PHY层的功能可以包括添加循环冗余校验(cyclic redundancy check,CRC)码、信道编码、速率匹配、加扰、调制、层映射、预编码、资源映射、物理天线映射、和/或射频发送功能。用于接收时,PHY层的功能可以包括CRC、信道解码、解速率匹配、解扰、解调、解层映射、信道检测、资源解映射、物理天线解映射、和/或射频接收功能。其中,PHY层中的高层功能可以包括PHY层的一部分功能,例如该部分功能更加靠近MAC层,PHY层中的低层功能可以包括PHY层的另一部分功能,例如该部分功能更加靠近射频功能。例如,PHY层中的高层功能可以包括添加CRC码、信道编码、速率匹配、加扰、调制、和层映射,PHY层中的低层功能可以包括预编码、资源映射、物理天线映射、和射频发送功能;或者,PHY层中的高层功能可以包括添加CRC码、信道编码、速率匹配、加扰、调制、层映射和预编码,PHY层中的低层功能可以包括资源映射、物理天线映射、和射频发送功能。Optionally, DU and RU can be divided at the physical layer (physical layer, PHY). For example, DU can implement high-level functions in the PHY layer, and RU can implement low-level functions in the PHY layer. Wherein, when used for transmission, the functions of the PHY layer may include adding cyclic redundancy check (CRC) code, channel coding, rate matching, scrambling, modulation, layer mapping, precoding, resource mapping, physical antenna mapping, and/or RF transmission functions. When used for reception, the functions of the PHY layer may include CRC, channel decoding, rate matching, descrambling, demodulation, layer mapping, channel detection, resource demapping, physical antenna demapping, and/or RF reception functions. Wherein, the high-level functions in the PHY layer may include a part of the functions of the PHY layer, such as the part of the functions closer to the MAC layer, and the low-level functions in the PHY layer may include another part of the functions of the PHY layer, such as the part of the functions closer to the RF functions. For example, the high-level functions in the PHY layer may include adding CRC code, channel coding, rate matching, scrambling, modulation, and layer mapping, and the low-level functions in the PHY layer may include precoding, resource mapping, physical antenna mapping, and RF transmission functions; or, the high-level functions in the PHY layer may include adding CRC code, channel coding, rate matching, scrambling, modulation, layer mapping and precoding, and the low-level functions in the PHY layer may include resource mapping, physical antenna mapping, and RF transmission functions.
示例性的,CU的功能可以由一个实体来实现,或者也可以由不同的实体来实现。例如,可以对CU的功能进行进一步划分,即将控制面和用户面分离并通过不同实体来实现,分别为控制面CU实体(即CU-CP实体)和用户面CU实体(即CU-UP实体)。该CU-CP实体和CU-UP实体可以与DU相耦合,共同完成接入网设备的功能。Exemplarily, the functions of CU can be implemented by one entity, or can be implemented by different entities. For example, the functions of CU can be further divided, that is, the control plane and the user plane are separated and implemented by different entities, namely the control plane CU entity (i.e., CU-CP entity) and the user plane CU entity (i.e., CU-UP entity). The CU-CP entity and the CU-UP entity can be coupled with the DU to jointly complete the functions of the access network device.
上述架构中,CU产生的信令可以通过DU发送给终端设备,或者终端设备产生的信令可以通过DU发送给CU。例如,RRC或PDCP层的信令最终会处理为物理层的信令发送给终端设备,或者,由接收到的物理层的信令转变而来。在这种架构下,该RRC或PDCP层的信令,即可以认为是通过DU发送的,或者,通过DU和RU发送的。In the above architecture, the signaling generated by the CU can be sent to the terminal device through the DU, or the signaling generated by the terminal device can be sent to the CU through the DU. For example, the signaling of the RRC or PDCP layer will eventually be processed into the signaling of the physical layer and sent to the terminal device, or converted from the received signaling of the physical layer. In this architecture, the signaling of the RRC or PDCP layer can be considered to be sent through the DU, or through the DU and the RU.
可选的,上述DU、CU、CU-CP、CU-UP和RU中的任一个可以是软件模块、硬件结构、或者软件模块+硬件结构,不予限制。其中,不同实体的存在形式可以是不同的,不予限制。例如DU、CU、CU-CP、CU-UP是软件模块,RU是硬件结构。这些模块及其执行的方法也在本公开的保护范围内。Optionally, any of the above DU, CU, CU-CP, CU-UP and RU can be a software module, a hardware structure, or a software module + hardware structure, without limitation. Among them, the existence forms of different entities can be different, without limitation. For example, DU, CU, CU-CP, CU-UP are software modules, and RU is a hardware structure. These modules and their execution methods are also within the scope of protection of the present disclosure.
接入网设备可以支持一种或多种类型的前传接口,不同前传接口,分别对应具有不同功能的DU和RU。若DU和RU之间的前传接口为通用公共无线电接口(common public radio interface,CPRI),DU被配置用于实现基带功能中的一项或多项,RU被配置用于实现射频功能中的一项或多项。若DU和RU之间的前传接口为另一种接口,其相对于CPRI,将下行和/或上行的部分基带功能,比如,针对下行,预编码(precoding),数字波束赋形(beamforming,BF),或快速傅立叶反变换(inverse fast Fourier transform,IFFT)/添加循环前缀(cyclic prefix,CP)中的一项或多项,从DU中移至RU中实现,针对上行,数字波束赋形(beamforming,BF),或快速傅立叶变换(fast Fourier transform,FFT)/去除循环前缀(cyclic prefix,CP)中的一项或多项,从DU中移至RU中实现。在一种可能的实现方式中,该接口可以为增强型通用公共无线电接口(enhanced common public radio interface,eCPRI)。在eCPRI架构下,DU和RU之间的切分方式不同,对应不同类型(category,Cat)的eCPRI,比如eCPRI Cat A,B,C,D,E,F。The access network equipment may support one or more types of fronthaul interfaces, and different fronthaul interfaces correspond to DUs and RUs with different functions. If the fronthaul interface between the DU and the RU is a common public radio interface (CPRI), the DU is configured to implement one or more of the baseband functions, and the RU is configured to implement one or more of the radio frequency functions. If the fronthaul interface between the DU and the RU is another interface, relative to the CPRI, part of the baseband functions of the downlink and/or uplink, such as, for the downlink, one or more of precoding, digital beamforming (BF), or inverse fast Fourier transform (IFFT)/adding cyclic prefix (CP), are moved from the DU to the RU for implementation, and for the uplink, one or more of digital beamforming (BF), or fast Fourier transform (FFT)/removing cyclic prefix (CP), are moved from the DU to the RU for implementation. In a possible implementation, the interface may be an enhanced common public radio interface (eCPRI). Under the eCPRI architecture, the division between DU and RU is different, corresponding to different types (category, Cat) of eCPRI, such as eCPRI Cat A, B, C, D, E, and F.
以eCPRI Cat A为例,对于下行传输,以层映射为切分,DU被配置用于实现层映射及之前的一项或多项功能(即编码、速率匹配,加扰,调制,层映射中的一项或多项),而层映射之后的其他功能(例如,资源元素(resource element,RE)映射,数字波束赋形(beamforming,BF),或快速傅立叶反变换(inverse fast Fourier transform,IFFT)/添加循环前缀(cyclic prefix,CP)中的一项或多项)移至RU中实现。对于上行传输,以解RE映射为切分,DU被配置用于实现解映射及之前的一项或多项功能(即解码,解速率匹配,解扰,解调,离散傅里叶逆变换(inverse discrete Fourier transform,IDFT),信道均衡,解RE映射中的一项或多项功能),而解映射之后的其他功能(例如,数字BF或快速傅里叶变换(fast Fourier transform,FFT)/去CP中的一项或多项)移至RU中实现。可以理解的是,关于各种类型的eCPRI所对应的DU和RU的功能描述,可以参考eCPRI协议,在此不予赘述。Taking eCPRI Cat A as an example, for downlink transmission, based on layer mapping, DU is configured to implement layer mapping and one or more functions before it (i.e., one or more of coding, rate matching, scrambling, modulation, and layer mapping), while other functions after layer mapping (for example, one or more of resource element (RE) mapping, digital beamforming (BF), or inverse fast Fourier transform (IFFT)/adding cyclic prefix (CP)) are moved to RU for implementation. For uplink transmission, with RE demapping as the division, DU is configured to implement demapping and one or more functions before it (i.e. decoding, rate matching, descrambling, demodulation, inverse discrete Fourier transform (IDFT), channel equalization, one or more functions of RE demapping), while other functions after demapping (e.g., one or more of digital BF or fast Fourier transform (FFT)/CP removal) are moved to RU for implementation. It is understandable that the functional description of DU and RU corresponding to various types of eCPRI can be referred to the eCPRI protocol, which will not be repeated here.
一种可能的设计中,BBU中用于实现基带功能的处理单元称为基带高层(base band high,BBH)单元,RRU/AAU/RRH中用于实现基带功能的处理单元称为基带低层(base band low,BBL)单元。In one possible design, the processing unit for implementing the baseband function in the BBU is called a baseband high layer (BBH) unit, and the processing unit for implementing the baseband function in the RRU/AAU/RRH is called a baseband low layer (BBL) unit.
在不同系统中,CU(或CU-CP和CU-UP)、DU或RU也可以有不同的名称,但是本领域的技术人员可以理解其含义。例如,在开放无线接入网络(open RAN,ORAN)系统中,CU也可以称为O-CU(开放式CU),DU也可以称为O-DU,CU-CP也可以称为O-CU-CP,CU-UP也可以称为O-CU-UP,RU也可以称为O-RU。本申请中的CU(或CU-CP、CU-UP)、DU和RU中的任一单元,可以是通过软件模块、硬件模块、或者软件模块与硬件模块结合来实现。In different systems, CU (or CU-CP and CU-UP), DU or RU may also have different names, but those skilled in the art can understand their meanings. For example, in an open radio access network (open RAN, ORAN) system, CU may also be called O-CU (open CU), DU may also be called O-DU, CU-CP may also be called O-CU-CP, CU-UP may also be called O-CU-UP, and RU may also be called O-RU. Any unit in the CU (or CU-CP, CU-UP), DU and RU in this application may be implemented by a software module, a hardware module, or a combination of a software module and a hardware module.
本申请实施例中,用于实现网络设备的功能的装置可以是网络设备;也可以是能够支持网络设备实现该功能的装置,例如芯片系统、硬件电路、软件模块、或硬件电路加软件模块。该装置可以被安装在网络设备中或者和网络设备匹配使用。在本申请实施例中仅以用于实现网络设备的功能的装置为网络设备为例进行说明,不对本申请实施例的方案构成限定。In the embodiments of the present application, the device for realizing the function of the network device may be a network device; or it may be a device capable of supporting the network device to realize the function, such as a chip system, a hardware circuit, a software module, or a hardware circuit plus a software module. The device may be installed in the network device or used in combination with the network device. In the embodiments of the present application, only the device for realizing the function of the network device is a network device as an example for explanation, and the scheme of the embodiments of the present application is not limited.
网络设备和/或终端设备可以部署在陆地上,包括室内或室外、手持或车载;也可以部署在水面上;还可以部署在空中的飞机、气球和卫星上。本申请实施例中对网络设备和终端设备所处的场景不做限定。此外,终端设备和网络设备可以是硬件设备,也可以是在专用硬件上运行的软件功能,通用硬件上运行的软件功能,比如,是平台(例如,云平台)上实例化的虚拟化功能,又或者,是包括专用或通用硬件设备和软件功能的实体,本申请对于终端设备和网络设备的具体形态不作限定。The network device and/or terminal device can be deployed on land, including indoors or outdoors, handheld or vehicle-mounted; it can also be deployed on the water surface; it can also be deployed on aircraft, balloons and satellites in the air. The scenarios in which the network device and the terminal device are located are not limited in the embodiments of the present application. In addition, the terminal device and the network device can be hardware devices, or they can be software functions running on dedicated hardware, software functions running on general-purpose hardware, such as virtualization functions instantiated on a platform (e.g., a cloud platform), or entities including dedicated or general-purpose hardware devices and software functions. The present application does not limit the specific forms of the terminal device and the network device.
应理解,图1所示的通信系统中各个设备的数量、类型仅作为示意,本公开并不限于此,实际应用中在通信系统中还可以包括更多的终端设备、更多的接入网设备,还可以包括其它网元,例如可以包括核心网设备,和/或用于实现人工智能功能的网元。其中,用于实现人工智能功能的网元可以为RAN智能控制器(RAN intelligent controller,RIC)。图2a是适用于本申请实施例的一种通信系统的示意图。如图2a所示,通信系统200可以包括至少一个网络设备,例如图2a所示的网络设备210;通信系统200还可以包括至少一个终端设备,例如图2a所示的终端设备220和终端设备230。网络设备210与终端设备(如终端设备220和终端设备230)可通过无线链路通信。该通信系统中的各通信设备之间,例如,网络设备210与终端设备220之间,可通过多天线技术通信。It should be understood that the number and type of each device in the communication system shown in FIG1 are only for illustration, and the present disclosure is not limited thereto. In actual applications, the communication system may also include more terminal devices, more access network devices, and other network elements, such as core network devices, and/or network elements for implementing artificial intelligence functions. Among them, the network element for implementing the artificial intelligence function may be a RAN intelligent controller (RAN intelligent controller, RIC). FIG2a is a schematic diagram of a communication system applicable to an embodiment of the present application. As shown in FIG2a, the communication system 200 may include at least one network device, such as the network device 210 shown in FIG2a; the communication system 200 may also include at least one terminal device, such as the terminal device 220 and the terminal device 230 shown in FIG2a. The network device 210 and the terminal device (such as the terminal device 220 and the terminal device 230) may communicate via a wireless link. The communication devices in the communication system, for example, the network device 210 and the terminal device 220, may communicate via a multi-antenna technology.
图2b是适用于本申请实施例的另一种通信系统的示意图。相较于图2a所示的通信系统200而言,图2b所示的通信系统300还包括AI网元240。AI网元240用于执行AI相关的操作,例如,构建训练数据集或训练AI模型等。FIG2b is a schematic diagram of another communication system applicable to an embodiment of the present application. Compared with the communication system 200 shown in FIG2a, the communication system 300 shown in FIG2b also includes an AI network element 240. The AI network element 240 is used to perform AI-related operations, such as building a training data set or training an AI model.
在一种可能的实现方式中,网络设备210可以将与AI模型的训练相关的数据发送给AI网元240,由AI网元240构建训练数据集,并训练AI模型。例如,与AI模型的训练相关的数据可以包括终端设备上报的数据。AI网元240可以将AI模型相关的操作的结果发送至网络设备210,并通过网络设备210转发至终端设备。例如,AI模型相关的操作的结果可以包括以下至少一项:已完成训练的AI模型、模型的评估结果或测试结果等。示例性地,已完成训练的AI模型的一部分可以部署于网络设备210上,另一部分部署于终端设备上。可替换地,已完成训练的AI模型可以部署于网络设备210上。或者,已完成训练的AI模型可以部署于终端设备上。In one possible implementation, the network device 210 may send data related to the training of the AI model to the AI network element 240, which constructs a training data set and trains the AI model. For example, the data related to the training of the AI model may include data reported by the terminal device. The AI network element 240 may send the results of the operations related to the AI model to the network device 210, and forward them to the terminal device through the network device 210. For example, the results of the operations related to the AI model may include at least one of the following: an AI model that has completed training, an evaluation result or a test result of the model, etc. Exemplarily, a part of the AI model that has completed training may be deployed on the network device 210, and another part may be deployed on the terminal device. Alternatively, the AI model that has completed training may be deployed on the network device 210. Alternatively, the AI model that has completed training may be deployed on the terminal device.
应理解,图2b仅以AI网元240与网络设备210直接相连为例进行说明,在其他场景中,AI网元240也可以与终端设备相连。或者,AI网元240可以同时与网络设备210和终端设备相连。或者,AI网元240还可以通过第三方网元(也称第三方设备或第三方实体)与网络设备210相连。本申请实施例对AI网元与其他网元的连接关系不做限定。It should be understood that FIG. 2b only illustrates the example of the AI network element 240 being directly connected to the network device 210. In other scenarios, the AI network element 240 may also be connected to the terminal device. Alternatively, the AI network element 240 may be connected to the network device 210 and the terminal device at the same time. Alternatively, the AI network element 240 may also be connected to the network device 210 through a third-party network element (also referred to as a third-party device or a third-party entity). The embodiment of the present application does not limit the connection relationship between the AI network element and other network elements.
AI网元240也可以作为一个模块设置于网络设备和/或终端设备中,例如,设置于图2a所示的网络设备210或终端设备中。The AI network element 240 may also be provided as a module in a network device and/or a terminal device, for example, in the network device 210 or the terminal device shown in FIG. 2a .
需要说明的是,图2a和图2b仅为便于理解而示例的简化示意图,例如,通信系统中还可以包括其它设备,如还可以包括无线中继设备和/或无线回传设备等,图2a和图2b中未予以画出。在实际应用中,该通信系统可以包括多个网络设备,也可以包括多个终端设备。本申请实施例对通信系统中包括的网络设备和终端设备的数量不做限定。It should be noted that FIG. 2a and FIG. 2b are simplified schematic diagrams for ease of understanding. For example, the communication system may also include other devices, such as wireless relay devices and/or wireless backhaul devices, which are not shown in FIG. 2a and FIG. 2b. In practical applications, the communication system may include multiple network devices and may also include multiple terminal devices. The embodiment of the present application does not limit the number of network devices and terminal devices included in the communication system.
为了在无线网络中支持AI技术,网络中还可能引入AI节点。In order to support AI technology in wireless networks, AI nodes may also be introduced into the network.
可选地,AI节点可以部署于该通信系统中的如下位置中的一项或多项:接入网络设备、终端设备、或核心网设备等,或者,AI节点也可单独部署,例如,部署于上述任一项设备之外的位置,比如,过顶(over the top,OTT)系统的主机或云端服务器中。AI节点可以与通信系统中的其它设备通信,其它设备例如可以为以下中的一项或多项:网络设备,终端设备,或,核心网的网元等。Optionally, the AI node can be deployed in one or more of the following locations in the communication system: access network equipment, terminal equipment, or core network equipment, etc., or the AI node can also be deployed separately, for example, deployed in a location other than any of the above devices, such as a host or cloud server in an over-the-top (OTT) system. The AI node can communicate with other devices in the communication system, and the other devices can be, for example, one or more of the following: network equipment, terminal equipment, or network elements of the core network, etc.
可以理解,本申请对于AI节点的数量不予限制。例如,当有多个AI节点时,多个AI节点可以基于功能进行划分,如不同的AI节点负责不同的功能。It is understood that the present application does not limit the number of AI nodes. For example, when there are multiple AI nodes, the multiple AI nodes can be divided based on functions, such as different AI nodes are responsible for different functions.
还可以理解,AI节点可以是各自独立的设备,也可以集成于同一设备中实现不同的功能,或者可以是硬件设备中的网络元件,也可以是在专用硬件上运行的软件功能,或者是平台(例如,云平台)上实例化的虚拟化功能,本申请对于上述AI节点的具体形态不作限定。It can also be understood that AI nodes can be independent devices, or they can be integrated into the same device to implement different functions, or they can be network elements in hardware devices, or they can be software functions running on dedicated hardware, or they can be virtualized functions instantiated on a platform (for example, a cloud platform). This application does not limit the specific form of the above-mentioned AI nodes.
AI节点可以为AI网元或AI模块。An AI node can be an AI network element or an AI module.
图3a为通信系统中的一种可能的应用框架示意图。如图3a所示,通信系统中网元之间通过接口(例如NG,Xn),或空口相连。这些网元节点,例如核心网设备、接入网(radio access network,RAN)节点、终端或OAM中的一个或多个设备中设置有一个或多个AI模块(为清楚起见,图3a中仅示出1个)。所述接入网节点可以作为单独的RAN节点,也可以包括多个RAN节点,例如,包括CU和DU。所述CU和、或DU也可以设置一个或多个AI模块。可选的,CU还可以被拆分为CU-CP和CU-UP。CU-CP和/或CU-UP中设置有一个或多个AI模型。Figure 3a is a schematic diagram of a possible application framework in a communication system. As shown in Figure 3a, network elements in the communication system are connected through interfaces (e.g., NG, Xn) or air interfaces. One or more AI modules are provided in one or more devices of these network element nodes, such as core network equipment, access network (radio access network, RAN) nodes, terminals or OAM (for clarity, only one is shown in Figure 3a). The access network node can be a separate RAN node, or it can include multiple RAN nodes, for example, including CU and DU. The CU and/or DU can also be provided with one or more AI modules. Optionally, the CU can also be split into CU-CP and CU-UP. One or more AI models are provided in the CU-CP and/or CU-UP.
所述AI模块用以实现相应的AI功能。不同网元中部署的AI模块可以相同或不同。AI模块的模型根据不同的参数配置,AI模块可以实现不同的功能。AI模块的模型可以是基于以下一项或多项参数配置的:结构参数(例如神经网络层数、神经网络宽度、层间的连接关系、神经元的权值、神经元的激活函数、或激活函数中的偏置中的至少一项)、输入参数(例如输入参数的类型和/或输入参数的维度)、或输出参数(例如输出参数的类型和/或输出参数的维度)。其中,激活函数中的偏置还可以称为神经网络的偏置。The AI module is used to implement the corresponding AI function. The AI modules deployed in different network elements may be the same or different. The model of the AI module can implement different functions according to different parameter configurations. The model of the AI module can be configured based on one or more of the following parameters: structural parameters (such as the number of neural network layers, the width of the neural network, the connection relationship between layers, the weight of the neuron, the activation function of the neuron, or at least one of the biases in the activation function), input parameters (such as the type of input parameters and/or the dimension of input parameters), or output parameters (such as the type of output parameters and/or the dimension of output parameters). Among them, the bias in the activation function can also be called the bias of the neural network.
一个AI模块可以具有一个或多个模型。一个模型可以推理得到一个输出,该输出包括一个参数或者多个参数。不同模型的学习过程、训练过程、或推理过程可以部署在不同的节点或设备中,或者可以部署在相同的节点或设备中。An AI module can have one or more models. A model can be inferred to obtain an output, which includes one parameter or multiple parameters. The learning process, training process, or inference process of different models can be deployed in different nodes or devices, or can be deployed in the same node or device.
图3b为通信系统中的另一种可能的应用框架示意图。如图3b所示,通信系统中包括RAN智能控制器(RAN intelligent controller,RIC)。例如,所述RIC可以是图3a中的AI模块,用于实现AI相关的功能。所述RIC包括近实时RIC(near-real time RIC,near-RT RIC),和非实时RIC(non-real time RIC,Non-RT RIC)。其中,非实时RIC主要处理非实时的信息,比如,对时延不敏感的数据,该数据的时延可以为秒级。实时RIC主要处理近实时的信息,比如,对时延相对敏感的数据,该数据的时延为数十毫秒级。FIG3b is a schematic diagram of another possible application framework in a communication system. As shown in FIG3b, the communication system includes a RAN intelligent controller (RIC). For example, the RIC may be the AI module in FIG3a, which is used to implement AI-related functions. The RIC includes a near-real-time RIC (near-real-time RIC, near-RT RIC) and a non-real-time RIC (non-real-time RIC, Non-RT RIC). Among them, the non-real-time RIC mainly processes non-real-time information, such as data that is not sensitive to delay, and the delay of the data may be in the order of seconds. The real-time RIC mainly processes near-real-time information, such as data that is relatively sensitive to delay, and the delay of the data is in the order of tens of milliseconds.
所述近实时RIC用于进行模型训练和推理。例如,用于训练AI模型,利用该AI模型进行推理。近实时RIC可以从RAN节点(例如CU、CU-CP、CU-UP、DU和/或RU)和/或终端获得网络侧和/或终端侧的信息。该信息可以作为训练数据或者推理数据。可选的,近实时RIC可以将推理结果递交给RAN节点和/或终端。可选的,CU和DU之间,和/或DU和RU之间可以交互推理结果。例如近实时RIC将推理结果递交给DU,DU将其发给RU。The near real-time RIC is used for model training and reasoning. For example, it is used to train an AI model and use the AI model for reasoning. The near real-time RIC can obtain information on the network side and/or the terminal side from a RAN node (e.g., CU, CU-CP, CU-UP, DU, and/or RU) and/or a terminal. This information can be used as training data or reasoning data. Optionally, the near real-time RIC can submit the reasoning results to the RAN node and/or the terminal. Optionally, the reasoning results can be exchanged between the CU and the DU, and/or between the DU and the RU. For example, the near real-time RIC submits the reasoning results to the DU, and the DU sends it to the RU.
所述非实时RIC也用于进行模型训练和推理。例如,用于训练AI模型,利用该模型进行推理。非实时RIC可以从RAN节点(例如CU、CU-CP、CU-UP、DU和/或RU)和/或终端获得网络侧和/或终端侧的信息。该信息可以作为训练数据或者推理数据,推理结果可以被递交给RAN节点和/或终端。可选的,CU和DU之间,和/或DU和RU之间可以交互推理结果,例如非实时RIC将推理结果递交给DU,由DU将其发给RU。The non-real-time RIC is also used for model training and reasoning. For example, it is used to train an AI model and use the model for reasoning. The non-real-time RIC can obtain information on the network side and/or the terminal side from a RAN node (such as a CU, CU-CP, CU-UP, DU and/or RU) and/or a terminal. The information can be used as training data or reasoning data, and the reasoning results can be submitted to the RAN node and/or the terminal. Optionally, the reasoning results can be exchanged between the CU and the DU, and/or between the DU and the RU. For example, the non-real-time RIC submits the reasoning results to the DU, and the DU sends it to the RU.
所述近实时RIC,非实时RIC也可以分别作为一个网元单独设置。可选的,所述近实时RIC,非实时RIC也可以作为其他设备的一部分,例如,近实时RIC设置在RAN节点中(例如,CU,DU中),而非实时RIC设置在OAM中、服务器(如云服务器)中、核心网设备、或者其他网络设备中。The near real-time RIC and the non-real-time RIC may also be separately set as a network element. Optionally, the near real-time RIC and the non-real-time RIC may also be part of other devices, for example, the near real-time RIC is set in a RAN node (for example, in a CU or DU), and the non-real-time RIC is set in an OAM, a server (such as a cloud server), a core network device, or other network devices.
可以理解的是,终端设备、接入网设备、核心网设备、或用于实现人工智能功能的网元中的一项或多项所实现的全部或部分功能均可以进行虚拟化,也即,通过专有处理器或通用处理器中的一项或多项和相应的软件模块来实现。其中,终端设备和接入网设备因涉及空口传输的接口,该接口的收发功能可由硬件来实现。核心网设备,如操作维护管理(operation administration and maintenance,OAM)网元,均可虚拟化。可选的,虚拟化后的终端设备、接入网设备、核心网设备、或用于实现人工智能功能的网元中的一项或多项功能可以由云端设备来实现,比如过顶(over the top,OTT)系统中的云端设备来实现。It is understandable that all or part of the functions implemented by one or more of the terminal equipment, access network equipment, core network equipment, or network elements used to implement artificial intelligence functions can be virtualized, that is, implemented by one or more of the proprietary processors or general-purpose processors and corresponding software modules. Among them, the terminal equipment and access network equipment involve interfaces for air interface transmission, and the transceiver functions of the interfaces can be implemented by hardware. Core network equipment, such as operation administration and maintenance (OAM) network elements, can be virtualized. Optionally, one or more functions of the virtualized terminal equipment, access network equipment, core network equipment, or network elements used to implement artificial intelligence functions can be implemented by cloud devices, such as cloud devices in over the top (OTT) systems.
本公开提供的方法可以用于接入网设备和终端设备之间的通信,也可以用于其他通信设备之间的通信,例如无线回传链路中宏基站和微基站之间的通信,又如边链路(sidelink,SL)中两个终端设备之间的通信等,不予限制。The method provided by the present invention can be used for communication between access network equipment and terminal equipment, and can also be used for communication between other communication equipment, such as communication between macro base stations and micro base stations in a wireless backhaul link, and communication between two terminal devices in a side link (SL), without limitation.
为了便于理解本申请实施例的方案,下面对本申请实施例可能涉及的术语进行解释。In order to facilitate understanding of the solutions of the embodiments of the present application, the terms that may be involved in the embodiments of the present application are explained below.
(1)AI模型:(1) AI Model:
AI模型为能实现AI功能的算法或者计算机程序,AI模型表征了模型的输入和输出之间的映射关系。AI模型的类型可以是神经网络、线性回归模型、决策树模型、支持向量机(support vector machine,SVM)、贝叶斯网络、Q学习模型或者其他机器学习(machine learning,ML)模型。An AI model is an algorithm or computer program that can implement AI functions. The AI model represents the mapping relationship between the input and output of the model. The type of AI model can be a neural network, linear regression model, decision tree model, support vector machine (SVM), Bayesian network, Q learning model or other machine learning (ML) model.
(2)双端模型:(2) Two-terminal model:
双端模型也可以称为双边模型、协作模型、对偶模型或双端(two-side)模型等。双端模型指的是由多个子模型组合在一起构成的一个模型。构成该模型的多个子模型需要相互匹配。该多个子模型可以部署于不同的节点中。The two-end model can also be called a bilateral model, a collaborative model, a dual model, or a two-side model. The two-end model refers to a model composed of multiple sub-models. The multiple sub-models that constitute the model need to match each other. The multiple sub-models can be deployed in different nodes.
一种可能的设计中,本申请实施例涉及用于压缩CSI的编码器和用于恢复压缩CSI的解码器。编码器与解码器匹配使用,可以理解编码器和解码器为配套的AI模型。一个编码器可以包括一个或多个AI模型,该编码器匹配的解码器中也包括一个或多个AI模型,匹配使用的编码器和解码器中包括的AI模型数量相同且一一对应。In one possible design, an embodiment of the present application relates to an encoder for compressing CSI and a decoder for recovering compressed CSI. The encoder and the decoder are used in matching manner, and it can be understood that the encoder and the decoder are matching AI models. An encoder may include one or more AI models, and the decoder matched by the encoder also includes one or more AI models. The number of AI models included in the matching encoder and decoder is the same and corresponds one to one.
一种可能的设计中,一套匹配使用的编码器(encoder)和解码器(decoder)可以具体为同一个自编码器(auto-encoders,AE)中的两个部分,例如,如图4所示。编码器和解码器分别部署于不同的节点的AE模型是一种典型的双边模型。AE模型的编码器和解码器通常是共同训练的编码器与解码器匹配使用。编码器对输入V进行处理,以得到处理后的结果z,解码器能够将编码器的输出z再解码为期望的输出V’。In one possible design, a set of matched encoders and decoders can be specifically two parts of the same auto-encoder (AE), for example, as shown in Figure 4. The AE model in which the encoder and decoder are deployed on different nodes is a typical bilateral model. The encoder and decoder of the AE model are usually a jointly trained encoder and decoder used in matching. The encoder processes the input V to obtain the processed result z, and the decoder can decode the encoder output z into the desired output V'.
自编码器是一种无监督学习的神经网络,它的特点是将输入数据作为标签,因此自编码器也可以理解为自监督学习的神经网络。自编码器可以用于数据的压缩和恢复。示例性地,自编码器中的编码器可以对数据A进行压缩(编码)处理,得到数据B;自编码器中的解码器可以对数据B进行解压缩(解码)处理,恢复出数据A。或者可以理解为,解码器是编码器的逆操作。An autoencoder is a neural network for unsupervised learning. Its characteristic is that it uses input data as labels, so an autoencoder can also be understood as a neural network for self-supervised learning. An autoencoder can be used for data compression and recovery. For example, the encoder in an autoencoder can compress (encode) data A to obtain data B; the decoder in an autoencoder can decompress (decode) data B to recover data A. Alternatively, it can be understood that the decoder is the inverse operation of the encoder.
示例性地,本申请实施例中的AI模型可以包括编码器和解码器。编码器与解码器匹配使用,可以理解编码器和解码器为配套的AI模型。编码器和解码器可以分别部署于终端设备和网络设备。Exemplarily, the AI model in the embodiment of the present application may include an encoder and a decoder. The encoder and the decoder are used in combination, and it can be understood that the encoder and the decoder are matching AI models. The encoder and the decoder can be deployed on the terminal device and the network device respectively.
可替换地,本申请实施例中的AI模型可以为单端模型,该AI模型可以部署于终端设备或网络设备。Alternatively, the AI model in the embodiment of the present application may be a single-end model, which may be deployed on a terminal device or a network device.
(3)神经网络(neural network,NN):(3) Neural network (NN):
神经网络是AI或机器学习的一种具体实现形式。根据通用近似定理,神经网络理论上可以逼近任意连续函数,从而使得神经网络具备学习任意映射的能力。Neural networks are a specific implementation of AI or machine learning. According to the universal approximation theorem, neural networks can theoretically approximate any continuous function, thus enabling neural networks to learn any mapping.
神经网络可以是由神经单元组成的,神经单元可以是指以xs和截距1为输入的运算单元。神经网络是将许多个上述单一的神经单元联结在一起形成的网络,即一个神经单元的输出可以是另一个神经单元的输入。每个神经单元的输入可以与前一层的局部接受域相连,来提取局部接受域的特征,局部接受域可以是由若干个神经单元组成的区域。A neural network can be composed of neural units, and a neural unit can refer to an operation unit with xs and intercept 1 as input. A neural network is a network formed by connecting many of the above single neural units together, that is, the output of one neural unit can be the input of another neural unit. The input of each neural unit can be connected to the local receptive field of the previous layer to extract the features of the local receptive field, and the local receptive field can be an area composed of several neural units.
以AI模型的类型为神经网络为例,本公开涉及的AI模型可以为深度神经网络(deep neural network,DNN)。根据网络的构建方式,DNN可以包括前馈神经网络(feedforward neural network,FNN)、卷积神经网络(convolutional neural networks,CNN)和递归神经网络(recurrent neural network,RNN)等。Taking the AI model type as a neural network as an example, the AI model involved in the present disclosure may be a deep neural network (DNN). Depending on the network construction method, DNN may include a feedforward neural network (FNN), a convolutional neural network (CNN), and a recurrent neural network (RNN).
(4)训练数据集和推理数据:(4) Training data set and inference data:
在机器学习领域,真值(ground truth)通常指的是被认为是准确的数据或真实的数据。In the field of machine learning, ground truth usually refers to data that is believed to be accurate or real.
训练数据集用于AI模型的训练,训练数据集可以包括AI模型的输入,或者包括AI模型的输入和目标输出。其中,训练数据集包括一个或多个训练数据,训练数据可以包括输入至AI模型的训练样本,也可以包括AI模型的目标输出。其中,目标输出也可以被称为标签、样本标签或标签样本。标签即为真值。The training data set is used for training the AI model. The training data set may include the input of the AI model, or the input and target output of the AI model. The training data set includes one or more training data. The training data may include training samples input to the AI model, or may include the target output of the AI model. The target output may also be referred to as a label, sample label, or label sample. The label is the true value.
在通信领域,训练数据集可以包括通过仿真平台收集的仿真数据,也可以包括实验场景收集的实验数据,或者,也可以包括在实际的通信网络中收集的实测数据。由于数据产生的地理环境和信道条件存在差异,例如,室内、室外、移动速度、频段或天线配置等存在差异,在获取数据时,可以对收集到数据进行分类。例如,将信道传播环境以及天线配置相同的数据归为一类。In the field of communications, training data sets can include simulation data collected through simulation platforms, experimental data collected in experimental scenarios, or measured data collected in actual communication networks. Due to differences in the geographical environment and channel conditions in which the data is generated, such as indoor and outdoor conditions, mobile speeds, frequency bands, or antenna configurations, the collected data can be classified when acquiring the data. For example, data with the same channel propagation environment and antenna configuration can be grouped together.
模型训练本质上就是从训练数据中学习它的某些特征,在训练AI模型(如神经网络模型)的过程中,因为希望AI模型的输出尽可能的接近真正想要预测的值,所以可以通过比较当前网络的预测值和真正想要的目标值,再根据两者之间的差异情况来更新每一层AI模型的权重向量(当然,在第一次更新之前通常会有初始化的过程,即为AI模型中的各层预先配置参数),比如,如果网络的预测值高了,就调整权重向量让它预测低一些,不断的调整,直到AI模型能够预测出真正想要的目标值或与真正想要的目标值非常接近的值。因此,就需要预先定义“如何比较预测值和目标值之间的差异”,这便是损失函数(loss function)或目标函数(objective function),它们是用于衡量预测值和目标值的差异的重要方程。其中,以损失函数举例,损失函数的输出值(loss)越高表示差异越大,那么AI模型的训练就变成了尽可能缩小这个loss的过程,使得损失函数的取值小于门限,或者使得损失函数的取值满足目标需求的过程。例如,AI模型为神经网络,调整神经网络的模型参数包括调整如下参数中的至少一种:神经网络的层数、宽度、神经元的权值、或神经元的激活函数中的参数。Model training is essentially learning some of its features from the training data. In the process of training AI models (such as neural network models), because we hope that the output of the AI model is as close as possible to the value we really want to predict, we can compare the current network's predicted value with the target value we really want, and then update the weight vector of each layer of the AI model according to the difference between the two (of course, there is usually an initialization process before the first update, that is, pre-configuring parameters for each layer in the AI model). For example, if the network's predicted value is high, adjust the weight vector to make it predict lower, and keep adjusting until the AI model can predict the target value we really want or a value very close to the target value we really want. Therefore, it is necessary to predefine "how to compare the difference between the predicted value and the target value", which is the loss function or objective function, which are important equations used to measure the difference between the predicted value and the target value. Among them, taking the loss function as an example, the higher the output value (loss) of the loss function, the greater the difference, so the training of the AI model becomes a process of minimizing this loss as much as possible, so that the value of the loss function is less than the threshold, or the value of the loss function meets the target requirements. For example, the AI model is a neural network, and adjusting the model parameters of the neural network includes adjusting at least one of the following parameters: the number of layers, width, weights of neurons, or parameters in the activation function of the neurons of the neural network.
推理数据可以作为已完成训练的AI模型的输入,用于AI模型的推理。在模型推理过程中,将推理数据输入AI模型,可以得到对应的输出即为推理结果。Inference data can be used as input to the trained AI model for inference of the AI model. During the model inference process, the inference data is input into the AI model, and the corresponding output is the inference result.
(5)AI模型的设计:(5) Design of AI model:
AI模型的设计主要包括数据收集环节(例如收集训练数据和/或推理数据)、模型训练环节以及模型推理环节。进一步地还可以包括推理结果应用环节。The design of AI models mainly includes data collection (such as collecting training data and/or inference data), model training, and model inference. It can also include the application of inference results.
图5示出了一种AI应用框架。FIG5 shows an AI application framework.
在前述数据收集环节中,数据源(data source)用于提供训练数据集和推理数据。在模型训练环节中,通过对数据源提供的训练数据(training data)进行分析或训练,得到AI模型。其中,AI模型表征了模型的输入和输出之间的映射关系。通过模型训练节点学习得到AI模型,相当于利用训练数据学习得到模型的输入和输出之间的映射关系。在模型推理环节中,使用经由模型训练环节训练后的AI模型,基于数据源提供的推理数据进行推理,得到推理结果。该环节还可以理解为:将推理数据输入到AI模型,通过AI模型得到输出,该输出即为推理结果。该推理结果可以指示:由执行对象使用(执行)的配置参数、和/或由执行对象执行的操作。在推理结果应用环节中进行推理结果的发布,例如推理结果可以由执行(actor)实体统一规划,例如执行实体可以发送推理结果给一个或多个执行对象(例如,网络设备或终端设备等)去执行。又如执行实体还可以反馈模型的性能给数据源,便于后续实施模型的更新训练。In the aforementioned data collection link, the data source is used to provide training data sets and inference data. In the model training link, the AI model is obtained by analyzing or training the training data provided by the data source. Among them, the AI model represents the mapping relationship between the input and output of the model. Learning the AI model through the model training node is equivalent to using the training data to learn the mapping relationship between the input and output of the model. In the model inference link, the AI model trained in the model training link is used to perform inference based on the inference data provided by the data source to obtain the inference result. This link can also be understood as: inputting the inference data into the AI model, and obtaining the output through the AI model, which is the inference result. The inference result can indicate: the configuration parameters used (executed) by the execution object, and/or the operation performed by the execution object. The inference result is published in the inference result application link. For example, the inference result can be uniformly planned by the execution (actor) entity, for example, the execution entity can send the inference result to one or more execution objects (for example, network devices or terminal devices, etc.) for execution. For example, the execution entity can also feedback the performance of the model to the data source to facilitate the subsequent implementation of the model update training.
可以理解的是,在通信系统中可以包括具备人工智能功能的网元。上述AI模型设计相关的环节可以由一个或多个具备人工智能功能的网元执行。一种可能的设计中,可以在通信系统中已有网元内配置AI功能(如AI模块或者AI实体)来实现AI相关的操作,例如AI模型的训练和/或推理。例如该已有网元可以是网络设备或终端设备等。或者另一种可能的设计中,也可以在通信系统中引入独立的网元来执行AI相关的操作,如训练AI模型。该独立的网元可以称为AI网元或者AI节点或者AI实体等,本申请实施例对此名称不进行限制。示例性地,该AI网元可以和通信系统中的网络设备之间直接连接,也可以通过第三方网元和网络设备实现间接连接。其中,第三方网元可以是认证管理功能(authentication management function,AMF)网元、用户面功能(user plane function,UPF)网元等核心网网元、操作维护管理(operation administration and maintenance,OAM)、服务器(如云服务器),过顶(over the top,OTT)设备或者其他网元,不予限制。示例性地,该独立的AI网元或AI实体或AI节点可以部署于网络设备侧,终端设备侧,或,核心网侧中的一项或多项。可选的,其可以部署于服务器,如云端服务器,或OTT设备,或其他设备上。示例性地,如图2b所示的通信系统中引入了AI网元240。可以理解的是,前述AI模块,AI实体,AI网元,或AI节点都可以用于执行AI功能中的一项或多项,其中AI功能可以包括:AI模型的处理,如AI模型的训练和/或更新,AI模型的监控,AI模型的管理,如AI模型的注册和/或去注册,或AI模型的应用推理。It is understandable that a network element with artificial intelligence function may be included in the communication system. The above-mentioned links related to the AI model design can be performed by one or more network elements with artificial intelligence function. In one possible design, an AI function (such as an AI module or an AI entity) can be configured in an existing network element in the communication system to implement AI-related operations, such as training and/or reasoning of an AI model. For example, the existing network element may be a network device or a terminal device. Or in another possible design, an independent network element may be introduced into the communication system to perform AI-related operations, such as training an AI model. The independent network element may be referred to as an AI network element, an AI node, an AI entity, etc., and the embodiments of the present application do not limit this name. Exemplarily, the AI network element may be directly connected to the network device in the communication system, or it may be indirectly connected through a third-party network element and a network device. Among them, the third-party network element can be a core network element such as an authentication management function (AMF) network element, a user plane function (UPF) network element, an operation administration and maintenance (OAM), a server (such as a cloud server), an over the top (OTT) device or other network element, without limitation. Exemplarily, the independent AI network element or AI entity or AI node can be deployed on one or more of the network device side, the terminal device side, or the core network side. Optionally, it can be deployed on a server, such as a cloud server, or an OTT device, or other device. Exemplarily, an AI network element 240 is introduced in the communication system shown in Figure 2b. It will be understood that the aforementioned AI modules, AI entities, AI network elements, or AI nodes can be used to perform one or more of the AI functions, where the AI functions may include: processing of AI models, such as training and/or updating of AI models, monitoring of AI models, management of AI models, such as registration and/or deregistration of AI models, or application reasoning of AI models.
不同模型的训练过程可以部署在不同的设备或节点中,也可以部署在相同的设备或节点中。不同模型的推理过程可以部署在不同的设备或节点中,也可以部署在相同的设备或节点中。以终端设备完成模型训练环节为例,终端设备可以训练配套的编码器和解码器之后,将其中解码器的模型参数发送给网络设备。以网络设备完成模型训练环节为例,网络设备在训练配套的编码器和解码器之后,可以将其中编码器的模型参数指示给终端设备。以独立的AI网元完成模型训练环节为例,AI网元可以训练配套的编码器和解码器之后,将其中编码器的模型参数发送给终端设备,将解码器的模型参数发送给网络设备。进而在终端设备中进行编码器对应的模型推理环节,以及在网络设备中进行解码器对应的模型推理环节。The training process of different models can be deployed in different devices or nodes, or in the same device or node. The reasoning process of different models can be deployed in different devices or nodes, or in the same device or node. Taking the completion of the model training link of the terminal device as an example, the terminal device can train the matching encoder and decoder, and then send the model parameters of the decoder to the network device. Taking the completion of the model training link of the network device as an example, after the network device trains the matching encoder and decoder, it can indicate the model parameters of the encoder to the terminal device. Taking the completion of the model training link of an independent AI network element as an example, the AI network element can train the matching encoder and decoder, and then send the model parameters of the encoder to the terminal device, and send the model parameters of the decoder to the network device. Then, the model reasoning link corresponding to the encoder is performed in the terminal device, and the model reasoning link corresponding to the decoder is performed in the network device.
其中,模型参数可以包括如下的一种或多种模型的结构参数(例如模型的层数、和/或权值等)、模型的输入参数(如输入维度、输入端口数)、或模型的输出参数(如输出维度、输出端口数)。可以理解,输入维度可以指的是一个输入数据的大小,例如输入数据为一个序列时,该序列对应的输入维度可以指示该序列的长度。输入端口数可以指的是输入数据的数量。类似地,输出维度可以指的是一个输出数据的大小,例如输出数据为一个序列时,该序列对应的输出维度可以指示该序列的长度。输出端口数可以指的是输出数据的数量。Among them, the model parameters may include one or more of the following structural parameters of the model (such as the number of layers of the model, and/or weights, etc.), the input parameters of the model (such as input dimension, number of input ports), or the output parameters of the model (such as output dimension, number of output ports). It can be understood that the input dimension may refer to the size of an input data. For example, when the input data is a sequence, the input dimension corresponding to the sequence may indicate the length of the sequence. The number of input ports may refer to the number of input data. Similarly, the output dimension may refer to the size of an output data. For example, when the output data is a sequence, the output dimension corresponding to the sequence may indicate the length of the sequence. The number of output ports may refer to the number of output data.
(6)信道信息:(6) Channel information:
在通信系统(例如,LTE通信系统或NR通信系统等)中,网络设备需要基于信道信息决定调度终端设备的下行数据信道的资源、MCS以及预编码等配置。可以理解,信道信息也可以被称为信道状态信息(channel state information,CSI)或信道环境信息,是一种能够反映信道特征、信道质量的信息。在本申请中,CSI的含义相较于传统方案中的CSI的含义更广,并不局限于信道质量指示(channel quality indication,CQI)、预编码矩阵指示(precoding matrix indicator,PMI)、秩指示(rank indicator,RI)、或,CSI-RS资源指示(CSI-RS resource indicator,CRI),其还可以为信道响应信息(如信道响应矩阵),信道响应对应的权值信息,参考信号接收功率(reference signal receiving power,RSRP)或信号与干扰加噪声比(signal to interference plus noise ratio,SINR)等中的一种或多项。In a communication system (e.g., an LTE communication system or an NR communication system, etc.), a network device needs to determine the resource, MCS, and precoding configuration of a downlink data channel of a scheduling terminal device based on channel information. It can be understood that channel information can also be referred to as channel state information (CSI) or channel environment information, which is information that can reflect channel characteristics and channel quality. In this application, the meaning of CSI is broader than that of CSI in traditional schemes, and is not limited to channel quality indication (CQI), precoding matrix indication (PMI), rank indication (RI), or CSI-RS resource indication (CRI), which can also be channel response information (such as channel response matrix), weight information corresponding to channel response, reference signal receiving power (RSRP) or signal to interference plus noise ratio (SINR), etc. One or more of the above.
CSI测量指的是接收端根据发送端发送的参考信号求解信道信息,即利用信道估计方法估计出信道信息。示例性地,参考信号可以包括信道信息参考信号(channel state information reference signal,CSI-RS)、同步信号/广播信道块(synchronizing signal/physical broadcast channel block,SSB)、信道探测参考信号(sounding reference signal,SRS)或解调参考信号(demodulation reference signal,DMRS)等中的一项或多项。CSI-RS、SSB以及DMRS等可以用于测量下行CSI。SRS和DMRS等可以用于测量上行CSI。CSI measurement refers to the receiver solving the channel information based on the reference signal sent by the transmitter, that is, estimating the channel information using the channel estimation method. Exemplarily, the reference signal may include one or more of the channel state information reference signal (CSI-RS), synchronization signal/physical broadcast channel block (SSB), sounding reference signal (SRS) or demodulation reference signal (DMRS). CSI-RS, SSB and DMRS can be used to measure downlink CSI. SRS and DMRS can be used to measure uplink CSI.
以FDD通信场景为例,在FDD通信场景中,由于上下行信道不具备互易性或者说无法保证上下行信道的互易性,网络设备通常会向终端设备下行参考信号,终端设备根据接收到的下行参考信号进行信道测量、干扰测量估计下行CSI。终端设备根据协议预定义的方式或网络设备配置的方式生成CSI报告,并反馈给网络设备,以使其获取下行CSI。Taking the FDD communication scenario as an example, in the FDD communication scenario, since the uplink and downlink channels are not reciprocal or cannot be guaranteed, the network equipment usually sends a downlink reference signal to the terminal device, and the terminal device performs channel measurement and interference measurement based on the received downlink reference signal to estimate the downlink CSI. The terminal device generates a CSI report according to the protocol predefined method or the network device configuration method, and feeds it back to the network device so that it can obtain the downlink CSI.
示例性地,CSI可以包括以下至少一项:信道质量指示(channel quality indication,CQI)、预编码矩阵指示(precoding matrix indicator,PMI)、秩指示(rank indicator,RI)、CSI-RS资源指示(CSI-RS resource indicator,CRI)、层指示(layer indicator,LI),参考信号接收功率(reference signal receiving power,RSRP)或信号与干扰加噪声比(signal to interference plus noise ratio,SINR)等。信号与干扰加噪声比也可以称为信干噪比。Exemplarily, CSI may include at least one of the following: channel quality indication (CQI), precoding matrix indicator (PMI), rank indicator (RI), CSI-RS resource indicator (CRI), layer indicator (LI), reference signal receiving power (RSRP) or signal to interference plus noise ratio (SINR), etc. The signal to interference plus noise ratio may also be called signal to interference plus noise ratio.
其中,RI用于指示终端设备建议的下行传输的层数,CQI用于指示终端设备判断的当前信道条件所能支持的调制编码方式,PMI用于指示终端设备建议的预编码。PMI所指示的预编码的层数与RI对应。Among them, RI is used to indicate the number of downlink transmission layers recommended by the terminal device, CQI is used to indicate the modulation and coding mode supported by the current channel conditions determined by the terminal device, and PMI is used to indicate the precoding recommended by the terminal device. The number of precoding layers indicated by PMI corresponds to RI.
应理解,上述CSI报告所指示的RI、CQI和PMI等仅为终端设备的建议值,网络设备可以按照该CSI报告所指示的信息中的部分或全部进行下行传输。或者,网络设备也可以不按照该CSI报告所指示的信息进行下行传输。It should be understood that the RI, CQI and PMI indicated in the above CSI report are only recommended values for the terminal device, and the network device may perform downlink transmission according to part or all of the information indicated in the CSI report. Alternatively, the network device may not perform downlink transmission according to the information indicated in the CSI report.
将AI技术引入无线通信网络中,产生了一种基于AI模型的CSI反馈方式。终端设备利用AI模型对CSI进行压缩反馈,网络设备利用AI模型对压缩的CSI进行恢复。在基于AI的CSI反馈中传输的是一个序列(如比特序列),开销相较于传统CSI反馈CSI的开销低。The introduction of AI technology into wireless communication networks has resulted in a CSI feedback method based on the AI model. The terminal device uses the AI model to compress and feedback the CSI, and the network device uses the AI model to restore the compressed CSI. In the AI-based CSI feedback, a sequence (such as a bit sequence) is transmitted, and the overhead is lower than that of traditional CSI feedback.
以图4为例,图4中的编码器可以为CSI生成器,解码器可以为CSI重构器。编码器可以部署于终端设备中,解码器可以部署于网络设备中。终端设备可以将CSI原始信息V通过编码器生成CSI反馈信息z。终端设备上报CSI报告,该CSI报告可以包括CSI反馈信息z。网络设备可以通过解码器重构CSI信息,即得到CSI恢复信息V’。Taking FIG. 4 as an example, the encoder in FIG. 4 may be a CSI generator, and the decoder may be a CSI reconstructor. The encoder may be deployed in a terminal device, and the decoder may be deployed in a network device. The terminal device may generate CSI feedback information z from the CSI original information V through the encoder. The terminal device reports a CSI report, which may include the CSI feedback information z. The network device may reconstruct the CSI information through the decoder, that is, obtain CSI recovery information V'.
CSI原始信息V可以是终端设备通过CSI测量得到的。例如,该CSI原始信息V可以包括下行信道的信道响应或下行信道的特征向量矩阵(由特征向量构成的矩阵)。编码器对下行信道的特征向量矩阵进行处理,以得到CSI反馈信息z。换言之,将相关方案中根据码本对特征矩阵进行压缩和/或量化操作替换为由编码器对特征矩阵进行处理的操作,以得到CSI反馈信息z。终端设备上报该CSI反馈信息z。网络设备通过解码器对CSI反馈信息z进行处理以得到CSI恢复信息V’。The CSI original information V may be obtained by the terminal device through CSI measurement. For example, the CSI original information V may include the channel response of the downlink channel or the eigenvector matrix (a matrix composed of eigenvectors) of the downlink channel. The encoder processes the eigenvector matrix of the downlink channel to obtain CSI feedback information z. In other words, the compression and/or quantization operation of the characteristic matrix according to the codebook in the related scheme is replaced by the operation of processing the characteristic matrix by the encoder to obtain CSI feedback information z. The terminal device reports the CSI feedback information z. The network device processes the CSI feedback information z through a decoder to obtain CSI recovery information V'.
下面进一步对本申请实施例中的AI模型的训练过程以及推理过程进行示例性说明。The following further illustrates the training process and reasoning process of the AI model in the embodiment of the present application.
用于训练AI模型的训练数据包括训练样本和样本标签。示例性地,训练样本为终端设备确定的信道信息,样本标签为真实的信道信息,即真值CSI。对于编码器和解码器属于同一自编码器的情况,训练数据可以仅包括训练样本,或者说训练样本就是样本标签。The training data used to train the AI model includes training samples and sample labels. For example, the training samples are channel information determined by the terminal device, and the sample labels are real channel information, i.e., true CSI. In the case where the encoder and decoder belong to the same autoencoder, the training data may only include training samples, or the training samples are sample labels.
在无线通信领域,真值CSI可以为高精度的CSI。In the field of wireless communications, the true CSI may be a high-precision CSI.
具体训练过程如下:模型训练节点使用编码器处理信道信息,即训练样本,以得到信道反馈信息,如CSI反馈信息,并使用解码器处理反馈信息,得到恢复的信道信息,即信道恢复信息,如CSI恢复信息。进而计算信道恢复信息与对应的样本标签之间的差异,即损失函数的取值,根据损失函数的取值更新编码器和解码器的参数,使得恢复的信道信息与对应的样本标签之间的差异最小化,即最小化损失函数。示例性地,损失函数可以是最小均方误差(mean square error,MSE)或者余弦相似度。重复上述操作,即可得到满足目标需求的编码器和解码器。上述模型训练节点可以是终端设备、网络设备或者通信系统中其他具备AI功能的网元。The specific training process is as follows: the model training node uses an encoder to process channel information, that is, training samples, to obtain channel feedback information, such as CSI feedback information, and uses a decoder to process the feedback information to obtain recovered channel information, that is, channel recovery information, such as CSI recovery information. Then, the difference between the channel recovery information and the corresponding sample label is calculated, that is, the value of the loss function, and the parameters of the encoder and decoder are updated according to the value of the loss function, so that the difference between the recovered channel information and the corresponding sample label is minimized, that is, the loss function is minimized. Exemplarily, the loss function can be the minimum mean square error (MSE) or cosine similarity. Repeat the above operations to obtain an encoder and decoder that meet the target requirements. The above model training node can be a terminal device, a network device, or other network elements with AI functions in a communication system.
应理解,以上仅以AI模型用于CSI压缩为例进行说明,在CSI反馈中,AI模型还可以用于其他场景。例如,AI模型可以用于CSI预测,即基于一个或多个历史时刻测量的信道信息预测未来一个或多个时刻的信道信息。本申请实施例对CSI反馈场景中,AI模型的具体用途不做限定。It should be understood that the above only uses the AI model for CSI compression as an example. In CSI feedback, the AI model can also be used in other scenarios. For example, the AI model can be used for CSI prediction, that is, predicting the channel information at one or more future moments based on the channel information measured at one or more historical moments. The embodiment of the present application does not limit the specific use of the AI model in the CSI feedback scenario.
应理解,本申请中,指示包括直接指示(也称为显式指示)和隐式指示。其中,直接指示信息A,是指包括该信息A;隐式指示信息A,是指通过信息A和信息B的对应关系以及直接指示信息B,来指示信息A。其中,信息A和信息B的对应关系可以是预定义的,预存储的,预烧制的,或者,预先配置的。It should be understood that in the present application, indication includes direct indication (also called explicit indication) and implicit indication. Wherein, direct indication of information A means including the information A; implicit indication of information A means indicating information A through the correspondence between information A and information B and direct indication of information B. Wherein, the correspondence between information A and information B can be predefined, pre-stored, pre-burned, or pre-configured.
应理解,本申请中,信息C用于信息D的确定,既包括信息D仅基于信息C来确定,也包括基于信息C和其他信息来确定。此外,信息C用于信息D的确定,还可以间接确定的情况,比如,信息D基于信息E确定,而信息E基于信息C确定这种情况。It should be understood that in the present application, information C is used to determine information D, which includes information D being determined based only on information C, and information D being determined based on information C and other information. In addition, information C is used to determine information D, and it can also be indirectly determined, for example, information D is determined based on information E, and information E is determined based on information C.
此外,本申请各实施例中的“网元A向网元B发送信息A”,可以理解为该信息A的目的端或与目的端之间的传输路径中的中间网元是网元B,可以包括直接或间接的向网元B发送信息。“网元B从网元A接收信息A”,可以理解为该信息A的源端或与该源端之间的传输路径中的中间网元是网元A,可以包括直接或间接的从网元A接收信息。信息在信息发送的源端和目的端之间可能会被进行必要的处理,例如格式变化等,但目的端可以理解来自源端的有效信息。本申请中类似的表述可以做类似的理解,在此不予赘述。In addition, "network element A sends information A to network element B" in each embodiment of the present application can be understood as the destination end of the information A or the intermediate network element in the transmission path between the destination end and the network element B, which may include directly or indirectly sending information to network element B. "Network element B receives information A from network element A" can be understood as the source end of the information A or the intermediate network element in the transmission path between the source end and the network element A, which may include directly or indirectly receiving information from network element A. The information may be processed as necessary between the source end and the destination end of the information transmission, such as format changes, etc., but the destination end can understand the valid information from the source end. Similar expressions in the present application can be understood similarly and will not be repeated here.
下面对本申请实施例的方法进行详细介绍。The method of the embodiment of the present application is described in detail below.
参照图6所示,是本申请实施例提供的一种模型数据获取方法的流程示意图。可选的,该方法可以应用于前述的模型数据获取系统,例如图1所示的模型数据获取系统。可以理解的,图6所示示例是以第一节点(数据合成节点,或者可以称为中心节点、第一实体等)和第二节点(数据提供节点,或者可以称为待服务节点、第二实体等)作为该交互示意的执行主体为例进行示意的。进一步还可以包括模型处理(如模型训练或更新等)节点和模型使用(如模型推理等)节点。其中,模型处理节点可以是第一节点,还可以是其他节点。模型使用节点可以是第二节点,还可以是其他节点等。如图6所示的模型数据获取方法可以包括步骤601-602。步骤601-602具体如下:Referring to Figure 6, it is a flow chart of a model data acquisition method provided by an embodiment of the present application. Optionally, the method can be applied to the aforementioned model data acquisition system, such as the model data acquisition system shown in Figure 1. It can be understood that the example shown in Figure 6 is an example of the execution subject of the interaction diagram using the first node (data synthesis node, or it can be called a central node, a first entity, etc.) and the second node (data providing node, or it can be called a node to be served, a second entity, etc.) as an example. It can further include model processing (such as model training or updating, etc.) nodes and model use (such as model reasoning, etc.) nodes. Among them, the model processing node can be the first node, or it can be other nodes. The model use node can be the second node, or it can be other nodes, etc. The model data acquisition method shown in Figure 6 may include steps 601-602. Steps 601-602 are as follows:
601、第二节点向第一节点发送第一信息,该第一信息包括以下类型的信息:原始数据和原始数据的特征信息。相应地,第一节点接收该第一信息。601. A second node sends first information to a first node, where the first information includes the following types of information: original data and characteristic information of the original data. Correspondingly, the first node receives the first information.
示例性的,该第一节点可以是数据合成节点,或者可以称为数据生成节点、中心节点、第一实体等。第二节点可以是数据提供节点,或者可以称为待服务节点、第二实体等。可选的,第一节点为接入网设备,第二节点为终端设备。Exemplarily, the first node may be a data synthesis node, or may be referred to as a data generation node, a central node, a first entity, etc. The second node may be a data providing node, or may be referred to as a node to be served, a second entity, etc. Optionally, the first node is an access network device, and the second node is a terminal device.
该原始数据,即未被压缩或无信息损失的实测数据。第二节点通过向第一节点提供原始数据,有助于使第一节点生成的合成数据与原始数据达到同维度、同精度。其中,合成数据是一种模仿真实世界数据的非人工创建的数据。它是由基于生成式人工智能技术的计算算法和模拟创建而成。合成数据集具有与其所基于的实际数据相同的数学特性,但不包含相同信息。The original data is measured data that is not compressed or has no information loss. By providing the original data to the first node, the second node helps to make the synthetic data generated by the first node have the same dimension and accuracy as the original data. Among them, synthetic data is a kind of non-artificially created data that imitates real-world data. It is created by computing algorithms and simulations based on generative artificial intelligence technology. The synthetic data set has the same mathematical properties as the actual data on which it is based, but does not contain the same information.
该原始数据的特征信息,即对原始数据进行特征提取得到的信息。其中,原始数据的特征信息提取了原始数据中的关键信息。第二节点通过向第一节点提供原始数据的特征信息,可以使得生成模型理解生成的数据应与原始数据在哪些特征上相近。The feature information of the original data is the information obtained by extracting the features of the original data. The feature information of the original data extracts the key information in the original data. By providing the feature information of the original data to the first node, the second node can make the generation model understand in which features the generated data should be similar to the original data.
该原始数据的特征信息,可以包括特征量信息。或者,该原始数据的特征信息,不仅包括特征量信息,还可以包括特征量对应的特征信息。例如,对于信道信息而言,特征量信息可以为信道的角度分布、信道的时延分布等;特征量对应的特征信息,可以为信道角度分布的取值、信道时延分布的取值等。The characteristic information of the original data may include characteristic quantity information. Alternatively, the characteristic information of the original data may include not only characteristic quantity information but also characteristic information corresponding to the characteristic quantity. For example, for channel information, the characteristic quantity information may be the angle distribution of the channel, the delay distribution of the channel, etc.; the characteristic information corresponding to the characteristic quantity may be the value of the angle distribution of the channel, the value of the delay distribution of the channel, etc.
该第一信息的类型,即第一信息所包括的信息的类型。例如,第一信息的类型为A时,第一信息包括上述原始数据和原始数据的特征信息;第一信息的类型为B时,第一信息包括上述原始数据;第一信息的类型为C时,第一信息包括上述原始数据的特征信息。The type of the first information is the type of information included in the first information. For example, when the type of the first information is A, the first information includes the original data and the characteristic information of the original data; when the type of the first information is B, the first information includes the original data; when the type of the first information is C, the first information includes the characteristic information of the original data.
可选的,所述第一信息还包括以下类型的信息中的至少一项:所述原始数据的优先级、所述原始数据的数据标签。Optionally, the first information further includes at least one of the following types of information: the priority of the original data, and a data label of the original data.
通过上报原始数据的优先级,以便第一节点根据每个原始数据的优先级或者概率大小,确定要生成的与之相似的合成数据的数量等,以使得合成数据的分布与预期一致,例如合成数据的分布与原始数据所应用的场景一致。例如,以信道信息为例,若第二节点处于室外环境,基站到用户设备的遮挡比室内遮挡少,则可以以无线信号的视线传输(line-of-sight,LOS)径为主的实测信道信息赋予更高的优先级,而以无线信号的非视线传输(non-line-of-sight,NLOS)径为主的实测信道赋予较低的优先级,从而可使得合成的信道信息以LOS径下的稀疏信道为主。By reporting the priority of the original data, the first node determines the number of similar synthetic data to be generated according to the priority or probability of each original data, so that the distribution of the synthetic data is consistent with expectations, for example, the distribution of the synthetic data is consistent with the scenario to which the original data is applied. For example, taking channel information as an example, if the second node is in an outdoor environment and the obstruction from the base station to the user equipment is less than that indoors, the measured channel information based on the line-of-sight (LOS) path of the wireless signal can be given a higher priority, while the measured channel based on the non-line-of-sight (NLOS) path of the wireless signal can be given a lower priority, so that the synthesized channel information can be based on the sparse channel under the LOS path.
该原始数据的数据标签,可以理解为,将原始数据进行分类得到的原始数据所属的类别。例如,若按原始数据所属场景进行分类,则标签即为场景分类标签。The data label of the original data can be understood as the category to which the original data belongs obtained by classifying the original data. For example, if the original data is classified according to the scene to which it belongs, the label is the scene classification label.
其中,数据标签(或称数据注释)是开发机器学习(ML)模型时预处理阶段的一部分。它负责识别原始数据(如图像、文本文件、视频),然后可以向原始数据添加一个或多个数据标签,以指定模型的上下文,帮助机器学习模型做出准确的预测。数据标签支持各种不同的机器学习和深度学习用例,包括计算机视觉和自然语言处理(NLP)。Data labeling (or data annotation) is part of the preprocessing phase when developing a machine learning (ML) model. It is responsible for identifying raw data (such as images, text files, videos), and then one or more data labels can be added to the raw data to specify the context of the model and help the machine learning model make accurate predictions. Data labeling supports a variety of different machine learning and deep learning use cases, including computer vision and natural language processing (NLP).
原始数据的优先级,可以取值于预定义的P档。例如,可预定义每一档优先级的含义。例如,优先级为P1档的原始数据,生成模型生成N1个与之相似的合成数据,优先级为P2档的原始数据,生成模型生成N2个与之相似的合成数据。或者,生成模型生成的与优先级为P1档的原始数据相似的合成数据的数量为x1,生成模型生成的与优先级为P2档的原始数据相似的合成数据数量为x2,且x1与x2的比例等于N1与N2的比例,其中N1与N2为预定义的值。The priority of the original data can be taken from the predefined P level. For example, the meaning of each priority level can be predefined. For example, for the original data with priority level P1, the generation model generates N1 synthetic data similar to it, and for the original data with priority level P2, the generation model generates N2 synthetic data similar to it. Alternatively, the number of synthetic data generated by the generation model that is similar to the original data with priority level P1 is x1, and the number of synthetic data generated by the generation model that is similar to the original data with priority level P2 is x2, and the ratio of x1 to x2 is equal to the ratio of N1 to N2, where N1 and N2 are predefined values.
示例性的,第二节点向第一节点发送第一信息,所述第一信息包括:原始数据、原始数据的特征信息、原始数据的数据标签。例如,以原始数据为信道状态信息为例,该原始数据的特征信息可以是提取的信道特征向量(对原始信道做奇异值分解(Singular Value Decomposition,SVD)后得到的一个或多个特征向量)、或对信道特征向量进行压缩后的压缩信息。该原始数据的数据标签可以是信道质量信息(信道质量指示(Channel Quality Indication,CQI)、秩指示(Rank Indication,RI)等)、或场景信息(信道NLOS/LOS分类信息,用户信道高/中/低速分类信息,或其他场景分类信息)等。Exemplarily, the second node sends the first information to the first node, and the first information includes: original data, feature information of the original data, and a data label of the original data. For example, taking the original data as channel state information, the feature information of the original data can be an extracted channel feature vector (one or more feature vectors obtained after performing singular value decomposition (SVD) on the original channel), or compressed information after compressing the channel feature vector. The data label of the original data can be channel quality information (Channel Quality Indication (CQI), Rank Indication (RI), etc.), or scenario information (channel NLOS/LOS classification information, user channel high/medium/low speed classification information, or other scenario classification information), etc.
第二节点通过向第一节点发送第一信息,以便第一节点基于该第一信息生成多个合成数据,其中,第二节点以原始数据的特征信息作为合成数据的条件,使得合成数据在具体的一些特征维度模仿原始数据,可提高合成数据的多样性,也可以提高合成数据的准确性。The second node sends the first information to the first node so that the first node generates multiple synthetic data based on the first information, wherein the second node uses the characteristic information of the original data as the condition for synthesizing the data, so that the synthetic data imitates the original data in some specific characteristic dimensions, which can improve the diversity of the synthetic data and also improve the accuracy of the synthetic data.
在一种可能的实现方式中,还可以包括步骤603、第一节点向第二节点发送第二信息。所述第二信息指示所述第二节点得到所述第一信息的方式,比如,获得途径,获得来源,或,获得途径或来源所涉及的参数等中的一项或多项。例如,所述第一信息包括所述原始数据的特征信息的情况下,所述第二节点得到所述第一信息的方式包括所述原始数据的特征信息的获取方式。也就是说,第一节点向第二节点指示原始数据的特征信息的获取方式,以便第二节点基于该原始数据的特征信息的获取方式对原始数据进行处理,得到上述原始数据的特征信息。In a possible implementation, step 603 may also be included, in which the first node sends second information to the second node. The second information indicates a way in which the second node obtains the first information, such as one or more of an acquisition path, an acquisition source, or parameters involved in the acquisition path or source. For example, when the first information includes the characteristic information of the original data, the way in which the second node obtains the first information includes a way in which the characteristic information of the original data is acquired. That is, the first node indicates to the second node a way in which the characteristic information of the original data is acquired, so that the second node processes the original data based on the way in which the characteristic information of the original data is acquired to obtain the characteristic information of the original data.
例如,原始数据的特征信息的获取方式可以是D1:提取原始数据的N个特征向量,或者,D2:从原始数据矩阵的前M个最大特征值对应的特征向量中,提取N个特征向量。再如,D3:原始数据的特征信息的获取方式为采用R16/R17码本方式得到的压缩信息。再如,D4:原始数据的特征信息的获取方式为采用基于AI的编码器得到压缩信息。再例如,原始数据的特征信息的获取方式还可以是特征量,以指示第二节点计算相应特征量对应的特征信息。具体而言,第二信息可以包含获取方式标识(例如D1/D2/D3/D4),也可以包含某一获取方式下的参数(例如,对于方式D1而言,可指示特征向量个数N;对于方式D4而言,指示AI编码器的标识)。For example, the method for acquiring the feature information of the original data may be D1: extracting N feature vectors of the original data, or D2: extracting N feature vectors from the feature vectors corresponding to the first M largest eigenvalues of the original data matrix. For another example, D3: the method for acquiring the feature information of the original data is compressed information obtained by using the R16/R17 codebook method. For another example, D4: the method for acquiring the feature information of the original data is compressed information obtained by using an AI-based encoder. For another example, the method for acquiring the feature information of the original data may also be a feature quantity to indicate that the second node calculates the feature information corresponding to the corresponding feature quantity. Specifically, the second information may include an acquisition method identifier (e.g., D1/D2/D3/D4), or may include parameters under a certain acquisition method (e.g., for method D1, it may indicate the number of feature vectors N; for method D4, it may indicate the identifier of the AI encoder).
可选的,所述第一信息包括所述原始数据的情况下,所述第二节点得到所述第一信息的方式包括所述原始数据的获取方式。例如,原始数据的获取方式可以是获取指定时间内的特定数据,该指定时间例如可以是某一天内9点到15点等,该特定数据例如可以是温度数据、湿度数据、文本数据等。再如,原始数据的获取方式还可以是采用预设传感器获取预设区域的原始数据等。该预设传感器例如可以是摄像头、激光雷达等。Optionally, when the first information includes the raw data, the way in which the second node obtains the first information includes the way in which the raw data is acquired. For example, the way in which the raw data is acquired may be to acquire specific data within a specified time, where the specified time may be, for example, from 9:00 to 15:00 on a certain day, and the specific data may be, for example, temperature data, humidity data, text data, etc. For another example, the way in which the raw data is acquired may also be to acquire raw data of a preset area using a preset sensor, etc. The preset sensor may be, for example, a camera, a laser radar, etc.
可选的,所述第一信息包括所述原始数据的数据标签的情况下,所述第二节点得到所述第一信息的方式还包括所述原始数据的数据标签的获取方式。例如第二信息指示第二节点按哪种方式或标准确定数据的数据标签。以信道信息为例,E1:指示根据稀疏度确定数据标签(例如根据信道的稀疏度的一个阈值,确定信道标签为LOS或NLOS);或者,E2:根据时延分布确定数据标签等;E3:或者直接指示标签,例如指示数据标签即信道的CQI值,E4:或者指示数据标签即信道的RI等。具体而言,第二信息可以包含获取方式标识(例如E1/E2/E3/E4)。Optionally, when the first information includes the data label of the original data, the way in which the second node obtains the first information also includes a way to obtain the data label of the original data. For example, the second information indicates the way or standard by which the second node determines the data label of the data. Taking channel information as an example, E1: indicates that the data label is determined according to sparsity (for example, according to a threshold of the sparsity of the channel, the channel label is determined to be LOS or NLOS); or, E2: determines the data label according to the delay distribution, etc.; E3: or directly indicates the label, for example, indicates that the data label is the CQI value of the channel, E4: or indicates that the data label is the RI of the channel, etc. Specifically, the second information may include an acquisition method identifier (for example, E1/E2/E3/E4).
第一节点通过向第二节点发送第二信息,以便第二节点基于该第二信息对原始数据进行处理,进而得到第一信息。The first node sends the second information to the second node, so that the second node processes the original data based on the second information, thereby obtaining the first information.
在一种可能的实现方式中,还可以包括步骤604、第一节点还向第二节点发送第三信息,所述第三信息指示所述第一信息的上报配置。示例性的,该上报配置包括以下至少一项:所述第一信息的类型,一种或多种类型的所述第一信息的各自数量或总数量。也就是说,第一节点向第二节点发送该第三信息,以便第二节点基于第三信息中指示的第一信息的类型和/或第一信息的数量进行上报。例如,预定义多种第一信息的类型。进而,第三信息指示第一信息为类型标识时,第二节点可确定第一信息所包括的信息类型。例如根据前文预定义的第一信息的类型标识,第三信息指示第一信息的类型为A,则可知第一信息包括上述原始数据和原始数据的特征信息;或者,预定义第一信息所包含的信息类型标识,定义原始数据类型的标识为A_1,原始数据特征信息类型的标识为A_2,标签类型的标识为A_3,优先级类型的标识为A_4,则第三信息指示第一信息所包含信息类型的标识,例如第三信息指示第一信息的类型为{A_1,A_2,A_3},则第二节点可确定第一信息包括上述原始数据和原始数据的特征信息以及原始数据的标签;再如,第三信息还指示第一信息的数据量为1000时,表示第二节点需要上报该上述原始数据和原始数据的特征信息的数据量分别为1000;或者还可以表示原始数据和原始数据的特征信息的数据量之和为1000等,其中,比如,第二节点分别上报该上述原始数据的数据量为500,原始数据的特征信息的数据量为500,或者原始数据的特征信息的数据量多于或少于原始数据的数据量等,比如二者的比例为6:4,比例可以为预定义的或配置的,本方案对此不作限制。再例如,可以预定义数量的多种取值,第三信息通过指示预定义的数量取值的标识,以使第二节点确定一种或多种类型的第一信息的各自数量或总数量。In a possible implementation, step 604 may also be included, the first node also sends third information to the second node, and the third information indicates the reporting configuration of the first information. Exemplarily, the reporting configuration includes at least one of the following: the type of the first information, the respective quantity or total quantity of one or more types of the first information. That is, the first node sends the third information to the second node so that the second node reports based on the type of the first information indicated in the third information and/or the quantity of the first information. For example, multiple types of first information are predefined. Furthermore, when the third information indicates that the first information is a type identifier, the second node can determine the type of information included in the first information. For example, according to the type identifier of the first information predefined in the previous text, the third information indicates that the type of the first information is A, then it can be known that the first information includes the above-mentioned original data and the characteristic information of the original data; or, the information type identifier contained in the first information is predefined, and the identifier of the original data type is defined as A_1, the identifier of the original data characteristic information type is defined as A_2, the identifier of the tag type is defined as A_3, and the identifier of the priority type is defined as A_4, then the third information indicates the identifier of the information type contained in the first information, for example, the third information indicates that the type of the first information is {A_1, A_2, A_3}, then the second node can determine that the first information includes the above-mentioned original data and the characteristic information of the original data. information and labels of the original data; for another example, when the third information also indicates that the data volume of the first information is 1000, it means that the second node needs to report that the data volume of the above original data and the characteristic information of the original data are 1000 respectively; or it can also indicate that the sum of the data volume of the original data and the characteristic information of the original data is 1000, etc., wherein, for example, the second node reports that the data volume of the above original data is 500, and the data volume of the characteristic information of the original data is 500, or the data volume of the characteristic information of the original data is more or less than the data volume of the original data, etc., for example, the ratio of the two is 6:4, and the ratio can be predefined or configured, and this scheme does not limit this. For another example, multiple values of a quantity can be predefined, and the third information indicates an identifier of the predefined quantity value so that the second node determines the respective quantity or total quantity of one or more types of first information.
可选的,该上报配置还可以包括第一信息的时域资源信息或频域资源信息中的一项或多项。本方案对此不作限制。Optionally, the reporting configuration may also include one or more of the time domain resource information or the frequency domain resource information of the first information. This solution does not impose any limitation on this.
第一节点通过向第二节点发送该第三信息,以便第二节点基于第三信息中指示的第一信息的上报配置进行上报。这样使得相互分离的数据提供端与合成数据端可以对齐对数据的理解,提高合成数据的质量。The first node sends the third information to the second node so that the second node reports based on the reporting configuration of the first information indicated in the third information, so that the data provider and the synthesized data end separated from each other can align their understanding of the data and improve the quality of the synthesized data.
在一种可能的实现方式中,上述第一信息、第二信息和第三信息中的至少两项可以是第一节点发送的同一信息所包括的不同内容,或者,上述第一信息、第二信息和第三信息是第一节点发送的不同消息中的内容。本方案对此不作限制。In a possible implementation, at least two of the first information, the second information, and the third information may be different contents included in the same information sent by the first node, or the first information, the second information, and the third information may be contents in different messages sent by the first node. This solution does not limit this.
可以理解的是,以上第一信息、第二信息和第三信息中未被指示的内容,比如上报的时域资源信息或频域资源信息等,可以是协议预定义的,或者,第一节点和第二节点中预先设定或存储的,比如为默认的配置,或者,基于其他信息所确定的。It can be understood that the content not indicated in the above first information, second information and third information, such as reported time domain resource information or frequency domain resource information, etc., may be predefined by the protocol, or pre-set or stored in the first node and the second node, such as a default configuration, or determined based on other information.
基于上述第二信息和/或第三信息的指示,第二节点对原始数据进行处理,进而向第一节点发送第一信息。例如,第三信息指示第一信息的类型为A,即第一信息包括上述原始数据和原始数据的特征信息;第二信息指示原始数据的特征信息的获取方式为提取原始数据的N个特征向量,则第二节点确定原始数据和原始数据的特征信息,并根据第三信息的指示进行上报。可选的,若第三信息指示第一信息包含原始数据的特征信息,但第二节点未接收到特征信息的获取方式的指示,则可以采用预定义的特征信息的获取方式。同样,若第三信息指示第一信息包含原始数据的数据标签,但第二节点未接收到原始数据的数据标签的获取方式的指示,则可以采用预定义的获取方式。Based on the indication of the above-mentioned second information and/or third information, the second node processes the original data, and then sends the first information to the first node. For example, the third information indicates that the type of the first information is A, that is, the first information includes the above-mentioned original data and the characteristic information of the original data; the second information indicates that the method for obtaining the characteristic information of the original data is to extract N characteristic vectors of the original data, then the second node determines the original data and the characteristic information of the original data, and reports it according to the indication of the third information. Optionally, if the third information indicates that the first information contains the characteristic information of the original data, but the second node does not receive an indication of the method for obtaining the characteristic information, a predefined method for obtaining the characteristic information can be used. Similarly, if the third information indicates that the first information contains the data label of the original data, but the second node does not receive an indication of the method for obtaining the data label of the original data, a predefined method for obtaining can be used.
在一种可能的实现方式中,还可以包括步骤605、第一节点还向第二节点发送第四信息,所述第四信息指示所述第一节点具有将相同分组的多个原始数据的特征信息进行联合处理的能力。该联合处理,例如可以是将相同分组的多个原始数据的特征信息加权求和等。In a possible implementation, the method may further include step 605, wherein the first node further sends fourth information to the second node, wherein the fourth information indicates that the first node has the ability to jointly process feature information of multiple original data of the same group. The joint processing may, for example, be weighted summation of feature information of multiple original data of the same group.
第一节点通过向第二节点发送该第四信息,以便第二节点可以将可进行联合处理的原始数据划分为同一组,而将不可进行联合处理的原始数据不划分为同一组,从而可以避免将不同组的原始数据做融合,而生成不符合预期的合成数据。The first node sends the fourth information to the second node so that the second node can divide the original data that can be jointly processed into the same group, while dividing the original data that cannot be jointly processed into the same group, thereby avoiding the fusion of original data from different groups to generate unexpected synthetic data.
可选的,所述第一信息还包括以下类型的信息:所述原始数据的组信息。第二节点向第一节点上报原始数据的组信息,进而第一节点可将相同分组的多个原始数据的特征信息进行联合处理。Optionally, the first information further includes the following types of information: group information of the original data. The second node reports the group information of the original data to the first node, and then the first node can jointly process feature information of multiple original data in the same group.
可以理解的,该示例中将可以进行联合处理的原始数据按照同一组进行划分。该组信息与前述原始数据的数据标签可以是不相同的。例如,原始数据的组的划分标准和原始数据的数据标签的划分标准可以是不相同的。当然,两者的划分标准也可以是相同的,也即,组信息即为数据标签,本方案对此不作限制。It can be understood that in this example, the raw data that can be jointly processed are divided into the same group. The group information may be different from the data label of the raw data. For example, the division criteria of the raw data group and the division criteria of the data label of the raw data may be different. Of course, the division criteria of the two may also be the same, that is, the group information is the data label, and this solution does not limit this.
可以理解的,图6中步骤603、步骤604和步骤605为可选的一个或多个步骤,图6中所示各步骤仅为示例,对于其执行的先后顺序、执行的时间均不作限制。It can be understood that step 603, step 604 and step 605 in FIG. 6 are one or more optional steps. The steps shown in FIG. 6 are only examples, and there is no limitation on the order and time of their execution.
602、第一节点基于该第一信息,生成多个合成数据,该多个合成数据用于作为模型的输入。602. The first node generates a plurality of synthetic data based on the first information, and the plurality of synthetic data are used as input of the model.
该第一节点基于所述原始数据和所述原始数据的特征信息,生成多个合成数据。例如,第一节点将原始数据和所述原始数据的特征信息输入至生成模型中,进而生成多个合成数据。或者,第一节点对第一信息进行预处理,然后将处理后的信息输入到生成模型中,进而生成多个合成数据。该预处理例如可以是对第一信息进行归一化,或者将第一信息中的原始数据以及原始数据的特征信息进行拼接等。The first node generates a plurality of synthetic data based on the original data and the characteristic information of the original data. For example, the first node inputs the original data and the characteristic information of the original data into the generation model, thereby generating a plurality of synthetic data. Alternatively, the first node preprocesses the first information, and then inputs the processed information into the generation model, thereby generating a plurality of synthetic data. The preprocessing may, for example, be normalizing the first information, or splicing the original data in the first information and the characteristic information of the original data.
其中,在传统的数据生成方式中,以原始实测数据作为合成数据的条件,生成模型会模仿原始数据,包括数据维度精度和内容等方面,使得合成数据与实测数据相似度高。但是,仅以原始数据作为输入易导致合成数据多样性不足。而本申请的方案中,以原始数据的特征信息,或者以原始数据的特征信息以及数据标签作为合成数据的条件,生成模型在具体的一些特征维度模仿原始数据,可提高合成数据的多样性。因此,进一步地,基于原始数据、所述原始数据的特征信息的联合表示,或者,原始数据、所述原始数据的特征信息以及所述原始数据的数据标签的联合表示作为合成数据的条件,以少量数据进行数据生成,即可同时满足合成数据多样性高,且合成数据的特征分布与原始数据的特征分布接近的需求。Among them, in the traditional data generation method, the original measured data is used as the condition for synthesizing data, and the generation model will imitate the original data, including aspects such as data dimension accuracy and content, so that the synthesized data is highly similar to the measured data. However, using only the original data as input can easily lead to insufficient diversity of the synthesized data. In the scheme of the present application, the feature information of the original data, or the feature information of the original data and the data label are used as the condition for synthesizing data, and the generation model imitates the original data in some specific feature dimensions, which can improve the diversity of the synthesized data. Therefore, further, based on the joint representation of the original data and the feature information of the original data, or the joint representation of the original data, the feature information of the original data and the data label of the original data as the condition for synthesizing data, data generation is performed with a small amount of data, which can simultaneously meet the requirements of high diversity of the synthesized data and close feature distribution of the synthesized data to the feature distribution of the original data.
例如,以信道生成为例,原始数据为信道信息,原始数据的特征信息可包括信道角度功率谱特征、信道时延谱特征等,则生成模型可知生成的信道应在角度、时延功率谱分布上与原始信道信息相近,而其它特征值,例如多普勒频偏的随机性可较高。For example, taking channel generation as an example, the original data is the channel information, and the characteristic information of the original data may include channel angle power spectrum characteristics, channel delay spectrum characteristics, etc. Then the generation model can know that the generated channel should be similar to the original channel information in angle and delay power spectrum distribution, while other characteristic values, such as the randomness of Doppler frequency deviation, can be higher.
再如,将相同分组的多个原始数据的特征信息加权求和,可以提高合成数据的多样性。例如,以信道生成为例,原始数据为信道信息,原始数据的特征信息可包括信道角度功率谱特征、信道时延谱特征等,通过将多个角度功率谱分布不同的信道信息的特征信息加权求和输入至生成模型中可合成角度功率谱分布更多样的信道,从而可进一步提高合成数据的多样性,也可提高合成数据的准确性。For another example, weighted summing of feature information of multiple original data of the same group can improve the diversity of synthesized data. For example, taking channel generation as an example, the original data is channel information, and the feature information of the original data may include channel angle power spectrum features, channel delay spectrum features, etc. By weighted summing of feature information of multiple channel information with different angle power spectrum distributions and inputting them into the generation model, channels with more diverse angle power spectrum distributions can be synthesized, thereby further improving the diversity of synthesized data and the accuracy of synthesized data.
其中,上述多个合成数据可用于作为模型的输入,该输入可以用于模型的处理,例如模型训练、模型更新、模型推理或模型性能监控等。Among them, the above-mentioned multiple synthetic data can be used as the input of the model, and the input can be used for model processing, such as model training, model updating, model reasoning or model performance monitoring.
其中,可在第一节点进行模型训练、更新等处理,或者也可以将合成数据传输给模型处理节点等,进行模型训练、更新等处理。完成模型训练或更新后,可进行模型推理,也可以将模型传输给模型使用节点进行模型推理。例如,传输给第二节点,用于模型推理。The model training, updating, etc. can be performed at the first node, or the synthesized data can be transmitted to the model processing node, etc., for model training, updating, etc. After the model training or updating is completed, the model can be inferred, or the model can be transmitted to the model using node for model inference. For example, it can be transmitted to the second node for model inference.
在一种可能的实现方式中,第一节点还向第二节点发送所述多个合成数据中的至少一个子集。In a possible implementation manner, the first node further sends at least a subset of the plurality of synthesized data to the second node.
该多个合成数据中的至少一个子集,例如可以是从多个合成数据中随机采样多个样本作为合成数据的一个子集。再如,从根据多种数据标签中的至少一种得到的合成数据中,随机采样多个样本,得到合成数据的至少一个子集。一种可能的实现方式为,原始数据的数据标签有3种可能的取值(例如,数据标签a,数据标签b和数据标签c),合成数据的1个子集为以数据标签a为条件输入得到的合成数据中,随机采样多个样本得到的子集,合成数据的另一个子集为以数据标签b(或数据标签c)为条件输入得到的合成数据中,随机采样多个样本得到的子集等。或者,该合成数据的1个子集可以是以数据标签a为条件输入得到的合成数据中,随机采样得到多个样本S1,以数据标签b为条件输入得到的合成数据中,随机采样得到多个样本S2,以数据标签c为条件输入得到的合成数据中,随机采样得到多个样本S3,进而基于该多个样本S1、多个样本S2和多个样本S3得到其中1个子集,通过重复该方式再次进行采样,得到另一个子集。当然,还可以是其他方式获得该子集,本方案对此不作限制。At least one subset of the multiple synthetic data may be, for example, a subset of the synthetic data obtained by randomly sampling multiple samples from the multiple synthetic data. For another example, multiple samples are randomly sampled from the synthetic data obtained according to at least one of the multiple data labels to obtain at least one subset of the synthetic data. A possible implementation is that the data label of the original data has three possible values (for example, data label a, data label b and data label c), one subset of the synthetic data is a subset obtained by randomly sampling multiple samples from the synthetic data obtained by inputting data label a as a condition, another subset of the synthetic data is a subset obtained by randomly sampling multiple samples from the synthetic data obtained by inputting data label b (or data label c) as a condition, and so on. Alternatively, a subset of the synthetic data may be obtained by randomly sampling multiple samples S1 from the synthetic data obtained by inputting data label a as a condition, randomly sampling multiple samples S2 from the synthetic data obtained by inputting data label b as a condition, and randomly sampling multiple samples S3 from the synthetic data obtained by inputting data label c as a condition, and then obtaining one of the subsets based on the multiple samples S1, multiple samples S2, and multiple samples S3, and obtaining another subset by repeating the sampling method again. Of course, the subset may also be obtained in other ways, and this solution does not limit this.
第二节点基于接收到的该至少一个子集,向第一节点发送第五信息。其中,所述第五信息指示所述至少一个子集中至少一个合成数据的评估结果。所述评估结果指示所述至少一个子集中需要剔除的至少一个合成数据,或指示所述至少一个子集中不需要剔除的至少一个合成数据。相应的,第一节点接收该第五信息。The second node sends fifth information to the first node based on the received at least one subset. The fifth information indicates an evaluation result of at least one synthetic data in the at least one subset. The evaluation result indicates at least one synthetic data that needs to be eliminated in the at least one subset, or indicates at least one synthetic data that does not need to be eliminated in the at least one subset. Accordingly, the first node receives the fifth information.
该需要剔除的合成数据例如可以是与所述第二节点的原始数据的特征分布不相同的合成数据。在一种可能的实现方式中,第二节点通过判断接收到的合成数据是否与其实测数据相似,进而得到该评估结果。例如,第二节点计算至少一个子集中的一个合成数据与其本地数据间的距离。若存在一个本地数据,使得该合成数据与该本地数据间距离满足第一条件,则判断该合成数据与原始数据的特征分布相同,即不剔除;若该合成数据与所有本地数据间距离不满足第一条件,则判断该合成数据与原始数据的特征分布不同,即需要被剔除。其中,该第一条件可以是某一特定范围,或者某一阈值等。可选的,合成数据与本地数据间距离的计算,可以是将合成数据与本地数据间的差值的范数作为合成数据与本地数据间的距离。或者,分别提取合成数据与本地数据的数据特征信息,将合成数据与本地数据的数据特征信息间的差值的范数作为合成数据与本地数据间的距离。The synthetic data to be eliminated may be, for example, synthetic data having a different characteristic distribution from the original data of the second node. In a possible implementation, the second node obtains the evaluation result by determining whether the received synthetic data is similar to the actual measured data. For example, the second node calculates the distance between a synthetic data in at least one subset and its local data. If there is a local data such that the distance between the synthetic data and the local data satisfies the first condition, the synthetic data is determined to have the same characteristic distribution as the original data, i.e., it is not eliminated; if the distance between the synthetic data and all local data does not meet the first condition, the synthetic data is determined to have a different characteristic distribution from the original data, i.e., it needs to be eliminated. Among them, the first condition may be a certain specific range, or a certain threshold, etc. Optionally, the calculation of the distance between the synthetic data and the local data may be to use the norm of the difference between the synthetic data and the local data as the distance between the synthetic data and the local data. Alternatively, the data feature information of the synthetic data and the local data is extracted respectively, and the norm of the difference between the data feature information of the synthetic data and the local data is used as the distance between the synthetic data and the local data.
可选的,第五信息可以用0或1等来指示每个合成数据是否剔除。或者,第二节点用一个合成数据的标识的列表来指示合成数据子集中被剔除的合成数据或用一个合成数据的标识的列表来指示合成数据子集中不被剔除的合成数据。其中合成数据子集中每一个合成数据有一个唯一标识。Optionally, the fifth information may indicate whether each synthesized data is to be removed by using 0 or 1, etc. Alternatively, the second node indicates the synthesized data to be removed from the synthesized data subset by using a list of synthesized data identifiers or indicates the synthesized data not to be removed from the synthesized data subset by using a list of synthesized data identifiers. Each synthesized data in the synthesized data subset has a unique identifier.
第二节点还可以基于其他方式得到上述评估结果,本方案对此不作限制。The second node may also obtain the above evaluation result based on other methods, which is not limited in this solution.
第一节点基于接收到的该评估结果对所述多个合成数据进行筛选,进而得到筛选后的至少一个合成数据。The first node filters the multiple synthetic data based on the received evaluation result, and thereby obtains at least one synthetic data after filtering.
可选的,所述筛选后的至少一个合成数据是基于待剔除的第一合成数据与所述多个合成数据中的其他合成数据之间的距离确定的。所述待剔除的第一合成数据为根据所述第五信息确定的需要剔除的合成数据。Optionally, the at least one synthesized data after screening is determined based on the distance between the first synthesized data to be eliminated and other synthesized data in the plurality of synthesized data. The first synthesized data to be eliminated is the synthesized data to be eliminated determined according to the fifth information.
例如,第一节点计算需要剔除的合成数据与第一节点中其他合成数据间的距离。若存在一个合成数据,使得该合成数据与要剔除的合成数据间距离不大于某阈值,则判断该合成数据也剔除。For example, the first node calculates the distance between the synthetic data to be removed and other synthetic data in the first node. If there is a synthetic data such that the distance between the synthetic data and the synthetic data to be removed is not greater than a certain threshold, it is determined that the synthetic data is also removed.
通过合成数据的评估过程对合成数据进行筛选,可以有效提高合成数据的质量。By screening the synthetic data through the synthetic data evaluation process, the quality of the synthetic data can be effectively improved.
其中,所述筛选后的至少一个合成数据用于作为模型的输入。针对该部分的介绍,可参阅步骤601的记载,在此不再赘述。The at least one synthetic data after screening is used as the input of the model. For the introduction of this part, please refer to the description of step 601, which will not be repeated here.
可选的,第一节点为前述执行第一节点相关的动作的第三方设备。比如,上述步骤601,602均由第三方设备执行。Optionally, the first node is a third-party device that performs the aforementioned actions related to the first node. For example, the above steps 601 and 602 are both performed by a third-party device.
可选的,第二节点为前述执行第二节点相关的动作的第三方设备。比如,上述步骤601由第三方设备执行。Optionally, the second node is a third-party device that performs the aforementioned second node-related actions. For example, the above step 601 is performed by a third-party device.
可选的,第一节点为网络设备。比如,上述步骤601,602均由网络设备执行。Optionally, the first node is a network device. For example, the above steps 601 and 602 are both performed by the network device.
可选的,第二节点为终端设备。比如,上述步骤601由终端设备执行。Optionally, the second node is a terminal device. For example, the above step 601 is performed by the terminal device.
可选的,第一节点包括网络设备和第三方设备。在一个示例中,上述步骤602可以由第三方设备,如OTT,或,云服务器等执行,上述步骤601可以由网络设备执行。此外,网络设备与第三方设备之间也可以进行通信,进行上述步骤601中所传输的内容的传输。Optionally, the first node includes a network device and a third-party device. In one example, the above step 602 can be performed by a third-party device, such as an OTT, or a cloud server, and the above step 601 can be performed by the network device. In addition, the network device and the third-party device can also communicate with each other to transmit the content transmitted in the above step 601.
本申请实施例,第一节点通过接收来自第二节点的原始数据和所述原始数据的特征信息;进而基于所述原始数据和所述原始数据的特征信息,生成多个合成数据。该多个合成数据用于模型的处理。该示例,基于原始数据和原始数据的特征信息可以生成满足多样性要求且数据分布与实测数据分布相同的合成数据,提高了合成数据的质量,进而有助于提高模型训练或更新的性能。In an embodiment of the present application, the first node receives the original data from the second node and the feature information of the original data; and then generates multiple synthetic data based on the original data and the feature information of the original data. The multiple synthetic data are used for model processing. In this example, synthetic data that meets the diversity requirements and has the same data distribution as the measured data distribution can be generated based on the original data and the feature information of the original data, thereby improving the quality of the synthetic data, and thus helping to improve the performance of model training or updating.
上述示例以第一节点向第二节点发送第二信息,该第二信息指示原始数据的特征信息的获取方式为例进行介绍。可替代的,该第二信息还可以是第三节点向第二节点发送的(图7所示实施例)。也就是说,第三节点向第二节点指示原始数据的特征信息的获取方式,以便第二节点基于该原始数据的特征信息的获取方式进行处理,得到原始数据的特征信息。其中,第三节点可以是模型训练节点或者模型使用节点,也可以称为第三实体,如,模型训练实体或模型使用实体。The above example is introduced by taking the first node sending the second information to the second node, and the second information indicates the method for obtaining the characteristic information of the original data as an example. Alternatively, the second information can also be sent by the third node to the second node (the embodiment shown in Figure 7). That is to say, the third node indicates the method for obtaining the characteristic information of the original data to the second node, so that the second node processes based on the method for obtaining the characteristic information of the original data to obtain the characteristic information of the original data. Among them, the third node can be a model training node or a model use node, and can also be called a third entity, such as a model training entity or a model use entity.
上述以第二节点对原始数据进行处理得到原始数据的特征信息为例进行介绍。可替代的,还可以是第一节点对原始数据进行处理得到原始数据的特征信息(图8所示实施例以及图9所示实施例)或第三节点对原始数据进行处理得到原始数据的特征信息(图10所示实施例)。The above is introduced by taking the second node processing the original data to obtain the characteristic information of the original data as an example. Alternatively, the first node may process the original data to obtain the characteristic information of the original data (the embodiment shown in FIG8 and the embodiment shown in FIG9 ) or the third node may process the original data to obtain the characteristic information of the original data (the embodiment shown in FIG10 ).
下面分别基于图7、图8、图9和图10依次对上述替代方案进行详细介绍。The above alternative solutions are described in detail below based on Figures 7, 8, 9 and 10 respectively.
参照图7所示,是本申请实施例提供的另一种模型数据获取方法的流程示意图。图7所示示例是以第一节点(数据合成节点,或者可以称为中心节点、第一实体等)、第二节点(数据提供节点,或者可以称为待服务节点、第二实体等)和第三节点(模型训练节点或者模型使用节点,也可以称为第三实体,如,模型训练实体或模型使用实体)作为该交互示意的执行主体为例进行示意的。该示例与图6所示示例的区别在于该第二信息是第三节点向第二节点发送的,而不是第一节点向第二节点发送的。如图7所示的模型数据获取方法可以包括步骤701-705。步骤701-705具体如下:Referring to Figure 7, it is a flow chart of another model data acquisition method provided by an embodiment of the present application. The example shown in Figure 7 is illustrated by taking the first node (data synthesis node, or it can be called a central node, a first entity, etc.), the second node (data providing node, or it can be called a node to be served, a second entity, etc.) and the third node (model training node or model use node, or it can be called a third entity, such as a model training entity or a model use entity) as an example of the execution subject of the interaction diagram. The difference between this example and the example shown in Figure 6 is that the second information is sent by the third node to the second node, rather than by the first node to the second node. The model data acquisition method shown in Figure 7 may include steps 701-705. Steps 701-705 are as follows:
701、第一节点向第二节点发送第三信息,该第三信息指示第一信息的上报配置。相应地,第二节点接收该第三信息。701. A first node sends third information to a second node, where the third information indicates a reporting configuration of the first information. Correspondingly, the second node receives the third information.
针对该部分的介绍,可参阅图6所示实施例中步骤601的记载,在此不再赘述。For the introduction of this part, please refer to the description of step 601 in the embodiment shown in FIG6 , which will not be repeated here.
702、第三节点向第二节点发送第二信息,该第二信息指示原始数据的特征信息的获取方式。相应地,第二节点接收该第二信息。702. The third node sends second information to the second node, where the second information indicates a method for acquiring characteristic information of the original data. Correspondingly, the second node receives the second information.
第三节点根据模型需求确定出原始数据的特征提取方式。进而向第二节点发送该原始数据的特征信息的获取方式。The third node determines a feature extraction method for the original data according to the model requirements, and then sends a method for obtaining feature information of the original data to the second node.
703、第二节点基于该第二信息对原始数据进行处理,得到原始数据的特征信息。703. The second node processes the original data based on the second information to obtain characteristic information of the original data.
针对该部分的介绍可参阅前述图6所示实施例中步骤601的记载,在此不再赘述。For the introduction of this part, please refer to the description of step 601 in the embodiment shown in FIG. 6 , which will not be repeated here.
可选的,在第一种可能的实现方式中,第三节点还向第二节点发送原始数据的数据标签的获取方式。例如,上述第二信息还指示原始数据的数据标签的获取方式。Optionally, in a first possible implementation manner, the third node further sends a method for obtaining the data label of the original data to the second node. For example, the second information further indicates a method for obtaining the data label of the original data.
在第二种可能的实现方式中,第一节点向第二节点发送第二信息,所述第二信息指示所述第二节点得到所述第一信息的处理方式,所述第二节点得到所述第一信息的处理方式包括原始数据的数据标签的获取方式。In a second possible implementation, the first node sends second information to the second node, where the second information indicates a processing method for the second node to obtain the first information, and the processing method for the second node to obtain the first information includes a method for obtaining a data label of the original data.
704、第二节点向第一节点发送第一信息,该第一信息包括以下类型的信息:原始数据和该原始数据的特征信息。相应地,第一节点接收该第一信息。704. The second node sends first information to the first node, where the first information includes the following types of information: original data and characteristic information of the original data. Correspondingly, the first node receives the first information.
可选的,所述第一信息还包括以下类型的信息:原始数据的数据标签。Optionally, the first information also includes the following types of information: data labels of original data.
针对该部分的介绍,可参阅图6所示实施例中步骤601的记载,在此不再赘述。For the introduction of this part, please refer to the description of step 601 in the embodiment shown in FIG6 , which will not be repeated here.
705、第一节点基于第一信息生成多个合成数据,该多个合成数据用于模型的处理。705. The first node generates a plurality of synthetic data based on the first information, and the plurality of synthetic data are used for processing the model.
针对该部分的介绍,可参阅前述图6所示实施例中步骤602的记载,在此不再赘述。For the introduction of this part, please refer to the description of step 602 in the embodiment shown in FIG. 6 , which will not be repeated here.
可选的,第一节点为前述执行第一节点相关的动作的第三方设备。比如,上述步骤701,704,705均由第三方设备执行。Optionally, the first node is a third-party device that performs the aforementioned actions related to the first node. For example, the above steps 701, 704, and 705 are all performed by a third-party device.
可选的,第二节点为前述执行第二节点相关的动作的第三方设备。比如,上述步骤701,702,703,704由第三方设备执行。Optionally, the second node is a third-party device that performs the aforementioned actions related to the second node. For example, the above steps 701, 702, 703, and 704 are performed by a third-party device.
可选的,第三节点为前述执行第三节点相关的动作的第三方设备。比如,上述步骤702由第三方设备执行。Optionally, the third node is a third-party device that performs the aforementioned third-node related actions. For example, the above step 702 is performed by a third-party device.
可选的,第一节点为网络设备。比如,上述步骤701,704,705均由网络设备执行。Optionally, the first node is a network device. For example, the above steps 701, 704, and 705 are all performed by the network device.
可选的,第二节点为终端设备。比如,上述步骤701,702,703,704由终端设备执行。Optionally, the second node is a terminal device. For example, the above steps 701, 702, 703, and 704 are performed by the terminal device.
可选的,第三节点为AI节点。比如,上述步骤702由AI节点执行。Optionally, the third node is an AI node. For example, the above step 702 is performed by the AI node.
可选的,第一节点包括网络设备和第三方设备。在一个示例中,上述步骤705可以由第三方设备,如OTT,或,云服务器等执行,上述步骤701,703可以由网络设备执行。此外,网络设备与第三方设备之间也可以进行通信,进行上述步骤701,703中所传输的内容的传输。Optionally, the first node includes a network device and a third-party device. In one example, the above step 705 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 701 and 703 can be performed by the network device. In addition, the network device and the third-party device can also communicate with each other to transmit the content transmitted in the above steps 701 and 703.
可选的,第二节点包括终端设备和第三方设备。在一个示例中,上述步骤703可以由第三方设备,如OTT,或,云服务器等执行,上述步骤701,702,704可以由终端设备执行。此外,终端设备与第三方设备之间也可以进行通信,进行上述步骤701,702,704中所传输的内容的传输。Optionally, the second node includes a terminal device and a third-party device. In one example, the above step 703 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 701, 702, and 704 can be performed by the terminal device. In addition, the terminal device and the third-party device can also communicate with each other to transmit the content transmitted in the above steps 701, 702, and 704.
可选的,第三节点为AI实体。在一个示例中,上述步骤702可以由第三方设备,如OTT,或,云服务器等执行。Optionally, the third node is an AI entity. In one example, the above step 702 can be performed by a third-party device, such as OTT, or a cloud server.
本申请实施例,第二节点基于来自第三节点的原始数据的特征信息的获取方式对原始数据进行处理,得到原始数据的特征信息。进而,第二节点向第一节点发送原始数据和原始数据的特征信息。第一节点基于来自第二节点的原始数据以及原始数据的特征信息,生成多个合成数据。该示例中,模型训练节点或模型使用节点(第三节点)根据模型需求确定出原始数据的特征信息的获取方式(或原始数据的数据标签的获取方式),可提高合成数据的质量,进而有助于提高模型训练或更新的性能。In an embodiment of the present application, the second node processes the original data based on the method for obtaining the characteristic information of the original data from the third node to obtain the characteristic information of the original data. Furthermore, the second node sends the original data and the characteristic information of the original data to the first node. The first node generates multiple synthetic data based on the original data from the second node and the characteristic information of the original data. In this example, the model training node or the model use node (third node) determines the method for obtaining the characteristic information of the original data (or the method for obtaining the data label of the original data) according to the model requirements, which can improve the quality of the synthetic data, thereby helping to improve the performance of model training or updating.
参照图8所示,是本申请实施例提供的另一种模型数据获取方法的流程示意图。图8所示示例是以第一节点(数据合成节点,或者可以称为中心节点、第一实体等)和第二节点(数据提供节点,或者可以称为待服务节点、第二实体等)作为该交互示意的执行主体为例进行示意的。如图8所示的模型数据获取方法可以包括步骤801-805。步骤801-805具体如下:Referring to FIG8 , it is a flowchart of another model data acquisition method provided by an embodiment of the present application. The example shown in FIG8 is illustrated by taking the first node (data synthesis node, or can be called a central node, a first entity, etc.) and the second node (data providing node, or can be called a node to be served, a second entity, etc.) as an example of the execution subject of the interaction. The model data acquisition method shown in FIG8 may include steps 801-805. Steps 801-805 are as follows:
801、第一节点向第二节点发送第七信息,该第七信息指示第六信息的上报配置。相应地,第二节点接收该第七信息。801. A first node sends seventh information to a second node, where the seventh information indicates a reporting configuration of sixth information. Correspondingly, the second node receives the seventh information.
例如,第七信息指示第六信息包括原始数据;或者第七信息指示第六信息包括原始数据、原始数据的数据标签;或者第七信息指示第六信息包括原始数据、原始数据的数据标签、原始数据优先级等。For example, the seventh information indicates that the sixth information includes original data; or the seventh information indicates that the sixth information includes original data and a data tag of the original data; or the seventh information indicates that the sixth information includes original data, a data tag of the original data, a priority of the original data, etc.
可选的,在第七信息指示第六信息包括原始数据的数据标签时,第一节点还向第二节点发送第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。Optionally, when the seventh information indicates that the sixth information includes the data label of the original data, the first node also sends eighth information to the second node, and the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining the data label of the original data.
当然,该原始数据的数据标签的获取方式还可以是第二节点根据模型处理需求确定的,本方案对此不作限制。Of course, the method for obtaining the data label of the original data can also be determined by the second node according to the model processing requirements, and this solution does not limit this.
进而,第二节点基于该原始数据的数据标签的获取方式和原始数据,得到原始数据的数据标签。Furthermore, the second node obtains the data label of the original data based on the method for obtaining the data label of the original data and the original data.
第二节点还向第一节点上报该原始数据的数据标签。The second node also reports the data label of the original data to the first node.
针对该部分的介绍,可参阅图6所示实施例中步骤601的记载,在此不再赘述。For the introduction of this part, please refer to the description of step 601 in the embodiment shown in FIG6 , which will not be repeated here.
802、第二节点向第一节点发送第六信息,该第六信息包括以下类型的信息:原始数据。相应地,第一节点接收该第六信息。802. The second node sends sixth information to the first node, where the sixth information includes the following type of information: original data. Correspondingly, the first node receives the sixth information.
803、第二节点还向第一节点发送第九信息,该第九信息指示所述第一节点得到该原始数据的特征信息的获取方式。相应地,第一节点接收该第九信息。803. The second node further sends ninth information to the first node, where the ninth information indicates a method for the first node to obtain the characteristic information of the original data. Correspondingly, the first node receives the ninth information.
也就是说,该示例中原始数据的特征信息的获取方式是第二节点确定的。第二节点可以根据模型需求确定该原始数据的特征信息的获取方式。That is to say, in this example, the method for acquiring the characteristic information of the original data is determined by the second node. The second node can determine the method for acquiring the characteristic information of the original data according to the model requirements.
804、第一节点基于该第六信息和第九信息,得到原始数据的特征信息。804. The first node obtains characteristic information of the original data based on the sixth information and the ninth information.
第一节点基于接收到的原始数据以及原始数据的特征信息的获取方式,进而可得到原始数据的特征信息。The first node can obtain the characteristic information of the original data based on the received original data and the method for obtaining the characteristic information of the original data.
805、第一节点基于原始数据的特征信息和第六信息生成多个合成数据,该多个合成数据用于模型的处理。805. The first node generates a plurality of synthetic data based on the feature information of the original data and the sixth information, and the plurality of synthetic data are used for processing the model.
针对该步骤的介绍可参阅前述图6所示实施例中步骤602的记载,在此不再赘述。For the introduction of this step, please refer to the description of step 602 in the embodiment shown in FIG. 6 , which will not be repeated here.
可选的,第一节点为前述执行第一节点相关的动作的第三方设备。比如,上述步骤801-805均由第三方设备执行。Optionally, the first node is a third-party device that performs the aforementioned actions related to the first node. For example, the above steps 801-805 are all performed by a third-party device.
可选的,第二节点为前述执行第二节点相关的动作的第三方设备。比如,上述步骤801-803由第三方设备执行。Optionally, the second node is a third-party device that performs the aforementioned actions related to the second node. For example, the above steps 801-803 are performed by a third-party device.
可选的,第一节点为网络设备。比如,上述步骤801-805均由网络设备执行。Optionally, the first node is a network device. For example, the above steps 801-805 are all performed by the network device.
可选的,第二节点为终端设备。比如,上述步骤801-803由终端设备执行。Optionally, the second node is a terminal device. For example, the above steps 801-803 are performed by the terminal device.
可选的,第一节点包括网络设备和第三方设备。在一个示例中,上述步骤804和805可以由第三方设备,如OTT,或,云服务器等执行,上述步骤801-803可以由网络设备执行。此外,网络设备与第三方设备之间也可以进行通信,进行上述步骤801-803中所传输的内容的传输。Optionally, the first node includes a network device and a third-party device. In one example, the above steps 804 and 805 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 801-803 can be performed by the network device. In addition, the network device and the third-party device can also communicate with each other to transmit the content transmitted in the above steps 801-803.
本申请实施例,第一节点基于来自第二节点的原始数据以及原始数据的特征信息的获取方式进行处理,得到原始数据的特征信息。进而第一节点基于原始数据的特征信息和原始数据生成多个合成数据。第二节点根据模型需求确定出原始数据的特征信息的获取方式,提高了合成数据的质量,进而有助于提高模型训练或更新的性能。In the embodiment of the present application, the first node processes the original data from the second node and the method for obtaining the characteristic information of the original data to obtain the characteristic information of the original data. Then the first node generates multiple synthetic data based on the characteristic information of the original data and the original data. The second node determines the method for obtaining the characteristic information of the original data according to the model requirements, improves the quality of the synthetic data, and then helps to improve the performance of model training or updating.
再如,下面以第三节点向第一节点指示原始数据的特征信息的获取方式为例进行介绍。参照图9所示,是本申请实施例提供的又一种模型数据获取方法的流程示意图。图9所示示例是以第一节点(数据合成节点,或者可以称为中心节点、第一实体等)、第二节点(数据提供节点,或者可以称为待服务节点、第二实体等)和第三节点(模型训练节点或者模型使用节点,也可以称为第三实体,如,模型训练实体或模型使用实体)作为该交互示意的执行主体为例进行示意的。如图9所示的模型数据获取方法可以包括步骤901-905。步骤901-905具体如下:For another example, the following is an introduction to the method of obtaining characteristic information of the original data indicated by the third node to the first node. Referring to Figure 9, it is a flow chart of another model data acquisition method provided by an embodiment of the present application. The example shown in Figure 9 is illustrated by taking the first node (data synthesis node, or it can be called a central node, a first entity, etc.), the second node (data providing node, or it can be called a node to be served, a second entity, etc.) and the third node (model training node or model using node, also referred to as a third entity, such as a model training entity or a model using entity) as an example of the execution subject of the interaction. The model data acquisition method shown in Figure 9 may include steps 901-905. Steps 901-905 are as follows:
901、第一节点向第二节点发送第七信息,该第七信息指示第六信息的上报配置。相应地,第二节点接收该第七信息。901. A first node sends seventh information to a second node, where the seventh information indicates a reporting configuration of sixth information. Correspondingly, the second node receives the seventh information.
例如,第七信息指示第六信息包括原始数据;或者第七信息指示第六信息包括原始数据、原始数据的数据标签;或者第七信息指示第六信息包括原始数据、原始数据的数据标签、原始数据优先级等。For example, the seventh information indicates that the sixth information includes original data; or the seventh information indicates that the sixth information includes original data and a data tag of the original data; or the seventh information indicates that the sixth information includes original data, a data tag of the original data, a priority of the original data, etc.
可选的,在第七信息指示第六信息包括原始数据的数据标签时,第一节点还向第二节点发送第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。Optionally, when the seventh information indicates that the sixth information includes the data label of the original data, the first node also sends eighth information to the second node, and the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining the data label of the original data.
当然,该原始数据的数据标签的获取方式还可以是第二节点根据模型处理需求确定的,本方案对此不作限制。Of course, the method for obtaining the data label of the original data can also be determined by the second node according to the model processing requirements, and this solution does not limit this.
进而,第二节点基于该原始数据的数据标签的获取方式和原始数据,得到原始数据的数据标签。Furthermore, the second node obtains the data label of the original data based on the method for obtaining the data label of the original data and the original data.
针对该部分的介绍可参阅图6所示实施例中步骤601的记载,在此不再赘述。For the introduction of this part, please refer to the record of step 601 in the embodiment shown in FIG6 , which will not be repeated here.
902、第二节点向第一节点发送第六信息,该第六信息包括以下类型的信息:原始数据。相应地,第一节点接收该第六信息。902. The second node sends sixth information to the first node, where the sixth information includes the following type of information: original data. Correspondingly, the first node receives the sixth information.
903、第三节点向第一节点发送第九信息,该第九信息指示第一节点得到上述原始数据的特征信息的获取方式。相应地,第一节点接收该第九信息。903. The third node sends ninth information to the first node, where the ninth information indicates a method for the first node to obtain the characteristic information of the original data. Correspondingly, the first node receives the ninth information.
该第三节点,可以是模型训练节点或者模型使用节点。模型训练节点或模型使用节点根据模型需求确定出原始数据的特征提取方式。进而向第一节点发送该原始数据的特征提取方式。The third node may be a model training node or a model use node. The model training node or the model use node determines a feature extraction method of the original data according to the model requirements, and then sends the feature extraction method of the original data to the first node.
在一种可能的实现方式中,在第七信息指示第六信息包括原始数据的数据标签时,第三节点还向第二节点发送原始数据的数据标签的获取方式。也就是说,该原始数据的数据标签的获取方式可以是第三节点指示给第二节点的。In a possible implementation, when the seventh information indicates that the sixth information includes the data label of the original data, the third node also sends the second node a method for obtaining the data label of the original data. That is, the method for obtaining the data label of the original data may be indicated by the third node to the second node.
在一种可能的实现方式中,第三节点还向第一节点发送原始数据的数据标签的获取方式。也就是说,该原始数据的数据标签的获取方式可以是第三节点指示给第一节点的。In a possible implementation, the third node also sends the first node a method for obtaining the data label of the original data. That is, the method for obtaining the data label of the original data may be indicated by the third node to the first node.
904、第一节点基于第九信息和第六信息,得到原始数据的特征信息。904. The first node obtains characteristic information of the original data based on the ninth information and the sixth information.
905、第一节点基于原始数据的特征信息和第六信息生成多个合成数据,该多个合成数据用于模型的处理。905. The first node generates a plurality of synthetic data based on the feature information of the original data and the sixth information, and the plurality of synthetic data are used for processing the model.
针对该部分的介绍,可参阅图6所示实施例中步骤602的记载,在此不再赘述。For the introduction of this part, please refer to the record of step 602 in the embodiment shown in FIG6 , which will not be repeated here.
可选的,第一节点为前述执行第一节点相关的动作的第三方设备。比如,上述步骤901-905均由第三方设备执行。Optionally, the first node is a third-party device that performs the aforementioned actions related to the first node. For example, the above steps 901-905 are all performed by a third-party device.
可选的,第二节点为前述执行第二节点相关的动作的第三方设备。比如,上述步骤901和902由第三方设备执行。Optionally, the second node is a third-party device that performs the aforementioned second-node-related actions. For example, the above steps 901 and 902 are performed by a third-party device.
可选的,第三节点为前述执行第三节点相关的动作的第三方设备。比如,上述步骤903由第三方设备执行。Optionally, the third node is a third-party device that performs the aforementioned third-node related actions. For example, the above step 903 is performed by a third-party device.
可选的,第一节点为网络设备。比如,上述步骤901-905均由网络设备执行。Optionally, the first node is a network device. For example, the above steps 901-905 are all performed by the network device.
可选的,第二节点为终端设备。比如,上述步骤901和902由终端设备执行。Optionally, the second node is a terminal device. For example, the above steps 901 and 902 are performed by the terminal device.
可选的,第三节点为AI节点。比如,上述步骤903由AI节点执行。Optionally, the third node is an AI node. For example, the above step 903 is performed by the AI node.
可选的,第一节点包括网络设备和第三方设备。在一个示例中,上述步骤904和905可以由第三方设备,如OTT,或,云服务器等执行,上述步骤901-903可以由网络设备执行。此外,网络设备与第三方设备之间也可以进行通信,进行上述步骤901,903中所传输的内容的传输。Optionally, the first node includes a network device and a third-party device. In one example, the above steps 904 and 905 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 901-903 can be performed by the network device. In addition, the network device and the third-party device can also communicate with each other to transmit the content transmitted in the above steps 901 and 903.
可选的,第三节点为AI实体。在一个示例中,上述步骤903可以由第三方设备,如OTT,或,云服务器等执行。Optionally, the third node is an AI entity. In one example, the above step 903 can be performed by a third-party device, such as OTT, or a cloud server.
本申请实施例,第一节点基于来自第二节点的原始数据以及来自第三节点的原始数据的特征信息的获取方式,得到原始数据的特征信息。进而第一节点基于原始数据的特征信息和原始数据生成多个合成数据。该示例,基于原始数据和原始数据的特征信息可以生成满足多样性要求且数据分布与实测数据分布相同的合成数据,提高了合成数据的质量,进而有助于提高模型训练或更新的性能。In an embodiment of the present application, the first node obtains characteristic information of the original data based on the original data from the second node and the characteristic information of the original data from the third node. Then the first node generates multiple synthetic data based on the characteristic information of the original data and the original data. In this example, synthetic data that meets the diversity requirements and has the same data distribution as the measured data distribution can be generated based on the original data and the characteristic information of the original data, thereby improving the quality of the synthetic data and thus helping to improve the performance of model training or updating.
下面以第三节点对原始数据进行处理得到原始数据的特征信息为例进行介绍。参照图10所示,是本申请实施例提供的又一种模型数据获取方法的流程示意图。可选的,该方法可以应用于前述的模型数据获取系统,例如图1所示的模型数据获取系统。图10所示示例是以第一节点(数据合成节点,或者可以称为中心节点、第一实体等)、第二节点(数据提供节点,或者可以称为待服务节点、第二实体等)和第三节点(模型训练节点或者模型使用节点,也可以称为第三实体,如,模型训练实体或模型使用实体)作为该交互示意的执行主体为例进行示意的。如图10所示的模型数据获取方法可以包括步骤1001-1007。步骤1001-1007具体如下:The following is an introduction using the example of a third node processing the original data to obtain the characteristic information of the original data. Referring to Figure 10, it is a flow chart of another model data acquisition method provided in an embodiment of the present application. Optionally, the method can be applied to the aforementioned model data acquisition system, such as the model data acquisition system shown in Figure 1. The example shown in Figure 10 is illustrated by taking the first node (data synthesis node, or it can be called a central node, a first entity, etc.), the second node (data providing node, or it can be called a node to be served, a second entity, etc.) and the third node (model training node or model using node, also referred to as a third entity, such as a model training entity or a model using entity) as an example of the execution subject of the interaction diagram. The model data acquisition method shown in Figure 10 may include steps 1001-1007. Steps 1001-1007 are as follows:
1001、第一节点向第二节点发送第七信息,该第七信息指示第六信息的上报配置。相应地,第二节点接收该第七信息。1001. A first node sends seventh information to a second node, where the seventh information indicates a reporting configuration of sixth information. Correspondingly, the second node receives the seventh information.
例如,第七信息指示第六信息包括原始数据;或者第七信息指示第六信息包括原始数据、原始数据的数据标签;或者第七信息指示第六信息包括原始数据、原始数据的数据标签、原始数据优先级等。For example, the seventh information indicates that the sixth information includes original data; or the seventh information indicates that the sixth information includes original data and a data tag of the original data; or the seventh information indicates that the sixth information includes original data, a data tag of the original data, a priority of the original data, etc.
针对该部分的介绍可参阅前述图6所示实施例中步骤601的记载,在此不再赘述。For the introduction of this part, please refer to the description of step 601 in the embodiment shown in FIG. 6 , which will not be repeated here.
1002、第二节点向第一节点发送第六信息,该第六信息包括以下类型的信息:原始数据。相应地,第一节点接收该第六信息。1002. The second node sends sixth information to the first node, where the sixth information includes the following type of information: original data. Correspondingly, the first node receives the sixth information.
1003、第二节点向第三节点发送原始数据。相应地,第三节点接收该原始数据。1003. The second node sends original data to the third node. Correspondingly, the third node receives the original data.
其中,第二节点不仅向第一节点发送原始数据。第二节点还向第三节点发送原始数据。The second node not only sends the original data to the first node, but also sends the original data to the third node.
该第三节点,可以是模型训练节点或者模型使用节点。The third node may be a model training node or a model use node.
1004、第三节点对该原始数据进行处理,得到原始数据的特征信息。1004. The third node processes the original data to obtain feature information of the original data.
其中,第三节点(模型训练节点或模型使用节点)根据模型需求确定出原始数据的特征信息的获取方式。进而基于该原始数据的特征信息的获取方式对接收到的原始数据进行处理得到原始数据的特征信息。The third node (model training node or model use node) determines a method for obtaining the characteristic information of the original data according to the model requirements, and then processes the received original data based on the method for obtaining the characteristic information of the original data to obtain the characteristic information of the original data.
1005、第一节点向第三节点发送第十信息,该第十信息指示第十一信息的上报配置。相应地,第三节点接收该第十信息。1005. The first node sends tenth information to the third node, where the tenth information indicates a reporting configuration of the eleventh information. Correspondingly, the third node receives the tenth information.
该第十一信息包括以下类型的信息:所述原始数据的特征信息。该上报配置例如可以包括原始数据的特征信息以及原始数据的特征信息的数据量等。The eleventh information includes the following types of information: characteristic information of the original data. The reporting configuration may include, for example, the characteristic information of the original data and the data volume of the characteristic information of the original data.
在一种可能的实现方式中,该第十信息还指示原始数据的数据标签的上报配置。In a possible implementation manner, the tenth information further indicates a reporting configuration of a data tag of the original data.
也就是说,原始数据的数据标签可以是第三节点上报的,还可以是上述第二节点上报的,本方案对此不作限制。That is to say, the data label of the original data may be reported by the third node or by the second node, and this solution does not impose any restriction on this.
1006、第三节点向第一节点发送第十一信息,该第十一信息包括以下类型的信息:原始数据的特征信息。相应地,第一节点接收该第十一信息。1006. The third node sends eleventh information to the first node, where the eleventh information includes the following type of information: characteristic information of original data. Correspondingly, the first node receives the eleventh information.
即第三节点向第一节点上报原始数据的特征信息。That is, the third node reports the characteristic information of the original data to the first node.
1007、第一节点基于第六信息和第十一信息生成多个合成数据,该多个合成数据用于模型的处理。1007. The first node generates a plurality of synthetic data based on the sixth information and the eleventh information, and the plurality of synthetic data are used for processing the model.
针对上述步骤的介绍可参阅前述图6所示实施例中步骤602的记载,在此不再赘述。For the introduction of the above steps, please refer to the description of step 602 in the embodiment shown in FIG. 6 , which will not be repeated here.
示例性的,在所述第十一信息包括原始数据的数据标签时,在一种可能的实现方式中,第一节点还向第二节点发送第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。Exemplarily, when the eleventh information includes the data label of the original data, in a possible implementation, the first node also sends an eighth information to the second node, and the eighth information indicates a processing method for the second node to obtain the sixth information, and the processing method for the second node to obtain the sixth information includes a method for obtaining the data label of the original data.
或者,第三节点向第二节点发送第八信息,所述第八信息指示所述第二节点得到所述第六信息的处理方式,所述第二节点得到所述第六信息的处理方式包括原始数据的数据标签的获取方式。本方案对此不作限制。Alternatively, the third node sends the eighth information to the second node, the eighth information indicating the processing method of the second node to obtain the sixth information, and the processing method of the second node to obtain the sixth information includes the method of obtaining the data label of the original data. This solution does not limit this.
该示例以第二节点上报原始数据的数据标签为例进行介绍。This example takes the data label of the original data reported by the second node as an example.
可替代的,原始数据的数据标签还可以是第三节点上报的。示例性的,在所述第十一信息包括原始数据的数据标签时,第三节点根据模型需求确定出原始数据的数据标签的获取方式。第三节点基于原始数据的数据标签的获取方式对原始数据进行处理,得到原始数据的数据标签。进而,第三节点向第一节点发送该原始数据的数据标签。Alternatively, the data label of the original data may also be reported by the third node. Exemplarily, when the eleventh information includes the data label of the original data, the third node determines the method for obtaining the data label of the original data according to the model requirements. The third node processes the original data based on the method for obtaining the data label of the original data to obtain the data label of the original data. Then, the third node sends the data label of the original data to the first node.
可选的,第一节点为前述执行第一节点相关的动作的第三方设备。比如,上述步骤1001、1002、1005-1007均由第三方设备执行。Optionally, the first node is a third-party device that performs the aforementioned actions related to the first node. For example, the above steps 1001, 1002, 1005-1007 are all performed by a third-party device.
可选的,第二节点为前述执行第二节点相关的动作的第三方设备。比如,上述步骤1001-1003由第三方设备执行。Optionally, the second node is a third-party device that performs the aforementioned actions related to the second node. For example, the above steps 1001-1003 are performed by a third-party device.
可选的,第三节点为前述执行第三节点相关的动作的第三方设备。比如,上述步骤1003-1006由第三方设备执行。Optionally, the third node is a third-party device that performs the aforementioned actions related to the third node. For example, the above steps 1003-1006 are performed by a third-party device.
可选的,第一节点为网络设备。比如,上述步骤1001、1002、1005-1007均由网络设备执行。Optionally, the first node is a network device. For example, the above steps 1001, 1002, 1005-1007 are all performed by the network device.
可选的,第二节点为终端设备。比如,上述步骤1001-1003由终端设备执行。Optionally, the second node is a terminal device. For example, the above steps 1001-1003 are performed by the terminal device.
可选的,第三节点为AI节点。比如,上述步骤1003-1006由AI节点执行。Optionally, the third node is an AI node. For example, the above steps 1003-1006 are performed by the AI node.
可选的,第一节点包括网络设备和第三方设备。在一个示例中,上述步骤1007可以由第三方设备,如OTT,或,云服务器等执行,上述步骤1001、1002、1005、1006可以由网络设备执行。此外,网络设备与第三方设备之间也可以进行通信,进行上述步骤1001、1002、1005、1006中所传输的内容的传输。Optionally, the first node includes a network device and a third-party device. In one example, the above step 1007 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 1001, 1002, 1005, and 1006 can be performed by the network device. In addition, the network device and the third-party device can also communicate with each other to transmit the content transmitted in the above steps 1001, 1002, 1005, and 1006.
可选的,第三节点包括第三方设备,以及网络设备或终端设备。在一个示例中,上述步骤1004可以由第三方设备,如OTT,或,云服务器等执行,上述步骤1003、1005、1006可以由网络设备或终端设备执行。此外,网络设备或终端设备与第三方设备之间也可以进行通信,进行上述步骤1003、1005、1006中所传输的内容的传输。Optionally, the third node includes a third-party device, and a network device or a terminal device. In one example, the above step 1004 can be performed by a third-party device, such as an OTT, or a cloud server, and the above steps 1003, 1005, and 1006 can be performed by a network device or a terminal device. In addition, the network device or the terminal device can also communicate with the third-party device to transmit the content transmitted in the above steps 1003, 1005, and 1006.
本申请实施例,第一节点基于来自第二节点的原始数据以及来自第三节点的原始数据的特征信息,生成多个合成数据。该示例,基于原始数据和原始数据的特征信息可以生成满足多样性要求且数据分布与实测数据分布相同的合成数据,提高了合成数据的质量,进而有助于提高模型训练或更新的性能。In the embodiment of the present application, the first node generates multiple synthetic data based on the original data from the second node and the feature information of the original data from the third node. In this example, synthetic data that meets the diversity requirements and has the same data distribution as the measured data distribution can be generated based on the original data and the feature information of the original data, thereby improving the quality of the synthetic data and further helping to improve the performance of model training or updating.
需要说明的是,在本申请的各个实施例中,如果没有特殊说明以及逻辑冲突,各个实施例之间的术语和/或描述具有一致性、且可以相互引用,不同的实施例中的技术特征根据其内在的逻辑关系可以组合形成新的实施例。It should be noted that in the various embodiments of the present application, unless otherwise specified or there is a logical conflict, the terms and/or descriptions between the various embodiments are consistent and can be referenced mutually, and the technical features in different embodiments can be combined to form new embodiments according to their internal logical relationships.
需要说明的是,本申请实施例中的第二节点和第三节点可以是位于同一节点内,也可以是不同的两个节点,本方案对此不作限制。It should be noted that the second node and the third node in the embodiment of the present application may be located in the same node or in two different nodes, and this solution does not impose any restrictions on this.
上述详细阐述了本申请实施例的方法,下面提供了本申请实施例的装置。可以理解的,本申请各个装置实施例中,对多个单元或者模块的划分仅是一种根据功能进行的逻辑划分,不作为对装置具体的结构的限定。在具体实现中,其中部分功能模块可能被细分为更多细小的功能模块,部分功能模块也可能组合成一个功能模块,但无论这些功能模块是进行了细分还是组合,装置所执行的大致流程是相同的。例如,一些装置中包含接收单元和发送单元。一些设计中,发送单元和接收单元也可以集成为通信单元,该通信单元可以实现接收单元和发送单元所实现的功能。通常,每个单元都对应有各自的程序代码(或者说程序指令),这些单元各自对应的程序代码在处理器上运行时,使得该单元受处理单元的控制而执行相应的流程从而实现相应功能。The above detailed description of the method of the embodiment of the present application, the following provides a device of the embodiment of the present application. It can be understood that in the various device embodiments of the present application, the division of multiple units or modules is only a logical division according to function, and is not used as a limitation on the specific structure of the device. In a specific implementation, some functional modules may be subdivided into more small functional modules, and some functional modules may also be combined into one functional module, but no matter whether these functional modules are subdivided or combined, the general process performed by the device is the same. For example, some devices contain a receiving unit and a sending unit. In some designs, the sending unit and the receiving unit can also be integrated into a communication unit, which can implement the functions implemented by the receiving unit and the sending unit. Usually, each unit corresponds to its own program code (or program instruction), and when the program code corresponding to each of these units is run on the processor, the unit is controlled by the processing unit to execute the corresponding process to implement the corresponding function.
本申请实施例还提供用于实现以上任一种方法的装置,例如,提供一种模型数据获取装置包括用以实现以上任一种方法中第一节点所执行的各步骤的模块(或手段)。再如,提供一种模型数据获取装置包括用以实现以上任一种方法中第二节点所执行的各步骤的模块(或手段)。再如,提供一种模型数据获取装置包括用以实现以上任一种方法中第三节点所执行的各步骤的模块(或手段)。The embodiments of the present application also provide a device for implementing any of the above methods, for example, a model data acquisition device is provided, including a module (or means) for implementing each step performed by the first node in any of the above methods. For another example, a model data acquisition device is provided, including a module (or means) for implementing each step performed by the second node in any of the above methods. For another example, a model data acquisition device is provided, including a module (or means) for implementing each step performed by the third node in any of the above methods.
例如,参照图11所示,是本申请实施例提供的一种模型数据获取装置的结构示意图。如图11所示,该装置可包括收发模块1101和处理模块1102,其中:For example, referring to FIG11 , which is a schematic diagram of a structure of a model data acquisition device provided in an embodiment of the present application. As shown in FIG11 , the device may include a transceiver module 1101 and a processing module 1102, wherein:
该模型数据获取装置用于实现上述方法实施例中第一节点的功能时,收发模块1101用于执行如图6所示实施例的步骤601中第一节点的操作,以及处理模块1102用于执行如图6所示实施例的步骤602;或者,收发模块1101用于执行如图7所示实施例的步骤701、704中第一节点的操作中的一项或多项,以及处理模块1102用于执行如图7所示实施例的步骤705;或者,收发模块1101用于执行如图8所示实施例的步骤801、802、803中第一节点的操作中的一项或多项,以及处理模块1102用于执行如图8所示实施例的步骤804、805中的一项或多项;或者,收发模块1101用于执行如图9所示实施例的步骤901、902、903中第一节点的操作中的一项或多项,以及处理模块1102用于执行如图9所示实施例的步骤904、905中的一项或多项;或者,收发模块1101用于执行如图10所示实施例的步骤1001、1002、1005、1006中的第一节点的操作中的一项或多项,以及处理模块1102用于执行如图10所示实施例的步骤1007。When the model data acquisition device is used to implement the function of the first node in the above method embodiment, the transceiver module 1101 is used to perform the operation of the first node in step 601 of the embodiment shown in FIG6, and the processing module 1102 is used to perform step 602 of the embodiment shown in FIG6; or, the transceiver module 1101 is used to perform one or more of the operations of the first node in steps 701 and 704 of the embodiment shown in FIG7, and the processing module 1102 is used to perform step 705 of the embodiment shown in FIG7; or, the transceiver module 1101 is used to perform one or more of the operations of the first node in steps 801, 802, and 803 of the embodiment shown in FIG8, And the processing module 1102 is used to execute one or more of steps 804 and 805 of the embodiment shown in Figure 8; or, the transceiver module 1101 is used to execute one or more of the operations of the first node in steps 901, 902, and 903 of the embodiment shown in Figure 9, and the processing module 1102 is used to execute one or more of the steps 904 and 905 of the embodiment shown in Figure 9; or, the transceiver module 1101 is used to execute one or more of the operations of the first node in steps 1001, 1002, 1005, and 1006 of the embodiment shown in Figure 10, and the processing module 1102 is used to execute step 1007 of the embodiment shown in Figure 10.
该模型数据获取装置用于实现上述方法实施例中第二节点的功能时,收发模块1101用于执行如图6所示实施例的步骤601中第二节点的操作;或者,收发模块1101用于执行如图7所示实施例的步骤701、702、704中第二节点的操作中的一项或多项,以及处理模块1102用于执行如图7所示实施例的步骤703;或者,收发模块1101用于执行如图8所示实施例的步骤801、802、803中第二节点的操作中的一项或多项;或者,收发模块1101用于执行如图9所示实施例的步骤901、902中第二节点的操作中的一项或多项;或者,收发模块1101用于执行如图10所示实施例的步骤1001、1002、1003中的第二节点的操作中的一项或多项。When the model data acquisition device is used to implement the function of the second node in the above method embodiment, the transceiver module 1101 is used to execute the operation of the second node in step 601 of the embodiment as shown in Figure 6; or, the transceiver module 1101 is used to execute one or more of the operations of the second node in steps 701, 702, and 704 of the embodiment as shown in Figure 7, and the processing module 1102 is used to execute step 703 of the embodiment as shown in Figure 7; or, the transceiver module 1101 is used to execute one or more of the operations of the second node in steps 801, 802, and 803 of the embodiment as shown in Figure 8; or, the transceiver module 1101 is used to execute one or more of the operations of the second node in steps 901 and 902 of the embodiment as shown in Figure 9; or, the transceiver module 1101 is used to execute one or more of the operations of the second node in steps 1001, 1002, and 1003 of the embodiment as shown in Figure 10.
该模型数据获取装置用于实现上述方法实施例中第三节点的功能时,收发模块1101用于执行如图7所示实施例的步骤702中第三节点的操作;或者,收发模块1101用于执行如图9所示实施例的步骤903中第三节点的操作;或者,收发模块1101用于执行如图10所示实施例的步骤1003、1005、1006中第三节点的操作中的一项或多项,以及处理模块1102用于执行如图10所示实施例的步骤1004。When the model data acquisition device is used to implement the function of the third node in the above method embodiment, the transceiver module 1101 is used to execute the operation of the third node in step 702 of the embodiment as shown in Figure 7; or, the transceiver module 1101 is used to execute the operation of the third node in step 903 of the embodiment as shown in Figure 9; or, the transceiver module 1101 is used to execute one or more of the operations of the third node in steps 1003, 1005, and 1006 of the embodiment as shown in Figure 10, and the processing module 1102 is used to execute step 1004 of the embodiment as shown in Figure 10.
上述各模块所涉及的操作的具体描述可参阅前述实施例的记载,在此不再赘述。The specific description of the operations involved in the above modules can be found in the records of the above embodiments, which will not be repeated here.
应理解以上各个装置中各模块的划分仅是一种逻辑功能的划分,实际实现时可以全部或部分集成到一个物理实体上,也可以物理上分开。此外,模型数据获取装置中的模块可以以处理器调用软件的形式实现;例如模型数据获取装置包括处理电路,处理电路与存储器连接,存储器中存储有指令,处理器调用存储器中存储的指令,以实现以上任一种方法或实现该装置各模块的功能,其中处理电路为处理器或处理器中的部分处理电路,处理器例如为通用处理器,比如中央处理单元(central processing unit,CPU)或微处理器,存储器为装置内的存储器或装置外的存储器。或者,装置中的模块可以以硬件电路的形式实现,可以通过对硬件电路的设计实现部分或全部单元的功能,该硬件电路可以理解为一个或多个处理器;例如,在一种实现中,该硬件电路为专用集成电路(application-specific integrated circuit,ASIC),通过对电路内元件逻辑关系的设计,实现以上部分或全部单元的功能;再如,在另一种实现中,该硬件电路为可以通过可编程逻辑器件(programmable logic device,PLD)实现,以现场可编程门阵列(field programmable gate array,FPGA)为例,其可以包括大量逻辑门电路,通过配置文件来配置逻辑门电路之间的连接关系,从而实现以上部分或全部单元的功能。以上装置的所有模块可以全部通过处理器调用软件的形式实现,或全部通过硬件电路的形式实现,或部分通过处理器调用软件的形式实现,剩余部分通过硬件电路的形式实现。It should be understood that the division of each module in each of the above devices is only a division of logical functions. In actual implementation, they can be fully or partially integrated into one physical entity, or they can be physically separated. In addition, the modules in the model data acquisition device can be implemented in the form of a processor calling software; for example, the model data acquisition device includes a processing circuit, the processing circuit is connected to a memory, and instructions are stored in the memory. The processor calls the instructions stored in the memory to implement any of the above methods or realize the functions of each module of the device, wherein the processing circuit is a processor or a part of the processing circuit in the processor, and the processor is, for example, a general-purpose processor, such as a central processing unit (CPU) or a microprocessor, and the memory is a memory inside the device or a memory outside the device. Alternatively, the modules in the device may be implemented in the form of hardware circuits, and the functions of some or all units may be implemented by designing the hardware circuits, and the hardware circuits may be understood as one or more processors; for example, in one implementation, the hardware circuit is an application-specific integrated circuit (ASIC), and the functions of some or all of the above units may be implemented by designing the logical relationship of the components in the circuit; for another example, in another implementation, the hardware circuit may be implemented by a programmable logic device (PLD), and a field programmable gate array (FPGA) may be used as an example, which may include a large number of logic gate circuits, and the connection relationship between the logic gate circuits may be configured by a configuration file, so as to implement the functions of some or all of the above units. All modules of the above devices may be implemented in the form of a processor calling software, or in the form of hardware circuits, or in part by a processor calling software, and the rest by hardware circuits.
参照图12所示,是本申请实施例提供的又一种模型数据获取装置的硬件结构示意图。如图12所示的模型数据获取装置1200包括一个或多个处理电路1201(图中示例了一个处理电路)。其中,处理电路1201可以为一个或多个处理器,或者,一个或多个处理器中用于处理的电路。Referring to FIG. 12 , it is a schematic diagram of the hardware structure of another model data acquisition device provided in an embodiment of the present application. The model data acquisition device 1200 shown in FIG. 12 includes one or more processing circuits 1201 (one processing circuit is illustrated in the figure). The processing circuit 1201 can be one or more processors, or a circuit used for processing in one or more processors.
可选的,该模型数据获取装置1200还可以包括收发电路1202(图中以虚线表示)。处理电路1201和收发电路1202之间相互耦合。其中,收发电路1202可以为收发器或接口电路。比如,所述装置1200为网络设备或终端设备或核心网设备或AI实体时,收发电路1202可以为收发器或接口电路;所述装置1200为用于网络设备或终端设备或核心网设备或AI实体的芯片时,收发电路1202可以为接口电路。例如,AI实体可以为第三方设备,如OTT,或,云服务器等。Optionally, the model data acquisition device 1200 may further include a transceiver circuit 1202 (indicated by a dotted line in the figure). The processing circuit 1201 and the transceiver circuit 1202 are coupled to each other. Among them, the transceiver circuit 1202 may be a transceiver or an interface circuit. For example, when the device 1200 is a network device or a terminal device or a core network device or an AI entity, the transceiver circuit 1202 may be a transceiver or an interface circuit; when the device 1200 is a chip for a network device or a terminal device or a core network device or an AI entity, the transceiver circuit 1202 may be an interface circuit. For example, the AI entity may be a third-party device, such as an OTT, or a cloud server, etc.
可选的,该模型数据获取装置1200还可以包括存储器1203(图中以虚线表示)。该存储器1203用于存储处理电路1201执行的指令,或存储处理电路1201运行指令所需要的输入数据,或存储处理电路1201运行指令后产生的数据。Optionally, the model data acquisition device 1200 may further include a memory 1203 (indicated by dotted lines in the figure). The memory 1203 is used to store instructions executed by the processing circuit 1201, or to store input data required by the processing circuit 1201 to run instructions, or to store data generated after the processing circuit 1201 runs instructions.
可选的,存储器1203可以位于所述一个或多个处理器中,或者,位于所述一个或多个处理器外,或者,可以包括位于所述一个或多个处理器中的存储部分和位于所述一个或多个处理器外的存储部分。Optionally, the memory 1203 may be located in the one or more processors, or located outside the one or more processors, or may include a storage part located in the one or more processors and a storage part located outside the one or more processors.
存储器1203可以是只读存储器(read only memory,ROM),静态存储设备,动态存储设备或者随机存取存储器(random access memory,RAM)。Memory 1203 can be a read-only memory (ROM), a static storage device, a dynamic storage device or a random access memory (RAM).
存储器1203可以存储程序,当存储器1203中存储的程序被处理电路1201执行时,处理电路1201和收发电路1202用于执行本申请实施例的模型数据获取方法的各个步骤。The memory 1203 can store programs. When the program stored in the memory 1203 is executed by the processing circuit 1201, the processing circuit 1201 and the transceiver circuit 1202 are used to execute the various steps of the model data acquisition method of the embodiment of the present application.
处理电路1201是一种具有信号的处理能力的电路,在一种实现中,处理电路1201可以是具有指令读取与运行能力的电路,例如以下处理器中的一项或多项:中央处理单元CPU、微处理器、图形处理器(graphics processing unit,GPU)(可以理解为一种微处理器)、或数字信号处理器(digital signal processor,DSP)等或者前述处理器中的处理电路;在另一种实现中,处理电路1201可以通过硬件电路的逻辑关系实现一定功能,该硬件电路的逻辑关系是固定的或可以重构的,例如处理电路1201为以下处理器中的一项或多项:ASIC或可编程逻辑器件PLD实现的硬件电路,比如FPGA或者前述处理器中的处理电路。在可重构的硬件电路中,处理器加载配置文档,实现硬件电路配置的过程,可以理解为处理器加载指令,以实现以上部分或全部模块的功能的过程。此外,还可以是针对人工智能设计的硬件电路,其可以理解为一种ASIC或ASIC中的处理电路,例如以下处理器中的一项或多项:神经网络处理单元(neural network processing unit,NPU)、张量处理单元(tensor processing unit,TPU)、深度学习处理单元(deep learning processing unit,DPU)等或者前述处理器中的处理电路。处理电路1201用于执行相关程序,以实现本申请实施例的模型数据获取装置中的单元所需执行的功能,或者执行本申请方法实施例的模型数据获取方法。The processing circuit 1201 is a circuit with signal processing capability. In one implementation, the processing circuit 1201 may be a circuit with instruction reading and execution capability, such as one or more of the following processors: a central processing unit CPU, a microprocessor, a graphics processing unit (GPU) (which may be understood as a microprocessor), or a digital signal processor (DSP), etc., or a processing circuit in the aforementioned processors; in another implementation, the processing circuit 1201 may realize certain functions through the logical relationship of a hardware circuit, and the logical relationship of the hardware circuit is fixed or reconfigurable, for example, the processing circuit 1201 is one or more of the following processors: a hardware circuit implemented by an ASIC or a programmable logic device PLD, such as an FPGA or a processing circuit in the aforementioned processor. In a reconfigurable hardware circuit, the process of the processor loading a configuration document to implement the hardware circuit configuration may be understood as the process of the processor loading instructions to implement the functions of some or all of the above modules. In addition, it can also be a hardware circuit designed for artificial intelligence, which can be understood as an ASIC or a processing circuit in an ASIC, such as one or more of the following processors: a neural network processing unit (NPU), a tensor processing unit (TPU), a deep learning processing unit (DPU), etc. or a processing circuit in the aforementioned processor. The processing circuit 1201 is used to execute relevant programs to implement the functions required to be performed by the units in the model data acquisition device of the embodiment of the present application, or to execute the model data acquisition method of the method embodiment of the present application.
可见,以上装置中的各模块可以是被配置成实施以上方法的一个或多个处理器(或处理电路),例如:CPU、GPU、NPU、TPU、DPU、微处理器、DSP、ASIC、FPGA,或这些处理器形式中至少两种的组合或者,这些处理器中的部分处理电路。It can be seen that each module in the above device can be one or more processors (or processing circuits) configured to implement the above method, such as: CPU, GPU, NPU, TPU, DPU, microprocessor, DSP, ASIC, FPGA, or a combination of at least two of these processor forms or some processing circuits in these processors.
此外,以上装置中的各模块可以全部或部分可以集成在一起,或者可以独立实现。在一种实现中,这些模块集成在一起,以片上系统(system-on-a-chip,SOC)的形式实现。该SOC中可以包括至少一个处理器,用于实现以上任一种方法或实现该装置各模块的功能,该至少一个处理器的种类可以不同,例如包括CPU和FPGA,CPU和人工智能处理器,CPU和GPU等。In addition, the modules in the above device can be fully or partially integrated together, or can be implemented independently. In one implementation, these modules are integrated together and implemented in the form of a system-on-a-chip (SOC). The SOC may include at least one processor for implementing any of the above methods or implementing the functions of the modules of the device. The type of the at least one processor may be different, for example, including a CPU and an FPGA, a CPU and an artificial intelligence processor, a CPU and a GPU, etc.
收发电路1202使用例如但不限于收发器一类的收发装置,来实现装置1200与其他设备或通信网络之间的通信。例如,可以通过收发电路1202获取信息。The transceiver circuit 1202 uses a transceiver device such as, but not limited to, a transceiver to implement communication between the device 1200 and other devices or a communication network. For example, information can be obtained through the transceiver circuit 1202.
应注意,尽管图12所示的装置1200仅仅示出了处理电路,收发电路和存储器,但是在具体实现过程中,本领域的技术人员应当理解,装置1200还包括实现正常运行所必须的其他器件。同时,根据具体需要,本领域的技术人员应当理解,装置1200还可包括实现其他附加功能的硬件器件。此外,本领域的技术人员应当理解,装置1200也可仅仅包括实现本申请实施例所必须的器件,而不必包括图12中所示的全部器件。It should be noted that although the device 1200 shown in FIG. 12 only shows a processing circuit, a transceiver circuit and a memory, in the specific implementation process, those skilled in the art should understand that the device 1200 also includes other devices necessary for normal operation. At the same time, according to specific needs, those skilled in the art should understand that the device 1200 may also include hardware devices for implementing other additional functions. In addition, those skilled in the art should understand that the device 1200 may also only include the devices necessary for implementing the embodiments of the present application, and does not necessarily include all the devices shown in FIG. 12.
本申请实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。An embodiment of the present application also provides a computer-readable storage medium, which stores instructions. When the computer-readable storage medium is executed on a computer or a processor, the computer or the processor executes one or more steps in any of the above methods.
本申请实施例还提供了一种包含指令的计算机程序产品。当该计算机程序产品在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。The embodiment of the present application further provides a computer program product including instructions. When the computer program product is executed on a computer or a processor, the computer or the processor executes one or more steps in any of the above methods.
应理解,在本申请的描述中,除非另有说明,“/”表示前后关联的对象是一种“或”的关系,例如,A/B可以表示A或B;其中A,B可以是单数或者复数。并且,在本申请的描述中,除非另有说明,“多个”是指两个或多于两个。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c,a-b,a-c,b-c,或a-b-c,其中a,b,c可以是单个,也可以是多个。另外,为了便于清楚描述本申请实施例的技术方案,在本申请的实施例中,采用了“第一”、“第二”等字样对功能和作用基本相同的相同项或相似项进行区分。本领域技术人员可以理解“第一”、“第二”等字样并不对数量和执行次序进行限定,并且“第一”、“第二”等字样也并不限定一定不同。同时,在本申请实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念,便于理解。It should be understood that in the description of the present application, unless otherwise specified, "/" indicates that the objects associated before and after are in an "or" relationship, for example, A/B can represent A or B; wherein A and B can be singular or plural. Also, in the description of the present application, unless otherwise specified, "multiple" refers to two or more than two. "At least one of the following" or similar expressions refers to any combination of these items, including any combination of single items or plural items. For example, at least one of a, b, or c can represent: a, b, c, a-b, a-c, b-c, or a-b-c, wherein a, b, c can be single or multiple. In addition, in order to facilitate the clear description of the technical solutions of the embodiments of the present application, in the embodiments of the present application, the words "first", "second", etc. are used to distinguish the same items or similar items with substantially the same functions and effects. Those skilled in the art can understand that the words "first", "second", etc. do not limit the quantity and execution order, and the words "first", "second", etc. do not limit them to be necessarily different. Meanwhile, in the embodiments of the present application, words such as "exemplary" or "for example" are used to indicate examples, illustrations or descriptions. Any embodiment or design described as "exemplary" or "for example" in the embodiments of the present application should not be interpreted as being more preferred or more advantageous than other embodiments or designs. Specifically, the use of words such as "exemplary" or "for example" is intended to present related concepts in a concrete manner for ease of understanding.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,该单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。所显示或讨论的相互之间的耦合、或直接耦合、或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in the present application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the division of the unit is only a logical function division, and there may be other division methods in actual implementation, for example, multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. The mutual coupling, direct coupling, or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of devices or units, which can be electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者通过该计算机可读存储介质进行传输。该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是只读存储器(read-only memory,ROM),或随机存取存储器(random access memory,RAM),或磁性介质,例如,软盘、硬盘、磁带、磁碟、或光介质,例如,数字通用光盘(digital versatile disc,DVD)、或者半导体介质,例如,固态硬盘(solid state disk,SSD)等。In the above embodiments, it can be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented using software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the process or function according to the embodiment of the present application is generated in whole or in part. The computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer instructions can be stored in a computer-readable storage medium or transmitted through the computer-readable storage medium. The computer instructions can be transmitted from a website site, computer, server or data center to another website site, computer, server or data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line (digital subscriber line, DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) means. The computer-readable storage medium can be any available medium that a computer can access or a data storage device such as a server or data center that includes one or more available media integrated. The available medium can be a read-only memory (ROM), or a random access memory (RAM), or a magnetic medium, such as a floppy disk, a hard disk, a tape, a disk, or an optical medium, such as a digital versatile disc (DVD), or a semiconductor medium, such as a solid state disk (SSD), etc.
以上所述,仅为本申请实施例的具体实施方式,但本申请实施例的保护范围并不局限于此,任何在本申请实施例揭露的技术范围内的变化或替换,都应涵盖在本申请实施例的保护范围之内。因此,本申请实施例的保护范围应以所述权利要求的保护范围为准。The above is only a specific implementation of the embodiment of the present application, but the protection scope of the embodiment of the present application is not limited thereto, and any changes or replacements within the technical scope disclosed in the embodiment of the present application should be included in the protection scope of the embodiment of the present application. Therefore, the protection scope of the embodiment of the present application should be based on the protection scope of the claims.
Claims (48)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202311871100.7 | 2023-12-29 | ||
| CN202311871100.7A CN120234608A (en) | 2023-12-29 | 2023-12-29 | Model data acquisition method, device and system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025140663A1 true WO2025140663A1 (en) | 2025-07-03 |
Family
ID=96159352
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2024/143477 Pending WO2025140663A1 (en) | 2023-12-29 | 2024-12-28 | Model data acquisition method, apparatus and system |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN120234608A (en) |
| WO (1) | WO2025140663A1 (en) |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113159145A (en) * | 2018-04-28 | 2021-07-23 | 华为技术有限公司 | Characteristic engineering arrangement method and device |
| CN114548416A (en) * | 2020-11-26 | 2022-05-27 | 华为技术有限公司 | Data model training method and device |
| CN115423031A (en) * | 2022-09-20 | 2022-12-02 | 腾讯科技(深圳)有限公司 | Model training method and related device |
| CN116318481A (en) * | 2021-12-20 | 2023-06-23 | 华为技术有限公司 | A communication method and device |
| CN116665219A (en) * | 2023-05-26 | 2023-08-29 | 华为技术有限公司 | Data processing method and device |
-
2023
- 2023-12-29 CN CN202311871100.7A patent/CN120234608A/en active Pending
-
2024
- 2024-12-28 WO PCT/CN2024/143477 patent/WO2025140663A1/en active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113159145A (en) * | 2018-04-28 | 2021-07-23 | 华为技术有限公司 | Characteristic engineering arrangement method and device |
| CN114548416A (en) * | 2020-11-26 | 2022-05-27 | 华为技术有限公司 | Data model training method and device |
| US20230281513A1 (en) * | 2020-11-26 | 2023-09-07 | Huawei Technologies Co., Ltd. | Data model training method and apparatus |
| CN116318481A (en) * | 2021-12-20 | 2023-06-23 | 华为技术有限公司 | A communication method and device |
| CN115423031A (en) * | 2022-09-20 | 2022-12-02 | 腾讯科技(深圳)有限公司 | Model training method and related device |
| CN116665219A (en) * | 2023-05-26 | 2023-08-29 | 华为技术有限公司 | Data processing method and device |
Also Published As
| Publication number | Publication date |
|---|---|
| CN120234608A (en) | 2025-07-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN118435671A (en) | Method, device and computer readable medium for communication | |
| EP4443939A1 (en) | Communication method and device | |
| US20250141525A1 (en) | Communication method and apparatus | |
| US20250211311A1 (en) | Beam management method | |
| WO2023115254A1 (en) | Data processing method and device | |
| CN117676630A (en) | Communication method and device | |
| KR20250125424A (en) | Method and communication device for transmitting training data of an AI model | |
| WO2025140663A1 (en) | Model data acquisition method, apparatus and system | |
| CN118509014A (en) | Communication method and communication device | |
| WO2025185425A1 (en) | Wireless model, information processing method and device, and system | |
| WO2025161853A1 (en) | Model monitoring method, apparatus and system | |
| US20250202560A1 (en) | Communication method and apparatus | |
| WO2024169600A1 (en) | Communication method and apparatus | |
| CN120786657A (en) | Information transmission method, device and system | |
| WO2025161598A1 (en) | Network training method, and communication apparatus | |
| WO2025209305A9 (en) | Information transmission method, apparatus and system | |
| WO2025067480A1 (en) | Communication method, apparatus and system | |
| WO2025140003A1 (en) | Communication method and communication apparatus | |
| WO2024208316A1 (en) | Compression information feedback method, and apparatus | |
| WO2025232813A1 (en) | Communication method and communication apparatus | |
| WO2025209114A1 (en) | Service identification method and apparatus, and system | |
| CN120474649A (en) | Communication method and communication device | |
| CN121013063A (en) | Communication method and communication device for decision | |
| CN120934717A (en) | Method and communication device for monitoring artificial intelligent model | |
| CN120835309A (en) | Communication method, communication device, chip module, storage medium, and program product |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24911577 Country of ref document: EP Kind code of ref document: A1 |