WO2025246373A1 - Information transmission method and related apparatus - Google Patents
Information transmission method and related apparatusInfo
- Publication number
- WO2025246373A1 WO2025246373A1 PCT/CN2025/070370 CN2025070370W WO2025246373A1 WO 2025246373 A1 WO2025246373 A1 WO 2025246373A1 CN 2025070370 W CN2025070370 W CN 2025070370W WO 2025246373 A1 WO2025246373 A1 WO 2025246373A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- model training
- data acquisition
- model
- acquisition device
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Definitions
- This application relates to the field of artificial intelligence technology, and more particularly to an information transmission method and related apparatus.
- Distributed learning can complete the learning task of AI models while fully ensuring user data privacy and security.
- Distributed learning mainly includes federated learning, partitioned learning, and decentralized learning. Assuming that the distributed nodes (e.g., terminal devices) have sufficient computing power, the distributed nodes train the model based on locally collected data to obtain a local model, and then transmit the local model to other nodes (e.g., the central node or other distributed nodes).
- the distributed nodes e.g., terminal devices
- the distributed nodes train the model based on locally collected data to obtain a local model, and then transmit the local model to other nodes (e.g., the central node or other distributed nodes).
- This application provides an information transmission method and related apparatus for allocating first model training information to a first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training to be completed based on the local data of the first data acquisition device.
- the first data acquisition device can be a terminal device, a component within the terminal device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the terminal device's functions.
- the method includes: the first data acquisition device sending a first request to a model training management device, the first request requesting the allocation of model training information for the first data acquisition device; and the first data acquisition device receiving first instruction information from the model training management device, the first instruction information indicating the allocation of first model training information for the first data acquisition device. This enables the allocation of first model training information to the first data acquisition device.
- the first data acquisition device can determine a first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data.
- the first data acquisition device is a terminal device
- the first model training device is the terminal manufacturer's server. Given the limited computing power of the terminal device and its data privacy protection requirements, the terminal manufacturer's server can complete model training based on the terminal device's local data. This ensures both the data privacy protection requirements of the first data acquisition device and the ability to train the model.
- the first model training information includes information about the first model training device and/or the first model training resources. This allows for the allocation of a corresponding model training device and training resources to the first data acquisition device, facilitating the completion of the model training task based on the data provided by the first data acquisition device.
- the first indication information includes a first identifier, which indicates that the first model training device is a model training device for the first data acquisition device. This allows the first data acquisition device to be paired or matched with the first model training device via the first identifier. This facilitates the first model training device in subsequently determining whether to perform model training based on data from the first data acquisition device using the first identifier.
- the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier.
- the pairing identifier is used to indicate that the first data acquisition device is paired with the first model training device.
- the first identifier can be similar to the destination address of a packet in network routing, which facilitates the first model training device in determining whether to perform model training based on the data from the first data acquisition device.
- the method further includes: a first data acquisition device sending first data to a first model training device, the first data including a first identifier, the first data being used for model training. This facilitates the first model training device to perform model training based on the first identifier and the first data.
- the method further includes: a first data acquisition device receiving a first model from a first model training device, wherein the first model is trained based on the first data; or, the first data acquisition device receiving a second model from a model fusion device, wherein the second model is obtained by fusing the first model reported by the first model training device, and the first model is trained based on the first data.
- the first data acquisition device can receive the first model from the first model training device.
- the first data acquisition device can receive the second model from the model fusion device. This enables the first data acquisition device to obtain the trained model, facilitating model inference operations based on the first model or the second model, thereby improving model inference performance.
- the method further includes: a first data acquisition device receiving a third model from a second model training device or a second data acquisition device, the first data acquisition device sending the third model to the first model training device, and the third model being used for model fusion.
- the method further includes: a first data acquisition device receiving a fourth model from a first model training device, the fourth model being obtained by fusing the first model and the third model.
- the method before the first data acquisition device sends the first data to the first model training device, the method further includes: the first data acquisition device receiving scheduling information from the control device, the scheduling information being used to schedule the first data acquisition device to send the first data. This achieves scheduling of the first data acquisition device.
- the method before the first data acquisition device receives scheduling information from the control device, the method further includes: the first data acquisition device sending second indication information to the control device, the second indication information being used to indicate the data status of the first data. This facilitates the control device in determining whether to schedule the first data acquisition device based on the data status of the first data. It also facilitates the control device in scheduling data acquisition devices with higher data quality to provide data, thereby improving the performance of model training.
- the second indication information is further used to indicate the first identifier. This facilitates the control device in determining the first model training device that matches the first data acquisition device based on the first identifier, so that the control device can determine whether to schedule the first data acquisition device.
- a second aspect of this application provides an information transmission method, which can be executed by a model training management device.
- the model training management device can be a model training management server, a component within a model training management server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of a model training management server.
- the method includes: the model training management device receiving a first request from a first data acquisition device, the first request requesting the allocation of model training information to the first data acquisition device; and the model training management device sending first instruction information to the first data acquisition device, the first instruction information indicating the allocation of first model training information to the first data acquisition device.
- This method allocates first model training information to the first data acquisition device, facilitating the first data acquisition device to determine how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device.
- the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device to complete model training based on the local data.
- the terminal manufacturer's server can complete model training based on the terminal device's local data. This ensures both the data privacy protection requirements of the first data acquisition device and the ability to train the model.
- the method further includes: the model training management device sending first instruction information to the first model training device. This facilitates the first model training device in determining the first data acquisition device it is paired with.
- one possible implementation further includes: the model training management device sending first instruction information to the control device. This facilitates the control device in determining the pairing between the first data acquisition device and the first model training device. It also allows the control device to better schedule the data acquisition device, thereby improving the performance of model training.
- the first model training information includes information about the first model training device and/or the first model training resources. This allows for the allocation of a corresponding model training device and training resources to the first data acquisition device, facilitating the provision of model training functionality to the first data acquisition device and thus completing the model training task.
- the first indication information includes a first identifier, which indicates that the first model training device is a model training device for the first data acquisition device. This allows the first data acquisition device to be paired or matched with the first model training device via the first identifier. This facilitates the first model training device in subsequently determining whether to perform model training based on data from the first data acquisition device using the first identifier.
- the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier.
- the pairing identifier is used to indicate that the first data acquisition device is paired with the first model training device.
- the first identifier can be similar to the destination address of a packet in network routing, which facilitates the first model training device to determine whether to perform model training based on the data from the first data acquisition device.
- a third aspect of this application provides an information transmission method, which can be executed by a first model training device.
- the first model training device can be an AI server, or a component within an AI server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of an AI server.
- the method includes: the first model training device receiving first instruction information from a model training management device, the first instruction information indicating first model training information allocated to a first data acquisition device; and the first model training device determining the first model training information based on the first instruction information. This method allocates first model training information to the first data acquisition device, facilitating the first model training device to determine, based on local data from the first data acquisition device, to perform model training, thereby completing the model training.
- the first model training device determines itself as the model training device for the first data acquisition device based on the first model training information.
- the first model training device can receive local data from the first data acquisition device and complete model training based on that local data. This solves the problem of model training being impossible when the first data acquisition device has limited or no computing power.
- the terminal manufacturer's server can complete model training based on the terminal device's local data. This ensures both the data privacy protection requirements of the first data acquisition device and the ability to train the model.
- the first model training information includes information about the first model training device and/or the first model training resources. This allows for the allocation of a corresponding model training device and training resources to the first data acquisition device. This facilitates the first model training device providing model training functionality to the first data acquisition device, thereby completing the model training task.
- the first indication information includes a first identifier, which identifies the first model training device as a model training device that is also a first data acquisition device. This allows the first data acquisition device to be paired or matched with the first model training device via the first identifier. This facilitates the first model training device in subsequently determining whether to perform model training based on data from the first data acquisition device using the first identifier.
- the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier.
- the pairing identifier is used to identify the pairing of the first data acquisition device and the first model training device.
- the first identifier can be similar to the destination address of a packet in network routing, which facilitates the first model training device in determining whether to perform model training based on the data from the first data acquisition device.
- the method further includes: a first model training device receiving first data from a first data acquisition device, the first data including a first identifier; the first model training device performing model training based on the first data to obtain a first model. This enables the first model training device to provide model training functionality to the first data acquisition device, completing the model training task.
- one possible implementation further includes: the first model training device sending the first model to the first data acquisition device or the model fusion device. This facilitates model inference by the first data acquisition device, improving model inference performance. Alternatively, it facilitates model fusion by the model fusion device, improving model training performance.
- the method further includes: a first model training device receiving a second model from a model fusion device, the second model being obtained by fusing the first model by the model fusion device. This facilitates the first model training device performing model inference and/or model training based on the second model.
- one possible implementation further includes: a first model training device receiving a third model from a first data acquisition device, a second data acquisition device, or a second model training device, the third model being used for model fusion.
- a first model training device receiving a third model from a first data acquisition device, a second data acquisition device, or a second model training device, the third model being used for model fusion.
- this allows models from different data acquisition devices or different model training devices to be transferred to each other, facilitating model fusion.
- the first model training device fuses the first model and the third model to obtain a fourth model; the first model training device then sends the fourth model to the first data acquisition device.
- models from different data acquisition devices or different model training devices can be transferred to each other and fused, thereby improving model training performance.
- the method before the first model training device receives the first data from the first data acquisition device, the method further includes: the first model training device sending third instruction information to the control device, the third instruction information being used to indicate training-related information of the first model training device. This facilitates better scheduling of the corresponding data acquisition devices by the control device, improving the performance of model training.
- the third indication information is also used to indicate the first identifier. This facilitates the control device in determining the first model training device that matches the first data acquisition device based on the first identifier, so that the control device can determine whether to schedule the first data acquisition device.
- a fourth aspect of this application provides an information transmission method, which can be executed by a control device.
- the control device can be a network device, a component within the network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the network device.
- the method includes: the control device receiving first instruction information from a model training management device, the first instruction information indicating first model training information allocated to a first data acquisition device; and the control device determining the first model training information based on the first instruction information. This facilitates the control device scheduling the first data acquisition device based on the first model training information, enabling model training to be completed based on the local data of the first data acquisition device.
- the above technical solution can solve the problem that the first data acquisition device cannot perform model training.
- the control device can schedule the terminal device, and the terminal device can send local data to the terminal manufacturer's server. Then, the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
- the first model training information includes: information about the first model training device, and/or, the first model training resources.
- the control device determines the model training device and/or model training resources allocated to the first data acquisition device.
- the first indication information includes a first identifier, which indicates that the first model training device is used as the model training device for the first data acquisition device.
- This allows the first identifier to indicate pairing or matching between the first data acquisition device and the first model training device.
- This facilitates the control device in determining whether to schedule the first data acquisition device based on its data status and the training capability of the first model training device. Consequently, it allows the control device to schedule data acquisition devices with higher data quality, thereby improving model training performance.
- the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device and the first model training device are paired.
- the control device determines the pairing of the first model training device and the first data acquisition device based on the first identifier.
- the method further includes: the control device sending scheduling information to the first data acquisition device, the scheduling information being used to schedule the first data acquisition device to send the first data.
- the method before the control device sends scheduling information to the first data acquisition device, the method further includes: the control device receiving second indication information from the first data acquisition device, the second indication information indicating the data status of the first data; and the control device determining the scheduling of the first data acquisition device based on the second indication information. This is beneficial for the control device to schedule data acquisition devices with higher data quality, thereby improving the performance of model training.
- the method further includes: the control device receiving third instruction information from the first model training device; the control device determining the scheduling of the first data acquisition device based on the second instruction information, including: the control device determining the scheduling of the first data acquisition device based on the second instruction information and the third instruction information.
- the control device receives third instruction information from the first model training device; the control device determining the scheduling of the first data acquisition device based on the second instruction information, including: the control device determining the scheduling of the first data acquisition device based on the second instruction information and the third instruction information.
- the control device can be a network device, a component within the network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the network device.
- the method includes: the control device receiving second indication information from a first data acquisition device, the second indication information indicating the data status and a first identifier of first data, the first identifier indicating a first model training device as the model training device for the first data acquisition device; the control device receiving third indication information from the first model training device, the third indication information indicating training-related information and a first identifier for the first model training device; and the control device determining the first model training device as the model training device for the first data acquisition device based on the second and third indication information. This facilitates the control device scheduling the first data acquisition device based on the second and third indication information to achieve model training based on the local data of the first data acquisition device.
- the control device can schedule the terminal device based on the second and third instruction information.
- the terminal device then sends local data to the terminal manufacturer's server.
- the terminal manufacturer's server can then complete model training based on the terminal device's local data. This approach ensures both the data privacy protection requirements of the first data acquisition device and the successful training of the model.
- one possible implementation further includes: the control device determining the scheduling of the first data acquisition device according to the second and third instruction information. This enables the control device to better schedule data acquisition devices, ensuring that the model training device matched with the scheduled data acquisition device can provide model training functionality. Furthermore, it facilitates the control device scheduling data acquisition devices with higher data quality, thereby improving model training performance.
- the method further includes: the control device sending scheduling information to the first data acquisition device, the scheduling information being used to schedule the first data acquisition device.
- a sixth aspect of this application provides an information transmission method, which can be executed by a model fusion device.
- the model fusion device can be a network device, a component within a network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of a network device.
- the method includes: the model fusion device receiving a first model from a first model training device, the first model being obtained based on first data from a first data acquisition device; and the model fusion device fusing the first model to obtain a second model. This completes the training and fusion of the models.
- one possible implementation further includes: the model fusion device sending the second model to the first data acquisition device. This facilitates the first data acquisition device in performing model inference based on the second model, improving model inference performance.
- the method further includes: the model fusion device sending the second model to the first model training device. This facilitates the first model training device to perform model training and/or model inference based on the second model, thereby improving model trainability and/or model inference performance.
- a seventh aspect of this application provides a first data acquisition device, the first data acquisition device comprising:
- the transceiver module is used to send a first request to the model training management device, the first request being used to request the allocation of model training information to the first data acquisition device; and to receive first instruction information from the model training management device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device.
- the first model training information includes: information about the first model training device, and/or, the first model training resources.
- the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
- the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
- the transceiver module is further configured to: receive a first model from a first model training device, wherein the first model is trained based on first data, or receive a second model from a model fusion device, wherein the second model is obtained by fusing the first model reported by the first model training device, and the first model is trained based on the first data.
- the transceiver module is further configured to: receive a third model from a second model training device or a second data acquisition device; and send the third model to a first model training device, wherein the third model is used for model fusion.
- the transceiver module is further configured to: receive a fourth model from the first model training device, the fourth model being obtained by fusing the first model and the third model.
- the transceiver module is further configured to: receive scheduling information from the control device, the scheduling information being used to schedule the first data acquisition device to send the first data.
- the transceiver module is further configured to: send a second indication message to the control device, the second indication message being used to indicate the data status of the first data.
- the second indication information is also used to indicate the first identifier.
- the eighth aspect of this application provides a model training management device, which includes:
- the transceiver module is used to receive a first request from a first data acquisition device, the first request being used to request the allocation of model training information to the first data acquisition device; and to send first instruction information to the first data acquisition device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device.
- the transceiver module is further configured to: send first instruction information to the first model training device.
- the transceiver module is also used to: send first instruction information to the control device.
- the first model training information includes information about the first model training device and/or, the first model training resources.
- the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
- the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
- a ninth aspect of this application provides a first model training apparatus, the first model training apparatus comprising:
- the transceiver module is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
- the processing module is used to determine the first model training information based on the first instruction information.
- the first model training information includes information about the first model training device and/or, the first model training resources.
- the first indication information includes a first identifier, which is used to identify the first model training device as a model training device for the first data acquisition device.
- the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to identify the pairing of the first data acquisition device and the first model training device.
- the transceiver module is further configured to: receive first data from the first data acquisition device, the first data including a first identifier; the processing module is further configured to: perform model training based on the first data to obtain a first model.
- the transceiver module is further configured to: send the first model to the first data acquisition device or the model fusion device.
- the transceiver module is further configured to: receive a second model from the model fusion device, the second model being obtained by the model fusion device fusing the first model.
- the transceiver module is further configured to: receive a third model from the first data acquisition device, the second data acquisition device, or the second model training device, wherein the third model is used for model fusion.
- the processing module is further configured to: fuse the first model and the third model to obtain a fourth model; the transceiver module is further configured to: send the fourth model to the first data acquisition device.
- the transceiver module is further configured to: send third instruction information to the control device, the third instruction information being used to indicate training-related information of the first model training device.
- the third indication information is also used to indicate the first identifier.
- control device comprising:
- the transceiver module is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
- the processing module is used to determine the first model training information based on the first instruction information.
- the first model training information includes: information about the first model training device, and/or, the first model training resources.
- the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
- the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
- the transceiver module is further configured to: send scheduling information to the first data acquisition device, wherein the scheduling information is used to schedule the first data acquisition device to send the first data.
- the transceiver module is further configured to: receive second indication information from the first data acquisition device, the second indication information being used to indicate the data status of the first data; the processing module is further configured to: determine the scheduling of the first data acquisition device based on the second indication information.
- the transceiver module is further configured to: receive third instruction information from the first model training device; the processing module is specifically configured to: determine the scheduling of the first data acquisition device based on the second instruction information and the third instruction information.
- control device comprising:
- the transceiver module is used to receive second instruction information from the first data acquisition device, the second instruction information being used to indicate the data status and first identifier of the first data, the first identifier being used to indicate that the first model training device is the model training device of the first data acquisition device; and to receive third instruction information from the first model training device, the third instruction information being used to indicate the training-related information and first identifier of the first model training device.
- the processing module is used to determine the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
- the processing module is further configured to: determine the scheduling of the first data acquisition device based on the second instruction information and the third instruction information.
- the transceiver module is further configured to: send scheduling information to the first data acquisition device, the scheduling information being used to schedule the first data acquisition device.
- the twelfth aspect of this application provides a model fusion apparatus, the model fusion apparatus comprising:
- the transceiver module is used to receive a first model from a first model training device, wherein the first model is obtained based on first data from a first data acquisition device.
- the processing module is used to fuse the first model to obtain the second model.
- the transceiver module is also used to: send the second model to the first data acquisition device.
- the transceiver module is also used to: send the second model to the first model training device.
- the first data acquisition device may be a terminal device, or a component of a terminal device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the terminal device.
- the transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
- the first data acquisition device is a chip, chip system, or circuit configured in the terminal device.
- the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit;
- the processing module may be a processor, processing circuit, or logic circuit.
- the model training management device can be a server, or a component of a server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the server's functions.
- the transceiver module can be a transceiver, or an input/output interface; the processing module can be a processor.
- the model training management device is a chip, chip system, or circuit configured in a server.
- the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit;
- the processing module may be a processor, processing circuit, or logic circuit.
- the first model training device may be an AI server, or a component of an AI server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of an AI server.
- the transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
- the first model training device is a chip, chip system, or circuit configured in an AI server.
- the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit;
- the processing module may be a processor, processing circuit, or logic circuit.
- control device may be a network device, or a component of a network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the network device.
- the transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
- control device is a chip, chip system, or circuit configured in the network device.
- the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit;
- the processing module may be a processor, processing circuit, or logic circuit.
- the model fusion apparatus may be a network device, or a component of a network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of a network device.
- the transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
- the model fusion device is a chip, chip system, or circuit configured in a network device.
- the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit;
- the processing module may be a processor, processing circuit, or logic circuit.
- the thirteenth aspect of this application provides an apparatus comprising a processor and a memory.
- the memory stores a computer program or computer instructions
- the processor is configured to invoke and execute the computer program or computer instructions stored in the memory, such that the processor implements any one of the implementations of the first to sixth aspects.
- the device may also include a transceiver, the processor of which controls the transceiver to transmit and receive signals.
- the fourteenth aspect of this application provides an apparatus including a processor and an interface circuit, the processor being configured to communicate with other devices via the interface circuit and to perform the method described in any one of the first to sixth aspects.
- the processor may include one or more devices.
- the fifteenth aspect of this application provides an apparatus including a processor connected to a memory for invoking a program stored in the memory to perform the method described in any one of the first to sixth aspects.
- the memory may be located within or outside the apparatus.
- the processor may include one or more processors.
- the first data acquisition device shown in the first and seventh aspects above can be a chip or a chip system.
- the model training management device shown in the second and eighth aspects above can be a chip or a chip system.
- the first model training device shown in the third and ninth aspects above can be a chip or a chip system.
- the control device shown in the fourth, fifth, tenth, and eleventh aspects above can be a chip or a chip system.
- the model fusion device shown in the sixth and twelfth aspects above can be a chip or a chip system.
- the sixteenth aspect of this application provides a computer program product including computer instructions, characterized in that, when run on a computer, it causes the computer to perform any of the implementations of any one of the first to sixth aspects.
- the seventeenth aspect of this application provides a computer-readable storage medium including computer instructions that, when executed on a computer, cause the computer to perform any of the implementations of any one of the first to sixth aspects.
- the eighteenth aspect of this application provides a chip device including a processor for calling a computer program or computer instructions in memory to cause the processor to execute any one of the implementations of the first to sixth aspects described above.
- the processor is coupled to the memory via an interface.
- the nineteenth aspect of this application provides a communication system comprising a first data acquisition device as shown in the first aspect and a model training management device as shown in the second aspect.
- the communication system further comprises a first model training device as described in the third aspect.
- the communication system further comprises a control device as shown in the fourth or fifth aspect and/or a model fusion device as shown in the sixth aspect.
- the first data acquisition device sends a first request to the model training management device.
- the first request requests the allocation of model training information for the first data acquisition device.
- the first data acquisition device receives first instruction information from the model training management device.
- This first instruction information indicates the allocation of first model training information to the first data acquisition device.
- the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device completing model training based on this local data.
- the terminal manufacturer's server can complete model training based on the terminal device's local data. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
- Figure 1 is a schematic diagram of a communication system according to an embodiment of this application.
- Figure 2 is a schematic diagram of federated learning in an embodiment of this application.
- Figure 3 is a schematic diagram of segmentation learning in an embodiment of this application.
- Figure 4 is a schematic diagram of decentralized learning in an embodiment of this application.
- FIG. 5 is a schematic diagram of an embodiment of the information transmission method of this application.
- Figure 6A is a schematic diagram of a scenario of the information transmission method according to an embodiment of this application.
- Figure 6B is a schematic diagram of another scenario of the information transmission method according to an embodiment of this application.
- Figure 6C is a schematic diagram of another scenario of the information transmission method according to an embodiment of this application.
- FIG. 7 is a schematic diagram of another embodiment of the information transmission method of this application.
- FIG. 8 is a schematic diagram of another embodiment of the information transmission method of this application.
- FIG. 9 is a schematic diagram of another embodiment of the information transmission method of this application.
- Figure 10 is a structural schematic diagram of a first data acquisition device according to an embodiment of this application.
- Figure 11 is a schematic diagram of a model training management device according to an embodiment of this application.
- Figure 12 is a schematic diagram of the structure of a first model training device according to an embodiment of this application.
- Figure 13 is a schematic diagram of a control device according to an embodiment of this application.
- Figure 14 is a schematic diagram of a model fusion device according to an embodiment of this application.
- Figure 15 is a schematic diagram of a device according to an embodiment of this application.
- Figure 16 is a structural schematic diagram of a terminal device according to an embodiment of this application.
- Figure 17 is a schematic diagram of a network device according to an embodiment of this application.
- This application provides an information transmission method and related apparatus for allocating first model training information to a first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training to be completed based on the local data of the first data acquisition device.
- references to "one embodiment” or “some embodiments” as described in this application mean that one or more embodiments of this application include a specific feature, structure, or characteristic described in connection with that embodiment. Therefore, the phrases “in one embodiment,” “in some embodiments,” “in other embodiments,” “in still other embodiments,” etc., appearing in different parts of this specification do not necessarily refer to the same embodiment, but rather mean “one or more, but not all, embodiments,” unless otherwise specifically emphasized.
- the terms “comprising,” “including,” “having,” and variations thereof mean “including but not limited to,” unless otherwise specifically emphasized.
- A/B can mean A or B.
- “And/or” in this document is merely a description of the relationship between related objects, indicating that three relationships can exist.
- a and/or B can represent: A alone, A and B simultaneously, and B alone.
- "at least one” means one or more
- “multiple” means two or more.
- "At least one of the following” or similar expressions refer to any combination of these items, including any combination of single or multiple items.
- at least one of a, b, or c can represent: a, b, c; a and b; a and c; b and c; or a and b and c. Where a, b, and c can be single or multiple.
- instruction can include direct instruction, indirect instruction, explicit instruction, and implicit instruction.
- instruction information can include direct instruction, indirect instruction, explicit instruction, and implicit instruction.
- the information indicated by the instruction information is called the information to be instructed.
- the information to be instructed there are many ways to instruct the information to be instructed, such as, but not limited to, directly instructing the information to be instructed, such as the information to be instructed itself or its index; indirectly instructing the information to be instructed by instructing other information, where there is a relationship between the other information and the information to be instructed; or instructing only a part of the information to be instructed, while the other parts are known or pre-agreed upon.
- the instruction of specific information can be achieved by using a pre-agreed (e.g., protocol-defined) arrangement of various pieces of information, thereby reducing instruction overhead to some extent.
- a pre-agreed e.g., protocol-defined
- the information to be instructed can be sent as a whole or divided into multiple sub-information messages, and the sending period and/or timing of these sub-information messages can be the same or different.
- This application does not limit the specific sending method.
- the sending period and/or timing of these sub-information messages can be predefined, for example, according to a protocol, or configured by the transmitting device by sending configuration information to the receiving device.
- send and “receive” in this application refer to the direction of signal transmission.
- send information to XX can be understood as the destination of the information being XX, which can include direct transmission via the air interface or indirect transmission via the air interface from other units or modules.
- Receiveive information from YY can be understood as the source of the information being YY, which can include direct reception from YY via the air interface or indirect reception from YY via the air interface from other units or modules.
- Send can also be understood as the "output” of the chip interface, and “receive” can also be understood as the "input” of the chip interface.
- sending and receiving can occur between devices, such as between network devices and terminal devices, or within a device, such as between components, modules, chips, software modules, or hardware modules within the device via buses, wiring, or interfaces.
- the technical solution of this application can be applied to cellular communication systems related to the 3rd Generation Partnership Project (3GPP).
- 3GPP 3rd Generation Partnership Project
- 4th generation (4G) communication systems 5th generation (5G) communication systems, and communication systems beyond the 5th generation, such as 6th generation (6G) communication systems.
- 4th generation communication systems may include Long Term Evolution (LTE) communication systems.
- 5th generation communication systems may include New Radio (NR) communication systems.
- WiFi Wireless Fidelity
- communication systems supporting the convergence of multiple wireless technologies such as device-to-device (D2D) systems, or vehicle-to-everything (V2X) communication systems, etc.
- D2D device-to-device
- V2X vehicle-to-everything
- the communication system to which the technical solution of this application applies includes a first data acquisition device, a first model training device, and a model training management device.
- the first data acquisition device has a data acquisition function. For example, the first data acquisition device acquires local data. Then, the first data acquisition device can send the local data to the first model training device. This local data is used for model training.
- the first data acquisition device can be a terminal device, or a component in the terminal device (e.g., a processor, chip, or chip system), or a logic module or software that can implement all or part of the functions of the terminal device.
- the first model training device has a model training function. It receives local data from the first data acquisition device and trains the model based on the local data.
- the first model training device can send the trained model to the first data acquisition device.
- the first model training device can be an AI server, or a component within an AI server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of an AI server.
- the first data acquisition device and the first model training device can jointly form a distributed node in distributed learning, but they are different physical devices.
- the first data acquisition device can be a terminal device 101
- the first model training device can be an AI server 103.
- the first model training device can be understood as the terminal manufacturer's server.
- the data collection (DC) module of the distributed node is deployed on the terminal device
- the model training (MT) module of the distributed node is deployed on the terminal manufacturer's server.
- the terminal manufacturer's server uses the model training module to train the model based on the data.
- the model training management device is used to manage model training devices and allocate model training devices to data acquisition devices.
- the model training management device can allocate a first model training device to a first data acquisition device. This enables the first model training device to perform model training using local data provided by the first data acquisition device. It also enables the first model training device to provide model training functionality to the first data acquisition device.
- the model training management device can be a model training management server or a core network element, or a component within a model training management server or core network element (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of a model training management server or core network element.
- the communication system also includes a model fusion device and/or a control device.
- the model fusion device can be a central node, which has the following functions: fusing the models sent by the model training device and sending the fused model to the data acquisition device and/or the model training device.
- the control device can be used to schedule the data acquisition device.
- both the model fusion device and the control device can be network devices, or components within network devices (e.g., processors, chips, or chip systems), or logical modules or software capable of implementing all or part of the functions of network devices. For example, as shown in Figure 1, both the model fusion device and the control device are network devices 102.
- the model fusion device and the control device can jointly form a central node in distributed learning; or, the model fusion device can be the central node in distributed learning, while the control device can be the control node in distributed learning.
- the model fusion device and the control device can be two different physical devices.
- the communication system also includes a second data acquisition device and a second model training device.
- the function of the second data acquisition device is similar to that of the first data acquisition device; for details, please refer to the aforementioned description of the first data acquisition device, which will not be repeated here.
- the function of the second model training device is similar to that of the first model training device; for details, please refer to the aforementioned description of the first model training device, which will not be repeated here.
- the second data acquisition device is a terminal device 104
- the second model training device is an AI server 105.
- the communication system includes a model training management device, at least one data acquisition device, and at least one model training device; the specific details are not limited in this application. That is, the communication system shown in Figure 1 is just an example.
- the communication system includes at least one terminal device and at least one AI server.
- the communication system includes at least one network device.
- the number of data acquisition devices and the number of model training devices in the communication system may be the same or different.
- the number of data acquisition devices exceeds the number of model training devices.
- the data acquisition devices may be terminal equipment, and the model training devices may be server provided by the terminal manufacturer.
- the number of terminal equipment in the communication system may exceed the number of server provided by the terminal manufacturer.
- the data acquisition device has a data acquisition function.
- the data acquisition device can also be called a data acquisition device, data collection device, data acquisition node, data collection device, or data acquisition node, etc., and this application does not limit the specific name of the data acquisition device.
- the model training device has a model training function.
- the model training device can also be called a local training device, training device, local training node, or training node, etc., and this application does not limit the specific name of the model training device.
- the control device can be used to schedule the data acquisition device.
- the model fusion device has the following functions: fusing the model sent by the model training device and sending the fused model to the data acquisition device and/or the model training device.
- the control device and the model fusion device can also be called a central device, central control node, central control device, or control center, etc., and this application does not limit the specific name of the control device.
- the terminal device can be a wireless terminal device capable of receiving scheduling and instruction information from network devices.
- the wireless terminal device can be a device that provides voice and/or data connectivity to the user, a handheld device with wireless connectivity, or other processing devices connected to a wireless modem.
- Terminal devices can communicate with one or more core networks or the Internet via an access network.
- Terminal devices can be mobile terminal devices, such as mobile phones (or "cellular" phones), computers, and data cards.
- mobile phones or "cellular" phones
- computers and data cards.
- they can be portable, pocket-sized, handheld, computer-embedded, or vehicle-mounted mobile devices that exchange voice and/or data with the radio access network.
- Examples include personal communication service (PCS) phones, cordless phones, session initiation protocol phones, wireless local loop (WLL) stations, personal digital assistants (PDAs), tablets, and computers with wireless transceiver capabilities.
- PCS personal communication service
- WLL wireless local loop
- PDAs personal digital assistants
- Wireless terminal equipment can also be referred to as a system, subscriber unit, subscriber station, mobile station, mobile station (MS), remote station, access point (AP), remote terminal, access terminal, user terminal, user agent, subscriber station (SS), customer premises equipment (CPE), terminal, user equipment (UE), mobile terminal (MT), etc.
- the terminal device can also be a wearable device.
- Wearable devices also known as wearable smart devices or smart wearable devices, are a general term for devices that utilize wearable technology to intelligently design and develop everyday wearables. Examples include glasses, gloves, watches, clothing, and shoes. Wearable devices are portable devices that are worn directly on the body or integrated into the user's clothing or accessories. Wearable devices are not merely hardware devices, but also achieve powerful functions through software support, data interaction, and cloud interaction.
- wearable smart devices include those that are feature-rich, large in size, and can achieve complete or partial functions without relying on a smartphone, such as smartwatches or smart glasses, as well as those that focus on a specific type of application function and require the use of other devices such as smartphones, such as various smart bracelets, smart helmets, and smart jewelry for vital sign monitoring.
- Terminal devices can also be drones, robots, device-to-device (D2D) communication devices, vehicle-to-everything (V2X) devices, virtual reality (VR) devices, augmented reality (AR) devices, wireless devices in industrial control, wireless devices in self-driving, wireless devices in remote medical care, wireless devices in smart grids, wireless devices in transportation safety, wireless devices in smart cities, and wireless devices in smart homes, etc.
- D2D device-to-device
- V2X vehicle-to-everything
- VR virtual reality
- AR augmented reality
- wireless devices in industrial control wireless devices in self-driving
- wireless devices in remote medical care wireless devices in smart grids
- wireless devices in transportation safety wireless devices in smart cities, and wireless devices in smart homes, etc.
- terminal devices can also be terminal devices in communication systems evolved from fifth-generation (5G) communication systems (such as sixth-generation (6G) communication systems) or in future public land mobile networks (PLMNs).
- 5G fifth-generation
- 6G communication systems can further expand the form and function of 5G communication terminals; 6G terminals include, but are not limited to, vehicles, cellular network terminals (integrating satellite terminal functions), drones, or Internet of Things (IoT) devices.
- IoT Internet of Things
- the terminal device has artificial intelligence (AI) capabilities.
- AI artificial intelligence
- the terminal device can obtain AI services provided by network devices or servers.
- the terminal device also has AI processing capabilities.
- the terminal device may be a device or apparatus with a chip, or a device or apparatus with integrated circuitry, or a chip, module or control unit in the device or apparatus shown above. This application does not limit the specific device.
- Network devices can be devices within a wireless network.
- a network device can be an access network node (or access network equipment) that connects terminal devices to a wireless network, also known as a base station.
- access network equipment include: base stations (gNodeB, gNB), transmission reception points (TRP), evolved Node B (eNB), radio network controllers (RNC), Node Bs (NB), home base stations (e.g., home evolved Node B, or home Node B, HNB), base band units (BBU), or Wi-Fi access points (APs) in 5G communication systems.
- network devices may include centralized unit (CU) nodes, distributed unit (DU) nodes, CU-control plane (CP), CU-user plane (UP), or radio unit (RU), or RAN equipment including CU and DU nodes.
- CU and DU may be separate entities or included in the same network element, such as a baseband unit (BBU).
- BBU baseband unit
- RU may be included in radio equipment or radio units, such as remote radio units (RRU), active antenna units (AAU), or remote radio heads (RRH).
- RRU remote radio units
- AAU active antenna units
- RRH remote radio heads
- CU or CU-CP and CU-UP
- DU or RU may have different names, but their meanings will be understood by those skilled in the art.
- a CU can also be called an Open CU (O-CU)
- a DU can also be called an Open DU (O-DU)
- a CU-CP can also be called an Open CU-CP (O-CU-CP)
- a CU-UP can also be called an Open CU-UP (O-CU-UP)
- a RU can also be called an Open RU (O-RU).
- Any of the units among the CU (or CU-CP, CU-UP), DU, and RU can be implemented through software modules, hardware modules, or a combination of software and hardware modules.
- Network devices can be other devices that provide wireless communication functions for terminal devices.
- the embodiments of this application do not limit the specific technology or form of the network device. For ease of description, the embodiments of this application are not limited.
- Network equipment may also include core network equipment, such as the Mobility Management Entity (MME), Home Subscriber Server (HSS), Serving Gateway (S-GW), Policy and Charging Rules Function (PCRF), and Public Data Network Gateway (PDN Gateway) in 4G networks; and access and mobility management function (AMF), user plane function (UPF), or session management function (SMF) in 5G networks.
- MME Mobility Management Entity
- HSS Home Subscriber Server
- S-GW Serving Gateway
- PCRF Policy and Charging Rules Function
- PDN Gateway Public Data Network Gateway
- AMF access and mobility management function
- UPF user plane function
- SMF Public Data Network Gateway
- the network device can also be a network node with AI capabilities, which can provide AI services to terminal devices or other network devices.
- the network device can be an AI node, computing power node, access network node with AI capabilities, or core network element with AI capabilities on the network side (access network or core network).
- the network device can be the device or apparatus shown above, or a component (e.g., a chip), module, or unit in the device or apparatus shown above; this application does not limit the specifics.
- An AI server is a device equipped with AI capabilities.
- an AI server can train models based on data and manage models.
- Distributed learning can complete the learning tasks of AI models while fully ensuring user data privacy and security.
- Distributed learning mainly includes federated learning, partitioning learning, and decentralized learning. These three methods will be introduced below.
- the communication system includes a central node and one or more distributed nodes.
- the communication system includes distributed node n, distributed node k, and distributed node m.
- Each distributed node collects its local dataset and performs local training on the model to obtain local parameters. Then, the distributed node sends its local parameters to the central node.
- the central node itself does not have a dataset; it collects the local parameters reported by multiple distributed nodes, fuses these parameters to obtain global parameters, and then distributes them to the distributed nodes. This enables the training and learning of the model.
- the complete neural network model is divided into multiple parts. For example, taking a two-part model, the neural network model is divided into two sub-networks. One part is deployed on distributed nodes, and the other part is deployed on a central node. The part where the complete neural network model is segmented can be called a segmentation layer.
- distributed node 1 inputs local data into its local sub-network and infers it to the segmentation layer to obtain the segmentation layer's output result F1.
- Distributed node 1 sends this F1 to the central node via the communication link.
- the central node inputs the received F1 into another sub-network it has deployed and continues forward inference to obtain the final inference result.
- the central node performs backpropagation to the segmentation layer through another sub-network it has deployed, obtaining the backpropagation result G1. Then, the central node sends G1 to distributed node 1.
- Distributed node 1 continues gradient backpropagation based on G1 through one of its deployed sub-networks. The forward inference and gradient backpropagation between other distributed nodes and the central node are similar.
- the forward inference and backward gradient propagation processes of segmentation learning involve only one distributed node and one central node.
- the subnetworks trained on the distributed nodes can be stored locally on the distributed nodes or on a specific model storage server.
- a new distributed node joins the segmentation learning process it can download the trained subnetwork and then use its local data to further train that subnetwork.
- decentralized learning is a learning method without a central node.
- the design objective f(x) of decentralized learning is generally the average of the local objective fi (x) obtained by each node, i.e.
- n is the number of distributed nodes
- x is the parameter to be optimized.
- x is the parameter of the machine learning model (such as a neural network).
- Each node uses its local data and local objective f ⁇ sub>i ⁇ /sub> (x) to calculate its local gradient. and the local gradient The gradient is sent to its neighboring nodes.
- any node can update the parameters x of its local model according to Formula 1 below.
- the model learning task is completed through information exchange between nodes.
- N ⁇ sub>i ⁇ /sub> is the set of neighboring nodes of node i
- represents the number of elements in the set of neighboring nodes of node i, i.e. the number of neighboring nodes of node i
- ⁇ ⁇ sub>k ⁇ /sub> is the fusion weight of the k-th training round.
- distributed nodes e.g., terminal devices
- distributed nodes possess sufficient computing power.
- Distributed nodes train models based on locally collected data to obtain local models, which are then transmitted to other nodes (e.g., central nodes or other distributed nodes).
- nodes e.g., central nodes or other distributed nodes.
- This application provides a corresponding technical solution for allocating first model training information to a first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, enabling model training based on the local data of the first data acquisition device. This solves the problem of being unable to perform model training when the first data acquisition device has weak or no computing power. Please refer to the following description of the embodiments for details.
- FIG. 5 is a schematic diagram of an embodiment of the information transmission method of this application. Referring to Figure 5, the method includes:
- the first data acquisition device sends a first request to the model training management device.
- the model training management device receives the first request from the first data acquisition device.
- the first request is used to request the allocation of model training information for the first data acquisition device.
- the first request includes task information for the first data acquisition device.
- the task information includes at least one of the following: task type, computing power required to perform the task, computing accuracy, or computing energy efficiency.
- the first request can be used to request the allocation of model training equipment and/or model training resources for the first data acquisition device.
- the model training management device sends a first instruction message to the first data acquisition device.
- the first data acquisition device receives the first instruction message from the model training management device.
- the first instruction information is used to indicate the first model training information allocated to the first data acquisition device.
- the first model training information includes information about the first model training device and/or, first model training resources.
- the first model training resources include at least one of the following: first computing resources or first communication resources.
- the model training management device allocates the first model training device and/or the first model training resources to the first data acquisition device according to the first request.
- the first indication information includes a first identifier.
- the first identifier is used to indicate that the first model training device is a model training device that serves as the first data acquisition device. Some possible forms of the first identifier are described below.
- the first identifier is the identifier of the first data acquisition device.
- the identifier of the first data acquisition device is assigned by the model training management device.
- the model training management device determines that the first data acquisition device and the first model training device are paired, and then sends the identifier of the first data acquisition device to both the first data acquisition device and the first model training device, so that the first model training device can know that it is paired with the first data acquisition device.
- the identifier of the first data acquisition device can be carried in the data of the first data acquisition device.
- Forwarding nodes can forward the data carrying the identifier of the first data acquisition device to the first model training device according to the known pairing relationship (the method of knowing is not limited, it can be known through the model training management device); or, if the first data acquisition device sends its data in a broadcast manner, each model training device, after receiving the data, determines whether it is its paired data acquisition device based on the identifier of the first data acquisition device carried in it, and thus determines whether to perform model training based on the data. Therefore, the first model training device can perform model training based on the data of the first data acquisition device.
- the first identifier is the identifier of the first model training device.
- the identifier of the first model training device is assigned by the model training management device.
- the model training management device determines that the first data acquisition device and the first model training device are paired, and then sends the identifier of the first model training device to both the first data acquisition device and the first model training device.
- the first data acquisition device knows that it is paired with the first model training device.
- the identifier of the first model training device can be carried within the data.
- the first model training device determines that the data comes from the paired first data acquisition device based on the identifier of the first model training device carried in the data (i.e., its own identifier), thus determining that model training needs to be performed based on this data. Therefore, the first model training device can perform model training based on the data from the first data acquisition device.
- the first identifier is a pairing identifier.
- This pairing identifier is used to indicate that the first data acquisition device is paired with the first model training device.
- the pairing identifier is generated based on the identifier of the first data acquisition device and/or the identifier of the first model training device.
- the model training management device determines that the first data acquisition device and the first model training device are paired, and then sends a pairing identifier to both the first data acquisition device and the first model training device.
- the pairing identifier can be carried in the data of the first data acquisition device.
- the first model training device determines that the pairing identifier carried in the data is the pairing identifier between the first data acquisition device and the first model training device. Therefore, the first model training device can perform model training based on this data.
- step 502 above describes the technical solution of this application by using the example of the model training management device allocating the first model training information to the first data acquisition device.
- the first model training information can also be pre-configured in the first data acquisition device, and this application does not limit this.
- the terminal manufacturer may pre-configure the first model training information offline in the first data acquisition device.
- step 501a may be performed before step 502.
- the first model training device sends a registration request to the model training management device.
- the model training management device receives the registration request from the first model training device.
- the registration request is used to request the registration of the first model training device.
- the registration request includes registration information of the first model training device.
- the registration information includes at least one of the following: the computing power, computing power type, computing power margin, computing accuracy, or computing energy efficiency of the first model training device.
- the computing power type includes at least one of the following: central processing unit (CPU), graphics processing unit (GPU), or neural network processing unit (NPU).
- Step 501a can be executed first, followed by step 501; or step 501 can be executed first, followed by step 501a; or, depending on the circumstances, steps 501 and 501a can be executed simultaneously.
- This application does not impose any specific restrictions on this.
- step 503 may be performed after step 501.
- the model training management device sends a first instruction message to the first model training device.
- the first model training device receives the first instruction message from the model training management device.
- step 502 For information on the first instruction, please refer to the relevant description in step 502 above, which will not be repeated here.
- Step 502 can be executed first, followed by step 503; or step 503 can be executed first, followed by step 502; or, depending on the circumstances, steps 502 and 503 can be executed simultaneously.
- This application does not impose any specific restrictions on this.
- Step 504 may be performed after step 501.
- the model training management device sends a first instruction message to the control device.
- the control device receives the first instruction message from the model training management device.
- step 502 For information on the first instruction, please refer to the relevant description in step 502 above, which will not be repeated here.
- control device can be a central node.
- step 504 can be executed first, then step 502, and finally step 503; or step 504 can be executed first, then step 502, and finally step 503; or step 502 can be executed first, then step 503, and finally step 504; or step 503 can be executed first, then step 502, and finally step 504.
- step 504 can be executed first, then step 502, and finally step 503; or step 504 can be executed first, then step 502, and finally step 503; or step 502 can be executed first, then step 503, and finally step 504; or step 503 can be executed first, then step 502, and finally step 504.
- This application does not limit the specific execution order.
- the embodiment shown in FIG5 further includes steps 505 to 506. Steps 505 to 506 may be performed after step 503.
- the first data acquisition device sends first data to the first model training device.
- the first data includes a first identifier.
- the first model training device receives the first data from the first data acquisition device.
- the first data is used for model training.
- the first data is model training data.
- the first data includes a first identifier, which instructs the first model training device to determine whether to train the model based on the first data according to the first identifier.
- the first data acquisition device is terminal device 1
- the first model training device is AI server 2.
- Terminal device 1 sends the first data to AI server 2.
- the first model training device trains the model based on the first data to obtain the first model.
- the first identifier may be the identifier of a first data acquisition device.
- the first model training device determines that model training can be performed based on the first data based on the identifier of the first data acquisition device carried in the first data.
- the first identifier may be the identifier of a first model training device.
- the first model training device determines that model training can be performed based on the first data based on the identifier of the first model training device carried in the first data.
- the first identifier may be a pairing identifier.
- the first model training device determines that the pairing identifier carried in the first data is a pairing identifier between the first data acquisition device and the first model training device; therefore, the first model training device can perform model training based on the first data.
- Steps 505 to 506 can be executed first, followed by step 504; or, step 504 can be executed first, followed by steps 506 to 506; or, depending on the circumstances, steps 504 and steps 505 to 506 can be executed simultaneously.
- This application does not impose any specific restrictions on this.
- step 505a may be performed before step 505.
- the control device sends scheduling information to the first data acquisition device.
- the first data acquisition device receives the scheduling information from the control device.
- the scheduling information is used to schedule the first data acquisition device to send the first data.
- the embodiment shown in FIG5 further includes steps 505b to 505c. Steps 505b to 505c may be performed before step 505a.
- the first data acquisition device sends a second instruction to the control device.
- the control device receives the second instruction from the first data acquisition device.
- the second indication information is used to indicate the data status of the first data.
- the data state of the first data includes at least one of the following: the data type, data distribution, number of samples included in the first data, or data attribute.
- the data type includes at least one of the following: image data, voice data, text data, channel data, signal data, or radar data, etc.
- the data distribution of the first data conforms to any one of the following: Gaussian distribution, exponential distribution, uniform distribution, or Poisson distribution, etc.
- the data attribute includes at least one of the following: the time, location, or conditions of the first data acquisition.
- the control device determines the scheduling of the first data acquisition device based on the second instruction information.
- control device determines that the first data is helpful in improving model performance based on its type and distribution, it can choose to schedule the first data acquisition device. Therefore, the control device determines whether to schedule the first data acquisition device based on the second instruction information, which helps it schedule data acquisition devices with higher data quality to improve model training performance.
- the embodiment shown in Figure 5 further includes step 505d.
- Step 505d may be performed before step 505c.
- the first model training device sends a third instruction message to the control device.
- the control device receives the third instruction message from the first model training device.
- the third indication information is used to indicate training-related information of the first model training device.
- the third indication information is used to indicate the training resources of the first model training device, such as computing power type, computing power margin, computing accuracy, and/or computing energy efficiency.
- step 505c specifically includes: the control device determining whether to schedule the first data acquisition device based on the second and third indication information.
- the control device further combines the third indication information to determine whether to schedule the first data acquisition device. This achieves better scheduling of the data acquisition device and improves the performance of model training.
- control device determines the scheduling of the first data acquisition device based on the second instruction information.
- control device may further determine the scheduling of the first data acquisition device in conjunction with the third instruction information.
- control device can determine the scheduling of the first data acquisition device based on the third instruction information, or optionally, the control device may further determine the scheduling of the first data acquisition device in conjunction with the second instruction information.
- Step 507 may be performed after step 506.
- the first model training device sends the first model to the model fusion device.
- the model fusion device receives the first model from the first model training device.
- the first model training device is AI server 1
- the model fusion device is network device
- AI server 1 sends the first model to network device.
- the first model training device sends the first model to the model fusion device.
- the training epochs corresponding to the first model refer to the training epochs in which the first model training device trains the model based on the first data.
- step 508 which can be performed after step 507.
- the model fusion device fuses the first model to obtain the second model.
- the model fusion device is a network device that receives a first model from AI server 1.
- the network device also receives a fifth model from AI server 2. Then, the network device fuses the first and fifth models to obtain a second model.
- Step 509 may be performed after step 508.
- the model fusion device sends the second model to the first data acquisition device.
- the first data acquisition device receives the second model from the model fusion device.
- the second model is used for model inference.
- the model fusion device is a network device
- the first data acquisition device is terminal device 1.
- the network device sends the second model to terminal device 1.
- Terminal device 1 can perform model inference using the second model.
- Step 510 may be performed after step 508.
- the model fusion device sends the second model to the first model training device.
- the first model training device receives the second model from the model fusion device.
- the second model is used for model inference and/or model training.
- Step 509 can be executed first, followed by step 510; or step 510 can be executed first, followed by step 509; or, depending on the circumstances, steps 509 and 510 can be executed simultaneously.
- This application does not impose any specific restrictions on this.
- the first data acquisition device sends a first request to the model training management device.
- the first request requests the allocation of model training information for the first data acquisition device.
- the first data acquisition device receives first instruction information from the model training management device.
- the first instruction information indicates the allocation of first model training information to the first data acquisition device.
- This enables the allocation of first model training information to the first data acquisition device.
- This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training.
- the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data.
- the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
- FIG. 7 is a schematic diagram of another embodiment of the information transmission method of this application. Referring to Figure 7, the method includes:
- the first data acquisition device sends a first request to the model training management device.
- the model training management device receives the first request from the first data acquisition device.
- the model training management device sends a first instruction message to the first data acquisition device.
- the first data acquisition device receives the first instruction message from the model training management device.
- Steps 701 to 702 are similar to steps 501 to 502 in the embodiment shown in Figure 5 above.
- steps 501 to 502 in the embodiment shown in Figure 5 above please refer to the relevant descriptions of steps 501 to 502 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 701a may be performed before step 702.
- the first model training device sends a registration request to the model training management device.
- the model training management device receives the registration request from the first model training device.
- Step 701a is similar to step 501a in the embodiment shown in Figure 5 above.
- Step 701a is similar to step 501a in the embodiment shown in Figure 5 above.
- step 703 may be performed after step 701.
- the model training management device sends a first instruction message to the first model training device.
- the first model training device receives the first instruction message from the model training management device.
- Step 703 is similar to step 503 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 503 in the embodiment shown in Figure 5 above, which will not be repeated here.
- the embodiment shown in FIG7 further includes steps 704a to 704c. Steps 704a to 704c may be performed after step 703.
- the first data acquisition device sends a second indication message to the control device.
- the second indication message is used to indicate the first identifier.
- the control device receives the second indication message from the first data acquisition device.
- the second indication information is used to indicate the data status and the first identifier of the first data.
- the first identifier is used to indicate that the first model training device is a model training device that serves as the first data acquisition device.
- the data status and the first identifier of the first data please refer to the relevant description in the embodiment shown in Figure 5 above; it will not be repeated here.
- the first model training device sends a third instruction message to the control device.
- the third instruction message is used to indicate the first identifier.
- the control device receives the third instruction message from the first model training device.
- the third instruction information is used to indicate the training-related information and the first identifier of the first model training device.
- the training-related information and the first identifier please refer to the relevant descriptions in the embodiment shown in Figure 5 above; they will not be repeated here.
- the control device determines the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
- control device can determine that the first model training device is the model training device for the first data acquisition device based on the first identifier carried in the second instruction information and the first identifier carried in the third instruction information.
- control device can determine that the first data acquisition device is paired with the first model training device based on the first identifier carried in the second instruction information and the first identifier carried in the third instruction information.
- the embodiment shown in FIG7 further includes steps 704 to 705.
- Steps 704 to 705 may be performed after step 703.
- the first data acquisition device sends first data to the first model training device.
- the first data includes a first identifier.
- the first model training device receives the first data from the first data acquisition device.
- the first model training device trains the model based on the first data to obtain the first model.
- Steps 704 to 705 are similar to steps 505 to 506 in the embodiment shown in Figure 5 above. For details, please refer to the relevant descriptions of steps 505 to 506 in the embodiment shown in Figure 5 above, which will not be repeated here.
- the embodiment shown in FIG7 further includes steps 704d to 704e.
- Steps 704d to 704e may be performed after step 704c and before step 704.
- the control device determines the scheduling of the first data acquisition device based on the second and third instruction information.
- Step 704d is similar to step 505c in the embodiment shown in Figure 5 above.
- step 505c in the embodiment shown in Figure 5 above please refer to the relevant description of step 505c in the embodiment shown in Figure 5 above, which will not be repeated here.
- the control device sends scheduling information to the first data acquisition device.
- the first data acquisition device receives the scheduling information from the control device.
- the scheduling information is used to schedule the first data acquisition device to send the first data.
- step 706 may be performed after step 705.
- the first model training device sends the first model to the model fusion device.
- the model fusion device receives the first model from the first model training device.
- Step 706 is similar to step 507 in the embodiment shown in Figure 5 above.
- step 507 in the embodiment shown in Figure 5 above please refer to the relevant description of step 507 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 707 may be performed after step 706.
- the model fusion device fuses the first model to obtain the second model.
- Step 707 is similar to step 508 in the embodiment shown in Figure 5 above.
- step 508 in the embodiment shown in Figure 5 above please refer to the relevant description of step 508 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 708 may be performed after step 707.
- the model fusion device sends the second model to the first data acquisition device.
- the first data acquisition device receives the second model from the model fusion device.
- Step 708 is similar to step 509 in the embodiment shown in Figure 5 above.
- step 509 in the embodiment shown in Figure 5 above please refer to the relevant description of step 509 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 709 may be performed after step 707.
- the model fusion device sends the second model to the first model training device.
- the first model training device receives the second model from the model fusion device.
- Step 709 is similar to step 510 in the embodiment shown in Figure 5 above.
- Step 510 in the embodiment shown in Figure 5 above please refer to the relevant description of step 510 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 708 can be executed first, followed by step 709; or step 709 can be executed first, followed by step 708; or, depending on the circumstances, steps 708 and 709 can be executed simultaneously.
- step 708 can be executed first, followed by step 709; or step 709 can be executed first, followed by step 708; or, depending on the circumstances, steps 708 and 709 can be executed simultaneously.
- This application does not impose any specific restrictions on this.
- the first data acquisition device sends a first request to the model training management device.
- the first request is used to request the allocation of model training information for the first data acquisition device.
- the first data acquisition device receives first instruction information from the model training management device.
- the first instruction information is used to indicate the allocation of first model training information for the first data acquisition device. This realizes the allocation of first model training information to the first data acquisition device. This facilitates the first data acquisition device to determine how to complete model training based on the first model training information, so as to realize the completion of model training based on the local data of the first data acquisition device.
- the first data acquisition device can determine the first model training device based on the first model training information and send the local data of the first data acquisition device to the first model training device. This facilitates the first model training device to complete model training based on the local data.
- the server of the terminal manufacturer can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
- the embodiments shown in Figures 5 and 7 above illustrate the technical solution of this application by taking the model fusion device as the central node in federated learning as an example.
- the model fusion device can be replaced by a model processing device, which can be the central node in segmentation learning.
- Step 506 in the embodiment shown in Figure 5 above or step 705 in the embodiment shown in Figure 7 above can be replaced by describing the following:
- the first model training device inputs the first data into the first model and pushes it to the segmentation layer to obtain the intermediate inference result.
- the first model is a sub-network of the complete neural network model.
- For information on the segmentation layer please refer to the relevant introduction in the aforementioned segmentation learning.
- Step 507 in the embodiment shown in Figure 5 above or step 706 in the embodiment shown in Figure 7 above can be replaced by:
- the model processing device receives the intermediate inference result from the first model training device.
- Step 508 in the embodiment shown in Figure 5 above or step 707 in the embodiment shown in Figure 7 above can be replaced by:
- the model processing device performs inference, gradient calculation, and/or parameter update of another part of the model based on the intermediate inference result.
- the other part of the model is another sub-network of the neural network model, and the first model and the other part of the model constitute the neural network model.
- Step 509 in the embodiment shown in Figure 5 or step 708 in the embodiment shown in Figure 7 may be omitted, while step 510 in the embodiment shown in Figure 5 or step 709 in the embodiment shown in Figure 7 may be replaced by: the model processing device sending the gradient backpropagation result to the first model training device.
- the first model training device updates the first model based on the gradient backpropagation result.
- the first model training device sends the updated first model to the first data acquisition device.
- FIG 8 is a schematic diagram of another embodiment of the information transmission method of this application. Referring to Figure 8, the method includes:
- the first data acquisition device sends a first request to the model training management device.
- the model training management device receives the first request from the first data acquisition device.
- the model training management device sends a first instruction message to the first data acquisition device.
- the first data acquisition device receives the first instruction message from the model training management device.
- Steps 801 to 802 are similar to steps 501 to 502 in the embodiment shown in Figure 5 above.
- steps 501 to 502 in the embodiment shown in Figure 5 above please refer to the relevant descriptions of steps 501 to 502 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 801a may be performed before step 802.
- the first model training device sends a registration request to the model training management device.
- the model training management device receives the registration request from the first model training device.
- Step 801a is similar to step 501a in the embodiment shown in Figure 5 above.
- Step 801a is similar to step 501a in the embodiment shown in Figure 5 above.
- step 803 may be performed after step 801.
- the model training management device sends a first instruction message to the first model training device.
- the first model training device receives the first instruction message from the model training management device.
- Step 803 is similar to step 503 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 503 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 804 may be performed after step 801.
- the model training management device sends a first instruction message to the control device.
- the control device receives the first instruction message from the model training management device.
- Step 804 is similar to step 504 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 504 in the embodiment shown in Figure 5 above, which will not be repeated here.
- the embodiment shown in FIG8 further includes steps 805 to 806. Steps 805 to 806 may be performed after step 803.
- the first data acquisition device sends first data to the first model training device.
- the first data includes a first identifier.
- the first model training device receives the first data from the first data acquisition device.
- the first model training device trains the model based on the first data to obtain the first model.
- Steps 805 to 806 are similar to steps 505 to 506 in the embodiment shown in Figure 5 above. For details, please refer to the relevant descriptions of steps 505 to 506 in the embodiment shown in Figure 5 above, which will not be repeated here.
- Step 805a may be performed after step 804 and before step 805.
- the control device sends scheduling information to the first data acquisition device.
- the first data acquisition device receives the scheduling information from the control device.
- the scheduling information is used to schedule the first data acquisition device to send the first data.
- the embodiment shown in FIG8 further includes steps 805b to 805c. Steps 805b to 805c may be performed before step 805a.
- the first data acquisition device sends a second instruction to the control device.
- the control device receives the second instruction from the first data acquisition device.
- the control device determines the scheduling of the first data acquisition device based on the second instruction information.
- Steps 805b to 805c are similar to steps 505b to 505c in the embodiment shown in Figure 5 above.
- steps 505b to 505c are similar to steps 505b to 505c in the embodiment shown in Figure 5 above.
- steps 505b to 505c in the embodiment shown in Figure 5 above please refer to the relevant description of steps 505b to 505c in the embodiment shown in Figure 5 above, which will not be repeated here.
- Step 805d may be performed before step 805c.
- the first model training device sends a third instruction message to the control device.
- the control device receives the third instruction message from the first model training device.
- Step 805d is similar to step 505d in the embodiment shown in Figure 5 above.
- step 505d in the embodiment shown in Figure 5 above please refer to the relevant description of step 505d in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 807 may be performed after step 806.
- the first model training device sends the first model to the first data acquisition device.
- the first data acquisition device receives the first model from the first model training device.
- the first model can be used for model inference.
- the first model is used for model inference and/or model training.
- the first model training device trains the model based on the first data to obtain the first model and sends the first model to the first data acquisition device.
- the first model training device sends the first model to the first data acquisition device when the first model converges or when the training epochs corresponding to the first model reach a preset threshold. Instead of the first model training device sending the trained model to the first data acquisition device after each training iteration, the first model training device sends the trained model to the first data acquisition device.
- step 808 may be performed after step 806.
- the first model training device sends the first model to the second data acquisition device or the second model training device.
- the second data acquisition device or the second model training device receives the first model from the first model training device.
- the first model training device sends the first model to the second data acquisition device or the second model training device.
- the second data acquisition device can also send the first model to the second model training device, facilitating the fusion of the first model with its local model.
- the second model training device sends the fused model to the second data acquisition device.
- the second model training device is the model training unit of the second data acquisition device, used for model training based on data provided by the second data acquisition device.
- the fused model can be used for model inference.
- the fused model can be used for model inference and/or model training.
- step 807 can be executed first, followed by step 808; or step 808 can be executed first, followed by step 807; or, depending on the circumstances, steps 807 and 808 can be executed simultaneously.
- step 807 can be executed first, followed by step 808; or step 808 can be executed first, followed by step 807; or, depending on the circumstances, steps 807 and 808 can be executed simultaneously.
- This application does not impose any specific restrictions on this.
- the above step 808 illustrates the technical solution of this application by taking the implementation of the first model training device sending the first model to the second data acquisition device or the second model training device as an example.
- the first data acquisition device sends the first model to the second data acquisition device.
- the first data acquisition device sends the first model to the second model training device.
- the second data acquisition device sends the first model to the second model training device.
- the embodiment shown in FIG8 further includes steps 809 to 810. Steps 809 to 810 may be performed after step 806.
- the second data acquisition device or the second model training device sends the third model to the first model training device.
- the first model training device receives the third model from the second data acquisition device or the second model training device.
- the above step 809 illustrates the technical solution of this application by using the implementation method of the first model training device acquiring the third model from the second data acquisition device or the second model training device as an example.
- the first data acquisition device may also acquire the third model from the second data acquisition device or the second model training device. Then, the first data acquisition device sends the third model to the first model training device; this application does not limit the specific implementation.
- the first model training device merges the first model and the third model to obtain the fourth model.
- the first model training device can receive a third model from a second data acquisition device or a second model training device.
- the first model training device merges the first model and the third model to obtain a fourth model. This is beneficial for improving model performance.
- the fourth model can be used for model inference and/or model training.
- steps 809 to 810 may be performed after step 807.
- step 808 there is no fixed execution order between step 808 and steps 809 to 810.
- Step 808 can be executed first, followed by steps 809 to 810; or, steps 809 to 810 can be executed first, followed by step 808; or, depending on the situation, steps 808 and steps 809 to 810 can be executed simultaneously.
- This application does not limit the specific execution order.
- steps 807 and 808 can be executed after step 807. There is no fixed execution order between steps 808 and steps 809 to 810.
- step 811 may be performed after step 810.
- the first model training device sends the fourth model to the first data acquisition device.
- the first data acquisition device receives the fourth model from the first model training device.
- the fourth model can be used for model inference.
- the first data acquisition device sends a first request to the model training management device.
- the first request requests the allocation of model training information for the first data acquisition device.
- the first data acquisition device receives first instruction information from the model training management device.
- the first instruction information indicates the allocation of first model training information to the first data acquisition device.
- This enables the allocation of first model training information to the first data acquisition device.
- This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training.
- the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data.
- the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
- FIG. 9 is a schematic diagram of another embodiment of the information transmission method of this application. Referring to Figure 9, the method includes:
- the first data acquisition device sends a first request to the model training management device.
- the model training management device receives the first request from the first data acquisition device.
- the model training management device sends a first instruction message to the first data acquisition device.
- the first data acquisition device receives the first instruction message from the model training management device.
- Steps 901 to 902 are similar to steps 501 to 502 in the embodiment shown in Figure 5 above.
- steps 501 to 502 in the embodiment shown in Figure 5 above please refer to the relevant descriptions of steps 501 to 502 in the embodiment shown in Figure 5 above, which will not be repeated here.
- step 901a may be performed before step 902.
- the first model training device sends a registration request to the model training management device.
- the model training management device receives the registration request from the first model training device.
- Step 901a is similar to step 501a in the embodiment shown in Figure 5 above.
- Step 901a is similar to step 501a in the embodiment shown in Figure 5 above.
- step 93 may be performed after step 901.
- the model training management device sends a first instruction message to the first model training device.
- the first model training device receives the first instruction message from the model training management device.
- Step 903 is similar to step 703 in the embodiment shown in Figure 7 above.
- step 703 in the embodiment shown in Figure 7 above please refer to the relevant description of step 703 in the embodiment shown in Figure 7 above, which will not be repeated here.
- the embodiment shown in FIG9 further includes steps 904a to 904c. Steps 904a to 904c may be performed after step 903.
- the first data acquisition device sends a second indication message to the control device.
- the second indication message is used to indicate the first identifier.
- the control device receives the second indication message from the first data acquisition device.
- the first model training device sends a third instruction message to the control device.
- the third instruction message is used to indicate the first identifier.
- the control device receives the third instruction message from the first model training device.
- the control device determines the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
- Steps 904a to 904c are similar to steps 704a to 704c in the embodiment shown in FIG7 above. For details, please refer to the relevant description of steps 704a to 704c in the embodiment shown in FIG7 above, which will not be repeated here.
- the embodiment shown in FIG9 further includes steps 904 to 905.
- Steps 904 to 905 may be performed after step 903.
- the first data acquisition device sends first data to the first model training device.
- the first data includes a first identifier.
- the first model training device receives the first data from the first data acquisition device.
- the first model training device trains the model based on the first data to obtain the first model.
- Steps 904 to 905 are similar to steps 704 to 705 in the embodiment shown in Figure 7 above. For details, please refer to the relevant descriptions of steps 704 to 705 in the embodiment shown in Figure 7 above, which will not be repeated here.
- step 904d may be performed after step 904c and before step 904.
- the control device sends scheduling information to the first data acquisition device.
- the first data acquisition device receives the scheduling information from the control device.
- the scheduling information is used to schedule the first data acquisition device to send the first data.
- step 96 may be performed after step 905.
- the first model training device sends the first model to the first data acquisition device.
- the first data acquisition device receives the first model from the first model training device.
- Step 906 is similar to step 807 in the embodiment shown in Figure 8 above. For details, please refer to the relevant description of step 807 in the embodiment shown in Figure 8. It will not be repeated here.
- step 97 may be performed after step 905.
- the first model training device sends the first model to the second data acquisition device or the second model training device.
- the second data acquisition device or the second model training device receives the first model from the first model training device.
- Step 907 is similar to step 808 in the embodiment shown in Figure 8 above.
- step 808 in the embodiment shown in Figure 8. It will not be repeated here.
- the embodiment shown in FIG9 further includes steps 908 to 909. Steps 908 to 909 may be performed after step 905.
- the second data acquisition device or the second model training device sends the third model to the first model training device.
- the first model training device receives the third model from the second data acquisition device or the second model training device.
- the first model training device merges the first model and the third model to obtain the fourth model.
- Steps 908 to 909 are similar to steps 809 to 810 in the embodiment shown in Figure 8 above. For details, please refer to the relevant descriptions of steps 809 to 810 in the embodiment shown in Figure 8 above, which will not be repeated here.
- step 910 may be performed after step 909.
- the first model training device sends the fourth model to the first data acquisition device.
- the first data acquisition device receives the fourth model from the first model training device.
- the fourth model can be used for model inference.
- the first data acquisition device sends a first request to the model training management device.
- the first request requests the allocation of model training information for the first data acquisition device.
- the first data acquisition device receives first instruction information from the model training management device.
- the first instruction information indicates the allocation of first model training information to the first data acquisition device.
- This enables the allocation of first model training information to the first data acquisition device.
- This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training.
- the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data.
- the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
- the first data acquisition device provided in the embodiments of this application is described below. Please refer to FIG10, which is a structural schematic diagram of the first data acquisition device in the embodiments of this application.
- the first data acquisition device 1000 can be used to execute the steps performed by the first data acquisition device in the embodiments shown in FIG5, FIG7 to FIG9.
- the first data acquisition device 1000 includes a transceiver module 1001.
- the first data acquisition device 1000 further includes a processing module 1002.
- the processing module 1002 is used for data processing.
- the transceiver module 1001 can implement the corresponding communication functions.
- the transceiver module 1001 can also be called a communication interface or a communication module.
- the first data acquisition device 1000 may further include a storage module, which can be used to store program code, program instructions and/or data.
- the processing module 1002 can read the instructions and/or data in the storage module so that the first data acquisition device 1000 can implement the aforementioned method embodiment.
- the first data acquisition device 1000 can be used to perform the actions performed by the first data acquisition device in the above method embodiment.
- the first data acquisition device 1000 can be a terminal device or a component configurable on a terminal device.
- the processing module 1002 is used to perform processing-related operations on the first data acquisition device side in the above method embodiment.
- the transceiver module 1001 is used to perform receiving-related operations on the first data acquisition device side in the above method embodiment.
- the transceiver module 1001 may include a sending module and a receiving module.
- the sending module is used to perform the sending operation in the above method embodiments.
- the receiving module is used to perform the receiving operation in the above method embodiments.
- the first data acquisition device 1000 may include a transmitting module but not a receiving module.
- the first data acquisition device 1000 may include a receiving module but not a transmitting module.
- the first data acquisition device 1000 is used to execute the actions performed by the first data acquisition device in the embodiments shown in Figures 5, 7 to 9.
- the first data acquisition device 1000 is used to execute the following scheme:
- the transceiver module 1001 is used to send a first request to the model training management device, the first request being used to request the allocation of model training information to the first data acquisition device 1000; and to receive first instruction information from the model training management device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device 1000.
- the first model training information includes: information about the first model training device, and/or, the first model training resources.
- the first indication information includes a first identifier, which indicates that the first model training device is a model training device of the first data acquisition device 1000.
- the first identifier is the identifier of the first data acquisition device 1000, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device 1000 is paired with the first model training device.
- the transceiver module 1001 is further configured to: receive a first model from a first model training device, wherein the first model is trained based on the first data; or receive a second model from a model fusion device, wherein the second model is obtained by fusing the first model reported by the first model training device, and the first model is trained based on the first data.
- the transceiver module 1001 is further configured to: receive a third model from the second model training device or the second data acquisition device; and send the third model to the first model training device, wherein the third model is used for model fusion.
- the transceiver module 1001 is also used to receive a fourth model from the first model training device, the fourth model being obtained by fusing the first model and the third model.
- the transceiver module 1001 is also used to: receive scheduling information from the control device, the scheduling information being used to schedule the first data acquisition device 1000 to send the first data.
- the transceiver module 1001 is further configured to: send a second indication message to the control device, the second indication message being used to indicate the data status of the first data.
- the second indication information is also used to indicate the first identifier.
- the processing module 1002 in the above embodiments can be implemented by at least one processor or processor-related circuitry.
- the transceiver module 1001 can be implemented by a transceiver or transceiver-related circuitry.
- the transceiver module 1001 can also be referred to as a communication module or communication interface.
- the storage module can be implemented by at least one memory.
- the model training management device provided in the embodiments of this application is described below. Please refer to FIG11, which is a structural schematic diagram of the model training management device according to an embodiment of this application.
- the model training management device 1100 can be used to execute the steps performed by the model training management device in the embodiments shown in FIG5, FIG7 to FIG9. For details, please refer to the relevant description of the above method embodiments.
- the model training management device 1100 includes a transceiver module 1101.
- the model training management device 1100 also includes a processing module 1102.
- the processing module 1102 is used for data processing.
- the transceiver module 1101 can implement the corresponding communication functions.
- the transceiver module 1101 can also be called a communication interface or a communication module.
- the model training management device 1100 may further include a storage module, which can be used to store program code, program instructions and/or data.
- the processing module 1102 can read the instructions and/or data in the storage module so that the model training management device 1100 can implement the aforementioned method embodiment.
- the model training management device 1100 can be used to execute the actions performed by the model training management device in the above method embodiment.
- the model training management device 1100 can be a server or a component configurable on a server.
- the processing module 1102 is used to execute processing-related operations on the model training management device side in the above method embodiment.
- the transceiver module 1101 is used to execute receiving-related operations on the model training management device side in the above method embodiment.
- the transceiver module 1101 may include a sending module and a receiving module.
- the sending module is used to perform the sending operation in the above method embodiments.
- the receiving module is used to perform the receiving operation in the above method embodiments.
- the model training management device 1100 may include a sending module but not a receiving module.
- the model training management device 1100 may include a receiving module but not a sending module.
- the model training management device 1100 it depends on whether the above-described scheme executed by the model training management device 1100 includes both sending and receiving actions.
- the model training management device 1100 is used to execute the actions performed by the model training management device in the embodiments shown in Figures 5 and 7 to 9.
- the model training management device 1100 is used to execute the following scheme:
- the transceiver module 1101 is used to receive a first request from the first data acquisition device, the first request being used to request the allocation of model training information to the first data acquisition device; and to send first instruction information to the first data acquisition device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device.
- the transceiver module 1101 is further configured to: send first instruction information to the first model training device.
- the transceiver module 1101 is also used to: send first instruction information to the control device.
- the first model training information includes information about the first model training device and/or the first model training resources.
- the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
- the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
- the processing module 1102 in the above embodiments can be implemented by at least one processor or processor-related circuitry.
- the transceiver module 1101 can be implemented by a transceiver or transceiver-related circuitry.
- the transceiver module 1101 can also be referred to as a communication module or communication interface.
- the storage module can be implemented by at least one memory.
- the first model training apparatus provided in the embodiments of this application is described below. Please refer to FIG12, which is a structural schematic diagram of the first model training apparatus in the embodiments of this application.
- the first model training apparatus 1200 can be used to execute the steps performed by the first model training apparatus in the embodiments shown in FIG5, FIG7 to FIG9. For details, please refer to the relevant description of the above method embodiments.
- the first model training apparatus 1200 includes a transceiver module 1201 and a processing module 1202.
- the processing module 1202 is used for data processing.
- the transceiver module 1201 can implement the corresponding communication functions.
- the transceiver module 1201 can also be called a communication interface or a communication module.
- the first model training device 1200 may further include a storage module, which can be used to store program code, program instructions and/or data.
- the processing module 1202 can read the instructions and/or data in the storage module so that the first model training device 1200 can implement the aforementioned method embodiment.
- the first model training device 1200 can be used to execute the actions performed by the first model training device in the above method embodiment.
- the first model training device 1200 can be an AI server or a component configurable on an AI server.
- the processing module 1202 is used to execute processing-related operations on the first model training device side in the above method embodiment.
- the transceiver module 1201 is used to execute receiving-related operations on the first model training device side in the above method embodiment.
- the transceiver module 1201 may include a sending module and a receiving module.
- the sending module is used to perform the sending operation in the above method embodiments.
- the receiving module is used to perform the receiving operation in the above method embodiments.
- the first model training device 1200 may include a sending module but not a receiving module.
- the first model training device 1200 may include a receiving module but not a sending module.
- the first model training device 1200 is used to execute the actions performed by the first model training device in the embodiments shown in Figures 5 and 7 to 9.
- the first model training device 1200 is used to execute the following scheme:
- the transceiver module 1201 is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
- the processing module 1202 is used to determine the first model training information based on the first instruction information.
- the first model training information includes information about the first model training device 1200 and/or the first model training resources.
- the first indication information includes a first identifier, which is used to identify the first model training device 1200 as a model training device for the first data acquisition device.
- the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device 1200, or a pairing identifier, which is used to identify the pairing of the first data acquisition device and the first model training device 1200.
- the transceiver module 1201 is further configured to: receive first data from the first data acquisition device, the first data including a first identifier; and the processing module 1202 is further configured to: perform model training based on the first identifier and the first data to obtain a first model.
- the transceiver module 1201 is also used to: send the first model to the first data acquisition device or the model fusion device.
- the transceiver module 1201 is also used to: receive a second model from the model fusion device, the second model being obtained by the model fusion device fusing the first model.
- the transceiver module 1201 is also used to: receive a third model from the first data acquisition device, the second data acquisition device, or the second model training device, the third model being used for model fusion.
- processing module 1202 is further configured to: fuse the first model and the third model to obtain a fourth model; the transceiver module 1201 is further configured to: send the fourth model to the first data acquisition device.
- the transceiver module 1201 is also used to: send third instruction information to the control device, the third instruction information being used to indicate training-related information of the first model training device 1200.
- the third indication information is also used to indicate the first identifier.
- the processing module 1202 in the above embodiments can be implemented by at least one processor or processor-related circuitry.
- the transceiver module 1201 can be implemented by a transceiver or transceiver-related circuitry.
- the transceiver module 1201 can also be referred to as a communication module or communication interface.
- the storage module can be implemented by at least one memory.
- the control device provided in the embodiments of this application is described below. Please refer to FIG13, which is a structural schematic diagram of the control device in the embodiments of this application.
- the control device 1300 can be used to execute the steps performed by the control device in the embodiments shown in FIG5, FIG7 to FIG9. For details, please refer to the relevant description of the above method embodiments.
- the control device 1300 includes a transceiver module 1301 and a processing module 1302.
- the processing module 1302 is used for data processing.
- the transceiver module 1301 can implement the corresponding communication functions.
- the transceiver module 1301 can also be called a communication interface or a communication module.
- control device 1300 may further include a storage module, which can be used to store program code, program instructions and/or data.
- the processing module 1302 can read the instructions and/or data in the storage module so that the control device 1300 can implement the aforementioned method embodiments.
- the control device 1300 can be used to execute the actions performed by the control device in the above method embodiments.
- the control device 1300 can be a network device or a component configurable on a network device.
- the processing module 1302 is used to execute processing-related operations on the control device side in the above method embodiments.
- the transceiver module 1301 is used to execute receiving-related operations on the control device side in the above method embodiments.
- the transceiver module 1301 may include a sending module and a receiving module.
- the sending module is used to perform the sending operation in the above method embodiments.
- the receiving module is used to perform the receiving operation in the above method embodiments.
- control device 1300 may include a transmitting module but not a receiving module.
- control device 1300 may include a receiving module but not a transmitting module.
- the control device 1300 is used to execute the actions performed by the control device in the embodiments shown in Figures 5 and 7 to 9.
- control device 1300 is used to execute the following scheme:
- the transceiver module 1301 is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
- the processing module 1302 is used to determine the first model training information based on the first instruction information.
- control device 1300 is used to execute the following scheme:
- the transceiver module 1301 is used to receive second instruction information from the first data acquisition device, the second instruction information being used to indicate the data status and first identifier of the first data, the first identifier being used to indicate the first model training device as the model training device of the first data acquisition device; and to receive third instruction information from the first model training device, the third instruction information being used to indicate the training-related information and first identifier of the first model training device.
- the processing module 1302 is used to determine the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
- the processing module 1302 in the above embodiments can be implemented by at least one processor or processor-related circuitry.
- the transceiver module 1301 can be implemented by a transceiver or transceiver-related circuitry.
- the transceiver module 1301 can also be referred to as a communication module or communication interface.
- the storage module can be implemented by at least one memory.
- the model fusion apparatus provided in the embodiments of this application is described below. Please refer to FIG14, which is a schematic diagram of the structure of the model fusion apparatus in the embodiments of this application.
- the model fusion apparatus 1400 can be used to perform the steps performed by the model fusion apparatus in the embodiments shown in FIG5 and FIG7. For details, please refer to the relevant description of the above method embodiments.
- the model fusion apparatus 1400 includes a transceiver module 1401 and a processing module 1402.
- the processing module 1402 is used for data processing.
- the transceiver module 1401 can implement the corresponding communication functions.
- the transceiver module 1401 can also be called a communication interface or a communication module.
- the model fusion apparatus 1400 may further include a storage module, which can be used to store program code, program instructions and/or data.
- the processing module 1402 can read the instructions and/or data in the storage module so that the model fusion apparatus 1400 can implement the aforementioned method embodiments.
- the model fusion device 1400 can be used to perform the actions performed by the model fusion device in the above method embodiments.
- the control device 1400 can be a network device or a component configurable on a network device.
- the processing module 1402 is used to perform processing-related operations on the model fusion device side in the above method embodiments.
- the transceiver module 1401 is used to perform receiving-related operations on the model fusion device side in the above method embodiments.
- the transceiver module 1401 may include a sending module and a receiving module.
- the sending module is used to perform the sending operation in the above method embodiments.
- the receiving module is used to perform the receiving operation in the above method embodiments.
- the model fusion device 1400 may include a transmitting module but not a receiving module.
- the model fusion device 1400 may include a receiving module but not a transmitting module.
- the model fusion device 1400 is used to execute the actions performed by the model fusion device in the embodiments shown in Figures 5 and 7.
- the model fusion device 1400 can be used to execute the following scheme:
- the transceiver module 1401 is used to receive a first model from the first model training device, the first model being obtained based on the first data from the first data acquisition device; the processing module 1402 is used to fuse the first model to obtain a second model.
- the transceiver module 1401 is also used to: send the second model to the first data acquisition device.
- the transceiver module 1401 is also used to send the second model to the first model training device.
- the processing module 1402 in the above embodiments can be implemented by at least one processor or processor-related circuitry.
- the transceiver module 1401 can be implemented by a transceiver or transceiver-related circuitry.
- the transceiver module 1401 can also be referred to as a communication module or communication interface.
- the storage module can be implemented by at least one memory.
- the apparatus 1500 includes a processor 1510, which is coupled to a memory 1520.
- the memory 1520 is used to store computer programs or instructions and/or data.
- the processor 1510 is used to execute the computer programs or instructions and/or data stored in the memory 1520, causing the methods in the above method embodiments to be executed.
- the apparatus 1500 is used to implement the operations performed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device in the above method embodiments.
- the device 1500 may include one or more processors 1510.
- the device 1500 may also include a memory 1520.
- the device 1500 may include one or more memory 1520.
- the memory 1520 can be integrated with the processor 1510 or set separately.
- the device 1500 may further include a transceiver 1530 for receiving and/or transmitting signals.
- the processor 1510 is used to control the transceiver 1530 to receive and/or transmit signals.
- This application also provides an apparatus 1600, which may be a terminal device, a processor in the terminal device, or a chip.
- the apparatus 1600 can be used to perform the operations performed by the first data acquisition device in the above method embodiments.
- Figure 16 shows a simplified schematic diagram of the terminal device.
- the terminal device includes a processor, a memory, and a transceiver.
- the memory can store computer program code
- the transceiver includes a transmitter 1631, a receiver 1632, radio frequency circuitry (not shown), an antenna 1633, and input/output devices (not shown).
- the processor is mainly used to process communication protocols and communication data; control terminal devices; execute software programs; and process data from software programs.
- Memory is mainly used to store software programs and data.
- Radio frequency (RF) circuits are mainly used for the conversion between baseband signals and RF signals, as well as for the processing of RF signals.
- Antennas are primarily used for transmitting and receiving radio frequency signals in the form of electromagnetic waves.
- Input/output devices can include touchscreens, displays, or keyboards. They are primarily used to receive user input and output data to the user. It should be noted that some types of terminal devices may not have input/output devices.
- the processor When data needs to be transmitted, the processor performs baseband processing on the data to be transmitted and outputs a baseband signal to the radio frequency (RF) circuit.
- the RF circuit then processes the baseband signal and transmits it outwards via an antenna as electromagnetic waves.
- the RF circuit receives the RF signal through the antenna.
- the RF circuit converts the RF signal back into a baseband signal and outputs it to the processor.
- the processor converts the baseband signal back into data and processes the data.
- Figure 16 only shows one memory, processor, and transceiver. In actual terminal device products, there may be one or more processors and one or more memories. Memory can also be called storage medium or storage device, etc. Memory can be independent of the processor or integrated with the processor; this embodiment does not limit this.
- the antenna and radio frequency circuit with transceiver function can be regarded as the transceiver module of the terminal device, and the processor with processing function can be regarded as the processing module of the terminal device.
- the terminal device includes a processor 1610, a memory 1620, and a transceiver 1630.
- the processor 1610 can also be referred to as a processing unit, processing board, processing module, or processing device, etc.
- the transceiver 1630 can also be referred to as a transceiver unit, transceiver, or transceiver device, etc.
- transceiver 1630 includes a receiver and a transmitter.
- a transceiver may also be called a transceiver unit, transceiver module, or transceiver circuit, etc.
- a receiver may also be called a receiver unit, receiving module, or receiving circuit, etc.
- a transmitter may also be called a transmitter, transmitting module, or transmitting circuit, etc.
- the processor 1610 is used to execute the processing operations on the first data acquisition device side of the embodiments shown in Figures 5, 7 to 9.
- the transceiver 1630 is used to execute the transmission and reception operations on the first data acquisition device side of the embodiments shown in Figures 5, 7 to 9.
- Figure 16 is merely an example and not a limitation, and the terminal device described above, including the transceiver module and the processing module, may not depend on the structure shown in Figure 10 or Figure 16.
- the chip When device 1600 is a chip, the chip includes a processor, a memory, and a transceiver.
- the transceiver can be an input/output circuit or a communication interface.
- the processor can be a processing module integrated on the chip, a microprocessor, or an integrated circuit.
- the transmitting operation of the first data acquisition device can be understood as the chip's output
- the receiving operation of the first data acquisition device in the above method embodiments can be understood as the chip's input.
- This application also provides a device 1700, which can be a network device or a chip.
- the device 1700 can be used to perform the operations performed by the control device or model fusion device in the embodiments shown in Figures 5, 7 to 9 above.
- Figure 17 shows a simplified schematic diagram of a base station structure.
- the base station includes parts 1710, 1720, and 1730.
- the 1710 section is mainly used for baseband processing and controlling the base station; the 1710 section is usually the control center of the base station, which can be called the processor, and is used to control the base station to perform the processing operations on the control device or model fusion device side in the above method embodiments.
- Part 1720 is primarily used to store computer program code and data.
- Section 1730 is primarily used for transmitting and receiving radio frequency (RF) signals, as well as converting RF signals to baseband signals.
- Section 1730 is commonly referred to as a transceiver module, transceiver, transceiver circuit, or transceiver unit.
- the transceiver module of section 1730 also known as a transceiver or transceiver unit, includes antenna 1733 and RF circuitry (not shown in the figure), where the RF circuitry is mainly used for RF processing.
- the device in section 1730 used for receiving can be considered a receiver, and the device used for transmitting can be considered a transmitter; that is, section 1730 includes receiver 1732 and transmitter 1731.
- the receiver can also be called a receiving module, receiver circuit, or receiving circuit
- the transmitter can be called a transmitting module, transmitter, or transmitting circuit.
- Sections 1710 and 1720 may include one or more circuit boards, each of which may include one or more processors and one or more memories.
- the processors are used to read and execute programs from the memories to implement baseband processing functions and control the base station. If multiple circuit boards exist, they can be interconnected to enhance processing capabilities. As an alternative implementation, multiple circuit boards may share one or more processors, multiple circuit boards may share one or more memories, or multiple circuit boards may simultaneously share one or more processors.
- the transceiver module of section 1730 is used to execute the transceiver-related processes performed by the control device or model fusion device in the embodiments shown in Figures 5, 7 to 9.
- the processor of section 1710 is used to execute the processing-related processes performed by the control device or model fusion device in the embodiments shown in Figures 5, 7 to 9.
- Figure 17 is merely an example and not a limitation, and the network device described above, including the processor, memory, and transceiver, may not depend on the structure shown in Figure 13 or Figure 17.
- the chip When device 1700 is a chip, the chip includes a transceiver, a memory, and a processor.
- the transceiver can be an input/output circuit or a communication interface;
- the processor can be an integrated processor, a microprocessor, or an integrated circuit on the chip.
- the transmitting operation of the control device or model fusion device can be understood as the chip's output, and the receiving operation of the control device or model fusion device in the above method embodiments can be understood as the chip's input.
- This application also provides a computer-readable storage medium storing computer instructions for implementing the methods executed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device in the above-described method embodiments.
- the computer when the computer program is executed by a computer, the computer can implement the method performed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device in the above method embodiments.
- This application also provides a computer program product containing instructions that, when executed by a computer, cause the computer to implement the method described above, which is performed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device.
- This application also provides a communication system, which includes a model training management device and a first data acquisition device.
- the model training management device is used to perform some or all of the operations performed by the model training management device in the embodiments shown in Figures 5, 7 to 9.
- the first data acquisition device is used to perform some or all of the operations performed by the first data acquisition device in the embodiments shown in Figures 5, 7 to 9.
- the communication system further includes a first model training device, which is used to perform some or all of the operations performed by the first model training device in the embodiments shown in Figures 5, 7 to 9.
- the communication system also includes a control device for performing some or all of the operations performed by the control device in the embodiments shown in Figures 5, 7 to 9.
- the communication system also includes a model fusion device, which performs some or all of the operations performed by the model fusion device in the embodiments shown in Figures 5 and 7.
- This application also provides a chip device, including a processor, for calling computer programs or computer instructions stored in the memory, so that the processor executes the method provided in the embodiments shown in Figures 5, 7 to 9 above.
- the input of the chip device corresponds to the receiving operation in any one of the embodiments shown in Figures 5, 7 to 9
- the output of the chip device corresponds to the sending operation in any one of the embodiments shown in Figures 5, 7 to 9.
- the processor is coupled to the memory via an interface.
- the chip device may also include a memory that stores computer programs or computer instructions.
- the processor mentioned above can be a general-purpose central processing unit, a microprocessor, an application-specific integrated circuit (ASIC), or one or more integrated circuits used to control the execution of a program for controlling the method provided in any of the embodiments shown in Figures 5, 7 to 9.
- the memory mentioned above can be read-only memory (ROM) or other types of static storage devices capable of storing static information and instructions, such as random access memory (RAM).
- the disclosed systems, apparatuses, and methods can be implemented in other ways.
- the apparatus embodiments described above are merely illustrative; for instance, the division of units is only a logical functional division, and in actual implementation, there may be other division methods.
- multiple units or components may be combined or integrated into another system, or some features may be ignored or not executed.
- the coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection between apparatuses or units through some interfaces, and may be electrical, mechanical, or other forms.
- the units described as separate components may or may not be physically separate.
- the components shown as units may or may not be physical units; that is, they may be located in one place or distributed across multiple network units. Some or all of the units can be selected to achieve the purpose of this embodiment according to actual needs.
- the functional units in the various embodiments of this application can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit.
- the integrated unit can be implemented in hardware or as a software functional unit.
- the integrated unit is implemented as a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium.
- This computer software product is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, server, or network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of this application.
- the aforementioned storage medium includes various media capable of storing program code, such as USB flash drives, portable hard drives, ROM, RAM, magnetic disks, or optical disks.
Landscapes
- Mobile Radio Communication Systems (AREA)
Abstract
Description
本申请要求于2024年05月31日提交国家知识产权局、申请号为202410705939.1、申请名称为“信息传输方法以及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to Chinese Patent Application No. 202410705939.1, filed with the State Intellectual Property Office of China on May 31, 2024, entitled "Information Transmission Method and Related Apparatus", the entire contents of which are incorporated herein by reference.
本申请涉及人工智能(artificial intelligence)技术领域,尤其涉及一种信息传输方法以及相关装置。This application relates to the field of artificial intelligence technology, and more particularly to an information transmission method and related apparatus.
分布式学习可以在充分保证用户数据隐私和安全的前提下完成AI模型的学习任务。分布式学习主要包括联邦学习、分割学习和去中心式学习。在分布式节点(例如,终端设备)具备足够计算能力的假设下,分布式节点基于本地采集的数据进行模型训练以获得本地模型,并将向其他节点(例如,中心节点或其他分布式节点)传输该本地模型。Distributed learning can complete the learning task of AI models while fully ensuring user data privacy and security. Distributed learning mainly includes federated learning, partitioned learning, and decentralized learning. Assuming that the distributed nodes (e.g., terminal devices) have sufficient computing power, the distributed nodes train the model based on locally collected data to obtain a local model, and then transmit the local model to other nodes (e.g., the central node or other distributed nodes).
但是,如果分布式节点的计算能力有限,该分布式节点无法进行模型训练,那么如何实现分布式学习以完成模型训练任务,是值得考虑的问题。However, if the computing power of a distributed node is limited and it is unable to train the model, then how to achieve distributed learning to complete the model training task is a question worth considering.
本申请提供了一种信息传输方法以及相关装置,用于实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。This application provides an information transmission method and related apparatus for allocating first model training information to a first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training to be completed based on the local data of the first data acquisition device.
本申请第一方面提供一种信息传输方法,该方法可以由第一数据采集装置执行,第一数据采集装置可以是终端设备,或者是终端设备中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分终端设备功能的逻辑模块或软件。方法包括:第一数据采集装置向模型训练管理装置发送第一请求,第一请求用于请求为第一数据采集装置分配模型训练信息;第一数据采集装置接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置根据第一模型训练信息可以确定第一模型训练装置,并向第一模型训练装置发送第一数据采集装置的本地数据。从而便于第一模型训练装置基于该本地数据完成模型训练。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。This application provides an information transmission method, which can be executed by a first data acquisition device. The first data acquisition device can be a terminal device, a component within the terminal device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the terminal device's functions. The method includes: the first data acquisition device sending a first request to a model training management device, the first request requesting the allocation of model training information for the first data acquisition device; and the first data acquisition device receiving first instruction information from the model training management device, the first instruction information indicating the allocation of first model training information for the first data acquisition device. This enables the allocation of first model training information to the first data acquisition device. It facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, when the first data acquisition device has weak computing power or lacks computing power, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training. For example, the first data acquisition device can determine a first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data. For example, the first data acquisition device is a terminal device, and the first model training device is the terminal manufacturer's server. Given the limited computing power of the terminal device and its data privacy protection requirements, the terminal manufacturer's server can complete model training based on the terminal device's local data. This ensures both the data privacy protection requirements of the first data acquisition device and the ability to train the model.
基于第一方面,一种可能的实现方式中,第一模型训练信息包括第一模型训练装置的信息,和/或,第一模型训练资源。从而实现为第一数据采集装置分配相应的模型训练装置以及训练资源。便于根据第一数据采集装置提供的数据完成模型训练任务。Based on the first aspect, in one possible implementation, the first model training information includes information about the first model training device and/or the first model training resources. This allows for the allocation of a corresponding model training device and training resources to the first data acquisition device, facilitating the completion of the model training task based on the data provided by the first data acquisition device.
基于第一方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。实现通过第一标识指示第一数据采集装置与第一模型训练装置配对或匹配。便于后续该第一模型训练装置通过第一标识确定是否基于第一数据采集装置的数据进行模型训练。Based on the first aspect, in one possible implementation, the first indication information includes a first identifier, which indicates that the first model training device is a model training device for the first data acquisition device. This allows the first data acquisition device to be paired or matched with the first model training device via the first identifier. This facilitates the first model training device in subsequently determining whether to perform model training based on data from the first data acquisition device using the first identifier.
基于第一方面,一种可能的实现方式中,第一标识为第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置与第一模型训练装置配对。第一标识可以类似于网络路由中的报文目的地址,便于第一模型训练装置基于第一标识确定是否基于第一数据采集装置的数据进行模型训练。Based on the first aspect, in one possible implementation, the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier. The pairing identifier is used to indicate that the first data acquisition device is paired with the first model training device. The first identifier can be similar to the destination address of a packet in network routing, which facilitates the first model training device in determining whether to perform model training based on the data from the first data acquisition device.
基于第一方面,一种可能的实现方式中,方法还包括:第一数据采集装置向第一模型训练装置发送第一数据,第一数据包括第一标识,第一数据用于模型训练。从而便于第一模型训练装置基于第一标识和第一数据进行模型训练。Based on the first aspect, in one possible implementation, the method further includes: a first data acquisition device sending first data to a first model training device, the first data including a first identifier, the first data being used for model training. This facilitates the first model training device to perform model training based on the first identifier and the first data.
基于第一方面,一种可能的实现方式中,方法还包括:第一数据采集装置接收来自第一模型训练装置的第一模型,其中,第一模型是基于第一数据训练得到的,或者,第一数据采集装置接收来自模型融合装置的第二模型,第二模型是融合第一模型训练装置上报的第一模型得到的,第一模型是基于所述第一数据训练得到的。在该实现方式中,在去中心式学习中,第一数据采集装置可以接收来自第一模型训练装置的第一模型。而在联邦学习中,第一数据采集装置可以接收来自模型融合装置的第二模型。从而实现第一数据采集装置获取训练后的模型,便于第一数据采集装置基于第一模型或第二模型进行模型推理操作,提升模型推理性能。Based on the first aspect, in one possible implementation, the method further includes: a first data acquisition device receiving a first model from a first model training device, wherein the first model is trained based on the first data; or, the first data acquisition device receiving a second model from a model fusion device, wherein the second model is obtained by fusing the first model reported by the first model training device, and the first model is trained based on the first data. In this implementation, in decentralized learning, the first data acquisition device can receive the first model from the first model training device. In federated learning, the first data acquisition device can receive the second model from the model fusion device. This enables the first data acquisition device to obtain the trained model, facilitating model inference operations based on the first model or the second model, thereby improving model inference performance.
基于第一方面,一种可能的实现方式中,该方法还包括:第一数据采集装置接收来自第二模型训练装置或第二数据采集装置的第三模型,第一数据采集装置向第一模型训练装置发送该第三模型,第三模型用于模型融合。Based on the first aspect, in one possible implementation, the method further includes: a first data acquisition device receiving a third model from a second model training device or a second data acquisition device, the first data acquisition device sending the third model to the first model training device, and the third model being used for model fusion.
基于第一方面,一种可能的实现方式中,该方法还包括:第一数据采集装置接收来自第一模型训练装置的第四模型,第四模型是融合第一模型和第三模型得到的。Based on the first aspect, in one possible implementation, the method further includes: a first data acquisition device receiving a fourth model from a first model training device, the fourth model being obtained by fusing the first model and the third model.
基于第一方面,一种可能的实现方式中,在第一数据采集装置向第一模型训练装置发送第一数据之前,方法还包括:第一数据采集装置接收来自控制装置的调度信息,调度信息用于调度第一数据采集装置发送第一数据。从而实现对第一数据采集装置的调度。Based on the first aspect, in one possible implementation, before the first data acquisition device sends the first data to the first model training device, the method further includes: the first data acquisition device receiving scheduling information from the control device, the scheduling information being used to schedule the first data acquisition device to send the first data. This achieves scheduling of the first data acquisition device.
基于第一方面,一种可能的实现方式中,在第一数据采集装置接收来自控制装置的调度信息之前,方法还包括:第一数据采集装置向控制装置发送第二指示信息,第二指示信息用于指示第一数据的数据状态。有利于控制装置基于第一数据的数据状态判断是否调度该第一数据采集装置。便于控制装置调度数据质量较高的数据采集装置以提供数据,从而有利于提升模型训练的性能。Based on the first aspect, in one possible implementation, before the first data acquisition device receives scheduling information from the control device, the method further includes: the first data acquisition device sending second indication information to the control device, the second indication information being used to indicate the data status of the first data. This facilitates the control device in determining whether to schedule the first data acquisition device based on the data status of the first data. It also facilitates the control device in scheduling data acquisition devices with higher data quality to provide data, thereby improving the performance of model training.
基于第一方面,一种可能的实现方式中,第二指示信息还用于指示第一标识。便于控制装置基于第一标识确定与该第一数据采集装置匹配的第一模型训练装置,以便于控制装置判断是否调度第一数据采集装置。Based on the first aspect, in one possible implementation, the second indication information is further used to indicate the first identifier. This facilitates the control device in determining the first model training device that matches the first data acquisition device based on the first identifier, so that the control device can determine whether to schedule the first data acquisition device.
本申请第二方面提供一种信息传输方法,该方法可以由模型训练管理装置执行,模型训练管理装置可以是模型训练管理服务器,或者是模型训练管理服务器中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分模型训练管理服务器功能的逻辑模块或软件。方法包括:模型训练管理装置接收来自第一数据采集装置的第一请求,第一请求用于请求为第一数据采集装置分配模型训练信息;模型训练管理装置向第一数据采集装置发送第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息,便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置根据第一模型训练信息可以确定第一模型训练装置,并向第一模型训练装置发送第一数据采集装置的本地数据。从而便于第一模型训练装置基于该本地数据完成模型训练。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。A second aspect of this application provides an information transmission method, which can be executed by a model training management device. The model training management device can be a model training management server, a component within a model training management server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of a model training management server. The method includes: the model training management device receiving a first request from a first data acquisition device, the first request requesting the allocation of model training information to the first data acquisition device; and the model training management device sending first instruction information to the first data acquisition device, the first instruction information indicating the allocation of first model training information to the first data acquisition device. This method allocates first model training information to the first data acquisition device, facilitating the first data acquisition device to determine how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, when the first data acquisition device has weak computing power or no computing power at all, the above technical solution helps to solve the problem that the first data acquisition device cannot perform model training. For example, the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device to complete model training based on the local data. For example, if the first data acquisition device is a terminal device and the first model training device is a server from the terminal manufacturer, and the terminal device has limited computing power and data privacy protection requirements, the terminal manufacturer's server can complete model training based on the terminal device's local data. This ensures both the data privacy protection requirements of the first data acquisition device and the ability to train the model.
基于第二方面,一种可能的实现方式中,方法还包括:模型训练管理装置向第一模型训练装置发送第一指示信息。从而便于第一模型训练装置确定与其配对的第一数据采集装置。Based on the second aspect, in one possible implementation, the method further includes: the model training management device sending first instruction information to the first model training device. This facilitates the first model training device in determining the first data acquisition device it is paired with.
基于第二方面,一种可能的实现方式中,方法还包括:模型训练管理装置向控制装置发送第一指示信息。便于控制装置确定第一数据采集装置与第一模型训练装置之间配对。有利于控制装置更好的调度数据采集装置。从而提升模型训练的性能。Based on the second aspect, one possible implementation further includes: the model training management device sending first instruction information to the control device. This facilitates the control device in determining the pairing between the first data acquisition device and the first model training device. It also allows the control device to better schedule the data acquisition device, thereby improving the performance of model training.
基于第二方面,一种可能的实现方式中,第一模型训练信息包括第一模型训练装置的信息,和/或,第一模型训练资源。从而实现为第一数据采集装置分配相应的模型训练装置以及训练资源。便于为第一数据采集装置提供模型训练功能,从而完成模型训练任务。Based on the second aspect, in one possible implementation, the first model training information includes information about the first model training device and/or the first model training resources. This allows for the allocation of a corresponding model training device and training resources to the first data acquisition device, facilitating the provision of model training functionality to the first data acquisition device and thus completing the model training task.
基于第二方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。实现通过第一标识指示第一数据采集装置与第一模型训练装置配对或匹配。便于后续该第一模型训练装置通过第一标识确定是否基于第一数据采集装置的数据进行模型训练。Based on the second aspect, in one possible implementation, the first indication information includes a first identifier, which indicates that the first model training device is a model training device for the first data acquisition device. This allows the first data acquisition device to be paired or matched with the first model training device via the first identifier. This facilitates the first model training device in subsequently determining whether to perform model training based on data from the first data acquisition device using the first identifier.
基于第二方面,一种可能的实现方式中,第一标识为第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置与第一模型训练装置配对。第一标识可以类似于网络路由中的报文目的地址,便于第一模型训练装置根据第一标识确定是否基于第一数据采集装置的数据进行模型训练。Based on the second aspect, in one possible implementation, the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier. The pairing identifier is used to indicate that the first data acquisition device is paired with the first model training device. The first identifier can be similar to the destination address of a packet in network routing, which facilitates the first model training device to determine whether to perform model training based on the data from the first data acquisition device.
本申请第三方面提供一种信息传输方法,该方法可以由第一模型训练装置执行,第一模型训练装置可以是AI服务器,或者是AI服务器中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分AI服务器功能的逻辑模块或软件。方法包括:第一模型训练装置接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息;第一模型训练装置根据第一指示信息确定第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息,便于第一模型训练装置根据第一模型训练信息确定基于第一数据采集装置的本地数据进行模型训练,以完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一模型训练装置根据第一模型训练信息确定第一模型训练装置作为第一数据采集装置的模型训练装置。第一模型训练装置可以接收来自第一数据采集装置的本地数据,并基于该本地数据完成模型训练。从而解决第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下无法进行模型训练的问题。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。A third aspect of this application provides an information transmission method, which can be executed by a first model training device. The first model training device can be an AI server, or a component within an AI server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of an AI server. The method includes: the first model training device receiving first instruction information from a model training management device, the first instruction information indicating first model training information allocated to a first data acquisition device; and the first model training device determining the first model training information based on the first instruction information. This method allocates first model training information to the first data acquisition device, facilitating the first model training device to determine, based on local data from the first data acquisition device, to perform model training, thereby completing the model training. For example, when the first data acquisition device has weak computing power or lacks computing power, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training. For example, the first model training device determines itself as the model training device for the first data acquisition device based on the first model training information. The first model training device can receive local data from the first data acquisition device and complete model training based on that local data. This solves the problem of model training being impossible when the first data acquisition device has limited or no computing power. For example, if the first data acquisition device is a terminal device and the first model training device is the terminal manufacturer's server, and the terminal device has limited computing power and data privacy protection requirements, the terminal manufacturer's server can complete model training based on the terminal device's local data. This ensures both the data privacy protection requirements of the first data acquisition device and the ability to train the model.
基于第三方面,一种可能的实现方式中,第一模型训练信息包括第一模型训练装置的信息,和/或,第一模型训练资源。从而实现为第一数据采集装置分配相应的模型训练装置以及训练资源。便于第一模型训练装置为第一数据采集装置提供模型训练功能,从而完成模型训练任务。Based on the third aspect, in one possible implementation, the first model training information includes information about the first model training device and/or the first model training resources. This allows for the allocation of a corresponding model training device and training resources to the first data acquisition device. This facilitates the first model training device providing model training functionality to the first data acquisition device, thereby completing the model training task.
基于第三方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于标识第一模型训练装置作为第一数据采集装置的模型训练装置。实现通过第一标识指示第一数据采集装置与第一模型训练装置配对或匹配。便于后续该第一模型训练装置通过第一标识确定是否基于第一数据采集装置的数据进行模型训练。Based on the third aspect, in one possible implementation, the first indication information includes a first identifier, which identifies the first model training device as a model training device that is also a first data acquisition device. This allows the first data acquisition device to be paired or matched with the first model training device via the first identifier. This facilitates the first model training device in subsequently determining whether to perform model training based on data from the first data acquisition device using the first identifier.
基于第三方面,一种可能的实现方式中,第一标识为第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于标识第一数据采集装置与第一模型训练装置配对。第一标识可以类似于网络路由中的报文目的地址,便于第一模型训练装置基于第一标识确定是否基于第一数据采集装置的数据进行模型训练。Based on the third aspect, in one possible implementation, the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier. The pairing identifier is used to identify the pairing of the first data acquisition device and the first model training device. The first identifier can be similar to the destination address of a packet in network routing, which facilitates the first model training device in determining whether to perform model training based on the data from the first data acquisition device.
基于第三方面,一种可能的实现方式中,方法还包括:第一模型训练装置接收来自第一数据采集装置的第一数据,第一数据包括第一标识;第一模型训练装置根据第一数据进行模型训练,得到第一模型。从而实现第一模型训练装置为第一数据采集装置提供模型训练功能,完成模型训练任务。Based on the third aspect, in one possible implementation, the method further includes: a first model training device receiving first data from a first data acquisition device, the first data including a first identifier; the first model training device performing model training based on the first data to obtain a first model. This enables the first model training device to provide model training functionality to the first data acquisition device, completing the model training task.
基于第三方面,一种可能的实现方式中,方法还包括:第一模型训练装置向第一数据采集装置或模型融合装置发送第一模型。从而便于第一数据采集装置进行模型推理,提升模型推理性能。或者便于模型融合装置进行模型融合,提升模型训练的性能。Based on the third aspect, one possible implementation further includes: the first model training device sending the first model to the first data acquisition device or the model fusion device. This facilitates model inference by the first data acquisition device, improving model inference performance. Alternatively, it facilitates model fusion by the model fusion device, improving model training performance.
基于第三方面,一种可能的实现方式中,方法还包括:第一模型训练装置接收来自模型融合装置的第二模型,第二模型是模型融合装置融合第一模型得到的。从而便于第一模型训练装置基于第二模型进行模型推理和/或模型训练等。Based on the third aspect, in one possible implementation, the method further includes: a first model training device receiving a second model from a model fusion device, the second model being obtained by fusing the first model by the model fusion device. This facilitates the first model training device performing model inference and/or model training based on the second model.
基于第三方面,一种可能的实现方式中,方法还包括:第一模型训练装置接收来自第一数据采集装置、第二数据采集装置或第二模型训练装置的第三模型,第三模型用于模型融合。在去中心式学习中,实现不同数据采集装置或不同模型训练装置中的模型可以相互传输,便于实现模型融合。Based on the third aspect, one possible implementation further includes: a first model training device receiving a third model from a first data acquisition device, a second data acquisition device, or a second model training device, the third model being used for model fusion. In decentralized learning, this allows models from different data acquisition devices or different model training devices to be transferred to each other, facilitating model fusion.
基于第三方面,一种可能的实现方式中,第一模型训练装置融合第一模型和第三模型,得到第四模型;第一模型训练装置向第一数据采集装置发送第四模型。在去中心式学习中,实现不同数据采集装置或不同模型训练装置中的模型可以相互传输,并进行融合。从而提升模型训练的性能。Based on the third aspect, in one possible implementation, the first model training device fuses the first model and the third model to obtain a fourth model; the first model training device then sends the fourth model to the first data acquisition device. In decentralized learning, models from different data acquisition devices or different model training devices can be transferred to each other and fused, thereby improving model training performance.
基于第三方面,一种可能的实现方式中,在第一模型训练装置接收来自第一数据采集装置的第一数据之前,方法还包括:第一模型训练装置向控制装置发送第三指示信息,第三指示信息用于指示第一模型训练装置的训练相关信息。有利于控制装置更好的调度相应的数据采集装置,提升模型训练的性能。Based on the third aspect, in one possible implementation, before the first model training device receives the first data from the first data acquisition device, the method further includes: the first model training device sending third instruction information to the control device, the third instruction information being used to indicate training-related information of the first model training device. This facilitates better scheduling of the corresponding data acquisition devices by the control device, improving the performance of model training.
基于第三方面,一种可能的实现方式中,第三指示信息还用于指示第一标识。便于控制装置基于第一标识确定与该第一数据采集装置匹配的第一模型训练装置,以便于控制装置判断是否调度第一数据采集装置。Based on the third aspect, in one possible implementation, the third indication information is also used to indicate the first identifier. This facilitates the control device in determining the first model training device that matches the first data acquisition device based on the first identifier, so that the control device can determine whether to schedule the first data acquisition device.
本申请第四方面提供一种信息传输方法,该方法可以由控制装置执行,控制装置可以是网络设备,或者是网络设备中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分网络设备功能的逻辑模块或软件。方法包括:控制装置接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息;控制装置根据第一指示信息确定第一模型训练信息。便于控制装置根据该第一模型训练信息调度该第一数据采集装置,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案能够解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,控制装置可以根据调度终端设备,终端设备向终端厂商的服务器发送本地数据。然后终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。A fourth aspect of this application provides an information transmission method, which can be executed by a control device. The control device can be a network device, a component within the network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the network device. The method includes: the control device receiving first instruction information from a model training management device, the first instruction information indicating first model training information allocated to a first data acquisition device; and the control device determining the first model training information based on the first instruction information. This facilitates the control device scheduling the first data acquisition device based on the first model training information, enabling model training to be completed based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution can solve the problem that the first data acquisition device cannot perform model training. For example, if the first data acquisition device is a terminal device and the first model training device is a server of the terminal manufacturer, and the terminal device has limited computing power and data privacy protection requirements, the control device can schedule the terminal device, and the terminal device can send local data to the terminal manufacturer's server. Then, the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
基于第四方面,一种可能的实现方式中,第一模型训练信息包括:第一模型训练装置的信息,和/或,第一模型训练资源。实现控制装置确定为第一数据采集装置分配的模型训练装置和/或模型训练资源。Based on the fourth aspect, in one possible implementation, the first model training information includes: information about the first model training device, and/or, the first model training resources. The control device determines the model training device and/or model training resources allocated to the first data acquisition device.
基于第四方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。实现通过第一标识指示第一数据采集装置与第一模型训练装置配对或匹配。便于控制装置结合第一数据采集装置的数据状态和第一模型训练装置的训练能力判断是否调度第一数据采集装置。从而便于控制装置调度数据质量较高的数据采集装置,以提升模型训练的性能。Based on the fourth aspect, in one possible implementation, the first indication information includes a first identifier, which indicates that the first model training device is used as the model training device for the first data acquisition device. This allows the first identifier to indicate pairing or matching between the first data acquisition device and the first model training device. This facilitates the control device in determining whether to schedule the first data acquisition device based on its data status and the training capability of the first model training device. Consequently, it allows the control device to schedule data acquisition devices with higher data quality, thereby improving model training performance.
基于第四方面,一种可能的实现方式中,第一标识为所述第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置与第一模型训练装置配对。实现控制装置基于第一标识确定第一模型训练装置与第一数据采集装置配对。Based on the fourth aspect, in one possible implementation, the first identifier is either the identifier of the first data acquisition device, the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device and the first model training device are paired. The control device determines the pairing of the first model training device and the first data acquisition device based on the first identifier.
基于第四方面,一种可能的实现方式中,方法还包括:控制装置向第一数据采集装置发送调度信息,调度信息用于调度第一数据采集装置发送第一数据。Based on the fourth aspect, in one possible implementation, the method further includes: the control device sending scheduling information to the first data acquisition device, the scheduling information being used to schedule the first data acquisition device to send the first data.
基于第四方面,一种可能的实现方式中,在控制装置向第一数据采集装置发送调度信息之前,方法还包括:控制装置接收来自第一数据采集装置的第二指示信息,第二指示信息用于指示第一数据的数据状态;控制装置根据第二指示信息确定调度第一数据采集装置。有利于控制装置调度数据质量较高的数据采集装置,以提升模型训练的性能。Based on the fourth aspect, in one possible implementation, before the control device sends scheduling information to the first data acquisition device, the method further includes: the control device receiving second indication information from the first data acquisition device, the second indication information indicating the data status of the first data; and the control device determining the scheduling of the first data acquisition device based on the second indication information. This is beneficial for the control device to schedule data acquisition devices with higher data quality, thereby improving the performance of model training.
基于第四方面,一种可能的实现方式中,方法还包括:控制装置接收来自第一模型训练装置的第三指示信息;控制装置根据第二指示信息确定调度第一数据采集装置,包括:控制装置根据第二指示信息和第三指示信息确定调度第一数据采集装置。实现控制装置更好的调度数据采集装置,保障被调度的数据采集装置所匹配的模型训练装置能够提供模型训练功能。Based on the fourth aspect, in one possible implementation, the method further includes: the control device receiving third instruction information from the first model training device; the control device determining the scheduling of the first data acquisition device based on the second instruction information, including: the control device determining the scheduling of the first data acquisition device based on the second instruction information and the third instruction information. This enables the control device to better schedule the data acquisition device, ensuring that the model training device matched with the scheduled data acquisition device can provide model training functionality.
本申请第五方面提供一种信息传输方法,该方法可以由控制装置执行,控制装置可以是网络设备,或者是网络设备中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分网络设备功能的逻辑模块或软件。方法包括:控制装置接收来自第一数据采集装置的第二指示信息,第二指示信息用于指示第一数据的数据状态和第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置;控制装置接收来自第一模型训练装置的第三指示信息,第三指示信息用于指示第一模型训练装置的训练相关信息和第一标识;控制装置根据第二指示信息和第三指示信息确定第一模型训练装置作为第一数据采集装置的模型训练装置。从而便于控制装置基于第二指示信息和第三指示信息调度该第一数据采集装置,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案能够解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,控制装置可以基于第二指示信息和第三指示信息调度终端设备,终端设备向终端厂商的服务器发送本地数据。然后终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。This application provides a fifth aspect of an information transmission method, which can be executed by a control device. The control device can be a network device, a component within the network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the network device. The method includes: the control device receiving second indication information from a first data acquisition device, the second indication information indicating the data status and a first identifier of first data, the first identifier indicating a first model training device as the model training device for the first data acquisition device; the control device receiving third indication information from the first model training device, the third indication information indicating training-related information and a first identifier for the first model training device; and the control device determining the first model training device as the model training device for the first data acquisition device based on the second and third indication information. This facilitates the control device scheduling the first data acquisition device based on the second and third indication information to achieve model training based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution can solve the problem that the first data acquisition device cannot perform model training. For example, if the first data acquisition device is a terminal device and the first model training device is the terminal manufacturer's server, and the terminal device has limited computing power and data privacy protection requirements, the control device can schedule the terminal device based on the second and third instruction information. The terminal device then sends local data to the terminal manufacturer's server. The terminal manufacturer's server can then complete model training based on the terminal device's local data. This approach ensures both the data privacy protection requirements of the first data acquisition device and the successful training of the model.
基于第五方面,一种可能的实现方式中,方法还包括:控制装置根据第二指示信息和第三指示信息确定调度第一数据采集装置。实现控制装置更好的调度数据采集装置,保障被调度的数据采集装置所匹配的模型训练装置能够提供模型训练功能。进一步的,有利于控制装置调度数据质量较高的数据采集装置,以提升模型训练的性能。Based on the fifth aspect, one possible implementation further includes: the control device determining the scheduling of the first data acquisition device according to the second and third instruction information. This enables the control device to better schedule data acquisition devices, ensuring that the model training device matched with the scheduled data acquisition device can provide model training functionality. Furthermore, it facilitates the control device scheduling data acquisition devices with higher data quality, thereby improving model training performance.
基于第五方面,一种可能的实现方式中,方法还包括:控制装置向第一数据采集装置发送调度信息,调度信息用于调度第一数据采集装置。Based on the fifth aspect, in one possible implementation, the method further includes: the control device sending scheduling information to the first data acquisition device, the scheduling information being used to schedule the first data acquisition device.
本申请第六方面提供一种信息传输方法,该方法可以由模型融合装置执行,模型融合装置可以是网络设备,或者是网络设备中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分网络设备功能的逻辑模块或软件。方法包括:模型融合装置接收来自第一模型训练装置的第一模型,第一模型是基于第一数据采集装置的第一数据得到的;模型融合装置融合第一模型,得到第二模型。从而完成模型的训练和融合。A sixth aspect of this application provides an information transmission method, which can be executed by a model fusion device. The model fusion device can be a network device, a component within a network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of a network device. The method includes: the model fusion device receiving a first model from a first model training device, the first model being obtained based on first data from a first data acquisition device; and the model fusion device fusing the first model to obtain a second model. This completes the training and fusion of the models.
基于第六方面,一种可能的实现方式中,方法还包括:模型融合装置向第一数据采集装置发送第二模型。便于第一数据采集装置基于第二模型进行模型推理,提升模型推理性能。Based on the sixth aspect, one possible implementation further includes: the model fusion device sending the second model to the first data acquisition device. This facilitates the first data acquisition device in performing model inference based on the second model, improving model inference performance.
基于第六方面,一种可能的实现方式中,方法还包括:模型融合装置向第一模型训练装置发送第二模型。便于第一模型训练装置基于第二模型进行模型训练和/或模型推理,提升模型训练性和/或模型推理性能。Based on the sixth aspect, in one possible implementation, the method further includes: the model fusion device sending the second model to the first model training device. This facilitates the first model training device to perform model training and/or model inference based on the second model, thereby improving model trainability and/or model inference performance.
本申请第七方面提供一种第一数据采集装置,第一数据采集装置包括:A seventh aspect of this application provides a first data acquisition device, the first data acquisition device comprising:
收发模块,用于向模型训练管理装置发送第一请求,第一请求用于请求为第一数据采集装置分配模型训练信息;接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。The transceiver module is used to send a first request to the model training management device, the first request being used to request the allocation of model training information to the first data acquisition device; and to receive first instruction information from the model training management device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device.
基于第七方面,一种可能的实现方式中,第一模型训练信息包括:第一模型训练装置的信息,和/或,第一模型训练资源。Based on the seventh aspect, in one possible implementation, the first model training information includes: information about the first model training device, and/or, the first model training resources.
基于第七方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。Based on the seventh aspect, in one possible implementation, the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
基于第七方面,一种可能的实现方式中,第一标识为所述第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置与第一模型训练装置配对。Based on the seventh aspect, in one possible implementation, the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
基于第七方面,一种可能的实现方式中,收发模块还用于:接收来自第一模型训练装置的第一模型,其中,第一模型是基于第一数据训练得到的,或者,接收来自模型融合装置的第二模型,第二模型是融合第一模型训练装置上报的第一模型得到的,第一模型是基于所述第一数据训练得到的。Based on the seventh aspect, in one possible implementation, the transceiver module is further configured to: receive a first model from a first model training device, wherein the first model is trained based on first data, or receive a second model from a model fusion device, wherein the second model is obtained by fusing the first model reported by the first model training device, and the first model is trained based on the first data.
基于第七方面,一种可能的实现方式中,收发模块还用于:接收来自第二模型训练装置或第二数据采集装置的第三模型;向第一模型训练装置发送该第三模型,第三模型用于模型融合。Based on the seventh aspect, in one possible implementation, the transceiver module is further configured to: receive a third model from a second model training device or a second data acquisition device; and send the third model to a first model training device, wherein the third model is used for model fusion.
基于第七方面,一种可能的实现方式中,收发模块还用于:接收来自第一模型训练装置的第四模型,第四模型是融合第一模型和第三模型得到的。Based on the seventh aspect, in one possible implementation, the transceiver module is further configured to: receive a fourth model from the first model training device, the fourth model being obtained by fusing the first model and the third model.
基于第七方面,一种可能的实现方式中,收发模块还用于:接收来自控制装置的调度信息,调度信息用于调度第一数据采集装置发送第一数据。Based on the seventh aspect, in one possible implementation, the transceiver module is further configured to: receive scheduling information from the control device, the scheduling information being used to schedule the first data acquisition device to send the first data.
基于第七方面,一种可能的实现方式中,收发模块还用于:向控制装置发送第二指示信息,第二指示信息用于指示第一数据的数据状态。Based on the seventh aspect, in one possible implementation, the transceiver module is further configured to: send a second indication message to the control device, the second indication message being used to indicate the data status of the first data.
基于第七方面,一种可能的实现方式中,第二指示信息还用于指示第一标识。Based on the seventh aspect, in one possible implementation, the second indication information is also used to indicate the first identifier.
本申请第八方面提供一种模型训练管理装置,模型训练管理装置包括:The eighth aspect of this application provides a model training management device, which includes:
收发模块,用于接收来自第一数据采集装置的第一请求,第一请求用于请求为第一数据采集装置分配模型训练信息;向第一数据采集装置发送第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。The transceiver module is used to receive a first request from a first data acquisition device, the first request being used to request the allocation of model training information to the first data acquisition device; and to send first instruction information to the first data acquisition device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device.
基于第八方面,一种可能的实现方式中,收发模块还用于:向第一模型训练装置发送第一指示信息。Based on the eighth aspect, in one possible implementation, the transceiver module is further configured to: send first instruction information to the first model training device.
基于第八方面,一种可能的实现方式中,收发模块还用于:向控制装置发送第一指示信息。Based on the eighth aspect, in one possible implementation, the transceiver module is also used to: send first instruction information to the control device.
基于第八方面,一种可能的实现方式中,第一模型训练信息包括第一模型训练装置的信息,和/或,第一模型训练资源。Based on the eighth aspect, in one possible implementation, the first model training information includes information about the first model training device and/or, the first model training resources.
基于第八方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。Based on the eighth aspect, in one possible implementation, the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
基于第八方面,一种可能的实现方式中,第一标识为第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置与第一模型训练装置配对。Based on the eighth aspect, in one possible implementation, the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
本申请第九方面提供一种第一模型训练装置,第一模型训练装置包括:A ninth aspect of this application provides a first model training apparatus, the first model training apparatus comprising:
收发模块,用于接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息;The transceiver module is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
处理模块,用于根据第一指示信息确定第一模型训练信息。The processing module is used to determine the first model training information based on the first instruction information.
基于第九方面,一种可能的实现方式中,第一模型训练信息包括第一模型训练装置的信息,和/或,第一模型训练资源。Based on the ninth aspect, in one possible implementation, the first model training information includes information about the first model training device and/or, the first model training resources.
基于第九方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于标识第一模型训练装置作为第一数据采集装置的模型训练装置。Based on the ninth aspect, in one possible implementation, the first indication information includes a first identifier, which is used to identify the first model training device as a model training device for the first data acquisition device.
基于第九方面,一种可能的实现方式中,第一标识为第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于标识第一数据采集装置与第一模型训练装置配对。Based on the ninth aspect, in one possible implementation, the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to identify the pairing of the first data acquisition device and the first model training device.
基于第九方面,一种可能的实现方式中,收发模块还用于:接收来自第一数据采集装置的第一数据,第一数据包括第一标识;处理模块还用于:根据第一数据进行模型训练,得到第一模型。Based on the ninth aspect, in one possible implementation, the transceiver module is further configured to: receive first data from the first data acquisition device, the first data including a first identifier; the processing module is further configured to: perform model training based on the first data to obtain a first model.
基于第九方面,一种可能的实现方式中,收发模块还用于:向第一数据采集装置或模型融合装置发送第一模型。Based on the ninth aspect, in one possible implementation, the transceiver module is further configured to: send the first model to the first data acquisition device or the model fusion device.
基于第九方面,一种可能的实现方式中,收发模块还用于:接收来自模型融合装置的第二模型,第二模型是模型融合装置融合第一模型得到的。Based on the ninth aspect, in one possible implementation, the transceiver module is further configured to: receive a second model from the model fusion device, the second model being obtained by the model fusion device fusing the first model.
基于第九方面,一种可能的实现方式中,收发模块还用于:接收来自第一数据采集装置、第二数据采集装置或第二模型训练装置的第三模型,第三模型用于模型融合。Based on the ninth aspect, in one possible implementation, the transceiver module is further configured to: receive a third model from the first data acquisition device, the second data acquisition device, or the second model training device, wherein the third model is used for model fusion.
基于第九方面,一种可能的实现方式中,处理模块还用于:融合第一模型和第三模型,得到第四模型;收发模块还用于:向第一数据采集装置发送第四模型。Based on the ninth aspect, in one possible implementation, the processing module is further configured to: fuse the first model and the third model to obtain a fourth model; the transceiver module is further configured to: send the fourth model to the first data acquisition device.
基于第九方面,一种可能的实现方式中,收发模块还用于:向控制装置发送第三指示信息,第三指示信息用于指示第一模型训练装置的训练相关信息。Based on the ninth aspect, in one possible implementation, the transceiver module is further configured to: send third instruction information to the control device, the third instruction information being used to indicate training-related information of the first model training device.
基于第九方面,一种可能的实现方式中,第三指示信息还用于指示第一标识。Based on the ninth aspect, in one possible implementation, the third indication information is also used to indicate the first identifier.
本申请第十方面提供一种控制装置,控制装置包括:The tenth aspect of this application provides a control device, the control device comprising:
收发模块,用于接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息;The transceiver module is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
处理模块,用于根据第一指示信息确定第一模型训练信息。The processing module is used to determine the first model training information based on the first instruction information.
基于第十方面,一种可能的实现方式中,第一模型训练信息包括:第一模型训练装置的信息,和/或,第一模型训练资源。Based on the tenth aspect, in one possible implementation, the first model training information includes: information about the first model training device, and/or, the first model training resources.
基于第十方面,一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。Based on the tenth aspect, in one possible implementation, the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
基于第十方面,一种可能的实现方式中,第一标识为所述第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置与第一模型训练装置配对。Based on the tenth aspect, in one possible implementation, the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
基于第十方面,一种可能的实现方式中,收发模块还用于:向第一数据采集装置发送调度信息,调度信息用于调度第一数据采集装置发送第一数据。Based on the tenth aspect, in one possible implementation, the transceiver module is further configured to: send scheduling information to the first data acquisition device, wherein the scheduling information is used to schedule the first data acquisition device to send the first data.
基于第十方面,一种可能的实现方式中,收发模块还用于:接收来自第一数据采集装置的第二指示信息,第二指示信息用于指示第一数据的数据状态;处理模块还用于:根据第二指示信息确定调度第一数据采集装置。Based on the tenth aspect, in one possible implementation, the transceiver module is further configured to: receive second indication information from the first data acquisition device, the second indication information being used to indicate the data status of the first data; the processing module is further configured to: determine the scheduling of the first data acquisition device based on the second indication information.
基于第十方面,一种可能的实现方式中,收发模块还用于:接收来自第一模型训练装置的第三指示信息;处理模块具体用于:根据第二指示信息和第三指示信息确定调度第一数据采集装置。Based on the tenth aspect, in one possible implementation, the transceiver module is further configured to: receive third instruction information from the first model training device; the processing module is specifically configured to: determine the scheduling of the first data acquisition device based on the second instruction information and the third instruction information.
本申请第十一方面提供一种控制装置,控制装置包括:The eleventh aspect of this application provides a control device, the control device comprising:
收发模块,用于接收来自第一数据采集装置的第二指示信息,第二指示信息用于指示第一数据的数据状态和第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置;接收来自第一模型训练装置的第三指示信息,第三指示信息用于指示第一模型训练装置的训练相关信息和第一标识;The transceiver module is used to receive second instruction information from the first data acquisition device, the second instruction information being used to indicate the data status and first identifier of the first data, the first identifier being used to indicate that the first model training device is the model training device of the first data acquisition device; and to receive third instruction information from the first model training device, the third instruction information being used to indicate the training-related information and first identifier of the first model training device.
处理模块,用于根据第二指示信息和第三指示信息确定第一模型训练装置作为第一数据采集装置的模型训练装置。The processing module is used to determine the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
基于第十一方面,一种可能的实现方式中,处理模块还用于:根据第二指示信息和第三指示信息确定调度第一数据采集装置。Based on the eleventh aspect, in one possible implementation, the processing module is further configured to: determine the scheduling of the first data acquisition device based on the second instruction information and the third instruction information.
基于第十一方面,一种可能的实现方式中,收发模块还用于:向第一数据采集装置发送调度信息,调度信息用于调度第一数据采集装置。Based on the eleventh aspect, in one possible implementation, the transceiver module is further configured to: send scheduling information to the first data acquisition device, the scheduling information being used to schedule the first data acquisition device.
本申请第十二方面提供一种模型融合装置,模型融合装置包括:The twelfth aspect of this application provides a model fusion apparatus, the model fusion apparatus comprising:
收发模块,用于接收来自第一模型训练装置的第一模型,第一模型是基于第一数据采集装置的第一数据得到的;The transceiver module is used to receive a first model from a first model training device, wherein the first model is obtained based on first data from a first data acquisition device.
处理模块,用于融合第一模型,得到第二模型。The processing module is used to fuse the first model to obtain the second model.
基于第十二方面,一种可能的实现方式中,收发模块还用于:向第一数据采集装置发送第二模型。Based on the twelfth aspect, in one possible implementation, the transceiver module is also used to: send the second model to the first data acquisition device.
基于第十二方面,一种可能的实现方式中,收发模块还用于:向第一模型训练装置发送第二模型。Based on the twelfth aspect, in one possible implementation, the transceiver module is also used to: send the second model to the first model training device.
针对上述第七方面,第一数据采集装置可以是终端设备,或者是终端设备的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分终端设备功能的逻辑模块或软件。所述收发模块可以是收发器,或,输入/输出接口;所述处理模块可以是处理器。Regarding the seventh aspect mentioned above, the first data acquisition device may be a terminal device, or a component of a terminal device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the terminal device. The transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
在一种实现方式中,第一数据采集装置为配置于终端设备中的芯片、芯片系统或电路。当第一数据采集装置为配置于终端设备中的芯片、芯片系统或电路时,所述收发模块可以是该芯片、芯片系统或电路上的输入/输出接口、接口电路、输出电路、输入电路、管脚或相关电路等;所述处理模块可以是处理器、处理电路或逻辑电路等。In one implementation, the first data acquisition device is a chip, chip system, or circuit configured in the terminal device. When the first data acquisition device is a chip, chip system, or circuit configured in the terminal device, the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit; the processing module may be a processor, processing circuit, or logic circuit.
针对上述第八方面,模型训练管理装置可以是服务器,或者是服务器的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分服务器功能的逻辑模块或软件。所述收发模块可以是收发器,或,输入/输出接口;所述处理模块可以是处理器。Regarding the eighth aspect above, the model training management device can be a server, or a component of a server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the server's functions. The transceiver module can be a transceiver, or an input/output interface; the processing module can be a processor.
在一种实现方式中,模型训练管理装置为配置于服务器中的芯片、芯片系统或电路。当模型训练管理装置为配置于服务器中的芯片、芯片系统或电路时,所述收发模块可以是该芯片、芯片系统或电路上的输入/输出接口、接口电路、输出电路、输入电路、管脚或相关电路等;所述处理模块可以是处理器、处理电路或逻辑电路等。In one implementation, the model training management device is a chip, chip system, or circuit configured in a server. When the model training management device is a chip, chip system, or circuit configured in a server, the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit; the processing module may be a processor, processing circuit, or logic circuit.
针对上述第九方面,第一模型训练装置可以是AI服务器,或者是AI服务器的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分AI服务器功能的逻辑模块或软件。所述收发模块可以是收发器,或,输入/输出接口;所述处理模块可以是处理器。Regarding the ninth aspect mentioned above, the first model training device may be an AI server, or a component of an AI server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of an AI server. The transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
在一种实现方式中,第一模型训练装置为配置于AI服务器中的芯片、芯片系统或电路。当第一模型训练装置为配置于AI服务器中的芯片、芯片系统或电路时,所述收发模块可以是该芯片、芯片系统或电路上的输入/输出接口、接口电路、输出电路、输入电路、管脚或相关电路等;所述处理模块可以是处理器、处理电路或逻辑电路等。In one implementation, the first model training device is a chip, chip system, or circuit configured in an AI server. When the first model training device is a chip, chip system, or circuit configured in an AI server, the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit; the processing module may be a processor, processing circuit, or logic circuit.
针对上述第十方面或第十一方面,控制装置可以是网络设备,或者是网络设备的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分网络设备功能的逻辑模块或软件。所述收发模块可以是收发器,或,输入/输出接口;所述处理模块可以是处理器。Regarding the tenth or eleventh aspect above, the control device may be a network device, or a component of a network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of the network device. The transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
在一种实现方式中,控制装置为配置于网络设备中的芯片、芯片系统或电路。当控制装置为配置于网络设备中的芯片、芯片系统或电路时,所述收发模块可以是该芯片、芯片系统或电路上的输入/输出接口、接口电路、输出电路、输入电路、管脚或相关电路等;所述处理模块可以是处理器、处理电路或逻辑电路等。In one implementation, the control device is a chip, chip system, or circuit configured in the network device. When the control device is a chip, chip system, or circuit configured in the network device, the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit; the processing module may be a processor, processing circuit, or logic circuit.
针对上述第十二方面,模型融合装置可以是网络设备,或者是网络设备的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分网络设备的逻辑模块或软件。所述收发模块可以是收发器,或,输入/输出接口;所述处理模块可以是处理器。Regarding the twelfth aspect above, the model fusion apparatus may be a network device, or a component of a network device (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of a network device. The transceiver module may be a transceiver, or an input/output interface; the processing module may be a processor.
在一种实现方式中,模型融合装置为配置于网络设备中的芯片、芯片系统或电路。当模型融合装置为配置于网络设备中的芯片、芯片系统或电路时,所述收发模块可以是该芯片、芯片系统或电路上的输入/输出接口、接口电路、输出电路、输入电路、管脚或相关电路等;所述处理模块可以是处理器、处理电路或逻辑电路等。In one implementation, the model fusion device is a chip, chip system, or circuit configured in a network device. When the model fusion device is a chip, chip system, or circuit configured in a network device, the transceiver module may be an input/output interface, interface circuit, output circuit, input circuit, pin, or related circuit on the chip, chip system, or circuit; the processing module may be a processor, processing circuit, or logic circuit.
本申请第十三方面提供一种装置,该装置包括:处理器和存储器。该存储器中存储有计算机程序或计算机指令,该处理器用于调用并运行该存储器中存储的计算机程序或计算机指令,使得处理器实现如第一方面至第六方面中任一方面中的任意一种实现方式。The thirteenth aspect of this application provides an apparatus comprising a processor and a memory. The memory stores a computer program or computer instructions, and the processor is configured to invoke and execute the computer program or computer instructions stored in the memory, such that the processor implements any one of the implementations of the first to sixth aspects.
可选的,该装置还包括收发器,该处理器用于控制该收发器收发信号。Optionally, the device may also include a transceiver, the processor of which controls the transceiver to transmit and receive signals.
本申请第十四方面提供一种装置,包括处理器和接口电路,所述处理器用于通过接口电路与其它装置通信,并执行上述第一方面至第六方面中任一方面所述的方法。该处理器包括一个或多个。The fourteenth aspect of this application provides an apparatus including a processor and an interface circuit, the processor being configured to communicate with other devices via the interface circuit and to perform the method described in any one of the first to sixth aspects. The processor may include one or more devices.
本申请第十五方面提供一种装置,包括处理器,用于与存储器相连,用于调用所述存储器中存储的程序,以执行上述第一方面至第六方面中任一方面所述的方法。该存储器可以位于该装置之内,也可以位于该装置之外。且该处理器包括一个或多个。The fifteenth aspect of this application provides an apparatus including a processor connected to a memory for invoking a program stored in the memory to perform the method described in any one of the first to sixth aspects. The memory may be located within or outside the apparatus. The processor may include one or more processors.
在一种实现方式中,上述第一方面、第七方面所示的第一数据采集装置可以是芯片或芯片系统。上述第二方面、第八方面所示的模型训练管理装置可以是芯片或芯片系统。上述第三方面、第九方面所示的第一模型训练装置可以是芯片或芯片系统。上述第四方面、第五方面、第十方面、第十一方面所示的控制装置可以是芯片或芯片系统。上述第六方面、第十二方面所示的模型融合装置可以是芯片或芯片系统。In one implementation, the first data acquisition device shown in the first and seventh aspects above can be a chip or a chip system. The model training management device shown in the second and eighth aspects above can be a chip or a chip system. The first model training device shown in the third and ninth aspects above can be a chip or a chip system. The control device shown in the fourth, fifth, tenth, and eleventh aspects above can be a chip or a chip system. The model fusion device shown in the sixth and twelfth aspects above can be a chip or a chip system.
本申请第十六方面提供一种包括计算机指令的计算机程序产品,其特征在于,当其在计算机上运行时,使得该计算机执行如第一方面至第六方面中任一方面中的任一种的实现方式。The sixteenth aspect of this application provides a computer program product including computer instructions, characterized in that, when run on a computer, it causes the computer to perform any of the implementations of any one of the first to sixth aspects.
本申请第十七方面提供一种计算机可读存储介质,包括计算机指令,当该指令在计算机上运行时,使得计算机执行如第一方面至第六方面中任一方面中的任一种实现方式。The seventeenth aspect of this application provides a computer-readable storage medium including computer instructions that, when executed on a computer, cause the computer to perform any of the implementations of any one of the first to sixth aspects.
本申请第十八方面提供一种芯片装置,包括处理器,用于调用存储器中的计算机程序或计算机指令,以使得该处理器执行上述第一方面至第六方面中任一方面中的任一种实现方式。The eighteenth aspect of this application provides a chip device including a processor for calling a computer program or computer instructions in memory to cause the processor to execute any one of the implementations of the first to sixth aspects described above.
可选的,该处理器通过接口与该存储器耦合。Optionally, the processor is coupled to the memory via an interface.
本申请第十九方面提供一种通信系统,该通信系统包括如第一方面所示的第一数据采集装置和如第二方面所示的模型训练管理装置。可选的,该通信系统还包括如第三方面所述的第一模型训练装置。可选的,该通信系统还包括如第四方面或第五方面所示的控制装置和/或如第六方面所示的模型融合装置。The nineteenth aspect of this application provides a communication system comprising a first data acquisition device as shown in the first aspect and a model training management device as shown in the second aspect. Optionally, the communication system further comprises a first model training device as described in the third aspect. Optionally, the communication system further comprises a control device as shown in the fourth or fifth aspect and/or a model fusion device as shown in the sixth aspect.
经由上述技术方案可知,第一数据采集装置向模型训练管理装置发送第一请求。第一请求用于请求为第一数据采集装置分配模型训练信息。然后,第一数据采集装置接收来自模型训练管理装置的第一指示信息。第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置根据第一模型训练信息可以确定第一模型训练装置,并向第一模型训练装置发送第一数据采集装置的本地数据。从而便于第一模型训练装置基于该本地数据完成模型训练。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。As described in the above technical solution, the first data acquisition device sends a first request to the model training management device. The first request requests the allocation of model training information for the first data acquisition device. Then, the first data acquisition device receives first instruction information from the model training management device. This first instruction information indicates the allocation of first model training information to the first data acquisition device. This enables the first data acquisition device to allocate first model training information, facilitating its determination of how to complete model training based on the first model training information, thus enabling model training based on the device's local data. For example, if the first data acquisition device has weak computing power or lacks computing power, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training. For example, the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device completing model training based on this local data. For example, if the first data acquisition device is a terminal device and the first model training device is a server of the terminal manufacturer, and the terminal device has limited computing power and data privacy protection requirements, the terminal manufacturer's server can complete model training based on the terminal device's local data. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
图1为本申请实施例通信系统的一个示意图;Figure 1 is a schematic diagram of a communication system according to an embodiment of this application;
图2为本申请实施例联邦学习的一个示意图;Figure 2 is a schematic diagram of federated learning in an embodiment of this application;
图3为本申请实施例分割学习的一个示意图;Figure 3 is a schematic diagram of segmentation learning in an embodiment of this application;
图4为本申请实施例去中心式学习的一个示意图;Figure 4 is a schematic diagram of decentralized learning in an embodiment of this application;
图5为本申请实施例信息传输方法的一个实施例示意图;Figure 5 is a schematic diagram of an embodiment of the information transmission method of this application;
图6A为本申请实施例信息传输方法的一个场景示意图;Figure 6A is a schematic diagram of a scenario of the information transmission method according to an embodiment of this application;
图6B为本申请实施例信息传输方法的另一个场景示意图;Figure 6B is a schematic diagram of another scenario of the information transmission method according to an embodiment of this application;
图6C为本申请实施例信息传输方法的再一个场景示意图;Figure 6C is a schematic diagram of another scenario of the information transmission method according to an embodiment of this application;
图7为本申请实施例信息传输方法的另一个实施例示意图;Figure 7 is a schematic diagram of another embodiment of the information transmission method of this application;
图8为本申请实施例信息传输方法的再一个实施例示意图;Figure 8 is a schematic diagram of another embodiment of the information transmission method of this application;
图9为本申请实施例信息传输方法的再一个实施例示意图;Figure 9 is a schematic diagram of another embodiment of the information transmission method of this application;
图10为本申请实施例第一数据采集装置的一个结构示意图;Figure 10 is a structural schematic diagram of a first data acquisition device according to an embodiment of this application;
图11为本申请实施例模型训练管理装置的一个结构示意图;Figure 11 is a schematic diagram of a model training management device according to an embodiment of this application;
图12为本申请实施例第一模型训练装置的一个结构示意图;Figure 12 is a schematic diagram of the structure of a first model training device according to an embodiment of this application;
图13为本申请实施例控制装置的一个结构示意图;Figure 13 is a schematic diagram of a control device according to an embodiment of this application;
图14为本申请实施例模型融合装置的一个结构示意图;Figure 14 is a schematic diagram of a model fusion device according to an embodiment of this application;
图15为本申请实施例装置的一个结构示意图;Figure 15 is a schematic diagram of a device according to an embodiment of this application;
图16为本申请实施例终端设备的一个结构示意图;Figure 16 is a structural schematic diagram of a terminal device according to an embodiment of this application;
图17为本申请实施例网络设备的一个结构示意图。Figure 17 is a schematic diagram of a network device according to an embodiment of this application.
本申请实施例提供了一种信息传输方法以及相关装置,用于实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。This application provides an information transmission method and related apparatus for allocating first model training information to a first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training to be completed based on the local data of the first data acquisition device.
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述。显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions of the embodiments of this application will be clearly and completely described below with reference to the accompanying drawings. Obviously, the described embodiments are only a part of the embodiments of this application, and not all of them. All other embodiments obtained by those skilled in the art based on the embodiments of this application without creative effort are within the scope of protection of this application.
在本申请中描述的参考“一个实施例”或“一些实施例”等意味着在本申请的一个或多个实施例中包括结合该实施例描述的特定特征、结构或特点。由此,在本说明书中的不同之处出现的语句“在一个实施例中”、“在一些实施例中”、“在其他一些实施例中”、“在另外一些实施例中”等不是必然都参考相同的实施例,而是意味着“一个或多个但不是所有的实施例”,除非是以其他方式另外特别强调。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。References to "one embodiment" or "some embodiments" as described in this application mean that one or more embodiments of this application include a specific feature, structure, or characteristic described in connection with that embodiment. Therefore, the phrases "in one embodiment," "in some embodiments," "in other embodiments," "in still other embodiments," etc., appearing in different parts of this specification do not necessarily refer to the same embodiment, but rather mean "one or more, but not all, embodiments," unless otherwise specifically emphasized. The terms "comprising," "including," "having," and variations thereof mean "including but not limited to," unless otherwise specifically emphasized.
在本申请的描述中,除非另有说明,“/”表示“或”的意思,例如,A/B可以表示A或B。本文中的“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。此外,“至少一个”是指一个或多个,“多个”是指两个或两个以上。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c;a和b;a和c;b和c;或a和b和c。其中a,b,c可以是单个,也可以是多个。In the description of this application, unless otherwise stated, "/" means "or". For example, A/B can mean A or B. "And/or" in this document is merely a description of the relationship between related objects, indicating that three relationships can exist. For example, A and/or B can represent: A alone, A and B simultaneously, and B alone. Furthermore, "at least one" means one or more, and "multiple" means two or more. "At least one of the following" or similar expressions refer to any combination of these items, including any combination of single or multiple items. For example, at least one of a, b, or c can represent: a, b, c; a and b; a and c; b and c; or a and b and c. Where a, b, and c can be single or multiple.
可以理解,在本申请中,“指示”可以包括直接指示、间接指示、显示指示、隐式指示。当描述某一指示信息用于指示A时,可以理解为该指示信息携带A、直接指示A,或间接指示A。It is understood that in this application, "instruction" can include direct instruction, indirect instruction, explicit instruction, and implicit instruction. When describing a certain instruction information to indicate A, it can be understood that the instruction information carries A, directly indicates A, or indirectly indicates A.
本申请中,指示信息所指示的信息,称为待指示信息。在具体实现过程中,对待指示信息进行指示的方式有很多种,例如但不限于,可以直接指示待指示信息,如待指示信息本身或者该待指示信息的索引等,也可以通过指示其他信息来间接指示待指示信息,其中,该其他信息与待指示信息之间存在关联关系。还可以仅仅指示待指示信息的一部分,而待指示信息的其他部分则是已知的或者提前约定的。例如,还可以借助预先约定(例如协议规定)的各个信息的排列顺序来实现对特定信息的指示,从而在一定程度上降低指示开销。In this application, the information indicated by the instruction information is called the information to be instructed. In specific implementations, there are many ways to instruct the information to be instructed, such as, but not limited to, directly instructing the information to be instructed, such as the information to be instructed itself or its index; indirectly instructing the information to be instructed by instructing other information, where there is a relationship between the other information and the information to be instructed; or instructing only a part of the information to be instructed, while the other parts are known or pre-agreed upon. For example, the instruction of specific information can be achieved by using a pre-agreed (e.g., protocol-defined) arrangement of various pieces of information, thereby reducing instruction overhead to some extent.
待指示信息可以作为一个整体一起发送,也可以分成多个子信息分开发送,而且这些子信息的发送周期和/或发送时机可以相同,也可以不同。具体发送方法本申请不进行限定。其中,这些子信息的发送周期和/或发送时机可以是预先定义的,例如根据协议预先定义的,也可以是发射端设备通过向接收端设备发送配置信息来配置的。The information to be instructed can be sent as a whole or divided into multiple sub-information messages, and the sending period and/or timing of these sub-information messages can be the same or different. This application does not limit the specific sending method. The sending period and/or timing of these sub-information messages can be predefined, for example, according to a protocol, or configured by the transmitting device by sending configuration information to the receiving device.
可以理解,本申请中的“发送”和“接收”,表示信号传递的走向。例如,“向XX发送信息”可以理解为该信息的目的端是XX,可以包括通过空口直接发送,也包括其他单元或模块通过空口间接发送。“接收来自YY的信息”可以理解为该信息的源端是YY,可以包括通过空口直接从YY接收,也可以包括通过空口从其他单元或模块间接地从YY接收。“发送”也可以理解为芯片接口的“输出”,“接收”也可以理解为芯片接口的“输入”。It is understood that "send" and "receive" in this application refer to the direction of signal transmission. For example, "send information to XX" can be understood as the destination of the information being XX, which can include direct transmission via the air interface or indirect transmission via the air interface from other units or modules. "Receive information from YY" can be understood as the source of the information being YY, which can include direct reception from YY via the air interface or indirect reception from YY via the air interface from other units or modules. "Send" can also be understood as the "output" of the chip interface, and "receive" can also be understood as the "input" of the chip interface.
换言之,发送和接收可以是在设备之间进行的,例如,网络设备和终端设备之间进行的,也可以是在设备内进行的,例如,通过总线、走线或接口在设备内的部件之间、模组之间、芯片之间、软件模块或者硬件模块之间发送或接收。In other words, sending and receiving can occur between devices, such as between network devices and terminal devices, or within a device, such as between components, modules, chips, software modules, or hardware modules within the device via buses, wiring, or interfaces.
可以理解的是,信息在信息发送的源端和目的端之间可能会被进行必要的处理,比如编码、调制等,但目的端可以理解来自源端的有效信息。本申请中类似的表述可以做相似的理解,不再赘述。It is understandable that information may undergo necessary processing, such as encoding and modulation, between the source and destination, but the destination can understand the valid information from the source. Similar statements in this application can be interpreted in a similar way and will not be elaborated further.
本申请的技术方案可以应用于第三代合作伙伴计划(3rd generation partnership project,3GPP)相关的蜂窝通信系统。例如,第四代(4th generation,4G)通信系统、第五代(5th generation,5G)通信系统、第五代通信系统之后的通信系统。例如,第六代(6th generation,6G)通信系统。例如,第四代通信系统可以包括长期演进(long term evolution,LTE)通信系统。第五代通信系统可以包括新无线(new radio,NR)通信系统。本申请的技术方案也可以应用于无线保真(wireless fidelity,WiFi)系统,支持多种无线技术融合的通信系统,设备到设备(device-to-device,D2D)系统,或车联网(vehicle to everything,V2X)通信系统等。The technical solution of this application can be applied to cellular communication systems related to the 3rd Generation Partnership Project (3GPP). For example, 4th generation (4G) communication systems, 5th generation (5G) communication systems, and communication systems beyond the 5th generation, such as 6th generation (6G) communication systems. For example, 4th generation communication systems may include Long Term Evolution (LTE) communication systems. 5th generation communication systems may include New Radio (NR) communication systems. The technical solution of this application can also be applied to Wireless Fidelity (WiFi) systems, communication systems supporting the convergence of multiple wireless technologies, device-to-device (D2D) systems, or vehicle-to-everything (V2X) communication systems, etc.
本申请的技术方案适用的通信系统包括第一数据采集装置、第一模型训练装置和模型训练管理装置。The communication system to which the technical solution of this application applies includes a first data acquisition device, a first model training device, and a model training management device.
第一数据采集装置具有数据采集功能。例如,第一数据采集装置采集本地数据。然后,第一数据采集装置可以向第一模型训练装置发送本地数据。该本地数据用于模型训练。可选的,第一数据采集装置可以是终端设备,或者是终端设备中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分终端设备功能的逻辑模块或软件。The first data acquisition device has a data acquisition function. For example, the first data acquisition device acquires local data. Then, the first data acquisition device can send the local data to the first model training device. This local data is used for model training. Optionally, the first data acquisition device can be a terminal device, or a component in the terminal device (e.g., a processor, chip, or chip system), or a logic module or software that can implement all or part of the functions of the terminal device.
第一模型训练装置具有模型训练功能。第一模型训练装置接收来自第一数据采集装置的本地数据,并基于本地数据进行模型训练。可选的,第一模型训练装置可以向第一数据采集装置发送训练得到的模型。可选的,第一模型训练装置可以是AI服务器,或者是AI服务器中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分AI服务器功能的逻辑模块或软件。The first model training device has a model training function. It receives local data from the first data acquisition device and trains the model based on the local data. Optionally, the first model training device can send the trained model to the first data acquisition device. Optionally, the first model training device can be an AI server, or a component within an AI server (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of an AI server.
可选的,第一数据采集装置和第一模型训练装置可以共同组成分布式学习中的一个分布式节点,但属于不同实体装置。例如,如图1所示,第一数据采集装置可以为终端设备101,第一模型训练装置可以为AI服务器103。第一模型训练装置可以理解为终端厂商服务器。换句话说,在分布式学习中,分布式节点的数据采集(data collecting,DC)模块部署在终端设备,分布式节点的模型训练(model training,MT)模块部署在终端厂商服务器。实现终端设备通过数据采集模块采集数据,并向模型训练模块提供数据。终端厂商服务器通过模型训练模块基于数据进行模型训练。Optionally, the first data acquisition device and the first model training device can jointly form a distributed node in distributed learning, but they are different physical devices. For example, as shown in Figure 1, the first data acquisition device can be a terminal device 101, and the first model training device can be an AI server 103. The first model training device can be understood as the terminal manufacturer's server. In other words, in distributed learning, the data collection (DC) module of the distributed node is deployed on the terminal device, and the model training (MT) module of the distributed node is deployed on the terminal manufacturer's server. This enables the terminal device to collect data through the data acquisition module and provide the data to the model training module. The terminal manufacturer's server then uses the model training module to train the model based on the data.
模型训练管理装置用于对模型训练装置进行管理、为数据采集装置分配模型训练装置。例如,模型训练管理装置可以为第一数据采集装置分配第一模型训练装置。从而实现第一模型训练装置通过第一数据采集装置提供的本地数据进行模型训练。实现第一模型训练装置为第一数据采集装置提供模型训练功能。可选的,模型训练管理装置可以是模型训练管理服务器或核心网网元,或者是模型训练管理服务器或核心网网元中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分模型训练管理服务器或核心网网元功能的逻辑模块或软件。The model training management device is used to manage model training devices and allocate model training devices to data acquisition devices. For example, the model training management device can allocate a first model training device to a first data acquisition device. This enables the first model training device to perform model training using local data provided by the first data acquisition device. It also enables the first model training device to provide model training functionality to the first data acquisition device. Optionally, the model training management device can be a model training management server or a core network element, or a component within a model training management server or core network element (e.g., a processor, chip, or chip system), or a logic module or software capable of implementing all or part of the functions of a model training management server or core network element.
可选的,通信系统还包括模型融合装置和/或控制装置。在联邦学习中,模型融合装置可以为中心节点,该模型融合装置具备以下功能:融合模型训练装置发送的模型,并向数据采集装置和/或模型训练装置发送融合后的模型。控制装置可以用于调度数据采集装置。可选的,模型融合装置和控制装置均可以是网络设备,或者是网络设备中的组件(例如,处理器、芯片、或芯片系统等),或者是由能实现全部或部分网络设备功能的逻辑模块或软件。例如,如图1所示,模型融合装置和控制装置均为网络设备102。可选的,模型融合装置和控制装置可以共同组成分布式学习中的中心节点;或者,模型融合装置为分布式学习中的中心节点,而控制装置为分布式学习中的控制节点。可选的,模型融合装置和控制装置可以是不同的两个实体装置。Optionally, the communication system also includes a model fusion device and/or a control device. In federated learning, the model fusion device can be a central node, which has the following functions: fusing the models sent by the model training device and sending the fused model to the data acquisition device and/or the model training device. The control device can be used to schedule the data acquisition device. Optionally, both the model fusion device and the control device can be network devices, or components within network devices (e.g., processors, chips, or chip systems), or logical modules or software capable of implementing all or part of the functions of network devices. For example, as shown in Figure 1, both the model fusion device and the control device are network devices 102. Optionally, the model fusion device and the control device can jointly form a central node in distributed learning; or, the model fusion device can be the central node in distributed learning, while the control device can be the control node in distributed learning. Optionally, the model fusion device and the control device can be two different physical devices.
可选的,通信系统还包括第二数据采集装置和第二模型训练装置。第二数据采集装置的功能与第一数据采集装置的功能类似,具体可以参阅前述第一数据采集装置的相关介绍,这里不再赘述。第二模型训练装置的功能与第一模型训练装置的功能类似,具体可以参阅前述第一模型训练装置的相关介绍,这里不再赘述。例如,如图1所示,第二数据采集装置为终端设备104,第二模型训练装置为AI服务器105。Optionally, the communication system also includes a second data acquisition device and a second model training device. The function of the second data acquisition device is similar to that of the first data acquisition device; for details, please refer to the aforementioned description of the first data acquisition device, which will not be repeated here. The function of the second model training device is similar to that of the first model training device; for details, please refer to the aforementioned description of the first model training device, which will not be repeated here. For example, as shown in Figure 1, the second data acquisition device is a terminal device 104, and the second model training device is an AI server 105.
需要说明的是,上述通信系统仅仅是一种示例。实际应用中,该通信系统包括模型训练管理装置、至少一个数据采集装置和至少一个模型训练装置,具体本申请不做限定。即图1所示的通信系统仅仅是一种示例。该通信系统包括至少一个终端设备和至少一个AI服务器。可选的,该通信系统包括至少一个网络设备。It should be noted that the above communication system is merely an example. In practical applications, the communication system includes a model training management device, at least one data acquisition device, and at least one model training device; the specific details are not limited in this application. That is, the communication system shown in Figure 1 is just an example. The communication system includes at least one terminal device and at least one AI server. Optionally, the communication system includes at least one network device.
本申请中,通信系统中,数据采集装置的数量和模型训练装置的数量可以相同,也可以不相同。通常情况下,数据采集装置的数量多于模型训练装置的数量。例如,数据采集装置为终端设备,模型训练装置为终端厂商服务器。通信系统中终端设备的数量多于终端厂商服务器。In this application, the number of data acquisition devices and the number of model training devices in the communication system may be the same or different. Typically, the number of data acquisition devices exceeds the number of model training devices. For example, the data acquisition devices may be terminal equipment, and the model training devices may be server provided by the terminal manufacturer. The number of terminal equipment in the communication system may exceed the number of server provided by the terminal manufacturer.
本申请中,数据采集装置具备数据采集功能。数据采集装置也可以称为数据获取装置,数据收集装置,数据获取节点,数据收集装置,或数据采集节点等,具体本申请对数据采集装置的名称不做限定。模型训练装置具备模型训练功能。模型训练装置也可以称为本地训练装置,训练装置,本地训练节点,或训练节点等,具体本申请对模型训练装置的名称不做限定。控制装置可以用于调度数据采集装置。模型融合装置具备以下功能:融合模型训练装置发送的模型并向数据采集装置和/或模型训练装置发送融合后的模型。控制装置和模型融合装置分别也可以称为中心装置,中心控制节点,中心控制装置,或控制中心等,具体本申请对控制装置的名称不做限定。In this application, the data acquisition device has a data acquisition function. The data acquisition device can also be called a data acquisition device, data collection device, data acquisition node, data collection device, or data acquisition node, etc., and this application does not limit the specific name of the data acquisition device. The model training device has a model training function. The model training device can also be called a local training device, training device, local training node, or training node, etc., and this application does not limit the specific name of the model training device. The control device can be used to schedule the data acquisition device. The model fusion device has the following functions: fusing the model sent by the model training device and sending the fused model to the data acquisition device and/or the model training device. The control device and the model fusion device can also be called a central device, central control node, central control device, or control center, etc., and this application does not limit the specific name of the control device.
下面介绍本申请涉及的终端设备、网络设备和AI服务器。The following describes the terminal equipment, network equipment, and AI server involved in this application.
终端设备可以是能够接收网络设备的调度信息和指示信息的无线终端设备。无线终端设备可以是指向用户提供语音和/或数据连通性的设备,或具有无线连接功能的手持式设备,或连接到无线调制解调器的其他处理设备。The terminal device can be a wireless terminal device capable of receiving scheduling and instruction information from network devices. The wireless terminal device can be a device that provides voice and/or data connectivity to the user, a handheld device with wireless connectivity, or other processing devices connected to a wireless modem.
终端设备可以经接入网与一个或多个核心网或者互联网进行通信。终端设备可以是移动终端设备,如移动电话(或称为“蜂窝”电话,手机(mobile phone))、计算机和数据卡,例如,可以是便携式、袖珍式、手持式、计算机内置的或者车载的移动装置,它们与无线接入网交换语音和/或数据。例如,个人通信业务(personal communication service,PCS)电话、无绳电话、会话发起协议话机、无线本地环路(wireless local loop,WLL)站、个人数字助理(personal digital assistant,PDA)、平板电脑(Pad)、带无线收发功能的电脑等设备。无线终端设备也可以称为系统、订户单元(subscriber unit)、订户站(subscriber station),移动站(mobile station)、移动台(mobile station,MS)、远程站(remote station)、接入点(access point,AP)、远程终端设备(remote terminal)、接入终端设备(access terminal)、用户终端设备(user terminal)、用户代理(user agent)、用户站(subscriber station,SS)、用户端设备(customer premises equipment,CPE)、终端(terminal)、用户设备(user equipment,UE)、移动终端(mobile terminal,MT)等。Terminal devices can communicate with one or more core networks or the Internet via an access network. Terminal devices can be mobile terminal devices, such as mobile phones (or "cellular" phones), computers, and data cards. For example, they can be portable, pocket-sized, handheld, computer-embedded, or vehicle-mounted mobile devices that exchange voice and/or data with the radio access network. Examples include personal communication service (PCS) phones, cordless phones, session initiation protocol phones, wireless local loop (WLL) stations, personal digital assistants (PDAs), tablets, and computers with wireless transceiver capabilities. Wireless terminal equipment can also be referred to as a system, subscriber unit, subscriber station, mobile station, mobile station (MS), remote station, access point (AP), remote terminal, access terminal, user terminal, user agent, subscriber station (SS), customer premises equipment (CPE), terminal, user equipment (UE), mobile terminal (MT), etc.
作为示例而非限定,在本申请实施例中,该终端设备还可以是可穿戴设备。可穿戴设备也可以称为穿戴式智能设备或智能穿戴式设备等,是应用穿戴式技术对日常穿戴进行智能化设计、开发出可以穿戴的设备的总称。例如,眼镜、手套、手表、服饰及鞋等。可穿戴设备即直接穿在身上,或是整合到用户的衣服或配件的一种便携式设备。可穿戴设备不仅仅是一种硬件设备,更是通过软件支持以及数据交互、云端交互来实现强大的功能。广义穿戴式智能设备包括功能全、尺寸大、可不依赖智能手机实现完整或者部分的功能,例如:智能手表或智能眼镜等,以及只专注于某一类应用功能,需要和其它设备如智能手机配合使用,如各类进行体征监测的智能手环、智能头盔、智能首饰等。By way of example and not limitation, in this embodiment, the terminal device can also be a wearable device. Wearable devices, also known as wearable smart devices or smart wearable devices, are a general term for devices that utilize wearable technology to intelligently design and develop everyday wearables. Examples include glasses, gloves, watches, clothing, and shoes. Wearable devices are portable devices that are worn directly on the body or integrated into the user's clothing or accessories. Wearable devices are not merely hardware devices, but also achieve powerful functions through software support, data interaction, and cloud interaction. Broadly speaking, wearable smart devices include those that are feature-rich, large in size, and can achieve complete or partial functions without relying on a smartphone, such as smartwatches or smart glasses, as well as those that focus on a specific type of application function and require the use of other devices such as smartphones, such as various smart bracelets, smart helmets, and smart jewelry for vital sign monitoring.
终端设备还可以是无人机、机器人、设备到设备通信(device-to-device,D2D)中的终端设备、车到一切(vehicle to everything,V2X)中的终端设备、虚拟现实(virtual reality,VR)终端设备、增强现实(augmented reality,AR)终端设备、工业控制(industrial control)中的无线终端设备、无人驾驶(self driving)中的无线终端设备、远程医疗(remote medical)中的无线终端设备、智能电网(smart grid)中的无线终端、运输安全(transportation safety)中的无线终端设备、智慧城市(smart city)中的无线终端设备、智慧家庭(smart home)中的无线终端设备等。Terminal devices can also be drones, robots, device-to-device (D2D) communication devices, vehicle-to-everything (V2X) devices, virtual reality (VR) devices, augmented reality (AR) devices, wireless devices in industrial control, wireless devices in self-driving, wireless devices in remote medical care, wireless devices in smart grids, wireless devices in transportation safety, wireless devices in smart cities, and wireless devices in smart homes, etc.
此外,终端设备也可以是第五代(5th generation,5G)通信系统之后演进的通信系统(例如第六代(6th generation,6G)通信系统等)中的终端设备或者未来演进的公共陆地移动网络(public land mobile network,PLMN)中的终端设备等。示例性的,6G通信系统可以进一步扩展5G通信终端的形态和功能,6G终端包括但不限于车、蜂窝网络终端(融合卫星终端功能)、无人机、或物联网(internet of things,IoT)设备。Furthermore, terminal devices can also be terminal devices in communication systems evolved from fifth-generation (5G) communication systems (such as sixth-generation (6G) communication systems) or in future public land mobile networks (PLMNs). For example, 6G communication systems can further expand the form and function of 5G communication terminals; 6G terminals include, but are not limited to, vehicles, cellular network terminals (integrating satellite terminal functions), drones, or Internet of Things (IoT) devices.
在本申请实施例中,终端设备具有人工智能(artificial intelligence,AI)能力。例如,终端设备可以获得网络设备或服务器提供的AI服务。终端设备还具有AI处理能力。In this embodiment, the terminal device has artificial intelligence (AI) capabilities. For example, the terminal device can obtain AI services provided by network devices or servers. The terminal device also has AI processing capabilities.
需要说明的是,终端设备可以是带有芯片的设备或装置,或者集成有电路的设备或装置,或者是上述示出的设备或装置中的芯片、模块或控制单元,具体本申请不做限定。It should be noted that the terminal device may be a device or apparatus with a chip, or a device or apparatus with integrated circuitry, or a chip, module or control unit in the device or apparatus shown above. This application does not limit the specific device.
网络设备可以是无线网络中的设备。例如,网络设备可以为将终端设备接入到无线网络的接入网节点(或接入网设备),又可以称为基站。目前,一些接入网设备的举例为:5G通信系统中的基站(gNodeB,gNB)、传输接收点(transmission reception point,TRP)、演进型节点B(evolved Node B,eNB)、无线网络控制器(radio network controller,RNC)、节点B(Node B,NB)、家庭基站(例如,home evolved Node B,或home Node B,HNB)、基带单元(base band unit,BBU),或无线保真(wireless fidelity,Wi-Fi)接入点AP等。另外,在一种网络结构中,网络设备可以包括集中单元(centralized unit,CU)节点、分布单元(distributed unit,DU)节点、CU-控制面(control plane,CP)、CU-用户面(user plane,UP)、或者无线单元(radio unit,RU),或包括CU节点和DU节点的RAN设备。CU和DU可以是单独设置,或者也可以包括在同一个网元中,例如基带单元(baseband unit,BBU)中。RU可以包括在射频设备或者射频单元中,例如包括在射频拉远单元(remote radio unit,RRU)、有源天线处理单元(active antenna unit,AAU)或远程射频头(remote radio head,RRH)中。在不同系统中,CU(或CU-CP和CU-UP)、DU或RU也可以有不同的名称,但是本领域的技术人员可以理解其含义。例如,在开放式接入网(open RAN,ORAN)系统中,CU也可以称为开放式CU(O-CU),DU也可以称为开放式DU(O-DU),CU-CP也可以称为开放式CU-CP(O-CU-CP),CU-UP也可以称为开放式CU-UP(O-CU-UP),RU也可以称为开放式RU(O-RU)。其中,CU(或CU-CP、CU-UP)、DU和RU中的任一单元,可以是通过软件模块、硬件模块、或者软件模块与硬件模块结合来实现。Network devices can be devices within a wireless network. For example, a network device can be an access network node (or access network equipment) that connects terminal devices to a wireless network, also known as a base station. Currently, some examples of access network equipment include: base stations (gNodeB, gNB), transmission reception points (TRP), evolved Node B (eNB), radio network controllers (RNC), Node Bs (NB), home base stations (e.g., home evolved Node B, or home Node B, HNB), base band units (BBU), or Wi-Fi access points (APs) in 5G communication systems. In another network architecture, network devices may include centralized unit (CU) nodes, distributed unit (DU) nodes, CU-control plane (CP), CU-user plane (UP), or radio unit (RU), or RAN equipment including CU and DU nodes. CU and DU may be separate entities or included in the same network element, such as a baseband unit (BBU). RU may be included in radio equipment or radio units, such as remote radio units (RRU), active antenna units (AAU), or remote radio heads (RRH). In different systems, CU (or CU-CP and CU-UP), DU, or RU may have different names, but their meanings will be understood by those skilled in the art. For example, in an Open RAN (ORAN) system, a CU can also be called an Open CU (O-CU), a DU can also be called an Open DU (O-DU), a CU-CP can also be called an Open CU-CP (O-CU-CP), a CU-UP can also be called an Open CU-UP (O-CU-UP), and a RU can also be called an Open RU (O-RU). Any of the units among the CU (or CU-CP, CU-UP), DU, and RU can be implemented through software modules, hardware modules, or a combination of software and hardware modules.
网络设备可以是其它为终端设备提供无线通信功能的装置。本申请的实施例对网络设备所采用的具体技术和具体设备形态不做限定。为方便描述,本申请实施例并不限定。Network devices can be other devices that provide wireless communication functions for terminal devices. The embodiments of this application do not limit the specific technology or form of the network device. For ease of description, the embodiments of this application are not limited.
网络设备还可以包括核心网设备,核心网设备例如包括第四代(4th generation,4G)网络中的移动性管理实体(mobility management entity,MME),归属用户服务器(home subscriber server,HSS),服务网关(serving gateway,S-GW),策略和计费规则功能(policy and charging rules function,PCRF),公共数据网网关(public data network gateway,PDN gateway,P-GW);5G网络中的访问和移动管理功能(access and mobility management function,AMF)、用户面功能(user plane function,UPF)或会话管理功能(session management function,SMF)等网元。此外,该核心网设备还可以包括5G网络以及5G网络的下一代网络(如,6G网络)中的其他核心网设备。Network equipment may also include core network equipment, such as the Mobility Management Entity (MME), Home Subscriber Server (HSS), Serving Gateway (S-GW), Policy and Charging Rules Function (PCRF), and Public Data Network Gateway (PDN Gateway) in 4G networks; and access and mobility management function (AMF), user plane function (UPF), or session management function (SMF) in 5G networks. Furthermore, the core network equipment may also include other core network equipment in 5G networks and next-generation networks (e.g., 6G networks).
本申请实施例中,上述网络设备还可以具有AI能力的网络节点,可以为终端设备或其他网络设备提供AI服务。例如,网络设备可以为网络侧(接入网或核心网)的AI节点、算力节点、具有AI能力的接入网节点、或具有AI能力的核心网网元等。In this embodiment, the network device can also be a network node with AI capabilities, which can provide AI services to terminal devices or other network devices. For example, the network device can be an AI node, computing power node, access network node with AI capabilities, or core network element with AI capabilities on the network side (access network or core network).
需要说明的是,网络设备可以是上述示出的设备或装置,也可以是上述示出的设备或装置中的部件(例如,芯片)、模块、或单元,具体本申请不做限定。It should be noted that the network device can be the device or apparatus shown above, or a component (e.g., a chip), module, or unit in the device or apparatus shown above; this application does not limit the specifics.
AI服务器是具备AI功能的设备。例如,AI服务器能够基于数据进行模型训练,还可以对模型进行管理等。An AI server is a device equipped with AI capabilities. For example, an AI server can train models based on data and manage models.
分布式学习可以在充分保证用户数据隐私和安全的前提下完成AI模型的学习任务。分布式学习主要包括联邦学习、分割学习和去中心式学习。下面分别介绍联邦学习、分割学习和去中心式学习。Distributed learning can complete the learning tasks of AI models while fully ensuring user data privacy and security. Distributed learning mainly includes federated learning, partitioning learning, and decentralized learning. These three methods will be introduced below.
一、联邦学习。I. Federal Learning.
联邦学习是一种典型的分布式学习方法,在充分保障用户数据隐私和安全的前提下,通过促使各个分布式节点和中心节点协同来高效地完成模型的学习任务。如图2所示,通信系统包括中心节点和一个或多个分布式节点。例如,如图2所示,通信系统包括分布式节点n、分布式节点k和分布式节点m。各个分布式节点收集本地的数据集,并进行模型的本地训练得到本地参数。然后,该分布式节点向中心节点发送本地参数。中心节点本身没有数据集,中心节点收集多个分布式节点上报的本地参数,并通过多个分布式节点上报的本地参数进行融合处理得到全局参数,并下发给各个分布式节点。从而实现对模型的训练和学习。Federated learning is a typical distributed learning method that efficiently completes model learning tasks by facilitating collaboration between distributed nodes and a central node, while fully protecting user data privacy and security. As shown in Figure 2, the communication system includes a central node and one or more distributed nodes. For example, as shown in Figure 2, the communication system includes distributed node n, distributed node k, and distributed node m. Each distributed node collects its local dataset and performs local training on the model to obtain local parameters. Then, the distributed node sends its local parameters to the central node. The central node itself does not have a dataset; it collects the local parameters reported by multiple distributed nodes, fuses these parameters to obtain global parameters, and then distributes them to the distributed nodes. This enables the training and learning of the model.
二、分割学习。II. Segmented learning.
分割学习中,完整的神经网络模型被分割为多部分。例如,以两部分为例,即神经网络模型被分割为两个子网络。一部分部署在分布式节点上,另外一部分部署中心节点上。完整的神经网络模型被分割的地方可以称为分割层。In segmentation learning, the complete neural network model is divided into multiple parts. For example, taking a two-part model, the neural network model is divided into two sub-networks. One part is deployed on distributed nodes, and the other part is deployed on a central node. The part where the complete neural network model is segmented can be called a segmentation layer.
如图3所示,在模型的前向推理时,分布式节点1将本地数据输入本地的子网络,并推理到分割层得到分割层输出的结果F1。分布式节点1通过通信链路向中心节点发送该F1。中心节点将收到的F1输入自身部署的另一个子网络,并继续进行前向推理得到最终的推理结果。在模型训练的梯度反向传递中,中心节点通过其部署的另一个子网络进行反向传递到分割层,得到反向传递结果G1。然后,中心节点向分布式节点1发送G1。分布式节点1基于G1通过其部署的一个子网络继续进行梯度反向传递。对于其他分布式节点与中心节点之间的模型前向推理和梯度反向传递同样类似。As shown in Figure 3, during forward inference of the model, distributed node 1 inputs local data into its local sub-network and infers it to the segmentation layer to obtain the segmentation layer's output result F1. Distributed node 1 sends this F1 to the central node via the communication link. The central node inputs the received F1 into another sub-network it has deployed and continues forward inference to obtain the final inference result. In gradient backpropagation during model training, the central node performs backpropagation to the segmentation layer through another sub-network it has deployed, obtaining the backpropagation result G1. Then, the central node sends G1 to distributed node 1. Distributed node 1 continues gradient backpropagation based on G1 through one of its deployed sub-networks. The forward inference and gradient backpropagation between other distributed nodes and the central node are similar.
可以看到,分割学习的前向推理过程和梯度反向梯度传递过程,只涉及一个分布式节点和一个中心节点。分布式节点上训练好的子网络可以保存在分布式节点本地或特定的模型存储服务器上。当有新的分布式节点加入分割学习,该新的分布式节点可以下载训练好的子网络,再使用该新的分布式节点的本地数据对该子网络进行进一步训练。As can be seen, the forward inference and backward gradient propagation processes of segmentation learning involve only one distributed node and one central node. The subnetworks trained on the distributed nodes can be stored locally on the distributed nodes or on a specific model storage server. When a new distributed node joins the segmentation learning process, it can download the trained subnetwork and then use its local data to further train that subnetwork.
三、去中心式学习。3. Decentralized learning.
与联邦学习不同,去中心式学习是没有中心节点的学习方式。如图4所示,去中心式学习的设计目标f(x)一般是各个节点得到的本地目标fi(x)的均值,即其中,n是分布式节点的数量,x是待优化参数。在机器学习中,x是机器学习(如神经网络)模型的参数。各个节点利用本地数据和本地目标fi(x)计算本地梯度并将该本地梯度发送给其邻居节点。任一节点接收到其邻居节点的本地梯度后,可以按照下述公式1更新本地模型的参数x。从而通过节点间的信息交互完成模型的学习任务。
Unlike federated learning, decentralized learning is a learning method without a central node. As shown in Figure 4, the design objective f(x) of decentralized learning is generally the average of the local objective fi (x) obtained by each node, i.e. Here, n is the number of distributed nodes, and x is the parameter to be optimized. In machine learning, x is the parameter of the machine learning model (such as a neural network). Each node uses its local data and local objective f <sub>i</sub> (x) to calculate its local gradient. and the local gradient The gradient is sent to its neighboring nodes. Upon receiving the local gradient from its neighboring nodes, any node can update the parameters x of its local model according to Formula 1 below. Thus, the model learning task is completed through information exchange between nodes.
其中,Ni是节点i的邻居节点集合,|Ni|表示节点i的邻居节点集合中的元素数量,即节点i的邻居节点数量,αk为第k轮训练的融合权重。是节点i在第k+1轮训练得到的模型参数。是节点i在第k轮训练得到的模型参数。是邻居节点集合中邻居节点j在第k轮训练得到的模型参数。k为大于或等于1的整数。Where N <sub>i </sub> is the set of neighboring nodes of node i, |N<sub>i</sub> | represents the number of elements in the set of neighboring nodes of node i, i.e. the number of neighboring nodes of node i, and α <sub>k</sub> is the fusion weight of the k-th training round. These are the model parameters obtained by node i in the (k+1)th round of training. These are the model parameters obtained by node i in the kth round of training. These are the model parameters obtained from neighbor node j in the k-th round of training. k is an integer greater than or equal to 1.
目前,分布式节点(例如,终端设备)具备足够的计算能力。分布式节点基于本地采集的数据进行模型训练以获得本地模型,并将向其他节点(例如,中心节点或其他分布式节点)传输该本地模型。但是,如果分布式节点的计算能力有限,但该分布式节点具有数据隐私保护需求。因此,从数据隐私保护需求的角度考虑,分布式节点无法将本地数据直接发送到中心节点或其他具有计算能力的分布式节点。导致该分布式节点无法进行模型训练。因此如何实现分布式学习以完成模型训练任务,是值得考虑的问题。本申请提供了相应的技术方案,用于实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。从而解决第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下无法进行模型训练的问题。具体请参阅后文的实施例的相关介绍。Currently, distributed nodes (e.g., terminal devices) possess sufficient computing power. Distributed nodes train models based on locally collected data to obtain local models, which are then transmitted to other nodes (e.g., central nodes or other distributed nodes). However, if a distributed node has limited computing power but requires data privacy protection, it cannot directly send local data to a central node or other distributed nodes with computing power, thus preventing model training. Therefore, how to achieve distributed learning to complete the model training task is a problem worth considering. This application provides a corresponding technical solution for allocating first model training information to a first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, enabling model training based on the local data of the first data acquisition device. This solves the problem of being unable to perform model training when the first data acquisition device has weak or no computing power. Please refer to the following description of the embodiments for details.
下面结合具体实施例介绍本申请的技术方案。The technical solution of this application is described below with reference to specific embodiments.
图5为本申请实施例信息传输方法的一个实施例示意图。请参阅图5,方法包括:Figure 5 is a schematic diagram of an embodiment of the information transmission method of this application. Referring to Figure 5, the method includes:
501、第一数据采集装置向模型训练管理装置发送第一请求。相应的,模型训练管理装置接收来自第一数据采集装置的第一请求。501. The first data acquisition device sends a first request to the model training management device. Correspondingly, the model training management device receives the first request from the first data acquisition device.
其中,第一请求用于请求为第一数据采集装置分配模型训练信息。可选的,第一请求包括第一数据采集装置的任务信息。例如,任务信息包括以下至少一项:任务类型、用于执行任务所需的算力、计算精度、或计算能效。具体的,第一请求可以用于请求为第一数据采集装置分配模型训练装置和/或模型训练资源。The first request is used to request the allocation of model training information for the first data acquisition device. Optionally, the first request includes task information for the first data acquisition device. For example, the task information includes at least one of the following: task type, computing power required to perform the task, computing accuracy, or computing energy efficiency. Specifically, the first request can be used to request the allocation of model training equipment and/or model training resources for the first data acquisition device.
502、模型训练管理装置向第一数据采集装置发送第一指示信息。相应的,第一数据采集装置接收来自模型训练管理装置的第一指示信息。502. The model training management device sends a first instruction message to the first data acquisition device. Correspondingly, the first data acquisition device receives the first instruction message from the model training management device.
其中,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。可选的,第一模型训练信息包括第一模型训练装置的信息,和/或,第一模型训练资源。第一模型训练资源包括以下至少一项:第一计算资源,或第一通信资源。具体的,模型训练管理装置根据第一请求为第一数据采集装置分配第一模型训练装置和/或第一模型训练资源。The first instruction information is used to indicate the first model training information allocated to the first data acquisition device. Optionally, the first model training information includes information about the first model training device and/or, first model training resources. The first model training resources include at least one of the following: first computing resources or first communication resources. Specifically, the model training management device allocates the first model training device and/or the first model training resources to the first data acquisition device according to the first request.
可选的,第一指示信息包括第一标识。第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。下面介绍第一标识的一些可能的形式。Optionally, the first indication information includes a first identifier. The first identifier is used to indicate that the first model training device is a model training device that serves as the first data acquisition device. Some possible forms of the first identifier are described below.
一、第一标识为第一数据采集装置的标识。I. The first identifier is the identifier of the first data acquisition device.
在该实现方式中,第一数据采集装置的标识是模型训练管理装置为第一数据采集装置分配的,模型训练管理装置确定第一数据采集装置和第一模型训练装置配对,之后将第一数据采集装置的标识分别发给第一数据采集装置和第一模型训练装置,从而第一模型训练装置可以知道与自己配对的是第一数据采集装置。当第一数据采集装置发送数据时,第一数据采集装置的标识可以携带于第一数据采集装置的数据。转发节点可以根据获知的配对关系(获知方式不做限定,可以通过模型训练管理装置获知),将携带有第一数据采集装置的标识的数据转发到第一模型训练装置;也可以在第一数据采集装置采用广播方式发送第一数据采集装置的数据的情况下,各个模型训练装置收到该数据后,根据其中携带的第一数据采集装置的标识来确定是否是自己配对的数据采集装置,从而确定是否基于该数据进行模型训练。因此,第一模型训练装置可以基于第一数据采集装置的该数据进行模型训练。In this implementation, the identifier of the first data acquisition device is assigned by the model training management device. The model training management device determines that the first data acquisition device and the first model training device are paired, and then sends the identifier of the first data acquisition device to both the first data acquisition device and the first model training device, so that the first model training device can know that it is paired with the first data acquisition device. When the first data acquisition device sends data, the identifier of the first data acquisition device can be carried in the data of the first data acquisition device. Forwarding nodes can forward the data carrying the identifier of the first data acquisition device to the first model training device according to the known pairing relationship (the method of knowing is not limited, it can be known through the model training management device); or, if the first data acquisition device sends its data in a broadcast manner, each model training device, after receiving the data, determines whether it is its paired data acquisition device based on the identifier of the first data acquisition device carried in it, and thus determines whether to perform model training based on the data. Therefore, the first model training device can perform model training based on the data of the first data acquisition device.
二、第一标识为第一模型训练装置的标识。II. The first identifier is the identifier of the first model training device.
在该实现方式中,第一模型训练装置的标识是模型训练管理装置为第一模型训练装置分配的,模型训练管理装置确定第一数据采集装置和第一模型训练装置配对,之后将第一模型训练装置的标识分别发给第一数据采集装置和第一模型训练装置,从而第一数据采集装置可以知道与自己配对的是第一模型训练装置。当第一数据采集装置发送数据时,第一模型训练装置的标识可以携带于第一数据采集装置的数据。第一模型训练装置根据该数据中携带的第一模型训练装置的标识(也就是其自身的标识)确定该数据来自配对的第一数据采集装置,也就是确定需要基于该数据进行模型训练。因此,第一模型训练装置可以基于第一数据采集装置的该数据进行模型训练。In this implementation, the identifier of the first model training device is assigned by the model training management device. The model training management device determines that the first data acquisition device and the first model training device are paired, and then sends the identifier of the first model training device to both the first data acquisition device and the first model training device. Thus, the first data acquisition device knows that it is paired with the first model training device. When the first data acquisition device sends data, the identifier of the first model training device can be carried within the data. The first model training device determines that the data comes from the paired first data acquisition device based on the identifier of the first model training device carried in the data (i.e., its own identifier), thus determining that model training needs to be performed based on this data. Therefore, the first model training device can perform model training based on the data from the first data acquisition device.
三、第一标识为配对标识。该配对标识用于指示第一数据采集装置与第一模型训练装置配对。可选的,配对标识是根据第一数据采集装置的标识和/或第一模型训练装置的标识生成的。Third, the first identifier is a pairing identifier. This pairing identifier is used to indicate that the first data acquisition device is paired with the first model training device. Optionally, the pairing identifier is generated based on the identifier of the first data acquisition device and/or the identifier of the first model training device.
在该实现方式中,模型训练管理装置确定第一数据采集装置和第一模型训练装置配对,之后将配对标识分别发给第一数据采集装置和第一模型训练装置。当第一数据采集装置发送数据时,配对标识可以携带于第一数据采集装置的数据中。第一模型训练装置确定该数据中携带的配对标识是第一数据采集装置与第一模型训练装置之间的配对标识。因此,第一模型训练装置可以基于该数据进行模型训练。In this implementation, the model training management device determines that the first data acquisition device and the first model training device are paired, and then sends a pairing identifier to both the first data acquisition device and the first model training device. When the first data acquisition device sends data, the pairing identifier can be carried in the data of the first data acquisition device. The first model training device determines that the pairing identifier carried in the data is the pairing identifier between the first data acquisition device and the first model training device. Therefore, the first model training device can perform model training based on this data.
需要说明的是,上述步骤502是模型训练管理装置为第一数据采集装置分配第一模型训练信息为例介绍本申请的技术方案。实际应用中,第一模型训练信息也可以预配置于第一数据采集装置中,具体本申请不做限定。例如,终端厂商离线预配置该第一模型训练信息于第一数据采集装置中。It should be noted that step 502 above describes the technical solution of this application by using the example of the model training management device allocating the first model training information to the first data acquisition device. In practical applications, the first model training information can also be pre-configured in the first data acquisition device, and this application does not limit this. For example, the terminal manufacturer may pre-configure the first model training information offline in the first data acquisition device.
可选的,图5所示的实施例还包括步骤501a。步骤501a可以在步骤502之前执行。Optionally, the embodiment shown in FIG5 further includes step 501a. Step 501a may be performed before step 502.
501a、第一模型训练装置向模型训练管理装置发送注册请求。相应的,模型训练管理装置接收来自第一模型训练装置的注册请求。501a. The first model training device sends a registration request to the model training management device. Correspondingly, the model training management device receives the registration request from the first model training device.
其中,注册请求用于请求注册该第一模型训练装置。可选的,注册请求包括第一模型训练装置的注册信息。例如,注册信息包括以下至少一项:第一模型训练装置的计算能力、算力类型、算力裕量、计算精度、或计算能效。可选的,算力类型包括以下至少一项:中央处理器(central processing unit,CPU)、图形处理器(graphics processing unit,GPU)、或神经网络处理器(neural network processing unit,NPU)。The registration request is used to request the registration of the first model training device. Optionally, the registration request includes registration information of the first model training device. For example, the registration information includes at least one of the following: the computing power, computing power type, computing power margin, computing accuracy, or computing energy efficiency of the first model training device. Optionally, the computing power type includes at least one of the following: central processing unit (CPU), graphics processing unit (GPU), or neural network processing unit (NPU).
需要说明的是,步骤501a与步骤501之间没有固定的执行顺序。可以先执行步骤501a,再执行步骤501;或者,先执行步骤501,再执行步骤501a;或者,依据情况同时执行步骤501和步骤501a,具体本申请不做限定。It should be noted that there is no fixed execution order between steps 501a and 501. Step 501a can be executed first, followed by step 501; or step 501 can be executed first, followed by step 501a; or, depending on the circumstances, steps 501 and 501a can be executed simultaneously. This application does not impose any specific restrictions on this.
可选的,图5所示的实施例还包括步骤503。步骤503可以在步骤501之后执行。Optionally, the embodiment shown in FIG5 further includes step 503. Step 503 may be performed after step 501.
503、模型训练管理装置向第一模型训练装置发送第一指示信息。相应的,第一模型训练装置接收来自模型训练管理装置的第一指示信息。503. The model training management device sends a first instruction message to the first model training device. Correspondingly, the first model training device receives the first instruction message from the model training management device.
关于第一指示信息请参阅前述步骤502中的相关介绍,这里不再赘述。For information on the first instruction, please refer to the relevant description in step 502 above, which will not be repeated here.
需要说明的是,步骤502与步骤503之间没有固定的执行顺序。可以先执行步骤502,再执行步骤503;或者,先执行步骤503,再执行步骤502;或者,依据情况同时执行步骤502和步骤503,具体本申请不做限定。It should be noted that there is no fixed execution order between steps 502 and 503. Step 502 can be executed first, followed by step 503; or step 503 can be executed first, followed by step 502; or, depending on the circumstances, steps 502 and 503 can be executed simultaneously. This application does not impose any specific restrictions on this.
可选的,图5所示的实施例还包括步骤504。步骤504可以在步骤501之后执行。Optionally, the embodiment shown in FIG5 further includes step 504. Step 504 may be performed after step 501.
504、模型训练管理装置向控制装置发送第一指示信息。相应的,控制装置接收来自模型训练管理装置的第一指示信息。504. The model training management device sends a first instruction message to the control device. Correspondingly, the control device receives the first instruction message from the model training management device.
关于第一指示信息请参阅前述步骤502中的相关介绍,这里不再赘述。For information on the first instruction, please refer to the relevant description in step 502 above, which will not be repeated here.
需要说明的是,可选的,在联邦学习中,控制装置可以为中心节点。It should be noted that, optionally, in federated learning, the control device can be a central node.
需要说明的是,步骤504与步骤502、步骤503之间没有固定的执行顺序。例如,可以先执行步骤504,再执行步骤502,最后执行步骤503;或者,先执行步骤504,再执行步骤502,最后执行步骤503;或者,先执行步骤502,再执行步骤503,最后执行步骤504;或者,先执行步骤503,再执行步骤502,最后执行步骤504,具体本申请不做限定。It should be noted that there is no fixed execution order between step 504 and steps 502 and 503. For example, step 504 can be executed first, then step 502, and finally step 503; or step 504 can be executed first, then step 502, and finally step 503; or step 502 can be executed first, then step 503, and finally step 504; or step 503 can be executed first, then step 502, and finally step 504. This application does not limit the specific execution order.
可选的,图5所示的实施例还包括步骤505至步骤506。步骤505至步骤506可以在步骤503之后执行。Optionally, the embodiment shown in FIG5 further includes steps 505 to 506. Steps 505 to 506 may be performed after step 503.
505、第一数据采集装置向第一模型训练装置发送第一数据。第一数据包括第一标识。相应的,第一模型训练装置接收来自第一数据采集装置的第一数据。505. The first data acquisition device sends first data to the first model training device. The first data includes a first identifier. Correspondingly, the first model training device receives the first data from the first data acquisition device.
第一数据用于模型训练。换句话说,第一数据为模型训练数据。第一数据中包括第一标识,从而指示第一模型训练装置根据第一标识确定基于第一数据进行模型训练。例如,如图6A所示,第一数据采集装置为终端设备1,第一模型训练装置为AI服务器2。终端设备1向AI服务器2发送第一数据。The first data is used for model training. In other words, the first data is model training data. The first data includes a first identifier, which instructs the first model training device to determine whether to train the model based on the first data according to the first identifier. For example, as shown in Figure 6A, the first data acquisition device is terminal device 1, and the first model training device is AI server 2. Terminal device 1 sends the first data to AI server 2.
506、第一模型训练装置根据第一数据进行模型训练,得到第一模型。506. The first model training device trains the model based on the first data to obtain the first model.
例如,第一标识为第一数据采集装置的标识,第一模型训练装置根据该第一数据中携带的第一数据采集装置的标识确定可以基于该第一数据进行模型训练。再例如,第一标识为第一模型训练装置的标识,第一模型训练装置根据该第一数据中携带的第一模型训练装置的标识确定可以基于该第一数据进行模型训练。再例如,第一标识为配对标识,第一模型训练装置确定第一数据携带的配对标识为第一数据采集装置与第一模型训练装置之间的配对标识,因此,第一模型训练装置可以基于该第一数据进行模型训练。For example, the first identifier may be the identifier of a first data acquisition device. The first model training device determines that model training can be performed based on the first data based on the identifier of the first data acquisition device carried in the first data. As another example, the first identifier may be the identifier of a first model training device. The first model training device determines that model training can be performed based on the first data based on the identifier of the first model training device carried in the first data. As yet another example, the first identifier may be a pairing identifier. The first model training device determines that the pairing identifier carried in the first data is a pairing identifier between the first data acquisition device and the first model training device; therefore, the first model training device can perform model training based on the first data.
需要说明的是,步骤505至步骤506与前述步骤504之间没有固定的执行顺序。可以先执行步骤505至步骤506,再执行步骤504;或者,先执行步骤504,再执行步骤506至步骤506;或者,依据情况同时执行步骤504以及步骤505至步骤506,具体本申请不做限定。It should be noted that there is no fixed execution order between steps 505 to 506 and the aforementioned step 504. Steps 505 to 506 can be executed first, followed by step 504; or, step 504 can be executed first, followed by steps 506 to 506; or, depending on the circumstances, steps 504 and steps 505 to 506 can be executed simultaneously. This application does not impose any specific restrictions on this.
可选的,图5所示的实施例还包括步骤505a。步骤505a可以在步骤505之前执行。Optionally, the embodiment shown in FIG5 further includes step 505a. Step 505a may be performed before step 505.
505a、控制装置向第一数据采集装置发送调度信息。相应的,第一数据采集装置接收来自控制装置的调度信息。505a. The control device sends scheduling information to the first data acquisition device. Correspondingly, the first data acquisition device receives the scheduling information from the control device.
其中,调度信息用于调度第一数据采集装置发送第一数据。The scheduling information is used to schedule the first data acquisition device to send the first data.
可选的,图5所示的实施例还包括步骤505b至步骤505c。步骤505b至步骤505c可以在步骤505a之前执行。Optionally, the embodiment shown in FIG5 further includes steps 505b to 505c. Steps 505b to 505c may be performed before step 505a.
505b、第一数据采集装置向控制装置发送第二指示信息。相应的,控制装置接收来自第一数据采集装置的第二指示信息。505b. The first data acquisition device sends a second instruction to the control device. Correspondingly, the control device receives the second instruction from the first data acquisition device.
第二指示信息用于指示第一数据的数据状态。The second indication information is used to indicate the data status of the first data.
可选的,第一数据的数据状态包括以下至少一项:第一数据的数据类型、数据分布、第一数据包括的样本数量、或数据属性。可选的,数据类型包括以下至少一项:图片数据、语音数据、文本数据、信道数据、信号数据、或雷达数据等。可选的,第一数据的数据分布符合以下任一项:高斯分布、指数分布、均匀分布、或泊松分布等。可选的,数据属性包括以下至少一项:第一数据采集的时间、位置、或条件。Optionally, the data state of the first data includes at least one of the following: the data type, data distribution, number of samples included in the first data, or data attribute. Optionally, the data type includes at least one of the following: image data, voice data, text data, channel data, signal data, or radar data, etc. Optionally, the data distribution of the first data conforms to any one of the following: Gaussian distribution, exponential distribution, uniform distribution, or Poisson distribution, etc. Optionally, the data attribute includes at least one of the following: the time, location, or conditions of the first data acquisition.
505c、控制装置根据第二指示信息确定调度第一数据采集装置。505c. The control device determines the scheduling of the first data acquisition device based on the second instruction information.
例如,控制装置根据第一数据的类型和数据分布确定该第一数据对于模型性能的提升有帮助,控制装置可以选择调度该第一数据采集装置。由此可知,控制装置根据第二指示信息确定是否调度第一数据采集装置,从而有利于控制装置调度数据质量较高的数据采集装置,以提升模型训练的性能。For example, if the control device determines that the first data is helpful in improving model performance based on its type and distribution, it can choose to schedule the first data acquisition device. Therefore, the control device determines whether to schedule the first data acquisition device based on the second instruction information, which helps it schedule data acquisition devices with higher data quality to improve model training performance.
可选的,图5所示的实施例还包括步骤505d。步骤505d可以在步骤505c之前执行。Optionally, the embodiment shown in Figure 5 further includes step 505d. Step 505d may be performed before step 505c.
505d、第一模型训练装置向控制装置发送第三指示信息。相应的,控制装置接收来自第一模型训练装置的第三指示信息。505d. The first model training device sends a third instruction message to the control device. Correspondingly, the control device receives the third instruction message from the first model training device.
第三指示信息用于指示第一模型训练装置的训练相关信息。例如,第三指示信息用于指示第一模型训练装置的训练资源,如算力类型、算力裕量、计算精度、和/或、计算能效。The third indication information is used to indicate training-related information of the first model training device. For example, the third indication information is used to indicate the training resources of the first model training device, such as computing power type, computing power margin, computing accuracy, and/or computing energy efficiency.
可选的,上述步骤505c具体包括:控制装置根据第二指示信息和第三指示信息确定调度第一数据采集装置。在该实现方式中,控制装置进一步结合第三指示信息确定是否调度第一数据采集装置。实现更好的调度数据采集装置,提升模型训练的性能。Optionally, step 505c specifically includes: the control device determining whether to schedule the first data acquisition device based on the second and third indication information. In this implementation, the control device further combines the third indication information to determine whether to schedule the first data acquisition device. This achieves better scheduling of the data acquisition device and improves the performance of model training.
需要说明的是,上述是以控制装置根据第二指示信息确定调度第一数据采集装置,可选的,控制装置还进一步结合第三指示信息确定调度第一数据采集装置的示例介绍本申请的技术方案。实际上,控制装置可以根据第三指示信息确定调度第一数据采集装置,可选的,控制装置进一步结合第二指示信息确定调度第一数据采集装置。具体本申请不做限定。It should be noted that the above example illustrates the technical solution of this application, where the control device determines the scheduling of the first data acquisition device based on the second instruction information. Optionally, the control device may further determine the scheduling of the first data acquisition device in conjunction with the third instruction information. In practice, the control device can determine the scheduling of the first data acquisition device based on the third instruction information, or optionally, the control device may further determine the scheduling of the first data acquisition device in conjunction with the second instruction information. This application does not impose any specific limitations on this.
可选的,图5所示的实施例还包括步骤507。步骤507可以在步骤506之后执行。Optionally, the embodiment shown in Figure 5 further includes step 507. Step 507 may be performed after step 506.
507、第一模型训练装置向模型融合装置发送第一模型。相应的,模型融合装置接收来自第一模型训练装置的第一模型。507. The first model training device sends the first model to the model fusion device. Correspondingly, the model fusion device receives the first model from the first model training device.
例如,如图6B所示,第一模型训练装置为AI服务器1,模型融合装置为网络设备,AI服务器1向网络设备发送第一模型。For example, as shown in Figure 6B, the first model training device is AI server 1, the model fusion device is network device, and AI server 1 sends the first model to network device.
可选的,当第一模型收敛时或第一模型对应的训练轮次达到预设阈值时,第一模型训练装置向模型融合装置发送第一模型。第一模型对应的训练轮次是指第一模型训练装置基于第一数据进行模型训练对应的训练轮次。Optionally, when the first model converges or the training epochs corresponding to the first model reach a preset threshold, the first model training device sends the first model to the model fusion device. The training epochs corresponding to the first model refer to the training epochs in which the first model training device trains the model based on the first data.
可选的,图5所示的实施例还包括步骤508,步骤508可以在步骤507之后执行。Optionally, the embodiment shown in FIG5 further includes step 508, which can be performed after step 507.
508、模型融合装置融合第一模型,得到第二模型。508. The model fusion device fuses the first model to obtain the second model.
例如,如图6B和图6C所示,模型融合装置为网络设备,网络设备接收来自AI服务器1的第一模型。网络设备接收来自AI服务器2的第五模型。然后,网络设备融合第一模型和第五模型,得到第二模型。For example, as shown in Figures 6B and 6C, the model fusion device is a network device that receives a first model from AI server 1. The network device also receives a fifth model from AI server 2. Then, the network device fuses the first and fifth models to obtain a second model.
可选的,图5所示的实施例还包括步骤509。步骤509可以在步骤508之后执行。Optionally, the embodiment shown in Figure 5 further includes step 509. Step 509 may be performed after step 508.
509、模型融合装置向第一数据采集装置发送第二模型。相应的,第一数据采集装置接收来自模型融合装置的第二模型。509. The model fusion device sends the second model to the first data acquisition device. Correspondingly, the first data acquisition device receives the second model from the model fusion device.
其中,第二模型用于模型推理。例如,如图6C所示,模型融合装置为网络设备,第一数据采集装置为终端设备1。网络设备向终端设备1发送第二模型。终端设备1可以通过第二模型进行模型推理。The second model is used for model inference. For example, as shown in Figure 6C, the model fusion device is a network device, and the first data acquisition device is terminal device 1. The network device sends the second model to terminal device 1. Terminal device 1 can perform model inference using the second model.
可选的,图5所示的实施例还包括步骤510。步骤510可以在步骤508之后执行。Optionally, the embodiment shown in Figure 5 further includes step 510. Step 510 may be performed after step 508.
510、模型融合装置向第一模型训练装置发送第二模型。相应的,第一模型训练装置接收来自模型融合装置的第二模型。510. The model fusion device sends the second model to the first model training device. Correspondingly, the first model training device receives the second model from the model fusion device.
其中,第二模型用于模型推理和/或模型训练。The second model is used for model inference and/or model training.
需要说明的是,步骤509和步骤510之间没有固定的执行顺序。可以先执行步骤509,再执行步骤510;或者,先执行步骤510,再执行步骤509;或者,依据情况同时执行步骤509和步骤510,具体本申请不做限定。It should be noted that there is no fixed execution order between steps 509 and 510. Step 509 can be executed first, followed by step 510; or step 510 can be executed first, followed by step 509; or, depending on the circumstances, steps 509 and 510 can be executed simultaneously. This application does not impose any specific restrictions on this.
上述图5所示的实施例中,第一数据采集装置向模型训练管理装置发送第一请求。第一请求用于请求为第一数据采集装置分配模型训练信息。然后,第一数据采集装置接收来自模型训练管理装置的第一指示信息。第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置根据第一模型训练信息可以确定第一模型训练装置,并向第一模型训练装置发送第一数据采集装置的本地数据。从而便于第一模型训练装置基于该本地数据完成模型训练。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。In the embodiment shown in Figure 5 above, the first data acquisition device sends a first request to the model training management device. The first request requests the allocation of model training information for the first data acquisition device. Then, the first data acquisition device receives first instruction information from the model training management device. The first instruction information indicates the allocation of first model training information to the first data acquisition device. This enables the allocation of first model training information to the first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training. For example, the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data. For example, if the first data acquisition device is a terminal device and the first model training device is a server of the terminal manufacturer, and the terminal device has limited computing power and data privacy protection requirements, the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
图7为本申请实施例信息传输方法的另一个实施例示意图。请参阅图7,方法包括:Figure 7 is a schematic diagram of another embodiment of the information transmission method of this application. Referring to Figure 7, the method includes:
701、第一数据采集装置向模型训练管理装置发送第一请求。相应的,模型训练管理装置接收来自第一数据采集装置的第一请求。701. The first data acquisition device sends a first request to the model training management device. Correspondingly, the model training management device receives the first request from the first data acquisition device.
702、模型训练管理装置向第一数据采集装置发送第一指示信息。相应的,第一数据采集装置接收来自模型训练管理装置的第一指示信息。702. The model training management device sends a first instruction message to the first data acquisition device. Correspondingly, the first data acquisition device receives the first instruction message from the model training management device.
步骤701至步骤702与前述图5所示的实施例中的步骤501至步骤502类似,具体可以参阅前述图5所示的实施例中的步骤501至步骤502的相关介绍,这里不再赘述。Steps 701 to 702 are similar to steps 501 to 502 in the embodiment shown in Figure 5 above. For details, please refer to the relevant descriptions of steps 501 to 502 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图7所示的实施例还包括步骤701a。步骤701a可以在步骤702之前执行。Optionally, the embodiment shown in FIG7 further includes step 701a. Step 701a may be performed before step 702.
701a、第一模型训练装置向模型训练管理装置发送注册请求。相应的,模型训练管理装置接收来自第一模型训练装置的注册请求。701a. The first model training device sends a registration request to the model training management device. Correspondingly, the model training management device receives the registration request from the first model training device.
步骤701a与前述图5所示的实施例中的步骤501a类似,具体可以参阅前述图5所示的实施例中的步骤501a的相关介绍,这里不再赘述。Step 701a is similar to step 501a in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 501a in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图7所示的实施例还包括步骤703。步骤703可以在步骤701之后执行。Optionally, the embodiment shown in FIG7 further includes step 703. Step 703 may be performed after step 701.
703、模型训练管理装置向第一模型训练装置发送第一指示信息。相应的,第一模型训练装置接收来自模型训练管理装置的第一指示信息。703. The model training management device sends a first instruction message to the first model training device. Correspondingly, the first model training device receives the first instruction message from the model training management device.
步骤703与前述图5所示的实施例中的步骤503类似,具体可以参阅前述图5所示的实施例中的步骤503的相关介绍,这里不再赘述。Step 703 is similar to step 503 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 503 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图7所示的实施例还包括步骤704a至步骤704c。步骤704a至步骤704c可以在步骤703之后执行。Optionally, the embodiment shown in FIG7 further includes steps 704a to 704c. Steps 704a to 704c may be performed after step 703.
704a、第一数据采集装置向控制装置发送第二指示信息。第二指示信息用于指示第一标识。相应的,控制装置接收来自第一数据采集装置的第二指示信息。704a. The first data acquisition device sends a second indication message to the control device. The second indication message is used to indicate the first identifier. Correspondingly, the control device receives the second indication message from the first data acquisition device.
第二指示信息用于指示第一数据的数据状态和第一标识。第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。关于第一数据的数据状态和第一标识请参阅前述图5所示的实施例中的相关介绍,这里不再赘述。The second indication information is used to indicate the data status and the first identifier of the first data. The first identifier is used to indicate that the first model training device is a model training device that serves as the first data acquisition device. For details regarding the data status and the first identifier of the first data, please refer to the relevant description in the embodiment shown in Figure 5 above; it will not be repeated here.
704b、第一模型训练装置向控制装置发送第三指示信息。第三指示信息用于指示第一标识。相应的,控制装置接收来自第一模型训练装置的第三指示信息。704b. The first model training device sends a third instruction message to the control device. The third instruction message is used to indicate the first identifier. Correspondingly, the control device receives the third instruction message from the first model training device.
第三指示信息用于指示第一模型训练装置的训练相关信息和第一标识。关于训练相关信息和第一标识请参阅前述图5所示的实施例中的相关介绍,这里不再赘述。The third instruction information is used to indicate the training-related information and the first identifier of the first model training device. For details regarding the training-related information and the first identifier, please refer to the relevant descriptions in the embodiment shown in Figure 5 above; they will not be repeated here.
704c、控制装置根据第二指示信息和第三指示信息确定第一模型训练装置作为第一数据采集装置的模型训练装置。704c. The control device determines the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
具体的,控制装置根据第二指示信息携带的第一标识和第三指示信息携带的第一标识可以确定第一模型训练装置作为第一数据采集装置的模型训练装置。或者说,控制装置根据第二指示信息携带的第一标识和第三指示信息携带的第一标识可以确定第一数据采集装置与第一模型训练装置配对。Specifically, the control device can determine that the first model training device is the model training device for the first data acquisition device based on the first identifier carried in the second instruction information and the first identifier carried in the third instruction information. Alternatively, the control device can determine that the first data acquisition device is paired with the first model training device based on the first identifier carried in the second instruction information and the first identifier carried in the third instruction information.
可选的,图7所示的实施例还包括步骤704至步骤705。步骤704至步骤705可以在步骤703之后执行。Optionally, the embodiment shown in FIG7 further includes steps 704 to 705. Steps 704 to 705 may be performed after step 703.
704、第一数据采集装置向第一模型训练装置发送第一数据。第一数据包括第一标识。相应的,第一模型训练装置接收来自第一数据采集装置的第一数据。704. The first data acquisition device sends first data to the first model training device. The first data includes a first identifier. Correspondingly, the first model training device receives the first data from the first data acquisition device.
705、第一模型训练装置根据第一数据进行模型训练,得到第一模型。705. The first model training device trains the model based on the first data to obtain the first model.
步骤704至步骤705与前述图5所示的实施例中的步骤505至步骤506类似,具体可以参阅前述图5所示的实施例中的步骤505至步骤506的相关介绍,这里不再赘述。Steps 704 to 705 are similar to steps 505 to 506 in the embodiment shown in Figure 5 above. For details, please refer to the relevant descriptions of steps 505 to 506 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图7所示的实施例还包括步骤704d至步骤704e。步骤704d至步骤704e可以在步骤704c之后执行且在步骤704之前执行。Optionally, the embodiment shown in FIG7 further includes steps 704d to 704e. Steps 704d to 704e may be performed after step 704c and before step 704.
704d、控制装置根据第二指示信息和第三指示信息确定调度第一数据采集装置。704d. The control device determines the scheduling of the first data acquisition device based on the second and third instruction information.
步骤704d与前述图5所示的实施例中的步骤505c类似,具体可以参阅前述图5所示的实施例中的步骤505c的相关介绍,这里不再赘述。Step 704d is similar to step 505c in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 505c in the embodiment shown in Figure 5 above, which will not be repeated here.
704e、控制装置向第一数据采集装置发送调度信息。相应的,第一数据采集装置接收来自控制装置的调度信息。704e. The control device sends scheduling information to the first data acquisition device. Correspondingly, the first data acquisition device receives the scheduling information from the control device.
其中,调度信息用于调度第一数据采集装置发送第一数据。The scheduling information is used to schedule the first data acquisition device to send the first data.
可选的,图7所示的实施例还包括步骤706。步骤706可以在步骤705之后执行。Optionally, the embodiment shown in FIG7 further includes step 706. Step 706 may be performed after step 705.
706、第一模型训练装置向模型融合装置发送第一模型。相应的,模型融合装置接收来自第一模型训练装置的第一模型。706. The first model training device sends the first model to the model fusion device. Correspondingly, the model fusion device receives the first model from the first model training device.
步骤706与前述图5所示的实施例中的步骤507类似,具体可以参阅前述图5所示的实施例中的步骤507的相关介绍,这里不再赘述。Step 706 is similar to step 507 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 507 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图7所示的实施例还包括步骤707。步骤707可以在步骤706之后执行。Optionally, the embodiment shown in FIG7 further includes step 707. Step 707 may be performed after step 706.
707、模型融合装置融合第一模型,得到第二模型。707. The model fusion device fuses the first model to obtain the second model.
步骤707与前述图5所示的实施例中的步骤508类似,具体可以参阅前述图5所示的实施例中的步骤508的相关介绍,这里不再赘述。Step 707 is similar to step 508 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 508 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图7所示的实施例还包括步骤708。步骤708可以在步骤707之后执行。Optionally, the embodiment shown in FIG7 further includes step 708. Step 708 may be performed after step 707.
708、模型融合装置向第一数据采集装置发送第二模型。相应的,第一数据采集装置接收来自模型融合装置的第二模型。708. The model fusion device sends the second model to the first data acquisition device. Correspondingly, the first data acquisition device receives the second model from the model fusion device.
步骤708与前述图5所示的实施例中的步骤509类似,具体可以参阅前述图5所示的实施例中的步骤509的相关介绍,这里不再赘述。Step 708 is similar to step 509 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 509 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图7所示的实施例还包括步骤709。步骤709可以在步骤707之后执行。Optionally, the embodiment shown in FIG7 further includes step 709. Step 709 may be performed after step 707.
709、模型融合装置向第一模型训练装置发送第二模型。相应的,第一模型训练装置接收来自模型融合装置的第二模型。709. The model fusion device sends the second model to the first model training device. Correspondingly, the first model training device receives the second model from the model fusion device.
步骤709与前述图5所示的实施例中的步骤510类似,具体可以参阅前述图5所示的实施例中的步骤510的相关介绍,这里不再赘述。Step 709 is similar to step 510 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 510 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,步骤708与步骤709之间没有固定的执行顺序。例如,可以先执行步骤708,再执行步骤709;或者,可以先执行步骤709,再执行步骤708;或者,依据情况同时执行步骤708和步骤709,具体本申请不做限定。Optionally, there is no fixed execution order between steps 708 and 709. For example, step 708 can be executed first, followed by step 709; or step 709 can be executed first, followed by step 708; or, depending on the circumstances, steps 708 and 709 can be executed simultaneously. This application does not impose any specific restrictions on this.
由上述图7所示的实施例可知,第一数据采集装置向模型训练管理装置发送第一请求。第一请求用于请求为第一数据采集装置分配模型训练信息。然后,第一数据采集装置接收来自模型训练管理装置的第一指示信息。第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置根据第一模型训练信息可以确定第一模型训练装置,并向第一模型训练装置发送第一数据采集装置的本地数据。从而便于第一模型训练装置基于该本地数据完成模型训练。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。As shown in the embodiment of Figure 7 above, the first data acquisition device sends a first request to the model training management device. The first request is used to request the allocation of model training information for the first data acquisition device. Then, the first data acquisition device receives first instruction information from the model training management device. The first instruction information is used to indicate the allocation of first model training information for the first data acquisition device. This realizes the allocation of first model training information to the first data acquisition device. This facilitates the first data acquisition device to determine how to complete model training based on the first model training information, so as to realize the completion of model training based on the local data of the first data acquisition device. For example, when the computing power of the first data acquisition device is weak or the first data acquisition device does not have computing power, the above technical solution helps to solve the problem that the first data acquisition device cannot perform model training. For example, the first data acquisition device can determine the first model training device based on the first model training information and send the local data of the first data acquisition device to the first model training device. This facilitates the first model training device to complete model training based on the local data. For example, if the first data acquisition device is a terminal device and the first model training device is a server of the terminal manufacturer, and the terminal device has limited computing power and data privacy protection requirements, the server of the terminal manufacturer can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
需要说明的是,上述图5和图7所示的实施例是以模型融合装置是联邦学习中的中心节点为例介绍本申请的技术方案。实际应用中,在分割学习中,模型融合装置可以替换称为模型处理装置,模型处理装置可以是分割学习中的中心节点。上述图5所示的实施例中的步骤506或上述图7所示的实施例中的步骤705可以替换描述为:第一模型训练装置将第一数据输入第一模型,并推到至分割层得到推理中间结果。第一模型是完整的神经网络模型的一个子网络。关于分割层请参阅前述分割学习中的相关介绍。上述图5所示的实施例中的步骤507或上述图7所示的实施例中的步骤706可以替换为:模型处理装置接收来自第一模型训练装置的推理中间结果。上述图5所示的实施例中的步骤508或上述图7所示的实施例中的步骤707可以替换为:模型处理装置根据推理中间结果进行另一部分模型的推理、梯度计算、和/或,该另一部分模型的参数更新。该另一部分模型是该神经网络模型的另一个子网络,第一模型和该另一部分模组组成该神经网络模型。上述图5所示的实施例中的步骤509或上述图7所示的实施例中的步骤708可以不执行,而上述图5所示的实施例中的步骤510或上述图7所示的实施例中的步骤709可以替换为:模型处理装置可以向第一模型训练装置发送梯度反向传递结果。第一模型训练装置根据该梯度反向传递结果更新第一模型。可选的,第一模型训练装置向第一数据采集装置发送更新后的第一模型。It should be noted that the embodiments shown in Figures 5 and 7 above illustrate the technical solution of this application by taking the model fusion device as the central node in federated learning as an example. In practical applications, in segmentation learning, the model fusion device can be replaced by a model processing device, which can be the central node in segmentation learning. Step 506 in the embodiment shown in Figure 5 above or step 705 in the embodiment shown in Figure 7 above can be replaced by describing the following: The first model training device inputs the first data into the first model and pushes it to the segmentation layer to obtain the intermediate inference result. The first model is a sub-network of the complete neural network model. For information on the segmentation layer, please refer to the relevant introduction in the aforementioned segmentation learning. Step 507 in the embodiment shown in Figure 5 above or step 706 in the embodiment shown in Figure 7 above can be replaced by: The model processing device receives the intermediate inference result from the first model training device. Step 508 in the embodiment shown in Figure 5 above or step 707 in the embodiment shown in Figure 7 above can be replaced by: The model processing device performs inference, gradient calculation, and/or parameter update of another part of the model based on the intermediate inference result. The other part of the model is another sub-network of the neural network model, and the first model and the other part of the model constitute the neural network model. Step 509 in the embodiment shown in Figure 5 or step 708 in the embodiment shown in Figure 7 may be omitted, while step 510 in the embodiment shown in Figure 5 or step 709 in the embodiment shown in Figure 7 may be replaced by: the model processing device sending the gradient backpropagation result to the first model training device. The first model training device updates the first model based on the gradient backpropagation result. Optionally, the first model training device sends the updated first model to the first data acquisition device.
图8为本申请实施例信息传输方法的再一个实施例示意图。请参阅图8,方法包括:Figure 8 is a schematic diagram of another embodiment of the information transmission method of this application. Referring to Figure 8, the method includes:
801、第一数据采集装置向模型训练管理装置发送第一请求。相应的,模型训练管理装置接收来自第一数据采集装置的第一请求。801. The first data acquisition device sends a first request to the model training management device. Correspondingly, the model training management device receives the first request from the first data acquisition device.
802、模型训练管理装置向第一数据采集装置发送第一指示信息。相应的,第一数据采集装置接收来自模型训练管理装置的第一指示信息。802. The model training management device sends a first instruction message to the first data acquisition device. Correspondingly, the first data acquisition device receives the first instruction message from the model training management device.
步骤801至步骤802与前述图5所示的实施例中的步骤501至步骤502类似,具体可以参阅前述图5所示的实施例中的步骤501至步骤502的相关介绍,这里不再赘述。Steps 801 to 802 are similar to steps 501 to 502 in the embodiment shown in Figure 5 above. For details, please refer to the relevant descriptions of steps 501 to 502 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图8所示的实施例还包括步骤801a。步骤801a可以在步骤802之前执行。Optionally, the embodiment shown in FIG8 further includes step 801a. Step 801a may be performed before step 802.
801a、第一模型训练装置向模型训练管理装置发送注册请求。相应的,模型训练管理装置接收来自第一模型训练装置的注册请求。801a. The first model training device sends a registration request to the model training management device. Correspondingly, the model training management device receives the registration request from the first model training device.
步骤801a与前述图5所示的实施例中的步骤501a类似,具体可以参阅前述图5所示的实施例中的步骤501a的相关介绍,这里不再赘述。Step 801a is similar to step 501a in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 501a in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图8所示的实施例还包括步骤803。步骤803可以在步骤801之后执行。Optionally, the embodiment shown in FIG8 further includes step 803. Step 803 may be performed after step 801.
803、模型训练管理装置向第一模型训练装置发送第一指示信息。相应的,第一模型训练装置接收来自模型训练管理装置的第一指示信息。803. The model training management device sends a first instruction message to the first model training device. Correspondingly, the first model training device receives the first instruction message from the model training management device.
步骤803与前述图5所示的实施例中的步骤503类似,具体可以参阅前述图5所示的实施例中的步骤503的相关介绍,这里不再赘述。Step 803 is similar to step 503 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 503 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图8所示的实施例还包括步骤804。步骤804可以在步骤801之后执行。Optionally, the embodiment shown in FIG8 further includes step 804. Step 804 may be performed after step 801.
804、模型训练管理装置向控制装置发送第一指示信息。相应的,控制装置接收来自模型训练管理装置的第一指示信息。804. The model training management device sends a first instruction message to the control device. Correspondingly, the control device receives the first instruction message from the model training management device.
步骤804与前述图5所示的实施例中的步骤504类似,具体可以参阅前述图5所示的实施例中的步骤504的相关介绍,这里不再赘述。Step 804 is similar to step 504 in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 504 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图8所示的实施例还包括步骤805至步骤806。步骤805至步骤806可以在步骤803之后执行。Optionally, the embodiment shown in FIG8 further includes steps 805 to 806. Steps 805 to 806 may be performed after step 803.
805、第一数据采集装置向第一模型训练装置发送第一数据。第一数据包括第一标识。相应的,第一模型训练装置接收来自第一数据采集装置的第一数据。805. The first data acquisition device sends first data to the first model training device. The first data includes a first identifier. Correspondingly, the first model training device receives the first data from the first data acquisition device.
806、第一模型训练装置根据第一数据进行模型训练,得到第一模型。806. The first model training device trains the model based on the first data to obtain the first model.
步骤805至步骤806与前述图5所示的实施例中的步骤505至步骤506类似,具体可以参阅前述图5所示的实施例中的步骤505至步骤506的相关介绍,这里不再赘述。Steps 805 to 806 are similar to steps 505 to 506 in the embodiment shown in Figure 5 above. For details, please refer to the relevant descriptions of steps 505 to 506 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图8所示的实施例还包括步骤805a。步骤805a可以在步骤804之后且在步骤805之前执行。Optionally, the embodiment shown in FIG8 further includes step 805a. Step 805a may be performed after step 804 and before step 805.
805a、控制装置向第一数据采集装置发送调度信息。相应的,第一数据采集装置接收来自控制装置的调度信息。805a. The control device sends scheduling information to the first data acquisition device. Correspondingly, the first data acquisition device receives the scheduling information from the control device.
其中,调度信息用于调度第一数据采集装置发送第一数据。The scheduling information is used to schedule the first data acquisition device to send the first data.
可选的,图8所示的实施例还包括步骤805b至步骤805c。步骤805b至步骤805c可以在步骤805a之前执行。Optionally, the embodiment shown in FIG8 further includes steps 805b to 805c. Steps 805b to 805c may be performed before step 805a.
805b、第一数据采集装置向控制装置发送第二指示信息。相应的,控制装置接收来自第一数据采集装置的第二指示信息。805b. The first data acquisition device sends a second instruction to the control device. Correspondingly, the control device receives the second instruction from the first data acquisition device.
805c、控制装置根据第二指示信息确定调度第一数据采集装置。805c. The control device determines the scheduling of the first data acquisition device based on the second instruction information.
步骤805b至步骤805c与前述图5所示的实施例中的步骤505b至步骤505c类似,具体可以参阅前述图5所示的实施例中的步骤505b至步骤505c的相关介绍,这里不再赘述。Steps 805b to 805c are similar to steps 505b to 505c in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of steps 505b to 505c in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图8所示的实施例还包括步骤805d。步骤805d可以在步骤805c之前执行。Optionally, the embodiment shown in Figure 8 further includes step 805d. Step 805d may be performed before step 805c.
805d、第一模型训练装置向控制装置发送第三指示信息。相应的,控制装置接收来自第一模型训练装置的第三指示信息。805d. The first model training device sends a third instruction message to the control device. Correspondingly, the control device receives the third instruction message from the first model training device.
步骤805d与前述图5所示的实施例中的步骤505d类似,具体可以参阅前述图5所示的实施例中的步骤505d的相关介绍,这里不再赘述。Step 805d is similar to step 505d in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 505d in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图8所示的实施例还包括步骤807。步骤807可以在步骤806之后执行。Optionally, the embodiment shown in FIG8 further includes step 807. Step 807 may be performed after step 806.
807、第一模型训练装置向第一数据采集装置发送第一模型。相应的,第一数据采集装置接收来自第一模型训练装置的第一模型。807. The first model training device sends the first model to the first data acquisition device. Correspondingly, the first data acquisition device receives the first model from the first model training device.
其中,对于第一数据采集装置来说,第一模型可以用于模型推理。对于第一模型训练装置来说,第一模型用于模型推理和/或模型训练。在去中心式学习中,第一模型训练装置基于第一数据进行模型训练得到第一模型,并向第一数据采集装置发送第一模型。Specifically, for the first data acquisition device, the first model can be used for model inference. For the first model training device, the first model is used for model inference and/or model training. In decentralized learning, the first model training device trains the model based on the first data to obtain the first model and sends the first model to the first data acquisition device.
可选的,当第一模型收敛时或第一模型对应的训练轮次达到预设阈值时,第一模型训练装置向第一数据采集装置发送第一模型。而不是第一模型训练装置每次模型训练完成后都发送训练后的模型给第一数据采集装置。Optionally, the first model training device sends the first model to the first data acquisition device when the first model converges or when the training epochs corresponding to the first model reach a preset threshold. Instead of the first model training device sending the trained model to the first data acquisition device after each training iteration, the first model training device sends the trained model to the first data acquisition device.
可选的,图8所示的实施例还包括步骤808。步骤808可以在步骤806之后执行。Optionally, the embodiment shown in FIG8 further includes step 808. Step 808 may be performed after step 806.
808、第一模型训练装置向第二数据采集装置或第二模型训练装置发送第一模型。相应的,第二数据采集装置或第二模型训练装置接收来自第一模型训练装置的第一模型。808. The first model training device sends the first model to the second data acquisition device or the second model training device. Correspondingly, the second data acquisition device or the second model training device receives the first model from the first model training device.
在去中心式学习中,第一模型训练装置向第二数据采集装置或第二模型训练装置发送第一模型。对于第二数据采集装置来说,第二数据采集装置可以向第二模型训练装置发送第一模型。从而便于第二模型训练装置将第一模型与第二模型训练装置的本地模型进行融合。可选的,第二模型训练装置将融合后的模型发送给第二数据采集装置。其中,第二模型训练装置是第二数据采集装置的模型训练装置,用于基于第二数据采集装置提供的数据进行模型训练。对于第二数据采集装置来说,融合后的模型可以用于模型推理。对于第二模型训练装置来说,融合后的模型可以用于模型推理和/或模型训练。In decentralized learning, the first model training device sends the first model to the second data acquisition device or the second model training device. The second data acquisition device can also send the first model to the second model training device, facilitating the fusion of the first model with its local model. Optionally, the second model training device sends the fused model to the second data acquisition device. The second model training device is the model training unit of the second data acquisition device, used for model training based on data provided by the second data acquisition device. For the second data acquisition device, the fused model can be used for model inference. For the second model training device, the fused model can be used for model inference and/or model training.
需要说明的是,步骤807与步骤808之间没有固定的执行顺序。例如,可以先执行步骤807,再执行步骤808;或者,先执行步骤808,再执行步骤807;或者,依据情况同时执行步骤807和步骤808,具体本申请不做限定。It should be noted that there is no fixed execution order between steps 807 and 808. For example, step 807 can be executed first, followed by step 808; or step 808 can be executed first, followed by step 807; or, depending on the circumstances, steps 807 and 808 can be executed simultaneously. This application does not impose any specific restrictions on this.
上述步骤808是以第一模型训练装置向第二数据采集装置或第二模型训练装置发送第一模型的实现方式为例介绍本申请的技术方案。实际应用中,也可以是如下方案:第一数据采集装置向第二数据采集装置发送第一模型。一种可能的实现方式中,第一数据采集装置向第二模型训练装置发送该第一模型。另一种可能的实现方式中,第二数据采集装置向第二模型训练装置发送第一模型。The above step 808 illustrates the technical solution of this application by taking the implementation of the first model training device sending the first model to the second data acquisition device or the second model training device as an example. In practical applications, the following solution is also possible: the first data acquisition device sends the first model to the second data acquisition device. In one possible implementation, the first data acquisition device sends the first model to the second model training device. In another possible implementation, the second data acquisition device sends the first model to the second model training device.
可选的,图8所示的实施例还包括步骤809至步骤810。步骤809至步骤810可以在步骤806之后执行。Optionally, the embodiment shown in FIG8 further includes steps 809 to 810. Steps 809 to 810 may be performed after step 806.
809、第二数据采集装置或第二模型训练装置向第一模型训练装置发送第三模型。相应的,第一模型训练装置接收来自第二数据采集装置或第二模型训练装置的第三模型。809. The second data acquisition device or the second model training device sends the third model to the first model training device. Correspondingly, the first model training device receives the third model from the second data acquisition device or the second model training device.
上述步骤809是以第一模型训练装置从第二数据采集装置或第二模型训练装置获取第三模型的实现方式为例介绍本申请的技术方案。实际应用中,也可以是第一数据采集装置从第二数据采集装置或第二模型训练装置获取第三模型。然后,第一数据采集装置向第一模型训练装置发送该第三模型,具体本申请不做限定。The above step 809 illustrates the technical solution of this application by using the implementation method of the first model training device acquiring the third model from the second data acquisition device or the second model training device as an example. In practical applications, the first data acquisition device may also acquire the third model from the second data acquisition device or the second model training device. Then, the first data acquisition device sends the third model to the first model training device; this application does not limit the specific implementation.
810、第一模型训练装置融合第一模型和第三模型,得到第四模型。810. The first model training device merges the first model and the third model to obtain the fourth model.
在去中心式学习中,第一模型训练装置可以接收来自第二数据采集装置或第二模型训练装置的第三模型。第一模型训练装置融合第一模型和第三模型,得到第四模型。有利于提升模型的性能。对于第一模型训练装置来说,第四模型可以用于模型推理和/或模型训练。In decentralized learning, the first model training device can receive a third model from a second data acquisition device or a second model training device. The first model training device merges the first model and the third model to obtain a fourth model. This is beneficial for improving model performance. For the first model training device, the fourth model can be used for model inference and/or model training.
可选的,若图8所示的实施例还包括步骤807,步骤809至步骤810可以在步骤807之后执行。Optionally, if the embodiment shown in FIG8 further includes step 807, steps 809 to 810 may be performed after step 807.
可选的,若图8所示的实施例还包括步骤808,步骤808与步骤809至步骤810之间没有固定的执行顺序。可以先执行步骤808,再执行步骤809至步骤810;或者,先执行步骤809至步骤810,再执行步骤808;或者,依据情况同时执行步骤808与步骤809至步骤810,具体本申请不做限定。Optionally, if the embodiment shown in FIG8 further includes step 808, there is no fixed execution order between step 808 and steps 809 to 810. Step 808 can be executed first, followed by steps 809 to 810; or, steps 809 to 810 can be executed first, followed by step 808; or, depending on the situation, steps 808 and steps 809 to 810 can be executed simultaneously. This application does not limit the specific execution order.
可选的,若图8所示的实施例包括步骤807和步骤808,步骤809至步骤810可以在步骤807之后执行。而步骤808与步骤809至步骤810之间没有固定的执行顺序。Optionally, if the embodiment shown in FIG8 includes steps 807 and 808, steps 809 to 810 can be executed after step 807. There is no fixed execution order between steps 808 and steps 809 to 810.
可选的,图8所示的实施例还包括步骤811。步骤811可以在步骤810之后执行。Optionally, the embodiment shown in FIG8 further includes step 811. Step 811 may be performed after step 810.
811、第一模型训练装置向第一数据采集装置发送第四模型。相应的,第一数据采集装置接收来自第一模型训练装置的第四模型。811. The first model training device sends the fourth model to the first data acquisition device. Correspondingly, the first data acquisition device receives the fourth model from the first model training device.
对于第一数据采集装置来说,第四模型可以用于模型推理。For the first data acquisition device, the fourth model can be used for model inference.
由上述图8所示的实施例可知,第一数据采集装置向模型训练管理装置发送第一请求。第一请求用于请求为第一数据采集装置分配模型训练信息。然后,第一数据采集装置接收来自模型训练管理装置的第一指示信息。第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置根据第一模型训练信息可以确定第一模型训练装置,并向第一模型训练装置发送第一数据采集装置的本地数据。从而便于第一模型训练装置基于该本地数据完成模型训练。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。As shown in Figure 8 above, in the embodiment, the first data acquisition device sends a first request to the model training management device. The first request requests the allocation of model training information for the first data acquisition device. Then, the first data acquisition device receives first instruction information from the model training management device. The first instruction information indicates the allocation of first model training information to the first data acquisition device. This enables the allocation of first model training information to the first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training. For example, the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data. For example, if the first data acquisition device is a terminal device and the first model training device is a server of the terminal manufacturer, and the terminal device has limited computing power and data privacy protection requirements, the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
图9为本申请实施例信息传输方法的再一个实施例示意图。请参阅图9,方法包括:Figure 9 is a schematic diagram of another embodiment of the information transmission method of this application. Referring to Figure 9, the method includes:
901、第一数据采集装置向模型训练管理装置发送第一请求。相应的,模型训练管理装置接收来自第一数据采集装置的第一请求。901. The first data acquisition device sends a first request to the model training management device. Correspondingly, the model training management device receives the first request from the first data acquisition device.
902、模型训练管理装置向第一数据采集装置发送第一指示信息。相应的,第一数据采集装置接收来自模型训练管理装置的第一指示信息。902. The model training management device sends a first instruction message to the first data acquisition device. Correspondingly, the first data acquisition device receives the first instruction message from the model training management device.
步骤901至步骤902与前述图5所示的实施例中的步骤501至步骤502类似,具体可以参阅前述图5所示的实施例中的步骤501至步骤502的相关介绍,这里不再赘述。Steps 901 to 902 are similar to steps 501 to 502 in the embodiment shown in Figure 5 above. For details, please refer to the relevant descriptions of steps 501 to 502 in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图9所示的实施例还包括步骤901a。步骤901a可以在步骤902之前执行。Optionally, the embodiment shown in FIG9 further includes step 901a. Step 901a may be performed before step 902.
901a、第一模型训练装置向模型训练管理装置发送注册请求。相应的,模型训练管理装置接收来自第一模型训练装置的注册请求。901a. The first model training device sends a registration request to the model training management device. Correspondingly, the model training management device receives the registration request from the first model training device.
步骤901a与前述图5所示的实施例中的步骤501a类似,具体可以参阅前述图5所示的实施例中的步骤501a的相关介绍,这里不再赘述。Step 901a is similar to step 501a in the embodiment shown in Figure 5 above. For details, please refer to the relevant description of step 501a in the embodiment shown in Figure 5 above, which will not be repeated here.
可选的,图9所示的实施例还包括步骤903。步骤903可以在步骤901之后执行。Optionally, the embodiment shown in FIG9 further includes step 903. Step 903 may be performed after step 901.
903、模型训练管理装置向第一模型训练装置发送第一指示信息。相应的,第一模型训练装置接收来自模型训练管理装置的第一指示信息。903. The model training management device sends a first instruction message to the first model training device. Correspondingly, the first model training device receives the first instruction message from the model training management device.
步骤903与前述图7所示的实施例中的步骤703类似,具体可以参阅前述图7所示的实施例中的步骤703的相关介绍,这里不再赘述。Step 903 is similar to step 703 in the embodiment shown in Figure 7 above. For details, please refer to the relevant description of step 703 in the embodiment shown in Figure 7 above, which will not be repeated here.
可选的,图9所示的实施例还包括步骤904a至步骤904c。步骤904a至步骤904c可以在步骤903之后执行。Optionally, the embodiment shown in FIG9 further includes steps 904a to 904c. Steps 904a to 904c may be performed after step 903.
904a、第一数据采集装置向控制装置发送第二指示信息。第二指示信息用于指示第一标识。相应的,控制装置接收来自第一数据采集装置的第二指示信息。904a. The first data acquisition device sends a second indication message to the control device. The second indication message is used to indicate the first identifier. Correspondingly, the control device receives the second indication message from the first data acquisition device.
904b、第一模型训练装置向控制装置发送第三指示信息。第三指示信息用于指示第一标识。相应的,控制装置接收来自第一模型训练装置的第三指示信息。904b. The first model training device sends a third instruction message to the control device. The third instruction message is used to indicate the first identifier. Correspondingly, the control device receives the third instruction message from the first model training device.
904c、控制装置根据第二指示信息和第三指示信息确定第一模型训练装置作为第一数据采集装置的模型训练装置。904c. The control device determines the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
步骤904a至步骤904c与前述图7所示的实施例中的步骤704a至步骤704c类似,具体可以参阅前述图7所示的实施例中的步骤704a至步骤704c的相关介绍,这里不再赘述。Steps 904a to 904c are similar to steps 704a to 704c in the embodiment shown in FIG7 above. For details, please refer to the relevant description of steps 704a to 704c in the embodiment shown in FIG7 above, which will not be repeated here.
可选的,图9所示的实施例还包括步骤904至步骤905。步骤904至步骤905可以在步骤903之后执行。Optionally, the embodiment shown in FIG9 further includes steps 904 to 905. Steps 904 to 905 may be performed after step 903.
904、第一数据采集装置向第一模型训练装置发送第一数据。第一数据包括第一标识。相应的,第一模型训练装置接收来自第一数据采集装置的第一数据。904. The first data acquisition device sends first data to the first model training device. The first data includes a first identifier. Correspondingly, the first model training device receives the first data from the first data acquisition device.
905、第一模型训练装置根据第一数据进行模型训练,得到第一模型。905. The first model training device trains the model based on the first data to obtain the first model.
步骤904至步骤905与前述图7所示的实施例中的步骤704至步骤705类似,具体可以参阅前述图7所示的实施例中的步骤704至步骤705的相关介绍,这里不再赘述。Steps 904 to 905 are similar to steps 704 to 705 in the embodiment shown in Figure 7 above. For details, please refer to the relevant descriptions of steps 704 to 705 in the embodiment shown in Figure 7 above, which will not be repeated here.
可选的,图9所示的实施例还包括步骤904d。步骤904d可以在步骤904c之后执行且在步骤904之前执行。Optionally, the embodiment shown in FIG9 further includes step 904d. Step 904d may be performed after step 904c and before step 904.
904d、控制装置向第一数据采集装置发送调度信息。相应的,第一数据采集装置接收来自控制装置的调度信息。904d. The control device sends scheduling information to the first data acquisition device. Correspondingly, the first data acquisition device receives the scheduling information from the control device.
其中,调度信息用于调度第一数据采集装置发送第一数据。The scheduling information is used to schedule the first data acquisition device to send the first data.
可选的,图9所示的实施例还包括步骤906。步骤906可以在步骤905之后执行。Optionally, the embodiment shown in FIG9 further includes step 906. Step 906 may be performed after step 905.
906、第一模型训练装置向第一数据采集装置发送第一模型。相应的,第一数据采集装置接收来自第一模型训练装置的第一模型。906. The first model training device sends the first model to the first data acquisition device. Correspondingly, the first data acquisition device receives the first model from the first model training device.
步骤906与前述图8所示的实施例中的步骤807类似,具体可以参阅图8所示的实施例中的步骤807的相关介绍,这里不再赘述。Step 906 is similar to step 807 in the embodiment shown in Figure 8 above. For details, please refer to the relevant description of step 807 in the embodiment shown in Figure 8. It will not be repeated here.
可选的,图9所示的实施例还包括步骤907。步骤907可以在步骤905之后执行。Optionally, the embodiment shown in FIG9 further includes step 907. Step 907 may be performed after step 905.
907、第一模型训练装置向第二数据采集装置或第二模型训练装置发送第一模型。相应的,第二数据采集装置或第二模型训练装置接收来自第一模型训练装置的第一模型。907. The first model training device sends the first model to the second data acquisition device or the second model training device. Correspondingly, the second data acquisition device or the second model training device receives the first model from the first model training device.
步骤907与前述图8所示的实施例中的步骤808类似,具体可以参阅图8所示的实施例中的步骤808的相关介绍,这里不再赘述。Step 907 is similar to step 808 in the embodiment shown in Figure 8 above. For details, please refer to the relevant description of step 808 in the embodiment shown in Figure 8. It will not be repeated here.
可选的,图9所示的实施例还包括步骤908至步骤909。步骤908至步骤909可以在步骤905之后执行。Optionally, the embodiment shown in FIG9 further includes steps 908 to 909. Steps 908 to 909 may be performed after step 905.
908、第二数据采集装置或第二模型训练装置向第一模型训练装置发送第三模型。相应的,第一模型训练装置接收来自第二数据采集装置或第二模型训练装置的第三模型。908. The second data acquisition device or the second model training device sends the third model to the first model training device. Correspondingly, the first model training device receives the third model from the second data acquisition device or the second model training device.
909、第一模型训练装置融合第一模型和第三模型,得到第四模型。909. The first model training device merges the first model and the third model to obtain the fourth model.
步骤908至步骤909与前述图8所示的实施例中的步骤809至步骤810类似,具体可以参阅前述图8所示的实施例中的步骤809至步骤810的相关介绍,这里不再赘述。Steps 908 to 909 are similar to steps 809 to 810 in the embodiment shown in Figure 8 above. For details, please refer to the relevant descriptions of steps 809 to 810 in the embodiment shown in Figure 8 above, which will not be repeated here.
可选的,图9所示的实施例还包括步骤910。步骤910可以在步骤909之后执行。Optionally, the embodiment shown in FIG9 further includes step 910. Step 910 may be performed after step 909.
910、第一模型训练装置向第一数据采集装置发送第四模型。相应的,第一数据采集装置接收来自第一模型训练装置的第四模型。910. The first model training device sends the fourth model to the first data acquisition device. Correspondingly, the first data acquisition device receives the fourth model from the first model training device.
对于第一数据采集装置来说,第四模型可以用于模型推理。For the first data acquisition device, the fourth model can be used for model inference.
由上述图9所示的实施例可知,第一数据采集装置向模型训练管理装置发送第一请求。第一请求用于请求为第一数据采集装置分配模型训练信息。然后,第一数据采集装置接收来自模型训练管理装置的第一指示信息。第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。实现为第一数据采集装置分配第一模型训练信息。便于第一数据采集装置根据第一模型训练信息确定如何完成模型训练,以实现基于第一数据采集装置的本地数据完成模型训练。例如,第一数据采集装置的计算能力较弱或第一数据采集装置不具备计算能力的情况下,通过上述技术方案有利于解决第一数据采集装置无法进行模型训练的问题。例如,第一数据采集装置根据第一模型训练信息可以确定第一模型训练装置,并向第一模型训练装置发送第一数据采集装置的本地数据。从而便于第一模型训练装置基于该本地数据完成模型训练。例如,第一数据采集装置为终端设备,第一模型训练装置为终端厂商的服务器,终端设备的计算能力有限,且终端设备具有数据隐私保护需求的情况下,终端厂商的服务器可以基于终端设备的本地数据完成模型训练。既能够保障第一数据采集装置的数据隐私保护需求,还能够实现模型的训练。As shown in Figure 9 above, in the embodiment, the first data acquisition device sends a first request to the model training management device. The first request requests the allocation of model training information for the first data acquisition device. Then, the first data acquisition device receives first instruction information from the model training management device. The first instruction information indicates the allocation of first model training information to the first data acquisition device. This enables the allocation of first model training information to the first data acquisition device. This facilitates the first data acquisition device in determining how to complete model training based on the first model training information, thereby enabling model training based on the local data of the first data acquisition device. For example, if the first data acquisition device has weak computing power or no computing power at all, the above technical solution helps solve the problem of the first data acquisition device being unable to perform model training. For example, the first data acquisition device can determine the first model training device based on the first model training information and send its local data to the first model training device. This facilitates the first model training device in completing model training based on the local data. For example, if the first data acquisition device is a terminal device and the first model training device is a server of the terminal manufacturer, and the terminal device has limited computing power and data privacy protection requirements, the terminal manufacturer's server can complete model training based on the local data of the terminal device. It can both ensure the data privacy protection needs of the primary data acquisition device and enable model training.
下面对本申请实施例提供的第一数据采集装置进行描述。请参阅图10,图10为本申请实施例第一数据采集装置的一个结构示意图。第一数据采集装置1000可以用于执行图5、图7至图9所示的实施例中第一数据采集装置执行的步骤,具体请参阅上述方法实施例的相关介绍。第一数据采集装置1000包括收发模块1001。可选的,第一数据采集装置1000还包括处理模块1002。The first data acquisition device provided in the embodiments of this application is described below. Please refer to FIG10, which is a structural schematic diagram of the first data acquisition device in the embodiments of this application. The first data acquisition device 1000 can be used to execute the steps performed by the first data acquisition device in the embodiments shown in FIG5, FIG7 to FIG9. For details, please refer to the relevant description of the above method embodiments. The first data acquisition device 1000 includes a transceiver module 1001. Optionally, the first data acquisition device 1000 further includes a processing module 1002.
处理模块1002用于进行数据处理。收发模块1001可以实现相应的通信功能。收发模块1001还可以称为通信接口或通信模块。The processing module 1002 is used for data processing. The transceiver module 1001 can implement the corresponding communication functions. The transceiver module 1001 can also be called a communication interface or a communication module.
可选的,第一数据采集装置1000还可以包括存储模块,该存储模块可以用于存储程序代码、程序指令和/或数据,处理模块1002可以读取存储模块中的指令和/或数据,以使得第一数据采集装置1000实现前述方法实施例。Optionally, the first data acquisition device 1000 may further include a storage module, which can be used to store program code, program instructions and/or data. The processing module 1002 can read the instructions and/or data in the storage module so that the first data acquisition device 1000 can implement the aforementioned method embodiment.
第一数据采集装置1000可以用于执行上文方法实施例中第一数据采集装置所执行的动作。第一数据采集装置1000可以为终端设备或者可配置于终端设备的部件。处理模块1002用于执行上文方法实施例中第一数据采集装置侧的处理相关的操作。收发模块1001用于执行上文方法实施例中第一数据采集装置侧的接收相关的操作。The first data acquisition device 1000 can be used to perform the actions performed by the first data acquisition device in the above method embodiment. The first data acquisition device 1000 can be a terminal device or a component configurable on a terminal device. The processing module 1002 is used to perform processing-related operations on the first data acquisition device side in the above method embodiment. The transceiver module 1001 is used to perform receiving-related operations on the first data acquisition device side in the above method embodiment.
可选的,收发模块1001可以包括发送模块和接收模块。发送模块用于执行上述方法实施例中的发送操作。接收模块用于执行上述方法实施例中的接收操作。Optionally, the transceiver module 1001 may include a sending module and a receiving module. The sending module is used to perform the sending operation in the above method embodiments. The receiving module is used to perform the receiving operation in the above method embodiments.
需要说明的是,第一数据采集装置1000可以包括发送模块,而不包括接收模块。或者,第一数据采集装置1000可以包括接收模块,而不包括发送模块。具体可以视第一数据采集装置1000执行的上述方案中是否包括发送动作和接收动作。例如,第一数据采集装置1000用于执行上述图5、图7至图9所示的实施例中第一数据采集装置所执行的动作。具体可以参阅上述图5、图7至图9所示的实施例中的相关介绍,这里不详细展开。例如,第一数据采集装置1000用于执行如下方案:It should be noted that the first data acquisition device 1000 may include a transmitting module but not a receiving module. Alternatively, the first data acquisition device 1000 may include a receiving module but not a transmitting module. Specifically, it depends on whether the above-described scheme executed by the first data acquisition device 1000 includes both transmitting and receiving actions. For example, the first data acquisition device 1000 is used to execute the actions performed by the first data acquisition device in the embodiments shown in Figures 5, 7 to 9. For details, please refer to the relevant descriptions in the embodiments shown in Figures 5, 7 to 9, which will not be elaborated here. For example, the first data acquisition device 1000 is used to execute the following scheme:
收发模块1001,用于向模型训练管理装置发送第一请求,第一请求用于请求为第一数据采集装置1000分配模型训练信息;接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置1000分配的第一模型训练信息。The transceiver module 1001 is used to send a first request to the model training management device, the first request being used to request the allocation of model training information to the first data acquisition device 1000; and to receive first instruction information from the model training management device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device 1000.
一种可能的实现方式中,第一模型训练信息包括:第一模型训练装置的信息,和/或,第一模型训练资源。In one possible implementation, the first model training information includes: information about the first model training device, and/or, the first model training resources.
另一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置1000的模型训练装置。In another possible implementation, the first indication information includes a first identifier, which indicates that the first model training device is a model training device of the first data acquisition device 1000.
另一种可能的实现方式中,第一标识为所述第一数据采集装置1000的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置1000与第一模型训练装置配对。In another possible implementation, the first identifier is the identifier of the first data acquisition device 1000, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device 1000 is paired with the first model training device.
另一种可能的实现方式中,收发模块1001还用于:接收来自第一模型训练装置的第一模型,其中,第一模型是基于第一数据训练得到的,或者,接收来自模型融合装置的第二模型,第二模型是融合第一模型训练装置上报的第一模型得到的,第一模型是基于所述第一数据训练得到的。In another possible implementation, the transceiver module 1001 is further configured to: receive a first model from a first model training device, wherein the first model is trained based on the first data; or receive a second model from a model fusion device, wherein the second model is obtained by fusing the first model reported by the first model training device, and the first model is trained based on the first data.
另一种可能的实现方式中,收发模块1001还用于:接收来自第二模型训练装置或第二数据采集装置的第三模型;向第一模型训练装置发送该第三模型,第三模型用于模型融合。In another possible implementation, the transceiver module 1001 is further configured to: receive a third model from the second model training device or the second data acquisition device; and send the third model to the first model training device, wherein the third model is used for model fusion.
另一种可能的实现方式中,收发模块1001还用于:接收来自第一模型训练装置的第四模型,第四模型是融合第一模型和第三模型得到的。In another possible implementation, the transceiver module 1001 is also used to receive a fourth model from the first model training device, the fourth model being obtained by fusing the first model and the third model.
另一种可能的实现方式中,收发模块1001还用于:接收来自控制装置的调度信息,调度信息用于调度第一数据采集装置1000发送第一数据。In another possible implementation, the transceiver module 1001 is also used to: receive scheduling information from the control device, the scheduling information being used to schedule the first data acquisition device 1000 to send the first data.
另一种可能的实现方式中,收发模块1001还用于:向控制装置发送第二指示信息,第二指示信息用于指示第一数据的数据状态。In another possible implementation, the transceiver module 1001 is further configured to: send a second indication message to the control device, the second indication message being used to indicate the data status of the first data.
另一种可能的实现方式中,第二指示信息还用于指示第一标识。In another possible implementation, the second indication information is also used to indicate the first identifier.
应理解,各模块执行上述相应过程的具体过程在上述方法实施例中已经详细说明,为了简洁,在此不再赘述。It should be understood that the specific procedures for each module to perform the above-mentioned corresponding processes have been described in detail in the above method embodiments, and will not be repeated here for the sake of brevity.
上文实施例中的处理模块1002可以由至少一个处理器或处理器相关电路实现。收发模块1001可以由收发器或收发器相关电路实现。收发模块1001还可称为通信模块或通信接口。存储模块可以通过至少一个存储器实现。The processing module 1002 in the above embodiments can be implemented by at least one processor or processor-related circuitry. The transceiver module 1001 can be implemented by a transceiver or transceiver-related circuitry. The transceiver module 1001 can also be referred to as a communication module or communication interface. The storage module can be implemented by at least one memory.
下面对本申请实施例提供的模型训练管理装置进行描述。请参阅图11,图11为本申请实施例模型训练管理装置的一个结构示意图。模型训练管理装置1100可以用于执行图5、图7至图9所示的实施例中模型训练管理装置执行的步骤,具体请参阅上述方法实施例的相关介绍。模型训练管理装置1100包括收发模块1101。可选的,模型训练管理装置1100还包括处理模块1102。The model training management device provided in the embodiments of this application is described below. Please refer to FIG11, which is a structural schematic diagram of the model training management device according to an embodiment of this application. The model training management device 1100 can be used to execute the steps performed by the model training management device in the embodiments shown in FIG5, FIG7 to FIG9. For details, please refer to the relevant description of the above method embodiments. The model training management device 1100 includes a transceiver module 1101. Optionally, the model training management device 1100 also includes a processing module 1102.
处理模块1102用于进行数据处理。收发模块1101可以实现相应的通信功能。收发模块1101还可以称为通信接口或通信模块。The processing module 1102 is used for data processing. The transceiver module 1101 can implement the corresponding communication functions. The transceiver module 1101 can also be called a communication interface or a communication module.
可选的,模型训练管理装置1100还可以包括存储模块,该存储模块可以用于存储程序代码、程序指令和/或数据,处理模块1102可以读取存储模块中的指令和/或数据,以使得模型训练管理装置1100实现前述方法实施例。Optionally, the model training management device 1100 may further include a storage module, which can be used to store program code, program instructions and/or data. The processing module 1102 can read the instructions and/or data in the storage module so that the model training management device 1100 can implement the aforementioned method embodiment.
模型训练管理装置1100可以用于执行上文方法实施例中模型训练管理装置所执行的动作。模型训练管理装置1100可以为服务器或者可配置于服务器的部件。处理模块1102用于执行上文方法实施例中模型训练管理装置侧的处理相关的操作。收发模块1101用于执行上文方法实施例中模型训练管理装置侧的接收相关的操作。The model training management device 1100 can be used to execute the actions performed by the model training management device in the above method embodiment. The model training management device 1100 can be a server or a component configurable on a server. The processing module 1102 is used to execute processing-related operations on the model training management device side in the above method embodiment. The transceiver module 1101 is used to execute receiving-related operations on the model training management device side in the above method embodiment.
可选的,收发模块1101可以包括发送模块和接收模块。发送模块用于执行上述方法实施例中的发送操作。接收模块用于执行上述方法实施例中的接收操作。Optionally, the transceiver module 1101 may include a sending module and a receiving module. The sending module is used to perform the sending operation in the above method embodiments. The receiving module is used to perform the receiving operation in the above method embodiments.
需要说明的是,模型训练管理装置1100可以包括发送模块,而不包括接收模块。或者,模型训练管理装置1100可以包括接收模块,而不包括发送模块。具体可以视模型训练管理装置1100执行的上述方案中是否包括发送动作和接收动作。例如,模型训练管理装置1100用于执行上述图5、图7至图9所示的实施例中模型训练管理装置所执行的动作。具体可以参阅上述图5、图7至图9所示的实施例中的相关介绍,这里不详细展开。例如,模型训练管理装置1100用于执行如下方案:It should be noted that the model training management device 1100 may include a sending module but not a receiving module. Alternatively, the model training management device 1100 may include a receiving module but not a sending module. Specifically, it depends on whether the above-described scheme executed by the model training management device 1100 includes both sending and receiving actions. For example, the model training management device 1100 is used to execute the actions performed by the model training management device in the embodiments shown in Figures 5 and 7 to 9. For details, please refer to the relevant descriptions in the embodiments shown in Figures 5, 7 to 9, which will not be elaborated here. For example, the model training management device 1100 is used to execute the following scheme:
收发模块1101,用于接收来自第一数据采集装置的第一请求,第一请求用于请求为第一数据采集装置分配模型训练信息;向第一数据采集装置发送第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息。The transceiver module 1101 is used to receive a first request from the first data acquisition device, the first request being used to request the allocation of model training information to the first data acquisition device; and to send first instruction information to the first data acquisition device, the first instruction information being used to indicate the allocation of first model training information to the first data acquisition device.
一种可能的实现方式中,收发模块1101还用于:向第一模型训练装置发送第一指示信息。In one possible implementation, the transceiver module 1101 is further configured to: send first instruction information to the first model training device.
另一种可能的实现方式中,收发模块1101还用于:向控制装置发送第一指示信息。In another possible implementation, the transceiver module 1101 is also used to: send first instruction information to the control device.
另一种可能的实现方式中,第一模型训练信息包括第一模型训练装置的信息,和/或,第一模型训练资源。In another possible implementation, the first model training information includes information about the first model training device and/or the first model training resources.
另一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置。In another possible implementation, the first indication information includes a first identifier, which is used to indicate that the first model training device is a model training device for the first data acquisition device.
另一种可能的实现方式中,第一标识为第一数据采集装置的标识,或者为第一模型训练装置的标识,或者为配对标识,配对标识用于指示第一数据采集装置与第一模型训练装置配对。In another possible implementation, the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device, or a pairing identifier, which is used to indicate that the first data acquisition device is paired with the first model training device.
应理解,各模块执行上述相应过程的具体过程在上述方法实施例中已经详细说明,为了简洁,在此不再赘述。It should be understood that the specific procedures for each module to perform the above-mentioned corresponding processes have been described in detail in the above method embodiments, and will not be repeated here for the sake of brevity.
上文实施例中的处理模块1102可以由至少一个处理器或处理器相关电路实现。收发模块1101可以由收发器或收发器相关电路实现。收发模块1101还可称为通信模块或通信接口。存储模块可以通过至少一个存储器实现。The processing module 1102 in the above embodiments can be implemented by at least one processor or processor-related circuitry. The transceiver module 1101 can be implemented by a transceiver or transceiver-related circuitry. The transceiver module 1101 can also be referred to as a communication module or communication interface. The storage module can be implemented by at least one memory.
下面对本申请实施例提供的第一模型训练装置进行描述。请参阅图12,图12为本申请实施例第一模型训练装置的一个结构示意图。第一模型训练装置1200可以用于执行图5、图7至图9所示的实施例中第一模型训练装置执行的步骤,具体请参阅上述方法实施例的相关介绍。第一模型训练装置1200包括收发模块1201和处理模块1202。The first model training apparatus provided in the embodiments of this application is described below. Please refer to FIG12, which is a structural schematic diagram of the first model training apparatus in the embodiments of this application. The first model training apparatus 1200 can be used to execute the steps performed by the first model training apparatus in the embodiments shown in FIG5, FIG7 to FIG9. For details, please refer to the relevant description of the above method embodiments. The first model training apparatus 1200 includes a transceiver module 1201 and a processing module 1202.
处理模块1202用于进行数据处理。收发模块1201可以实现相应的通信功能。收发模块1201还可以称为通信接口或通信模块。The processing module 1202 is used for data processing. The transceiver module 1201 can implement the corresponding communication functions. The transceiver module 1201 can also be called a communication interface or a communication module.
可选的,第一模型训练装置1200还可以包括存储模块,该存储模块可以用于存储程序代码、程序指令和/或数据,处理模块1202可以读取存储模块中的指令和/或数据,以使得第一模型训练装置1200实现前述方法实施例。Optionally, the first model training device 1200 may further include a storage module, which can be used to store program code, program instructions and/or data. The processing module 1202 can read the instructions and/or data in the storage module so that the first model training device 1200 can implement the aforementioned method embodiment.
第一模型训练装置1200可以用于执行上文方法实施例中第一模型训练装置所执行的动作。第一模型训练装置1200可以为AI服务器或者可配置于AI服务器的部件。处理模块1202用于执行上文方法实施例中第一模型训练装置侧的处理相关的操作。收发模块1201用于执行上文方法实施例中第一模型训练装置侧的接收相关的操作。The first model training device 1200 can be used to execute the actions performed by the first model training device in the above method embodiment. The first model training device 1200 can be an AI server or a component configurable on an AI server. The processing module 1202 is used to execute processing-related operations on the first model training device side in the above method embodiment. The transceiver module 1201 is used to execute receiving-related operations on the first model training device side in the above method embodiment.
可选的,收发模块1201可以包括发送模块和接收模块。发送模块用于执行上述方法实施例中的发送操作。接收模块用于执行上述方法实施例中的接收操作。Optionally, the transceiver module 1201 may include a sending module and a receiving module. The sending module is used to perform the sending operation in the above method embodiments. The receiving module is used to perform the receiving operation in the above method embodiments.
需要说明的是,第一模型训练装置1200可以包括发送模块,而不包括接收模块。或者,第一模型训练装置1200可以包括接收模块,而不包括发送模块。具体可以视第一模型训练装置1200执行的上述方案中是否包括发送动作和接收动作。例如,第一模型训练装置1200用于执行上述图5、图7至图9所示的实施例中第一模型训练装置所执行的动作。具体可以参阅上述图5、图7至图9所示的实施例中的相关介绍,这里不详细展开。例如,第一模型训练装置1200用于执行如下方案:It should be noted that the first model training device 1200 may include a sending module but not a receiving module. Alternatively, the first model training device 1200 may include a receiving module but not a sending module. Specifically, it depends on whether the above-described scheme executed by the first model training device 1200 includes both sending and receiving actions. For example, the first model training device 1200 is used to execute the actions performed by the first model training device in the embodiments shown in Figures 5 and 7 to 9. For details, please refer to the relevant descriptions in the embodiments shown in Figures 5, 7 to 9, which will not be elaborated here. For example, the first model training device 1200 is used to execute the following scheme:
收发模块1201,用于接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息;The transceiver module 1201 is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
处理模块1202,用于根据第一指示信息确定第一模型训练信息。The processing module 1202 is used to determine the first model training information based on the first instruction information.
一种可能的实现方式中,第一模型训练信息包括第一模型训练装置1200的信息,和/或,第一模型训练资源。In one possible implementation, the first model training information includes information about the first model training device 1200 and/or the first model training resources.
另一种可能的实现方式中,第一指示信息包括第一标识,第一标识用于标识第一模型训练装置1200作为第一数据采集装置的模型训练装置。In another possible implementation, the first indication information includes a first identifier, which is used to identify the first model training device 1200 as a model training device for the first data acquisition device.
另一种可能的实现方式中,第一标识为第一数据采集装置的标识,或者为第一模型训练装置1200的标识,或者为配对标识,配对标识用于标识第一数据采集装置与第一模型训练装置1200配对。In another possible implementation, the first identifier is the identifier of the first data acquisition device, or the identifier of the first model training device 1200, or a pairing identifier, which is used to identify the pairing of the first data acquisition device and the first model training device 1200.
另一种可能的实现方式中,收发模块1201还用于:接收来自第一数据采集装置的第一数据,第一数据包括第一标识;处理模块1202还用于:基于第一标识和第一数据进行模型训练得到第一模型。In another possible implementation, the transceiver module 1201 is further configured to: receive first data from the first data acquisition device, the first data including a first identifier; and the processing module 1202 is further configured to: perform model training based on the first identifier and the first data to obtain a first model.
另一种可能的实现方式中,收发模块1201还用于:向第一数据采集装置或模型融合装置发送第一模型。In another possible implementation, the transceiver module 1201 is also used to: send the first model to the first data acquisition device or the model fusion device.
另一种可能的实现方式中,收发模块1201还用于:接收来自模型融合装置的第二模型,第二模型是模型融合装置融合第一模型得到的。In another possible implementation, the transceiver module 1201 is also used to: receive a second model from the model fusion device, the second model being obtained by the model fusion device fusing the first model.
另一种可能的实现方式中,收发模块1201还用于:接收来自第一数据采集装置、第二数据采集装置或第二模型训练装置的第三模型,第三模型用于模型融合。In another possible implementation, the transceiver module 1201 is also used to: receive a third model from the first data acquisition device, the second data acquisition device, or the second model training device, the third model being used for model fusion.
另一种可能的实现方式中,处理模块1202还用于:融合第一模型和第三模型,得到第四模型;收发模块1201还用于:向第一数据采集装置发送第四模型。In another possible implementation, the processing module 1202 is further configured to: fuse the first model and the third model to obtain a fourth model; the transceiver module 1201 is further configured to: send the fourth model to the first data acquisition device.
另一种可能的实现方式中,收发模块1201还用于:向控制装置发送第三指示信息,第三指示信息用于指示第一模型训练装置1200的训练相关信息。In another possible implementation, the transceiver module 1201 is also used to: send third instruction information to the control device, the third instruction information being used to indicate training-related information of the first model training device 1200.
另一种可能的实现方式中,第三指示信息还用于指示第一标识。In another possible implementation, the third indication information is also used to indicate the first identifier.
应理解,各模块执行上述相应过程的具体过程在上述方法实施例中已经详细说明,为了简洁,在此不再赘述。It should be understood that the specific procedures for each module to perform the above-mentioned corresponding processes have been described in detail in the above method embodiments, and will not be repeated here for the sake of brevity.
上文实施例中的处理模块1202可以由至少一个处理器或处理器相关电路实现。收发模块1201可以由收发器或收发器相关电路实现。收发模块1201还可称为通信模块或通信接口。存储模块可以通过至少一个存储器实现。The processing module 1202 in the above embodiments can be implemented by at least one processor or processor-related circuitry. The transceiver module 1201 can be implemented by a transceiver or transceiver-related circuitry. The transceiver module 1201 can also be referred to as a communication module or communication interface. The storage module can be implemented by at least one memory.
下面对本申请实施例提供的控制装置进行描述。请参阅图13,图13为本申请实施例控制装置的一个结构示意图。控制装置1300可以用于执行图5、图7至图9所示的实施例中控制装置执行的步骤,具体请参阅上述方法实施例的相关介绍。控制装置1300包括收发模块1301和处理模块1302。The control device provided in the embodiments of this application is described below. Please refer to FIG13, which is a structural schematic diagram of the control device in the embodiments of this application. The control device 1300 can be used to execute the steps performed by the control device in the embodiments shown in FIG5, FIG7 to FIG9. For details, please refer to the relevant description of the above method embodiments. The control device 1300 includes a transceiver module 1301 and a processing module 1302.
处理模块1302用于进行数据处理。收发模块1301可以实现相应的通信功能。收发模块1301还可以称为通信接口或通信模块。The processing module 1302 is used for data processing. The transceiver module 1301 can implement the corresponding communication functions. The transceiver module 1301 can also be called a communication interface or a communication module.
可选的,控制装置1300还可以包括存储模块,该存储模块可以用于存储程序代码、程序指令和/或数据,处理模块1302可以读取存储模块中的指令和/或数据,以使得控制装置1300实现前述方法实施例。Optionally, the control device 1300 may further include a storage module, which can be used to store program code, program instructions and/or data. The processing module 1302 can read the instructions and/or data in the storage module so that the control device 1300 can implement the aforementioned method embodiments.
控制装置1300可以用于执行上文方法实施例中控制装置所执行的动作。控制装置1300可以为网络设备或者可配置于网络设备的部件。处理模块1302用于执行上文方法实施例中控制装置侧的处理相关的操作。收发模块1301用于执行上文方法实施例中控制装置侧的接收相关的操作。The control device 1300 can be used to execute the actions performed by the control device in the above method embodiments. The control device 1300 can be a network device or a component configurable on a network device. The processing module 1302 is used to execute processing-related operations on the control device side in the above method embodiments. The transceiver module 1301 is used to execute receiving-related operations on the control device side in the above method embodiments.
可选的,收发模块1301可以包括发送模块和接收模块。发送模块用于执行上述方法实施例中的发送操作。接收模块用于执行上述方法实施例中的接收操作。Optionally, the transceiver module 1301 may include a sending module and a receiving module. The sending module is used to perform the sending operation in the above method embodiments. The receiving module is used to perform the receiving operation in the above method embodiments.
需要说明的是,控制装置1300可以包括发送模块,而不包括接收模块。或者,控制装置1300可以包括接收模块,而不包括发送模块。具体可以视控制装置1300执行的上述方案中是否包括发送动作和接收动作。例如,控制装置1300用于执行上述图5、图7至图9所示的实施例中控制装置所执行的动作。具体可以参阅上述图5、图7至图9所示的实施例中的相关介绍,这里不详细展开。It should be noted that the control device 1300 may include a transmitting module but not a receiving module. Alternatively, the control device 1300 may include a receiving module but not a transmitting module. Specifically, it depends on whether the above-described scheme executed by the control device 1300 includes both transmitting and receiving actions. For example, the control device 1300 is used to execute the actions performed by the control device in the embodiments shown in Figures 5 and 7 to 9. For details, please refer to the relevant descriptions in the embodiments shown in Figures 5, 7 to 9; these will not be elaborated upon here.
例如,控制装置1300用于执行如下方案:For example, the control device 1300 is used to execute the following scheme:
收发模块1301,用于接收来自模型训练管理装置的第一指示信息,第一指示信息用于指示为第一数据采集装置分配的第一模型训练信息;The transceiver module 1301 is used to receive first instruction information from the model training management device, the first instruction information being used to indicate the first model training information allocated to the first data acquisition device;
处理模块1302,用于根据第一指示信息确定第一模型训练信息。The processing module 1302 is used to determine the first model training information based on the first instruction information.
再例如,控制装置1300用于执行如下方案:For example, control device 1300 is used to execute the following scheme:
收发模块1301,用于接收来自第一数据采集装置的第二指示信息,第二指示信息用于指示第一数据的数据状态和第一标识,第一标识用于指示第一模型训练装置作为第一数据采集装置的模型训练装置;接收来自第一模型训练装置的第三指示信息,第三指示信息用于指示第一模型训练装置的训练相关信息和第一标识;The transceiver module 1301 is used to receive second instruction information from the first data acquisition device, the second instruction information being used to indicate the data status and first identifier of the first data, the first identifier being used to indicate the first model training device as the model training device of the first data acquisition device; and to receive third instruction information from the first model training device, the third instruction information being used to indicate the training-related information and first identifier of the first model training device.
处理模块1302,用于根据第二指示信息和第三指示信息确定第一模型训练装置作为第一数据采集装置的模型训练装置。The processing module 1302 is used to determine the first model training device as the model training device of the first data acquisition device based on the second instruction information and the third instruction information.
对于其他实现方式请参阅前述方法实施例的相关介绍。For other implementation methods, please refer to the relevant descriptions in the foregoing method embodiments.
应理解,各模块执行上述相应过程的具体过程在上述方法实施例中已经详细说明,为了简洁,在此不再赘述。It should be understood that the specific procedures for each module to perform the above-mentioned corresponding processes have been described in detail in the above method embodiments, and will not be repeated here for the sake of brevity.
上文实施例中的处理模块1302可以由至少一个处理器或处理器相关电路实现。收发模块1301可以由收发器或收发器相关电路实现。收发模块1301还可称为通信模块或通信接口。存储模块可以通过至少一个存储器实现。The processing module 1302 in the above embodiments can be implemented by at least one processor or processor-related circuitry. The transceiver module 1301 can be implemented by a transceiver or transceiver-related circuitry. The transceiver module 1301 can also be referred to as a communication module or communication interface. The storage module can be implemented by at least one memory.
下面对本申请实施例提供的模型融合装置进行描述。请参阅图14,图14为本申请实施例模型融合装置的一个结构示意图。模型融合装置1400可以用于执行图5和图7所示的实施例中模型融合装置执行的步骤,具体请参阅上述方法实施例的相关介绍。模型融合装置1400包括收发模块1401和处理模块1402。The model fusion apparatus provided in the embodiments of this application is described below. Please refer to FIG14, which is a schematic diagram of the structure of the model fusion apparatus in the embodiments of this application. The model fusion apparatus 1400 can be used to perform the steps performed by the model fusion apparatus in the embodiments shown in FIG5 and FIG7. For details, please refer to the relevant description of the above method embodiments. The model fusion apparatus 1400 includes a transceiver module 1401 and a processing module 1402.
处理模块1402用于进行数据处理。收发模块1401可以实现相应的通信功能。收发模块1401还可以称为通信接口或通信模块。The processing module 1402 is used for data processing. The transceiver module 1401 can implement the corresponding communication functions. The transceiver module 1401 can also be called a communication interface or a communication module.
可选的,模型融合装置1400还可以包括存储模块,该存储模块可以用于存储程序代码、程序指令和/或数据,处理模块1402可以读取存储模块中的指令和/或数据,以使得模型融合装置1400实现前述方法实施例。Optionally, the model fusion apparatus 1400 may further include a storage module, which can be used to store program code, program instructions and/or data. The processing module 1402 can read the instructions and/or data in the storage module so that the model fusion apparatus 1400 can implement the aforementioned method embodiments.
模型融合装置1400可以用于执行上文方法实施例中模型融合装置所执行的动作。控制装置1400可以为网络设备或者可配置于网络设备的部件。处理模块1402用于执行上文方法实施例中模型融合装置侧的处理相关的操作。收发模块1401用于执行上文方法实施例中模型融合装置侧的接收相关的操作。The model fusion device 1400 can be used to perform the actions performed by the model fusion device in the above method embodiments. The control device 1400 can be a network device or a component configurable on a network device. The processing module 1402 is used to perform processing-related operations on the model fusion device side in the above method embodiments. The transceiver module 1401 is used to perform receiving-related operations on the model fusion device side in the above method embodiments.
可选的,收发模块1401可以包括发送模块和接收模块。发送模块用于执行上述方法实施例中的发送操作。接收模块用于执行上述方法实施例中的接收操作。Optionally, the transceiver module 1401 may include a sending module and a receiving module. The sending module is used to perform the sending operation in the above method embodiments. The receiving module is used to perform the receiving operation in the above method embodiments.
需要说明的是,模型融合装置1400可以包括发送模块,而不包括接收模块。或者,模型融合装置1400可以包括接收模块,而不包括发送模块。具体可以视模型融合装置1400执行的上述方案中是否包括发送动作和接收动作。例如,模型融合装置1400用于执行上述图5和图7所示的实施例中模型融合装置所执行的动作。具体可以参阅上述图5和图7所示的实施例中的相关介绍,这里不详细展开。例如,模型融合装置1400可以用于执行如下方案:It should be noted that the model fusion device 1400 may include a transmitting module but not a receiving module. Alternatively, the model fusion device 1400 may include a receiving module but not a transmitting module. Specifically, it depends on whether the above-described scheme executed by the model fusion device 1400 includes both transmitting and receiving actions. For example, the model fusion device 1400 is used to execute the actions performed by the model fusion device in the embodiments shown in Figures 5 and 7. For details, please refer to the relevant descriptions in the embodiments shown in Figures 5 and 7, which will not be elaborated here. For example, the model fusion device 1400 can be used to execute the following scheme:
收发模块1401,用于接收来自第一模型训练装置的第一模型,第一模型是基于第一数据采集装置的第一数据得到的;处理模块1402,用于融合第一模型,得到第二模型。The transceiver module 1401 is used to receive a first model from the first model training device, the first model being obtained based on the first data from the first data acquisition device; the processing module 1402 is used to fuse the first model to obtain a second model.
一种可能的实现方式中,收发模块1401还用于:向第一数据采集装置发送第二模型。In one possible implementation, the transceiver module 1401 is also used to: send the second model to the first data acquisition device.
另一种可能的实现方式中,收发模块1401还用于:向第一模型训练装置发送第二模型。In another possible implementation, the transceiver module 1401 is also used to send the second model to the first model training device.
应理解,各模块执行上述相应过程的具体过程在上述方法实施例中已经详细说明,为了简洁,在此不再赘述。It should be understood that the specific procedures for each module to perform the above-mentioned corresponding processes have been described in detail in the above method embodiments, and will not be repeated here for the sake of brevity.
上文实施例中的处理模块1402可以由至少一个处理器或处理器相关电路实现。收发模块1401可以由收发器或收发器相关电路实现。收发模块1401还可称为通信模块或通信接口。存储模块可以通过至少一个存储器实现。The processing module 1402 in the above embodiments can be implemented by at least one processor or processor-related circuitry. The transceiver module 1401 can be implemented by a transceiver or transceiver-related circuitry. The transceiver module 1401 can also be referred to as a communication module or communication interface. The storage module can be implemented by at least one memory.
本申请实施例还提供一种装置1500。请参阅图15,装置1500包括处理器1510,处理器1510与存储器1520耦合,存储器1520用于存储计算机程序或指令和/或数据,处理器1510用于执行存储器1520存储的计算机程序或指令和/或数据,使得上文方法实施例中的方法被执行。装置1500用于实现上文方法实施例中由第一数据采集装置、第一模型训练装置、模型训练管理装置、控制装置、或模型融合装置执行的操作。This application embodiment also provides an apparatus 1500. Referring to FIG15, the apparatus 1500 includes a processor 1510, which is coupled to a memory 1520. The memory 1520 is used to store computer programs or instructions and/or data. The processor 1510 is used to execute the computer programs or instructions and/or data stored in the memory 1520, causing the methods in the above method embodiments to be executed. The apparatus 1500 is used to implement the operations performed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device in the above method embodiments.
可选的,装置1500包括的处理器1510为一个或多个。Optionally, the device 1500 may include one or more processors 1510.
可选的,如图15所示,装置1500还可以包括存储器1520。Optionally, as shown in Figure 15, the device 1500 may also include a memory 1520.
可选的,装置1500包括的存储器1520可以为一个或多个。Optionally, the device 1500 may include one or more memory 1520.
可选的,存储器1520可以与该处理器1510集成在一起,或者分离设置。Optionally, the memory 1520 can be integrated with the processor 1510 or set separately.
可选的,如图15所示,装置1500还可以包括收发器1530,收发器1530用于信号的接收和/或发送。例如,处理器1510用于控制收发器1530进行信号的接收和/或发送。Optionally, as shown in FIG15, the device 1500 may further include a transceiver 1530 for receiving and/or transmitting signals. For example, the processor 1510 is used to control the transceiver 1530 to receive and/or transmit signals.
本申请还提供一种装置1600,装置1600可以为终端设备、终端设备中的处理器、或芯片。装置1600可以用于执行上述方法实施例中由第一数据采集装置所执行的操作。This application also provides an apparatus 1600, which may be a terminal device, a processor in the terminal device, or a chip. The apparatus 1600 can be used to perform the operations performed by the first data acquisition device in the above method embodiments.
当装置1600为终端设备时,图16示出了一种简化的终端设备的结构示意图。如图16所示,终端设备包括处理器、存储器、以及收发器。存储器可以存储计算机程序代码,收发器包括发射机1631、接收机1632、射频电路(图中未示出)、天线1633以及输入输出装置(图中未示出)。When device 1600 is a terminal device, Figure 16 shows a simplified schematic diagram of the terminal device. As shown in Figure 16, the terminal device includes a processor, a memory, and a transceiver. The memory can store computer program code, and the transceiver includes a transmitter 1631, a receiver 1632, radio frequency circuitry (not shown), an antenna 1633, and input/output devices (not shown).
处理器主要用于对通信协议以及通信数据进行处理;对终端设备进行控制,执行软件程序以及处理软件程序的数据等。The processor is mainly used to process communication protocols and communication data; control terminal devices; execute software programs; and process data from software programs.
存储器主要用于存储软件程序和数据。Memory is mainly used to store software programs and data.
射频电路主要用于基带信号与射频信号的转换以及对射频信号的处理。Radio frequency (RF) circuits are mainly used for the conversion between baseband signals and RF signals, as well as for the processing of RF signals.
天线主要用于收发电磁波形式的射频信号。Antennas are primarily used for transmitting and receiving radio frequency signals in the form of electromagnetic waves.
输入输出装置可以包括触摸屏、显示屏,或键盘等。输入输出装置主要用于接收用户输入的数据以及对用户输出数据。需要说明的是,有些种类的终端设备可以不具有输入输出装置。Input/output devices can include touchscreens, displays, or keyboards. They are primarily used to receive user input and output data to the user. It should be noted that some types of terminal devices may not have input/output devices.
当需要发送数据时,处理器对待发送的数据进行基带处理后,输出基带信号至射频电路。然后,射频电路将基带信号进行射频处理后将射频信号通过天线以电磁波的形式向外发送。当有数据发送到终端设备时,射频电路通过天线接收到射频信号。射频电路将射频信号转换为基带信号,并将基带信号输出至处理器。处理器将基带信号转换为数据并对该数据进行处理。为便于说明,图16中仅示出了一个存储器、处理器和收发器。在实际的终端设备的产品中,可以存在一个或多个处理器和一个或多个存储器。存储器也可以称为存储介质或者存储设备等。存储器可以是独立于处理器设置,也可以是与处理器集成在一起,本申请实施例对此不做限制。When data needs to be transmitted, the processor performs baseband processing on the data to be transmitted and outputs a baseband signal to the radio frequency (RF) circuit. The RF circuit then processes the baseband signal and transmits it outwards via an antenna as electromagnetic waves. When data is sent to the terminal device, the RF circuit receives the RF signal through the antenna. The RF circuit converts the RF signal back into a baseband signal and outputs it to the processor. The processor converts the baseband signal back into data and processes the data. For ease of explanation, Figure 16 only shows one memory, processor, and transceiver. In actual terminal device products, there may be one or more processors and one or more memories. Memory can also be called storage medium or storage device, etc. Memory can be independent of the processor or integrated with the processor; this embodiment does not limit this.
在本申请实施例中,可以将具有收发功能的天线和射频电路视为终端设备的收发模块,将具有处理功能的处理器视为终端设备的处理模块。In this embodiment, the antenna and radio frequency circuit with transceiver function can be regarded as the transceiver module of the terminal device, and the processor with processing function can be regarded as the processing module of the terminal device.
如图16所示,终端设备包括处理器1610、存储器1620和收发器1630。处理器1610也可以称为处理单元,处理单板,处理模块,或处理装置等。收发器1630也可以称为收发单元,收发机,或收发装置等。As shown in Figure 16, the terminal device includes a processor 1610, a memory 1620, and a transceiver 1630. The processor 1610 can also be referred to as a processing unit, processing board, processing module, or processing device, etc. The transceiver 1630 can also be referred to as a transceiver unit, transceiver, or transceiver device, etc.
可选的,将收发器1630中用于实现接收功能的器件视为接收模块,将收发器1630中用于实现发送功能的器件视为发送模块,即收发器1630包括接收器和发送器。收发器有时也可以称为收发机、收发模块、或收发电路等。接收器有时也可以称为接收机、接收模块、或接收电路等。发送器有时也可以称为发射机、发射模块或者发射电路等。Optionally, the device in transceiver 1630 used to implement the receiving function can be considered a receiving module, and the device in transceiver 1630 used to implement the transmitting function can be considered a transmitting module. That is, transceiver 1630 includes a receiver and a transmitter. A transceiver may also be called a transceiver unit, transceiver module, or transceiver circuit, etc. A receiver may also be called a receiver unit, receiving module, or receiving circuit, etc. A transmitter may also be called a transmitter, transmitting module, or transmitting circuit, etc.
处理器1610用于执行上述图5、图7至图9所示的实施例中第一数据采集装置侧的处理动作。收发器1630用于执行上述图5、图7至图9所示的实施例中第一数据采集装置侧的收发动作。The processor 1610 is used to execute the processing operations on the first data acquisition device side of the embodiments shown in Figures 5, 7 to 9. The transceiver 1630 is used to execute the transmission and reception operations on the first data acquisition device side of the embodiments shown in Figures 5, 7 to 9.
应理解,图16仅为示例而非限定,上述包括收发模块和处理模块的终端设备可以不依赖于图10或图16所示的结构。It should be understood that Figure 16 is merely an example and not a limitation, and the terminal device described above, including the transceiver module and the processing module, may not depend on the structure shown in Figure 10 or Figure 16.
当装置1600为芯片时,该芯片包括处理器、存储器和收发器。其中,收发器可以是输入输出电路或通信接口。处理器可以为该芯片上集成的处理模块或者微处理器或者集成电路。上述方法实施例中第一数据采集装置的发送操作可以理解为芯片的输出,上述方法实施例中第一数据采集装置的接收操作可以理解为芯片的输入。When device 1600 is a chip, the chip includes a processor, a memory, and a transceiver. The transceiver can be an input/output circuit or a communication interface. The processor can be a processing module integrated on the chip, a microprocessor, or an integrated circuit. In the above method embodiments, the transmitting operation of the first data acquisition device can be understood as the chip's output, and the receiving operation of the first data acquisition device in the above method embodiments can be understood as the chip's input.
本申请还提供一种装置1700,装置1700可以是网络设备也可以是芯片。装置1700可以用于执行上述图5、图7至图9所示的实施例中控制装置或模型融合装置所执行的操作。This application also provides a device 1700, which can be a network device or a chip. The device 1700 can be used to perform the operations performed by the control device or model fusion device in the embodiments shown in Figures 5, 7 to 9 above.
当装置1700为网络设备时,例如为基站。图17示出了一种简化的基站结构示意图。基站包括1710部分、1720部分以及1730部分。When device 1700 is a network device, such as a base station, Figure 17 shows a simplified schematic diagram of a base station structure. The base station includes parts 1710, 1720, and 1730.
1710部分主要用于基带处理,对基站进行控制等;1710部分通常是基站的控制中心,通常可以称为处理器,用于控制基站执行上述方法实施例中控制装置或模型融合装置侧的处理操作。The 1710 section is mainly used for baseband processing and controlling the base station; the 1710 section is usually the control center of the base station, which can be called the processor, and is used to control the base station to perform the processing operations on the control device or model fusion device side in the above method embodiments.
1720部分主要用于存储计算机程序代码和数据。Part 1720 is primarily used to store computer program code and data.
1730部分主要用于射频信号的收发以及射频信号与基带信号的转换;1730部分通常可以称为收发模块、收发机、收发电路、或者收发器等。1730部分的收发模块,也可以称为收发机或收发器等,其包括天线1733和射频电路(图中未示出),其中射频电路主要用于进行射频处理。可选的,可以将1730部分中用于实现接收功能的器件视为接收机,将用于实现发送功能的器件视为发射机,即1730部分包括接收机1732和发射机1731。接收机也可以称为接收模块、接收器、或接收电路等,发送机可以称为发射模块、发射器或者发射电路等。Section 1730 is primarily used for transmitting and receiving radio frequency (RF) signals, as well as converting RF signals to baseband signals. Section 1730 is commonly referred to as a transceiver module, transceiver, transceiver circuit, or transceiver unit. The transceiver module of section 1730, also known as a transceiver or transceiver unit, includes antenna 1733 and RF circuitry (not shown in the figure), where the RF circuitry is mainly used for RF processing. Optionally, the device in section 1730 used for receiving can be considered a receiver, and the device used for transmitting can be considered a transmitter; that is, section 1730 includes receiver 1732 and transmitter 1731. The receiver can also be called a receiving module, receiver circuit, or receiving circuit, and the transmitter can be called a transmitting module, transmitter, or transmitting circuit.
1710部分与1720部分可以包括一个或多个单板,每个单板可以包括一个或多个处理器和一个或多个存储器。处理器用于读取和执行存储器中的程序以实现基带处理功能以及对基站的控制。若存在多个单板,各个单板之间可以互联以增强处理能力。作为一种可选的实施方式,也可以是多个单板共用一个或多个处理器,或者是多个单板共用一个或多个存储器,或者是多个单板同时共用一个或多个处理器。Sections 1710 and 1720 may include one or more circuit boards, each of which may include one or more processors and one or more memories. The processors are used to read and execute programs from the memories to implement baseband processing functions and control the base station. If multiple circuit boards exist, they can be interconnected to enhance processing capabilities. As an alternative implementation, multiple circuit boards may share one or more processors, multiple circuit boards may share one or more memories, or multiple circuit boards may simultaneously share one or more processors.
例如,在一种实现方式中,1730部分的收发模块用于执行图5、图7至图9所示的实施例中由控制装置或模型融合装置执行的收发相关的过程。1710部分的处理器用于执行图5、图7至图9所示的实施例中由控制装置或模型融合装置执行的处理相关的过程。For example, in one implementation, the transceiver module of section 1730 is used to execute the transceiver-related processes performed by the control device or model fusion device in the embodiments shown in Figures 5, 7 to 9. The processor of section 1710 is used to execute the processing-related processes performed by the control device or model fusion device in the embodiments shown in Figures 5, 7 to 9.
应理解,图17仅为示例而非限定,上述包括处理器、存储器以及收发器的网络设备可以不依赖于图13或图17所示的结构。It should be understood that Figure 17 is merely an example and not a limitation, and the network device described above, including the processor, memory, and transceiver, may not depend on the structure shown in Figure 13 or Figure 17.
当装置1700为芯片时,该芯片包括收发器、存储器和处理器。其中,收发器可以是输入输出电路、通信接口;处理器为该芯片上集成的处理器、或者微处理器、或者集成电路。上述方法实施例中控制装置或模型融合装置的发送操作可以理解为芯片的输出,上述方法实施例中控制装置或模型融合装置的接收操作可以理解为芯片的输入。When device 1700 is a chip, the chip includes a transceiver, a memory, and a processor. The transceiver can be an input/output circuit or a communication interface; the processor can be an integrated processor, a microprocessor, or an integrated circuit on the chip. In the above method embodiments, the transmitting operation of the control device or model fusion device can be understood as the chip's output, and the receiving operation of the control device or model fusion device in the above method embodiments can be understood as the chip's input.
本申请还提供一种计算机可读存储介质,其上存储有用于实现上述方法实施例中由第一数据采集装置、第一模型训练装置、模型训练管理装置、控制装置、或模型融合装置执行的方法的计算机指令。This application also provides a computer-readable storage medium storing computer instructions for implementing the methods executed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device in the above-described method embodiments.
例如,该计算机程序被计算机执行时,使得该计算机可以实现上述方法实施例中由第一数据采集装置、第一模型训练装置、模型训练管理装置、控制装置、或模型融合装置执行的方法。For example, when the computer program is executed by a computer, the computer can implement the method performed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device in the above method embodiments.
本申请还提供一种包含指令的计算机程序产品,该指令被计算机执行时使得该计算机实现上述方法实施例中由第一数据采集装置、第一模型训练装置、模型训练管理装置、控制装置、或模型融合装置执行的方法。This application also provides a computer program product containing instructions that, when executed by a computer, cause the computer to implement the method described above, which is performed by the first data acquisition device, the first model training device, the model training management device, the control device, or the model fusion device.
本申请还提供一种通信系统,该通信系统包括模型训练管理装置和第一数据采集装置。模型训练管理装置用于执行图5、图7至图9所示的实施例中模型训练管理装置执行的部分或全部操作。第一数据采集装置用于执行图5、图7至图9所示的实施例中第一数据采集装置执行的部分或全部操作。This application also provides a communication system, which includes a model training management device and a first data acquisition device. The model training management device is used to perform some or all of the operations performed by the model training management device in the embodiments shown in Figures 5, 7 to 9. The first data acquisition device is used to perform some or all of the operations performed by the first data acquisition device in the embodiments shown in Figures 5, 7 to 9.
可选的,通信系统还包括第一模型训练装置,第一模型训练装置用于执行图5、图7至图9所示的实施例中第一模型训练装置执行的部分或全部操作。Optionally, the communication system further includes a first model training device, which is used to perform some or all of the operations performed by the first model training device in the embodiments shown in Figures 5, 7 to 9.
可选的,通信系统还包括控制装置,控制装置用于执行图5、图7至图9所示的实施例中控制装置执行的部分或全部操作。Optionally, the communication system also includes a control device for performing some or all of the operations performed by the control device in the embodiments shown in Figures 5, 7 to 9.
可选的,通信系统还包括模型融合装置,模型融合装置用于执行图5和图7所示的实施例中模型融合装置执行的部分或全部操作。Optionally, the communication system also includes a model fusion device, which performs some or all of the operations performed by the model fusion device in the embodiments shown in Figures 5 and 7.
本申请实施例还提供一种芯片装置,包括处理器,用于调用该存储器中存储的计算机程度或计算机指令,以使得该处理器执行上述图5、图7至图9所示的实施例提供的方法。This application also provides a chip device, including a processor, for calling computer programs or computer instructions stored in the memory, so that the processor executes the method provided in the embodiments shown in Figures 5, 7 to 9 above.
一种可能的实现方式中,该芯片装置的输入对应上述图5、图7至图9所示的实施例中任一个实施例中的接收操作,该芯片装置的输出对应上述图5、图7至图9所示的实施例中任一个实施例中的发送操作。In one possible implementation, the input of the chip device corresponds to the receiving operation in any one of the embodiments shown in Figures 5, 7 to 9, and the output of the chip device corresponds to the sending operation in any one of the embodiments shown in Figures 5, 7 to 9.
可选的,该处理器通过接口与存储器耦合。Optionally, the processor is coupled to the memory via an interface.
可选的,该芯片装置还包括存储器,该存储器中存储有计算机程度或计算机指令。Optionally, the chip device may also include a memory that stores computer programs or computer instructions.
其中,上述任一处提到的处理器,可以是一个通用中央处理器,微处理器,特定应用集成电路(application-specific integrated circuit,ASIC),或一个或多个用于控制上述图5、图7至图9所示的实施例中任一个实施例提供的方法的程序执行的集成电路。上述任一处提到的存储器可以为只读存储器(read-only memory,ROM)或可存储静态信息和指令的其他类型的静态存储设备,随机存取存储器(random access memory,RAM)等。The processor mentioned above can be a general-purpose central processing unit, a microprocessor, an application-specific integrated circuit (ASIC), or one or more integrated circuits used to control the execution of a program for controlling the method provided in any of the embodiments shown in Figures 5, 7 to 9. The memory mentioned above can be read-only memory (ROM) or other types of static storage devices capable of storing static information and instructions, such as random access memory (RAM).
所属领域的技术人员可以清楚地了解到,为描述方便和简洁,上述提供的任一种装置中相关内容的解释及有益效果均可参考上文提供的对应的方法实施例,此处不再赘述。Those skilled in the art will clearly understand that, for the sake of convenience and brevity, the explanations and beneficial effects of the relevant contents in any of the above-mentioned devices can be referred to the corresponding method embodiments provided above, and will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed systems, apparatuses, and methods can be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative; for instance, the division of units is only a logical functional division, and in actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated into another system, or some features may be ignored or not executed. Furthermore, the coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection between apparatuses or units through some interfaces, and may be electrical, mechanical, or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separate. The components shown as units may or may not be physical units; that is, they may be located in one place or distributed across multiple network units. Some or all of the units can be selected to achieve the purpose of this embodiment according to actual needs.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。Furthermore, the functional units in the various embodiments of this application can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit. The integrated unit can be implemented in hardware or as a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is implemented as a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the essential contribution of the technical solution of this application, or all or part of the technical solution, can be embodied in the form of a software product. This computer software product is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, server, or network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of this application. The aforementioned storage medium includes various media capable of storing program code, such as USB flash drives, portable hard drives, ROM, RAM, magnetic disks, or optical disks.
以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的范围。The above-described embodiments are only used to illustrate the technical solutions of this application, and are not intended to limit it. Although this application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that modifications can still be made to the technical solutions described in the foregoing embodiments, or equivalent substitutions can be made to some of the technical features. Such modifications or substitutions do not cause the essence of the corresponding technical solutions to deviate from the scope of the technical solutions of the embodiments of this application.
Claims (30)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410705939.1 | 2024-05-31 | ||
| CN202410705939.1A CN121051450A (en) | 2024-05-31 | 2024-05-31 | Information transmission methods and related devices |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025246373A1 true WO2025246373A1 (en) | 2025-12-04 |
Family
ID=97808565
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2025/070370 Pending WO2025246373A1 (en) | 2024-05-31 | 2025-01-03 | Information transmission method and related apparatus |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN121051450A (en) |
| WO (1) | WO2025246373A1 (en) |
-
2024
- 2024-05-31 CN CN202410705939.1A patent/CN121051450A/en active Pending
-
2025
- 2025-01-03 WO PCT/CN2025/070370 patent/WO2025246373A1/en active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| CN121051450A (en) | 2025-12-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN115484620A (en) | A method and related device for computing power resource scheduling | |
| US20230139345A1 (en) | Computing service implementation method and apparatus | |
| EP4192094A1 (en) | Service traffic shunting method and apparatus | |
| WO2023125599A1 (en) | Communication method and communication device | |
| US20250184938A1 (en) | Communication method and communication apparatus | |
| WO2023116350A1 (en) | Communication method and communication apparatus | |
| WO2023044904A1 (en) | Communication method, terminal device and network device | |
| US20210144630A1 (en) | Base station and data transmission method thereof for mobile communication system | |
| WO2023179231A1 (en) | Cell information configuration method and apparatus, and readable storage medium and chip system | |
| CN114731731A (en) | Communication method and device | |
| CN117223303A (en) | Communication methods, equipment and storage media | |
| WO2024197689A1 (en) | Communication method and related device | |
| WO2025246373A1 (en) | Information transmission method and related apparatus | |
| WO2023179262A1 (en) | Cell information configuration method and apparatus, and readable storage medium and chip system | |
| WO2023070684A1 (en) | Wireless communication method, and device | |
| WO2023272670A1 (en) | Network connection method, and apparatus and system | |
| WO2025236613A1 (en) | Model recognition method and related apparatus | |
| CN117580183B (en) | Network priority negotiation method and related equipment | |
| US20240414617A1 (en) | Handover method and related apparatus | |
| WO2025092059A1 (en) | Method, apparatus, and system for artificial intelligence (ai) model splitting | |
| WO2025152037A1 (en) | Wireless communication method, terminal device, and network device | |
| WO2025039271A1 (en) | Measurement methods and related apparatuses | |
| WO2025130521A1 (en) | Service generation method and apparatus, and network device | |
| WO2025050280A1 (en) | Communication method and related device | |
| WO2024140269A1 (en) | Communication method and communication apparatus |