CN119623510A

CN119623510A - An efficient online adaptation method for multi-device collaboration

Info

Publication number: CN119623510A
Application number: CN202411574116.6A
Authority: CN
Inventors: 谭明奎; 陈国昊; 陈德宇; 牛帅程; 张书海; 陈耀佛; 邓泽帅; 杜卿
Original assignee: Super Robot Research Institute Huangpu; South China University of Technology SCUT
Current assignee: Super Robot Research Institute Huangpu; South China University of Technology SCUT
Priority date: 2024-11-06
Filing date: 2024-11-06
Publication date: 2025-03-14

Abstract

The invention discloses a multi-device collaborative efficient online adaptation method which comprises the steps of processing a first model in a device by adopting an online collaborative adaptation mode based on forward and backward propagation aiming at a device with rich computational power resources, dynamically storing the learned weight of the model into a shared knowledge base when domain offset is detected, capturing new knowledge in an adaptation process by introducing a new parameter matrix into the first model parameter, and processing a second model in the device by adopting the online collaborative adaptation mode only adopting forward propagation aiming at a device with low computational power, wherein the model directly adopts the existing shared knowledge base to carry out online adaptation. The invention realizes the accumulation, sharing and dynamic utilization of knowledge among multiple devices by dynamically updating the shared knowledge base, and processes the knowledge by adopting different online collaborative adaptation modes aiming at devices with different calculation forces, thereby greatly improving the energy efficiency and practicability of online adaptation in practical application. The invention can be widely applied to the technical field of transfer learning.

Description

Efficient online adaptation method for multi-equipment cooperation

Technical Field

The invention relates to the technical field of migration learning, in particular to a multi-device collaborative efficient online adaptation method.

Background

The existing deep learning relies on the assumption that training and test data are distributed independently and uniformly, unknown data distribution deviation of a dynamic change scene is difficult to deal with, and the performance of a depth model is easy to be greatly reduced. Therefore, how to adapt to the test data online and realize the stabilization generalization of the dynamic scene is a key premise of deployment application of the depth model in the scenes such as smart cities, smart traffic and the like.

The online adaptation technology aims at performing scene adaptation by only using non-annotated online data, and the related method mainly comprises 1) self-supervision-based online adaptation and 2) non-supervision-based online adaptation. The self-supervision-based online adaptation method mainly focuses on information reconstruction learning from scene data, namely, depth models are understood and generalized to new application scenes by reducing reconstruction loss of the scenes online, however, the method introduces additional self-supervision branches, so that training and deployment cost of the models is obviously increased, and the efficiency is low. The unsupervised online adaptation method is based on shannon entropy analysis and optimizes the decision uncertainty of the model, and helps the model form a clearer decision boundary in a new scene, so that the cross-scene decision capability of the model is improved. The part of the method further analyzes the influence of different samples on the online adaptation, constructs a reliable and non-redundant active sample screening mechanism, solves the online adaptation problem only on a reserved sample subset, and improves the performance and efficiency of the adaptation.

However, in practical application, there are often multiple cooperative devices, such as smart phones, monitoring cameras, unmanned vehicles, and the like. However, the existing online adaptation method is only limited by independent adaptation of single equipment, efficiency and performance are limited in practical application, and the bottleneck exists that 1) high redundancy exists in adaptation of different equipment to similar scenes, waste of computational resources is serious, the online adaptation efficiency and performance are poor, 2) the existing online adaptation method is prone to catastrophic forgetting, is difficult to accumulate adaptation learned knowledge, cannot support knowledge sharing and utilization across equipment, multi-equipment cooperation is difficult, 3) online adaptation learning relies on a counter-propagation mechanism with high computational cost, however, the computational power of edge equipment is limited, contradiction between the two is prominent, and the online adaptation technology is difficult to apply to the edge equipment.

Disclosure of Invention

In order to solve at least one of the technical problems in the prior art to a certain extent, the invention aims to provide a multi-device collaborative efficient online adaptation method, electronic equipment and a medium.

The first technical scheme adopted by the invention is as follows:

The efficient on-line adaptation method for multi-device cooperation is used for realizing accumulation, sharing and dynamic utilization of knowledge among multiple devices, and different devices share the same shared knowledge base, and comprises the following steps:

aiming at equipment with rich computational power resources, a first model in the equipment is processed by adopting an online collaborative adaptation mode based on forward and backward propagation, when domain offset is detected, the learning weight of the model is dynamically saved to a shared knowledge base, so that the forgetful knowledge accumulation of cross-domain learning is realized;

aiming at the edge equipment with low computational effort, the second model in the equipment is processed by adopting an online collaborative adaptation mode only using forward propagation, and the model directly uses the existing shared knowledge base to carry out online adaptation, so that the problem of high cost of knowledge learning computational effort is avoided.

It should be noted that the first model in the above refers to a model deployed in a device with rich computing power resources, and the second model refers to a model deployed in a device with low computing power, i.e., a model deployed in a device with rich computing power and a device with low computing power are distinguished by "first" and "second".

Further, the expression of the parameters of the first model is:

Wherein, theta ⁰ is an original parameter of the model; is M pieces of knowledge accumulated in a shared knowledge base shared by multiple devices, alpha _i is a learnable weighting coefficient normalized by different knowledge, namely Δθ _new is a newly introduced learnable parameter matrix.

Further, the expression of the parameters of the second model is:

wherein, theta ⁰ is the original parameters of the model, Is M pieces of knowledge accumulated in a shared knowledge base shared by multiple devices, and gamma _i is a learnable weighting coefficient normalized by different knowledge, namelyFor supporting shared knowledge multiplexing among devices.

Further, the processing the first model in the device by adopting the online cooperative adaptation mode based on forward and backward propagation comprises the following steps:

A batch of target images to be adapted Inputting a first model in the device;

detecting whether domain offset occurs currently, if so, saving the learned knowledge to a shared knowledge base

And generating a prediction result according to the input image, updating the model by utilizing an online optimization target and a back propagation mechanism until all image data are processed, and stopping online adaptation of the model.

Further, the detection is performed to determine whether a domain shift occurs, and if so, the learned knowledge is saved to a shared knowledge baseComprising the following steps:

measuring the distribution of currently input image data by using the output characteristics of the model backbone network

Calculating the distribution of currently input image dataDifferences from the historical data distribution phi _d

If the difference isWhen the acquired knowledge delta theta _M+1 is larger than a preset threshold value, judging that the currently input image data has domain offset, and storing the learned knowledge delta theta _M+1 and the corresponding domain distribution phi _d into a shared knowledge base

Further, the currently input image data distributionThe calculation formula of (2) is as follows:

wherein B (·) is the backbone network output characteristic, x _i is the input data, and N is the data amount; And (3) with Is the mean and variance of the features,From the following componentsAnd (3) withComposition is prepared.

Further, the historical data distribution phi _d is a moving average of the input image data distribution, and the expression is:

Wherein λ is a balance factor;

measuring image data distribution by KL divergence Differences from the historical data distribution phi _d;

the expression of the learned knowledge Δθ _M+1 is:

in the formula, Is M pieces of knowledge accumulated in a shared knowledge base shared by multiple devices, alpha _i is a learnable weighting coefficient normalized by different knowledge, namelyΔθ _new is a newly introduced learnable parameter matrix.

Further, the expression for updating the model by using the online optimization target and the back propagation mechanism is as follows:

in the formula, Representing any online self-supervision or unsupervised optimization target, wherein θ is a parameter of a first model, x is a target data set to be adapted, and α is a normalized learnable weighting coefficient vector;

Different devices can adaptively adjust alpha, delta theta _new according to data input by the devices, so that efficient adaptation to an application scene is realized.

Further, the processing the second model in the device by using the online collaborative adaptation mode only using forward propagation for the low-computation-force edge device comprises the following steps:

A batch of target images to be adapted Inputting a second model in the device;

calculating the distribution of currently input image data Distributed in shared knowledge baseSimilarity of (2);

And according to the calculated similarity, acquiring knowledge learned by a similar domain from the shared knowledge base to update the model until all image data are processed, and stopping online adaptation of the model.

Further, the calculation formula of the similarity is as follows:

wherein D (·,) is the difference of data distribution, phi _i is the data distribution information phi _d when Deltaθ _i is learned in the shared knowledge base, and epsilon is the minimum amount for preventing numerical value overflow.

The second technical scheme adopted by the invention is as follows:

An electronic device comprising a processor and a memory having stored therein at least one instruction, at least one program, code set or instruction set loaded and executed by the processor to implement a multi-device collaborative, efficient online adaptation method as described above.

The third technical scheme adopted by the invention is as follows:

A computer readable storage medium having stored therein at least one instruction, at least one program, code set, or instruction set loaded and executed by a processor to implement a multi-device collaborative, efficient online adaptation method as described above.

The fourth technical scheme adopted by the invention is as follows:

a computer program product, or computer program, comprising computer instructions, the computer instructions are stored in a computer readable storage medium. The computer instructions may be read from a computer-readable storage medium by a processor of a computer device, and executed by the processor, cause the computer device to perform the method described above.

The invention has the beneficial effects that the invention provides a multi-device collaborative efficient online adaptation framework, the knowledge base is shared in a dynamic update way, the accumulation, sharing and dynamic utilization of knowledge among the multiple devices are realized, in addition, the energy efficiency and the practicability of online adaptation in practical application are greatly improved by adopting different online collaborative adaptation modes for processing the devices with different calculation forces.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following description is made with reference to the accompanying drawings of the embodiments of the present invention or the related technical solutions in the prior art, and it should be understood that the drawings in the following description are only for convenience and clarity of describing some embodiments in the technical solutions of the present invention, and other drawings may be obtained according to these drawings without the need of inventive labor for those skilled in the art.

FIG. 1 is a schematic flow chart of a multi-device collaborative efficient online adaptation method in an embodiment of the invention;

FIG. 2 is a flow diagram of an online collaborative adaptation mode based on forward and backward propagation in an embodiment of the invention;

FIG. 3 is a flow diagram of an online collaborative adaptation mode using only forward propagation in an embodiment of the present invention.

Detailed Description

Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative only and are not to be construed as limiting the invention. The step numbers in the following embodiments are set for convenience of illustration only, and the order between the steps is not limited in any way, and the execution order of the steps in the embodiments may be adaptively adjusted according to the understanding of those skilled in the art.

In the description of the present invention, it should be understood that references to orientation descriptions such as upper, lower, front, rear, left, right, etc. are based on the orientation or positional relationship shown in the drawings, are merely for convenience of description of the present invention and to simplify the description, and do not indicate or imply that the apparatus or elements referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus should not be construed as limiting the present invention.

In the description of the present invention, a number means one or more, a number means two or more, and greater than, less than, exceeding, etc. are understood to not include the present number, and above, below, within, etc. are understood to include the present number. The description of the first and second is for the purpose of distinguishing between technical features only and should not be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated or implicitly indicating the precedence of the technical features indicated.

In the description of the present invention, unless explicitly defined otherwise, terms such as arrangement, installation, connection, etc. should be construed broadly and the specific meaning of the terms in the present invention can be reasonably determined by a person skilled in the art in combination with the specific contents of the technical scheme.

Aiming at the prior art, the invention provides a multi-device collaborative efficient online adaptation framework, constructs a parameter dynamic storage mechanism based on a shared knowledge base, realizes accumulation, sharing and dynamic utilization of knowledge among multiple devices, and further provides an online collaborative adaptation method based on forward and backward propagation and only based on forward propagation, thereby improving the energy efficiency and practicability of online adaptation in practical application.

Example 1

As shown in fig. 1, the present embodiment provides a multi-device collaborative efficient online adaptation method, which is used for implementing accumulation, sharing and dynamic utilization of knowledge among multiple devices, and different devices share the same shared knowledge base, and includes the following steps:

In this embodiment, the online collaborative adaptation mode based on forward and backward propagation is directed to a device with rich computing power resources, and involves learning, accumulation, sharing and utilization processes of cross-domain knowledge. In the mode, a dynamic knowledge storage mechanism based on a knowledge base is constructed, the difference between the historical data distribution and the current test data distribution is measured, the learned weight is dynamically stored in the knowledge base when the domain is shifted, and the knowledge accumulation without forgetting in the cross-domain learning is realized. The same knowledge base is shared by different devices, so that the knowledge sharing and subsequent knowledge utilization of the cross-device are effectively supported, and the problem that disastrous forgetting is easy to occur in the prior art is solved. And secondly, introducing a weighting coefficient to each shared knowledge, and converting the multiplexing problem of the shared knowledge into a self-adaptive learning problem of the weighting coefficient. Further, a new parameter matrix delta theta _new is introduced to capture new knowledge in the adaptation process, so that the knowledge multiplexing and new knowledge learning processes are decoupled, and the optimization conflict of the two processes is relieved. The parameters can be optimized by utilizing an online optimization target and a back propagation mechanism, so that the dynamic utilization of knowledge among multiple devices is realized, the adaptive efficiency and performance are improved, and the problems of high redundancy, serious waste of computational power resources, poor online adaptive efficiency and performance and the like in the adaptation of different devices to similar scenes in the prior art are solved.

In this embodiment, only the online collaborative adaptation mode of forward propagation is utilized to face the low-computation-power edge device, and the utilization process of shared knowledge is involved. The main idea is to directly utilize the existing shared knowledge to carry out online adaptation, thereby avoiding the problem of high cost of knowledge learning and calculation. In this mode, the embodiment approximates the weighted coefficient optimization problem of knowledge utilization to a domain matching process, and based on similarity analysis of data distribution, higher utilization weight is given to knowledge learned during similar domain adaptation, so as to realize online non-optimization dynamic adaptation. The method reduces the calculation cost of online adaptation based on the mode, solves the problem that the online adaptation technology is difficult to apply to the edge equipment aiming at the limited calculation of the edge equipment in the prior art, and effectively expands the online adaptation technology to the edge equipment with limited calculation resources such as smart phones and the like.

The embodiment of the invention innovatively expands the independent online adaptation of single equipment to the efficient online adaptation of multi-equipment cooperation, breaks through the disastrous forgetting problem of the online adaptation, realizes the accumulation, sharing and dynamic utilization of knowledge among the multi-equipment, improves the efficiency and performance of the online adaptation, and enables the online adaptation technology with high cost to be applied to the edge equipment with limited resources. The embodiment of the invention has the characteristics of lower communication cost and higher expansibility, and can have important influence on model deployment and online adaptation technology widely applied to the fields of edge computing, internet of things, distributed systems and the like, thereby improving the energy efficiency and practicability of the model online adaptation in application.

The foregoing method is explained in detail below with reference to the drawings and specific embodiments.

(1) On-line collaborative adaptation mode based on forward and backward propagation

The mode is oriented to equipment with rich computing power resources and relates to learning, accumulating, sharing and utilizing processes of cross-domain knowledge. As an example, referring to fig. 2, on a device using this mode, the algorithm comprises the following steps:

Step one, a batch of target images to be adapted is processed The model is input. Formalized expression of model parameters θ is as follows:

wherein, theta ⁰ is the original parameter of the model, Is M pieces of knowledge accumulated in a knowledge base shared by multiple devices, alpha _i is a learnable weighting coefficient normalized by different knowledge, namelyΔθ _new is a newly introduced learnable parameter matrix. The scheme decouples the knowledge utilization and updating into the learning process of alpha, delta theta _new, and effectively supports the knowledge utilization and the learning of new knowledge across devices.

Step two, detecting whether domain offset occurs currently, if so, saving the learned knowledge to a shared knowledge base

Specifically, the domain detection offset includes the following flow:

1) Measuring the data distribution of the current input by using the output characteristics of the model backbone network The calculation formula is as follows:

Wherein B (·) is the backbone network output characteristic, And (3) withIs the mean and variance of the features,From the following componentsAnd (3) withComposition is prepared.

2) The difference between the current input data distribution and the historical data distribution is measured. The historical data distribution phi _d is a moving average of the input data distribution formalized as:

Where λ is a balance factor, which may be set to 0.8 in implementation. The difference metric D (·,) of the data distribution is the KL divergence between the data distributions under the assumption that the data distribution obeys Gaussian distribution, and the calculation formula is as follows:

where H is the dimension of the statistic. Data distribution differences Above a certain threshold, i.eWhen the current input data is considered to have a domain offset. The domain offset detection scheme does not need to save original data, has low calculation cost and storage cost, and can effectively maintain the efficiency of online adaptation.

Specifically, the learned knowledge delta theta _M+1 and the corresponding domain distribution phi _d are saved to a shared knowledge baseFormalized representation of the learned knowledge is as follows:

The knowledge is shared with other devices immediately after being saved and an additional learnable weighting coefficient alpha _M+1 is introduced into the model parameters theta, thereby updating the formal representation of the parameters theta to: Because Deltaθ _M+1 is solidified after being stored, the problem of disastrous forgetting caused by continuous online adaptation is effectively relieved, gradual accumulation of knowledge is realized, and the stability of online adaptation is improved. Further, a learning process that decouples knowledge utilization and update to alpha, deltaθ _new has the benefits that a) can be avoided Frequent updating of (1), effectively reduce the cause of cross-equipmentExpensive communication cost generated synchronously, realizing efficient multi-equipment collaborative online adaptation of communication, b) multi-equipment based on knowledge base onlyAny device can join or exit the collaboration group at any time (using the latest or non-latest shared knowledge base) to realize completely asynchronous multi-device collaboration adaptation, thereby improving the practicability and expansibility of the invention patent in the real scene.

Generating a prediction result, and updating a model by using an online optimization target and a back propagation mechanism, wherein the model is expressed in a formalized way as follows:

Wherein the method comprises the steps of Representing any online self-monitoring or unsupervised optimization objective. The patent focuses on breaking through the cooperative bottleneck of online adaptation among multiple devices, and can be flexibly combined with the existing advanced online optimization targets. Preferably, the entropy minimization loss can be used for parameter optimization in the mathematical form ofWherein f _θ(x_i) is the model output result. Model optimization based on the loss, and sharing the same knowledge base among devicesBut different devices can adaptively adjust alpha, delta theta _new according to own input data, so that efficient adaptation to application scenes is realized.

And fourthly, repeating the first step to the third step until all the test data are input, and stopping the online adaptation of the model.

(2) Online collaborative adaptation mode with forward propagation only

The mode is oriented to low-computation-force edge equipment and relates to a shared knowledge utilization process. As an alternative embodiment, referring to fig. 3, on a device using this mode, the algorithm comprises the following steps:

wherein, theta ⁰ is the original parameter of the model, Is M pieces of knowledge accumulated in a knowledge base shared by multiple devices, and gamma _i is a learnable weighting coefficient for normalization of different knowledge, namelyShared knowledge multiplexing across devices is supported.

Measuring the data distribution of the current inputDistributed in shared knowledge baseIs a similarity of (3). Wherein the data distribution of the current inputThe calculation formula is the same as above, phi _i is the data distribution information phi _d when delta theta _i is learned in the shared knowledge base. The calculation formula of the data distribution similarity ρ _i is as follows:

Where D (·, ·) is the data distribution difference metric defined above, and ε is the minimum amount to prevent value overflow.

And thirdly, based on the distribution similarity measurement, preferentially selecting knowledge acquired from the similar domains in the shared knowledge base. Specifically, formalization of γ is defined as:

γ=softmax(ρ)

by innovatively converting the online adaptation problem of the model into a dynamic utilization process of shared knowledge and approximating the weighting coefficient optimization problem to a domain matching process, the embodiment effectively avoids the problem of high computational cost caused by knowledge learning and coefficient optimization, reduces the computational cost of online adaptation, and realizes online non-optimized dynamic adaptation, thereby expanding the online adaptation technology with high cost into edge equipment with limited computational resources such as smart phones.

In general, the invention aims to break through the bottleneck that the existing online adaptation technology is limited to single-device independent adaptation, the catastrophic forgetting problem is prominent, the useful knowledge of other devices is difficult to use, the online adaptation performance and efficiency are poor, and the online adaptation technology cannot be applied to edge devices. In contrast, the invention provides a multi-device collaborative efficient online adaptation method, which realizes accumulation, sharing and dynamic utilization of knowledge among multiple devices, and has two collaboration modes, namely 1) an online collaborative adaptation mode based on forward and backward propagation, wherein the mode reconstructs model parameters into a model parameterTherefore, the multiplexing and updating of the knowledge are decoupled into a learning process of alpha, delta theta _new, the optimization conflict of the two processes is relieved, and the knowledge utilization of the cross-equipment and the learning of new knowledge are supported. Further, a dynamic knowledge preservation mechanism based on a knowledge base is constructed, learned new knowledge delta theta _M+1 is preserved to a shared knowledge base when a domain is shifted, gradual accumulation of knowledge is realized, the problem of catastrophic forgetting under continuous online adaptation is relieved, efficient knowledge sharing across devices is supported, and the stability and efficiency of online adaptation are improved. 2) On-line collaborative adaptation mode with forward propagation only, which reconstructs model parameters intoThe method dynamically utilizes shared knowledge of different devices, converts the gamma _i optimization process into domain similarity analysis for solving, and effectively avoids the problem of high computational effort and cost caused by knowledge learning and coefficient optimization, so that the high-cost online adaptation technology is expanded to edge devices with limited computing resources, such as smart phones. In summary, the method innovatively expands the independent online adaptation of the single device to the efficient online adaptation of the multi-device cooperation, breaks through the disastrous forgetting problem of the online adaptation, realizes the accumulation, sharing and dynamic utilization of knowledge among the multi-devices, improves the efficiency and performance of the online adaptation, and enables the online adaptation technology with high cost to be applied to the edge devices with limited resources. The method has the characteristics of lower communication cost and higher expansibility, and can have important influence on model deployment and online adaptation technology widely applied to the fields of edge computing, the Internet of things, a distributed system and the like, and the energy efficiency and the practicability of the model online adaptation in application are improved.

Example 2

The embodiment of the invention also provides electronic equipment, which comprises a processor and a memory, wherein at least one instruction, at least one section of program, a code set or an instruction set is stored in the memory, and the at least one instruction, the at least one section of program, the code set or the instruction set is loaded and executed by the processor so as to realize the efficient online adaptation method of multi-equipment cooperation as shown in fig. 1.

It is understood that the Memory may include random access Memory (Random Access Memory, RAM) or Read-Only Memory (ROM). Optionally, the memory includes a non-transitory computer readable medium (non-transitory computer-readable storage medium). The memory may be used to store instructions, programs, code sets, or instruction sets. The memory may include a stored program area that may store instructions for implementing an operating system, instructions for at least one function, instructions for implementing the various method embodiments described above, and the like, and a stored data area that may store data created according to the use of the server, and the like.

The processor may include one or more processing cores. The processor uses various interfaces and lines to connect various portions of the overall server, perform various functions of the server, and process data by executing or executing instructions, programs, code sets, or instruction sets stored in memory, and invoking data stored in memory. Alternatively, the processor may be implemented in hardware in at least one of digital signal Processing (DIGITAL SIGNAL Processing, DSP), field-Programmable gate array (Field-Programmable GATE ARRAY, FPGA), programmable logic array (Programmable Logic Array, PLA). The processor may integrate one or a combination of several of a central processing unit (Central Processing Unit, CPU) and a modem etc. The CPU is mainly used for processing an operating system, application programs and the like, and the modem is used for processing wireless communication. It will be appreciated that the modem may not be integrated into the processor and may be implemented by a single chip.

Because the electronic device is an electronic device corresponding to the efficient online adaptation method with multi-device cooperation according to the embodiment of the present invention, and the principle of solving the problem of the electronic device is similar to that of the method, the implementation of the electronic device may refer to the implementation process of the embodiment of the method, and the repetition is omitted.

Example 3

Embodiments of the present invention also provide a computer readable storage medium having stored therein at least one instruction, at least one program, a set of codes, or a set of instructions, the at least one instruction, the at least one program, the set of codes, or the set of instructions being loaded and executed by a processor to implement a multi-device collaborative, efficient online adaptation method as illustrated in fig. 1.

Those of ordinary skill in the art will appreciate that all or part of the steps of the various methods of the above embodiments may be implemented by a program that instructs associated hardware, the program may be stored in a computer readable storage medium including Read-Only Memory (ROM), random access Memory (Random Access Memory, RAM), programmable Read-Only Memory (Programmable Read-Only Memory, PROM), erasable programmable Read-Only Memory (Erasable Programmable Read Only Memory, EPROM), one-time programmable Read-Only Memory (OTPROM), electrically erasable programmable Read-Only Memory (EEPROM), compact disc Read-Only Memory (Compact Disc Read-Only Memory, CD-ROM) or other optical disk Memory, magnetic disk Memory, tape Memory, or any other medium that can be used for carrying or storing data.

Because the storage medium is a storage medium corresponding to the multi-device collaborative efficient online adaptation method according to the embodiment of the present invention, and the principle of solving the problem by the storage medium is similar to that of the method, the implementation of the storage medium can refer to the implementation process of the method embodiment, and the repetition is omitted.

Example 4

In some possible implementations, aspects of the methods of the embodiments of the present application may also be implemented in the form of a program product comprising program code for causing a computer device to carry out the steps of a multi-device collaborative, efficient online adaptation method according to the various exemplary embodiments of the present application described hereinabove, when the program product is run on a computer device. Wherein executable computer program code or "code" for performing the various embodiments may be written in a high-level programming language, such as C, C ++, c#, SMALLTALK, JAVA, JAVASCRIPT, visual Basic, structured query language (e.g., act-SQL), perl, or in a variety of other programming languages.

It is to be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above-described embodiments, the various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, may be implemented using any one or combination of techniques known in the art, discrete logic circuits with logic gates for implementing logic functions on data signals, application specific integrated circuits with appropriate combinational logic gates, programmable Gate Arrays (PGAs), field Programmable Gate Arrays (FPGAs), and the like.

In the description of the present specification, a description referring to terms "one embodiment," "some embodiments," "examples," "specific examples," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, the different embodiments or examples described in this specification and the features of the different embodiments or examples may be combined and combined by those skilled in the art without contradiction.

The above embodiments are only for illustrating the technical concept and features of the present invention, and are intended to enable those skilled in the art to understand the content of the present invention and implement the same, and are not intended to limit the scope of the present invention. All equivalent changes or modifications made in accordance with the essence of the present invention are intended to be included within the scope of the present invention.

Claims

1. An efficient online adaptation method for multi-device collaboration, characterized in that it is used to realize the accumulation, sharing and dynamic utilization of knowledge among multiple devices, and different devices share the same shared knowledge base, including the following steps:

For devices with abundant computing resources, an online collaborative adaptation mode based on forward and back propagation is used to process the first model in the device: when domain shift is detected, the weights learned by the model are dynamically saved to the shared knowledge base; a new parameter matrix is introduced into the first model parameters to capture new knowledge during the adaptation process;

For devices with low computing power, the second model in the device is processed in an online collaborative adaptation mode that only uses forward propagation: the model directly uses the shared knowledge base for online adaptation.

2. According to the method of claim 1, the efficient online adaptation method of multi-device collaboration is characterized in that the expression of the parameters of the first model is:

Where, θ ⁰ is the original parameter of the model; is the M pieces of knowledge accumulated in the shared knowledge base shared by multiple devices; α _i is the learnable weight coefficient for normalizing different knowledge, i.e. △θ _new is a newly introduced learnable parameter matrix.

3. According to the efficient online adaptation method for multi-device collaboration of claim 1, it is characterized in that the expression of the parameters of the second model is:

In the formula, θ ⁰ is the original parameter of the model, is the M pieces of knowledge accumulated in the shared knowledge base shared by multiple devices; γ _i is the learnable weight coefficient for normalizing different knowledge, i.e. Used to support shared knowledge reuse across devices.

4. The efficient online adaptation method for multi-device collaboration according to claim 1, wherein the first model in the device is processed using an online collaborative adaptation mode based on forward and backward propagation, comprising:

A batch of target images to be adapted a first model in the input device;

Detect whether domain shift has occurred. If so, save the learned knowledge to the shared knowledge base.

Generate prediction results based on the input image, and use online optimization objectives and back-propagation mechanism to update the model until all image data are processed and the model stops online adaptation.

5. According to claim 4, a highly efficient online adaptation method for multi-device collaboration is characterized in that the detection of whether domain shift currently occurs, if domain shift occurs, the learned knowledge is saved to a shared knowledge base include:

Use the output features of the model backbone network to measure the distribution of the current input image data

Calculate the current input image data distribution Difference from historical data distribution _φd

If the difference If the value is greater than the preset threshold, it is determined that the current input image data has domain shift, and the learned knowledge △θ _M+1 and the corresponding domain distribution φ _d are saved to the shared knowledge base.

6. The efficient online adaptation method for multi-device collaboration according to claim 5, characterized in that the current input image data distribution The calculation formula is as follows:

In the formula, B(·) is the output feature of the backbone network, _xi is the input data, and N is the amount of data; and is the mean and variance of the feature, Depend on and composition.

7. The efficient online adaptation method for multi-device collaboration according to claim 5, characterized in that the historical data distribution _φd is a moving average of the input image data distribution, expressed as:

Where λ is the balance factor;

Measuring image data distribution through KL divergence The difference between the distribution of historical data _φd ;

The expression of the knowledge learned △θ _M+1 is:

In the formula, is the M pieces of knowledge accumulated in the shared knowledge base shared by multiple devices; α _i is the learnable weight coefficient for normalizing different knowledge, i.e. △θ _new is a newly introduced learnable parameter matrix.

8. According to the efficient online adaptation method for multi-device collaboration of claim 4, it is characterized in that the expression for updating the model using the online optimization target and the back propagation mechanism is:

In the formula, represents any online self-supervised or unsupervised optimization objective; θ is the parameter of the first model; x is the target dataset to be adapted; α is the normalized learnable weight coefficient vector;

Among them, different devices can adaptively adjust α,△θ _new according to their own input data to achieve efficient adaptation to application scenarios.

9. The efficient online adaptation method for multi-device collaboration according to claim 1, characterized in that, for edge devices with low computing power, an online collaborative adaptation mode using only forward propagation is used to process the second model in the device, comprising:

A batch of target images to be adapted a second model in the input device;

Calculate the current input image data distribution Distributed in a shared knowledge base similarity;

According to the calculated similarity, the knowledge learned from similar domains is obtained from the shared knowledge base to update the model until all image data are processed and the model stops online adaptation.

10. The efficient online adaptation method for multi-device collaboration according to claim 9, wherein the similarity calculation formula is as follows:

Where D(·,·) is the difference in data distribution; φ _i is the data distribution information φ _d when learning △θ _i in the shared knowledge base; ∈ is a very small amount to prevent numerical overflow.