[go: up one dir, main page]

WO2021114262A1 - Facial image clustering method and apparatus, and computer-readable storage medium - Google Patents

Facial image clustering method and apparatus, and computer-readable storage medium Download PDF

Info

Publication number
WO2021114262A1
WO2021114262A1 PCT/CN2019/125298 CN2019125298W WO2021114262A1 WO 2021114262 A1 WO2021114262 A1 WO 2021114262A1 CN 2019125298 W CN2019125298 W CN 2019125298W WO 2021114262 A1 WO2021114262 A1 WO 2021114262A1
Authority
WO
WIPO (PCT)
Prior art keywords
clustered
face image
deep
data set
clustering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2019/125298
Other languages
French (fr)
Chinese (zh)
Inventor
陈文胜
曾倩文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen University
Original Assignee
Shenzhen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen University filed Critical Shenzhen University
Priority to PCT/CN2019/125298 priority Critical patent/WO2021114262A1/en
Publication of WO2021114262A1 publication Critical patent/WO2021114262A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition

Definitions

  • This application relates to the field of face recognition technology, and in particular to a method, device and computer-readable storage medium for clustering face images.
  • Face clustering is usually to cluster the face image information in the database into some different sub-categories, so that the similarity between the sub-categories is as small as possible, and the similarity within the sub-categories is as large as possible.
  • the subcategories with high similarity to the searched target are identified one by one, and several records with the greatest similarity are retrieved.
  • NMF Non-negative Matrix Factorization
  • the essence of the NMF algorithm is to approximately decompose the non-negative matrix X into the product of the base image matrix W and the coefficient matrix H, that is, X ⁇ WH, and both W and H are non-negative matrices.
  • each column of matrix X can be expressed as a non-negative linear combination of matrix W column vectors, which is also in line with the construction basis of the NMF algorithm-the perception of the whole is composed of the perception of the parts that make up the whole (pure additive) .
  • scholars have proposed many algorithms for deforming NMF, such as local NMF algorithm that strengthens local restrictions, discriminative NMF algorithm that integrates discriminant information, and symmetric NMF algorithm for symmetric matrices.
  • the NMF algorithm and its variants have achieved certain results, the method only considers the shallow information of the data. For data with rich features, the single-layer structure decomposed at one time cannot learn the representation of features from multiple angles.
  • Deep learning has established a deep neural network with a hierarchical structure, which brings hope to solving optimization problems related to deep structure.
  • Deep NMF Deep Non-negative Matrix Factorization
  • this deep decomposition method can explore the underlying feature representation in complex data, so as to extract features that are more complete and more discriminative than single-layer learning.
  • the existing Deep NMF model has a deep layered structure
  • this structure is generally constructed by simply reusing the single-layer NMF algorithm, and its performance does not meet the ideal requirements, and the calculation efficiency of the Deep NMF calculation method is not High, poor performance in face data clustering tasks.
  • none of these methods are produced by using deep neural networks, so they cannot use the powerful feature expression and clustering capabilities of deep neural networks.
  • the main purpose of this application is to provide a face image clustering method, device, and computer-readable storage medium, aiming to solve the technical problem of poor feature expression ability and clustering ability of Deep NMF.
  • the face image clustering method includes the following steps:
  • the face images to be clustered are classified by preset clustering rules, the clustering result of the data set to be clustered is determined, and the data set to be clustered is obtained The classification result of the face image.
  • the present application also provides a face image clustering device, the face image clustering device includes: a memory, a processor, and stored in the memory and running on the processor When the computer-readable instructions are executed by the processor, the following steps are implemented:
  • the face images to be clustered are classified by preset clustering rules, the clustering result of the data set to be clustered is determined, and the data set to be clustered is obtained The classification result of the face image.
  • This application acquires the training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered; inputting the training sample data and the highest feature amount to
  • the BP-Deep NMF model is trained, and the base image amount of the face image to be clustered is determined; based on the base image amount and the data set to be clustered, the aggregation is performed by preset
  • the class rule classifies the face images to be clustered, determines the clustering results of the data sets to be clustered, and obtains the classification results of the face images to be clustered.
  • a high-performance depth based on BP neural network is proposed.
  • Non-negative matrix factorization (BP-Deep NMF) model uses radial basis functions (RBF) to construct the input signal of the neural network. This input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and uses the original
  • RBF radial basis functions
  • the training data is used as the expected output of the network, and the optimization of the model adopts the BP neural network algorithm to update the network weight matrix.
  • the finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.
  • FIG. 1 is a schematic structural diagram of a face image clustering device in a hardware operating environment involved in a solution of an embodiment of the present application;
  • FIG. 2 is a schematic flowchart of the first embodiment of the applicant's face image clustering method.
  • FIG. 1 is a schematic structural diagram of a face image clustering apparatus in a hardware operating environment involved in a solution of an embodiment of the present application.
  • the face image clustering apparatus may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, and a communication bus 1002.
  • the communication bus 1002 is used to implement connection and communication between these components.
  • the user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface.
  • the network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface).
  • the memory 1005 may be a high-speed RAM memory, or a non-volatile memory (non-volatile memory), such as a magnetic disk memory.
  • the memory 1005 may also be a storage device independent of the aforementioned processor 1001.
  • FIG. 1 does not constitute a limitation on the face image clustering device, and may include more or less components than shown in the figure, or a combination of certain components. Components, or different component arrangements.
  • the memory 1005 as a computer storage medium may include an operating system, a network communication module, a user interface module, and a face image clustering program.
  • the network interface 1004 is mainly used to connect to the back-end server and communicate with the back-end server; the user interface 1003 is mainly used to connect to the client (user side) and conduct data with the client Communication; and the processor 1001 can be used to call the face image clustering program stored in the memory 1005.
  • the face image clustering device includes: a memory 1005, a processor 1001, and a face image clustering program stored on the memory 1005 and running on the processor 1001, wherein the processor When 1001 calls the face image clustering program stored in the memory 1005, and performs the following operations:
  • the face images to be clustered are classified by preset clustering rules, and the clustering result of the data set to be clustered is determined to obtain the data set to be clustered.
  • the classification result of the human-like face image is determined to obtain the data set to be clustered.
  • the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:
  • the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:
  • the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:
  • the non-negative weight of each neuron in the BP-Deep NMF model is determined by the projection of the projection gradient method
  • the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:
  • the second linear parameter Based on the training sample data and the non-negative weights, determine the second linear parameter corresponding to the activation function in the BP-Deep NMF model, and use the second linear parameter as the first linear parameter to perform all Said determining the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data and the first weight;
  • the first linear parameter and the first weight determine the step of determining the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model.
  • the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:
  • the number of training times reaches the maximum number of training times or the loss function value is less than or equal to the model error threshold, stop training the BP-Deep NMF model, and use the non-negative weight as the second weight to determine the The second weight of each layer of neurons in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and the second weight is used as the base image amount of the face image.
  • the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:
  • the highest feature amount corresponding to the training sample data is determined through a preset calculation method.
  • the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:
  • the face image to be clustered is classified by a preset clustering rule, and the clustering result of the data set to be clustered is determined to obtain the clustering result of the face image.
  • FIG. 2 is a schematic flowchart of the first embodiment of the method in this application.
  • the face image clustering method includes the following steps:
  • a system architecture to which the embodiment of the present application is applicable includes but is not limited to one server or multiple servers.
  • the server may be a network device such as a computer.
  • the server can be an independent device or a server cluster formed by multiple servers.
  • the server can use cloud computing technology for information processing.
  • this patent introduces the neural network algorithm into the Deep NMF field, and proposes a new high-performance deep non-negative matrix factorization (BP-Deep NMF) clustering model based on BP neural network. First, deep non-negative matrix factorization is constructed.
  • BP-Deep NMF deep non-negative matrix factorization
  • BP-Deep NMF BP-Deep NMF
  • the BP-Deep NMF method uses the radial basis function (RBF) to construct the input signal of the neural network.
  • the input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network.
  • the optimization of the model uses the BP neural network algorithm to update the network weight matrix.
  • the trained neural network model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition.
  • the BP-Deep NMF model proposed in this application shows superior performance in the task of clustering face data.
  • Deep non-negative matrix factorization (Deep NMF): NMF is only a single-layer decomposition learning process. It learns the base matrix W and the feature matrix H by decomposing the data matrix X.
  • the deep non-negative matrix factorization (Deep NMF) can further capture the intuitive hierarchical feature information hidden in the data set.
  • the basic idea is: the matrix H 1 obtained by the first shallow decomposition of the matrix X can be decomposed into the matrices W 2 and H 2 again , thereby expanding the original single-layer structure into a two-layer structure.
  • the shallow structure model can eventually be expanded to a multi-layer (deep) structure model.
  • the Deep NMF model decomposes the non-negative data matrix X into L+1 non-negative factor matrices, namely:
  • the hidden attributes H 1 ,..., H L of the data learned by the Deep NMF model can be expressed as H 1 ⁇ W 2 ...W L H L , H 2 ⁇ W 3 ...W L H L ,. .., H L-1 ⁇ W L H L , said H 1 ,..., H L is the characteristic matrix under non-negative constraints.
  • the Deep NMF algorithm In order to reduce the total reconstruction error of the entire model, the Deep NMF algorithm generally performs fine-tuning of the entire network model after the layer-by-layer decomposition is completed, that is, the following loss function based on the F-norm is minimized:
  • BP neural network Back-Propagation Algorithm of Neural Network: The neural network continuously changes the weight of the network through the sample signal, adjusts the error value of the network, and finally the output error reaches the expected error range.
  • the BP neural network is a multi-layer feedforward network based on error back propagation, while the standard BP neural network includes an input layer, a hidden layer, and an output layer. The layers are connected to each other through neurons. The neurons in the same layer of the network are not connected to each other.
  • Step S10 Obtain the training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered;
  • the face image used to train the BP-Deep NMF model is first converted into training sample data, and the radial basis function (RBF) is used to convert the training sample data into The highest feature quantity, the input signal of the radial basis function (RBF) to construct the neural network is the highest feature quantity.
  • This input is equivalent to the highest level feature in the deep non-negative matrix factorization.
  • the face image is used for training BP-Deep NMF
  • the facial image information of the model includes facial image information with different expressions, different identities, or different facial features. After the face image is converted into training sample data and the training sample data is converted into the highest feature quantity, the converted training sample data and the highest feature quantity are obtained to construct the BP-Deep NMF model.
  • RBF Radial Basis Function
  • the to-be-clustered data set of the face images to be clustered is first obtained for clustering the face images to be clustered.
  • Step S20 Input the training sample data and the highest feature amount into a BP-Deep NMF model, train the BP-Deep NMF model, and determine the base image amount of the face image to be clustered;
  • the BP-Deep NMF model is built, that is, the BP-Deep NMF model is trained.
  • the training sample data X and the highest feature The quantity H is input into the BP-Deep NMF model, specifically, the training sample data X is used as the expected output of the BP neural network, that is, the original training sample data X is input into the output layer of the BP-Deep NMF model, and the highest feature quantity H is used as the input signal of the BP-Deep NMF model, that is, the highest feature quantity H is input to the input layer of the BP-Deep NMF model.
  • the BP-Deep NMF model determines the base image amount W of the face images to be clustered after the BP-Deep NMF model training is completed. Specifically, input the highest feature amount H to Calculate the forward output of the BP neural network. After obtaining the forward output of the BP neural network, input the highest feature quantity to the output layer of the BP neural network, and combine the forward output of the BP neural network to find the reverse output of the BP neural network. Update the model parameters of the BP-Deep NMF model, that is, the weight of each neuron in the BP-Deep NMF model.
  • the BP-Deep NMF model is continuously iterated until the number of training times reaches the maximum number of training times or the loss function value is less than or equal to a threshold, otherwise the BP-Deep NMF model continues to be iterated.
  • the number of training times reaches the maximum number of training times or the loss function value is less than or equal to a threshold, it indicates that the construction of the BP-Deep NMF model is completed.
  • the base image amount W of the face image to be clustered of the BP-Deep NMF model is output.
  • Step S30 based on the amount of base images and the data set to be clustered, classify the face images to be clustered by preset clustering rules, determine the clustering result of the data set to be clustered, and obtain The classification result of the face image to be clustered.
  • Use the preset clustering rules, namely the k-means clustering method to separately analyze the feature vector set of the i-th layer of the sample Perform clustering to determine the clustering result of the data set to be clustered, and finally output the clustering result to obtain the classification result of the face image to be clustered.
  • the hidden feature amount can be reflected in the identity or expression or posture characteristics of the face image information. For example, based on the first-level hidden features of posture features, the face images to be clustered can be classified into clustering results of different posture features, for example, the head orientation angles are 0 degrees, 30 degrees, or -30 degrees. Class results; based on the secondary hidden features of expression features, the face images to be clustered can be classified into clustering results of different expression features, such as the clustering results of different expression features such as happy, angry, or confused; based on identity features
  • the three-level hidden features can classify the face images to be clustered into clustering results of different people, that is to say, classify the face images to be clustered on the basis of human classification.
  • the face image clustering method proposed in this embodiment proposes a high-performance deep non-negative matrix factorization (BP-Deep NMF) clustering model based on BP neural network.
  • the BP-Deep NMF model uses radial basis functions (RBF)
  • the input signal of the neural network is constructed.
  • the input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network.
  • the optimization of the model adopts the BP neural network algorithm to update the network weight matrix.
  • the finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.
  • step S20 includes:
  • Step a Obtain the first linear parameter corresponding to the activation function in the BP-Deep NMF model and the first weight of each neuron in the BP-Deep NMF model;
  • the first linear parameter p acts on the input signal of each neuron in the BP-Deep NMF model Converted into an output signal, the sum of the product of the input of the neuron and the weight corresponding to the input, and the activation function f(x) is applied to it to obtain the output of the neuron of this layer and feed it as the input to the next neural network Floor.
  • Step b Determine the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data, and the first weight;
  • Step c Determine the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model based on the first output error, the first linear parameter and the first weight;
  • the first output error in the BP-Deep NMF model is output
  • the second output error of the input layer neuron or hidden layer neuron of the l layer the calculation formula of the second output error ⁇ l is as follows:
  • l L-1,L-2,...,1 and W is the weight of the BP neural network.
  • Step d Based on the first output error and the second output error, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and combine the The second weight is used as the base image amount of the face image to be clustered.
  • the second weight of each neuron after the BP-Deep NMF model training is completed is determined, and the second weight is output, and
  • the second weight after the training of the BP-Deep NMF model is used as the base image amount of the face image to be clustered to determine the base image amount W of the face image to be clustered, specifically, the highest feature amount H is input to obtain BP
  • the forward output of the neural network after the forward output of the BP neural network is obtained, the first output error and the second output error are determined based on the forward output to obtain the reverse output of the BP neural network to update the BP-Deep NMF model
  • the model parameter of the BP-Deep NMF model is the weight of each neuron in the BP-Deep NMF model, and the second weight after the completion of the BP-Deep NMF model training is used as the base image amount of the face image to be clustere
  • the determining the first output error of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed based on the first output error and the second output error Two weights, and the step of using the second weight as the base image amount of the face image to be clustered includes:
  • Step e Determine the weight offset of each neuron in the BP-Deep NMF model based on the first output error and the second output error;
  • each neuron of the BP neural network calculates according to the output first output error ⁇ L and the second output error ⁇ l
  • the weight bias of each neuron in the BP-Deep NMF model is as follows:
  • Step f Based on the weight bias, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and use the second weight as the waiting The amount of base images for clustering face images.
  • the BP-Deep NMF model is iterated to determine the second weight of each neuron after the BP-Deep NMF model training is completed, and the second weight is output, and the BP-Deep NMF After the model training is completed, the second weight is used as the base image amount of the face image to be clustered to determine the base image amount W of the face image to be clustered.
  • the highest feature amount H is input to obtain the forward direction of the BP neural network Output, after obtaining the forward output of the BP neural network, determine the first output error and the second output error based on the forward output to obtain the reverse output of the BP neural network to update the model parameter of the BP-Deep NMF model, namely BP -The weight of each neuron in the Deep NMF model for subsequent output.
  • the second weight after the completion of the BP-Deep NMF model training is used as the base image amount of the face image to be clustered, so that the base image amount can be further calculated based on the base image amount.
  • the face images to be clustered are clustered.
  • the second weight of each neuron in the BP-Deep NMF model is determined after the training of the BP-Deep NMF model is completed, and
  • the step of using the second weight as the base image amount of the face images to be clustered includes:
  • Step g Obtain the learning rate of the BP-Deep NMF model
  • Step h based on the weight offset and the learning rate, determine the non-negative weight of each neuron in the BP-Deep NMF model through the projection of the projection gradient method;
  • the learning rate r of the BP-Deep NMF model is obtained for updating the weight parameters of the BP neural network, and the BP is updated based on the weight offset and the learning rate
  • the calculation formula of the weight parameter of the neural network is as follows:
  • the updated weight parameters of the BP neural network are projected by the projection gradient method to project into non-negative weight parameters.
  • the projection weight parameters determine the non-negative parameters of each neuron in the BP-Deep NMF model. Negative weight.
  • Step i Based on the non-negative weight, determine the second weight of each layer of neurons in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and use the second weight as the person The amount of the base image of the face image.
  • the second weight is used as the base image amount of the face image to be clustered to determine the base image amount W of the face image to be clustered.
  • the highest feature amount H is input to obtain the BP neural network Forward output, after obtaining the forward output of the BP neural network, determine the first output error and the second output error based on the forward output to obtain the reverse output of the BP neural network to update the model parameters of the BP-Deep NMF model That is, the weight of each neuron in the BP-Deep NMF model is used to output the second weight after the completion of the BP-Deep NMF model training as the base image amount of the face image to be clustered, so as to be further based on the base image
  • the face images to be clustered are clustered.
  • the step of determining the non-negative weight of each neuron in the BP-Deep NMF model through the projection of the projection gradient method based on the weight offset and the learning rate further include:
  • Step j Determine a second linear parameter corresponding to the activation function in the BP-Deep NMF model based on the training sample data and the non-negative weight, and use the second linear parameter as the first linear parameter , Performing the determining the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data, and the first weight, and determining the first output error based on the first output
  • the error, the first linear parameter, and the first weight determine the step of determining the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model.
  • the update formula of the linear parameter of the activation function is as follows:
  • ⁇ (p (t) ) is the step vector of p
  • the second weight of neurons in each layer of the BP-Deep NMF model after the completion of the training of the BP-Deep NMF model is determined based on the non-negative weight, and the The step of using the second weight as the base image amount of the face image includes:
  • Step k Obtain the number of training times of the BP-Deep NMF model and the loss function value of the BP-Deep NMF model;
  • Step 1 If the number of training times reaches the maximum number of training times or the value of the loss function is less than or equal to the model error threshold, stop training the BP-Deep NMF model, use the non-negative weight as the second weight, and determine The second weight of each layer of neurons in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and the second weight is used as the base image amount of the face image.
  • the number of training times of the BP-Deep NMF model and the loss function value of the BP-Deep NMF model are obtained to detect whether the iteration of the BP-Deep NMF model is completed. It is detected that the number of training times reaches the maximum number of training times and the loss function value is less than or equal to the model error threshold.
  • the BP-Deep NMF model is trained to determine the base image amount of the face images to be clustered after the BP-Deep NMF model training is completed.
  • the loss function value is greater than the model error threshold, continue to obtain the training sample data of the face image and the highest feature amount corresponding to the training sample data, and input the training sample data and the highest feature amount into the BP-Deep NMF model, and based on the training sample Data and the highest feature amount, through training the BP-Deep NMF model, determine the base image amount of the face image to be clustered after the BP-Deep NMF model training is completed.
  • the face image clustering method proposed in this embodiment proposes a high-performance deep non-negative matrix factorization (BP-Deep NMF) clustering model based on BP neural network.
  • the BP-Deep NMF method uses radial basis functions (RBF)
  • the input signal of the neural network is constructed.
  • the input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network.
  • the optimization of the model adopts the BP neural network algorithm to update the network weight matrix.
  • the finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.
  • step S10 includes:
  • Step m Obtain the face image, and convert the face image into training sample data
  • the face image is the face image information used to train the BP-Deep NMF model, including face image information with different expressions, different identities, or different facial features.
  • Preparations before constructing the BP-Deep NMF model Work convert the face image used to train the BP-Deep NMF model into training sample data for subsequent construction of the BP-Deep NMF model.
  • the training sample data has data labels, and the data labels are generally different people. That is, the face image includes the face image information of different people.
  • Step n Obtain the face image to be clustered, and convert the face image to be clustered into a data set to be clustered;
  • the face image to be clustered is obtained, and the face image to be clustered is converted into a data set to be clustered for subsequent processing of the face image to be clustered. Clustering.
  • Step o based on the training sample data, determine the highest feature amount corresponding to the training sample data through a preset calculation method.
  • the radial basis function (RBF) is used to convert the training sample data into the highest feature quantity for subsequent construction of the BP-Deep NMF model.
  • RBF Radial Basis Function
  • the face image to be clustered is classified by a preset clustering rule based on the amount of base images and the data set to be clustered, and the data to be clustered is determined
  • the steps of obtaining the classification result of the face image to be clustered include:
  • Step p extracting the hidden feature amount of each layer feature in the to-be-clustered data set based on the base image amount and the to-be-clustered data set;
  • the calculation formula is as follows:
  • Step q Based on the hidden feature quantity, classify the face image to be clustered by preset clustering rules, determine the clustering result of the data set to be clustered, and obtain the clustering result of the face image.
  • the preset clustering rule namely the k-means clustering method, is used to separately analyze the feature vector set of the i-th layer of the sample. Perform clustering to determine the clustering result of the data set to be clustered, and finally output the clustering result to obtain the classification result of the face image to be clustered.
  • the hidden feature amount can be reflected in the identity or expression or posture characteristics of the face image information. For example, based on the first-level hidden features of posture features, the face images to be clustered can be classified into clustering results of different posture features, for example, the head orientation angles are 0 degrees, 30 degrees, or -30 degrees.
  • Class results based on the secondary hidden features of expression features, the face images to be clustered can be classified into clustering results of different expression features, such as the clustering results of different expression features such as happy, angry, or confused; based on identity features
  • the three-level hidden features can classify the face images to be clustered into clustering results of different people, that is to say, classify the face images to be clustered on the basis of human classification.
  • the face image clustering method proposed in this embodiment proposes a high-performance deep non-negative matrix factorization (BP Deep NMF) clustering model based on BP neural network.
  • the BP Deep NMF method uses radial basis functions (RBF) to construct neural networks.
  • the input signal of the network is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network.
  • the optimization of the model uses the BP neural network algorithm to update the network weight matrix.
  • the finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.
  • an embodiment of the present application also proposes a computer-readable storage medium that stores a human face image clustering program, and the human face image clustering program may also be executed by a processor to implement the above-mentioned face image clustering program. The steps of each embodiment of the image clustering method.
  • the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above. , Magnetic disks, optical disks), including several instructions to make a terminal device (which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the method described in each embodiment of the present application.
  • a terminal device which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Image Analysis (AREA)

Abstract

Provided are a facial image clustering method and apparatus, and a computer-readable storage medium. The method comprises the following steps: acquiring training sample data of a facial image, the largest characteristic amount corresponding to the training sample data, and a data set to be clustered of a facial image to be clustered; inputting the training sample data and the largest characteristic amount into a BP-Deep NMF model, training the BP-Deep NMF model, and determining a base image amount of the facial image to be clustered; and on the basis of the base image amount and the data set to be clustered, classifying, by means of a preset clustering rule, the facial image to be clustered, and determining a clustering result of the data set to be clustered, so as to obtain a classification result of the facial image to be clustered.

Description

人脸图像聚类方法、装置及计算机可读存储介质Face image clustering method, device and computer readable storage medium 技术领域Technical field

本申请涉及人脸识别技术领域,尤其涉及一种人脸图像聚类方法、装置及计算机可读存储介质。This application relates to the field of face recognition technology, and in particular to a method, device and computer-readable storage medium for clustering face images.

背景技术Background technique

随着人脸识别和检索系统应用的推广,系统中人脸图像数据急剧地增长,人脸聚类技术已经成为提高系统检索效率的重要基础。人脸聚类通常是将数据库中的人脸图片信息聚成一些不同的子类,使得子类之间的相似性尽量小,子类内的相似性尽量大,这样在检索时,只需在与被检索目标相似度较高的子类内逐个识别,检索出与之相似性最大的若干记录。With the promotion of the application of face recognition and retrieval systems, the face image data in the system has increased dramatically, and face clustering technology has become an important foundation for improving the retrieval efficiency of the system. Face clustering is usually to cluster the face image information in the database into some different sub-categories, so that the similarity between the sub-categories is as small as possible, and the similarity within the sub-categories is as large as possible. The subcategories with high similarity to the searched target are identified one by one, and several records with the greatest similarity are retrieved.

特征提取这一步骤无论在人脸识别还是人脸聚类技术上都占有重要地位。主成分分析与奇异值分解都是较为经典的特征提取方法,但是这两种方法提出的特征向量通常含有负元素,因此在原始样本为非负数据下,这些方法不具有合理性与可解释性。NMF(Non-negative Matrix Factorization,非负矩阵分解)是一种处理非负数据的特征提取方法,它的应用非常广泛,比如高光谱数据处理、人脸图像识别等。NMF算法在原始样本非负数据矩阵分解过程中,对提取的特征具有非负性限制,即分解后的所有分量都是非负的,因而可以提取非负的稀疏特征。NMF算法的实质也就是将非负矩阵X近似分解为基图像矩阵W和系数矩阵H的乘积,即X≈WH,且W和H都是非负矩阵。这样矩阵X的每一列就可以表示成矩阵W列向量的非负线性组合,这也符合NMF算法的构造依据——对整体的感知是由对组成整体的部分的感知构成的(纯加性)。近年来,学者们提出了许多对NMF变形的算法,例如,加强局部限制的局部NMF算法、整合判别信息的判别NMF算法、针对对称矩阵提出的对称NMF算法。尽管NMF算法及其变体取得了一定的成效,但该方法只考虑了数据浅层信息,对于含有丰富特征的数据,一次分解而成的单层结构却无法从多角度学习特征的表示。The feature extraction step occupies an important position in face recognition and face clustering technology. Principal component analysis and singular value decomposition are relatively classic feature extraction methods, but the feature vectors proposed by these two methods usually contain negative elements, so when the original sample is non-negative data, these methods are not reasonable and interpretable . NMF (Non-negative Matrix Factorization) is a feature extraction method for processing non-negative data. It has a wide range of applications, such as hyperspectral data processing and face image recognition. In the process of matrix decomposition of non-negative data of the original sample, the NMF algorithm has non-negativity restrictions on the extracted features, that is, all components after decomposition are non-negative, so non-negative sparse features can be extracted. The essence of the NMF algorithm is to approximately decompose the non-negative matrix X into the product of the base image matrix W and the coefficient matrix H, that is, X≈WH, and both W and H are non-negative matrices. In this way, each column of matrix X can be expressed as a non-negative linear combination of matrix W column vectors, which is also in line with the construction basis of the NMF algorithm-the perception of the whole is composed of the perception of the parts that make up the whole (pure additive) . In recent years, scholars have proposed many algorithms for deforming NMF, such as local NMF algorithm that strengthens local restrictions, discriminative NMF algorithm that integrates discriminant information, and symmetric NMF algorithm for symmetric matrices. Although the NMF algorithm and its variants have achieved certain results, the method only considers the shallow information of the data. For data with rich features, the single-layer structure decomposed at one time cannot learn the representation of features from multiple angles.

目前,DL(Deep Learning,深度学习)已成为当前的研究热潮,深度学习通过建立了具有阶层结构的深度神经网络,为解决深层结构相关的优化难 题带来希望。受深度学习技术成功的启发,一些研究者在单层NMF算法的基础上提出了Deep NMF(Deep Non-negative Matrix Factorization,深度非负矩阵分解)模型。Deep NMF可以看作是通过将一个复杂的任务分解成几个简单的任务,然后在多层结构中一个接一个地处理它们。与此同时,这种深度分解方法可以探索复杂数据中的底层特征表示,从而提取到比单层学习更完整、更有辨别力的特征。At present, DL (Deep Learning, deep learning) has become the current research boom. Deep learning has established a deep neural network with a hierarchical structure, which brings hope to solving optimization problems related to deep structure. Inspired by the success of deep learning technology, some researchers have proposed the Deep Non-negative Matrix Factorization (Deep Non-negative Matrix Factorization) model based on the single-layer NMF algorithm. Deep NMF can be seen as decomposing a complex task into several simple tasks, and then processing them one after another in a multi-layer structure. At the same time, this deep decomposition method can explore the underlying feature representation in complex data, so as to extract features that are more complete and more discriminative than single-layer learning.

目前已有的Deep NMF模型虽然具有深度分层结构,但这种结构一般是简单重复使用单层NMF算法来构建的,其性能达不到理想的要求,而且Deep NMF的计算方法的计算效率不高,在人脸数据聚类任务中性能较差。特别地,这些方法均不是利用深度神经网络来产生的,因而不能利用深度神经网络强大的特征表达能力和聚类能力。Although the existing Deep NMF model has a deep layered structure, this structure is generally constructed by simply reusing the single-layer NMF algorithm, and its performance does not meet the ideal requirements, and the calculation efficiency of the Deep NMF calculation method is not High, poor performance in face data clustering tasks. In particular, none of these methods are produced by using deep neural networks, so they cannot use the powerful feature expression and clustering capabilities of deep neural networks.

上述内容仅用于辅助理解本申请的技术方案,并不代表承认上述内容是现有技术。The above content is only used to assist the understanding of the technical solution of the application, and does not mean that the above content is recognized as prior art.

发明内容Summary of the invention

本申请的主要目的在于提供一种人脸图像聚类方法、装置及计算机可读存储介质,旨在解决Deep NMF的特征表达能力和聚类能力较差的技术问题。The main purpose of this application is to provide a face image clustering method, device, and computer-readable storage medium, aiming to solve the technical problem of poor feature expression ability and clustering ability of Deep NMF.

为实现上述目的,本申请提供一种人脸图像聚类方法,所述人脸图像聚类方法包括以下步骤:To achieve the above objective, the present application provides a face image clustering method. The face image clustering method includes the following steps:

获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集;Acquiring training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered;

将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量;Input the training sample data and the highest feature amount into a BP-Deep NMF model, train the BP-Deep NMF model, and determine the base image amount of the face image to be clustered;

基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果。Based on the amount of base images and the data set to be clustered, the face images to be clustered are classified by preset clustering rules, the clustering result of the data set to be clustered is determined, and the data set to be clustered is obtained The classification result of the face image.

此外,为实现上述目的,本申请还提供一种人脸图像聚类装置,所述人脸图像聚类装置包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,所述计算机可读指令被所述处理器执行时,实现如下步骤:In addition, in order to achieve the above object, the present application also provides a face image clustering device, the face image clustering device includes: a memory, a processor, and stored in the memory and running on the processor When the computer-readable instructions are executed by the processor, the following steps are implemented:

获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和 待聚类人脸图像的待聚类数据集;Acquiring training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered;

将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量;以及,Input the training sample data and the highest feature amount into a BP-Deep NMF model, train the BP-Deep NMF model, and determine the base image amount of the face image to be clustered; and,

基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果。Based on the amount of base images and the data set to be clustered, the face images to be clustered are classified by preset clustering rules, the clustering result of the data set to be clustered is determined, and the data set to be clustered is obtained The classification result of the face image.

本申请通过获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集;将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量;基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果,提出了一种基于BP神经网络的高性能深度非负矩阵分解(BP-Deep NMF)模型,BP-Deep NMF模型使用径向基函数(RBF)构造神经网络的输入信号,该输入等同于深度非负矩阵分解中的最高层特征,并以原始训练数据作为网络的期望输出,模型的优化则采用BP神经网络算法对网络权重矩阵更新的法则。最终训练出的BP-Deep NMF模型能直接得到数据的深度非负矩阵分解,而不需要对分解进行微调,并且在人脸数据聚类任务中显示出了优越的性能。This application acquires the training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered; inputting the training sample data and the highest feature amount to In the BP-Deep NMF model, the BP-Deep NMF model is trained, and the base image amount of the face image to be clustered is determined; based on the base image amount and the data set to be clustered, the aggregation is performed by preset The class rule classifies the face images to be clustered, determines the clustering results of the data sets to be clustered, and obtains the classification results of the face images to be clustered. A high-performance depth based on BP neural network is proposed. Non-negative matrix factorization (BP-Deep NMF) model. The BP-Deep NMF model uses radial basis functions (RBF) to construct the input signal of the neural network. This input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and uses the original The training data is used as the expected output of the network, and the optimization of the model adopts the BP neural network algorithm to update the network weight matrix. The finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.

附图说明Description of the drawings

图1是本申请实施例方案涉及的硬件运行环境的人脸图像聚类装置结构示意图;FIG. 1 is a schematic structural diagram of a face image clustering device in a hardware operating environment involved in a solution of an embodiment of the present application;

图2为本申请人脸图像聚类方法第一实施例的流程示意图。FIG. 2 is a schematic flowchart of the first embodiment of the applicant's face image clustering method.

本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.

具体实施方式Detailed ways

应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not used to limit the present application.

如图1所示,图1是本申请实施例方案涉及的硬件运行环境的人脸图像聚类装置结构示意图。As shown in FIG. 1, FIG. 1 is a schematic structural diagram of a face image clustering apparatus in a hardware operating environment involved in a solution of an embodiment of the present application.

如图1所示,该人脸图像聚类装置可以包括:处理器1001,例如CPU,网 络接口1004,用户接口1003,存储器1005,通信总线1002。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in Fig. 1, the face image clustering apparatus may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, and a communication bus 1002. Among them, the communication bus 1002 is used to implement connection and communication between these components. The user interface 1003 may include a display screen (Display) and an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. The network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface). The memory 1005 may be a high-speed RAM memory, or a non-volatile memory (non-volatile memory), such as a magnetic disk memory. Optionally, the memory 1005 may also be a storage device independent of the aforementioned processor 1001.

本领域技术人员可以理解,图1中示出的人脸图像聚类装置结构并不构成对人脸图像聚类装置的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the structure of the face image clustering device shown in FIG. 1 does not constitute a limitation on the face image clustering device, and may include more or less components than shown in the figure, or a combination of certain components. Components, or different component arrangements.

如图1所示,作为一种计算机存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及人脸图像聚类程序。As shown in FIG. 1, the memory 1005 as a computer storage medium may include an operating system, a network communication module, a user interface module, and a face image clustering program.

在图1所示的人脸图像聚类装置中,网络接口1004主要用于连接后台服务器,与后台服务器进行数据通信;用户接口1003主要用于连接客户端(用户端),与客户端进行数据通信;而处理器1001可以用于调用存储器1005中存储的人脸图像聚类程序。In the face image clustering device shown in FIG. 1, the network interface 1004 is mainly used to connect to the back-end server and communicate with the back-end server; the user interface 1003 is mainly used to connect to the client (user side) and conduct data with the client Communication; and the processor 1001 can be used to call the face image clustering program stored in the memory 1005.

在本实施例中,人脸图像聚类装置包括:存储器1005、处理器1001及存储在所述存储器1005上并可在所述处理器1001上运行的人脸图像聚类程序,其中,处理器1001调用存储器1005中存储的人脸图像聚类程序时,并执行以下操作:In this embodiment, the face image clustering device includes: a memory 1005, a processor 1001, and a face image clustering program stored on the memory 1005 and running on the processor 1001, wherein the processor When 1001 calls the face image clustering program stored in the memory 1005, and performs the following operations:

获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集;Acquiring training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered;

将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,以获得所述待聚类人脸图像的基图像量;Inputting the training sample data and the highest feature amount into a BP-Deep NMF model, and training the BP-Deep NMF model to obtain the base image amount of the face image to be clustered;

基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,以得到待聚类人脸图像的分类结果。Based on the amount of base images and the data set to be clustered, the face images to be clustered are classified by preset clustering rules, and the clustering result of the data set to be clustered is determined to obtain the data set to be clustered. The classification result of the human-like face image.

进一步地,处理器1001可以调用存储器1005中存储的人脸图像聚类程序,还执行以下操作:Further, the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:

获取所述BP-Deep NMF模型中的激活函数对应的第一线性参数以及所述 BP-Deep NMF模型中各个神经元的第一权重;Acquiring the first linear parameter corresponding to the activation function in the BP-Deep NMF model and the first weight of each neuron in the BP-Deep NMF model;

基于所述第一线性参数、所述训练样本数据和所述第一权重,确定所述BP-Deep NMF模型中输出层神经元的第一输出误差;Determine the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data, and the first weight;

基于所述第一输出误差、所述第一线性参数和所述第一权重,确定所述BP-Deep NMF模型中输入层神经元和隐含层神经元的第二输出误差;Determine the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model based on the first output error, the first linear parameter, and the first weight;

基于所述第一输出误差和所述第二输出误差,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量。Based on the first output error and the second output error, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and set the second weight As the base image amount of the face image to be clustered.

进一步地,处理器1001可以调用存储器1005中存储的人脸图像聚类程序,还执行以下操作:Further, the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:

基于所述第一输出误差和所述第二输出误差,确定所述BP-Deep NMF模型中各个神经元的权重偏置量;Determine the weight offset of each neuron in the BP-Deep NMF model based on the first output error and the second output error;

基于所述权重偏置量,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量。Based on the weight bias, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and use the second weight as the person to be clustered The amount of the base image of the face image.

进一步地,处理器1001可以调用存储器1005中存储的人脸图像聚类程序,还执行以下操作:Further, the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:

获取所述BP-Deep NMF模型的学习率;Acquiring the learning rate of the BP-Deep NMF model;

基于所述权重偏置量和所述学习率,通过投影梯度法的投影确定所述BP-Deep NMF模型中各个神经元的非负权重;Based on the weight offset and the learning rate, the non-negative weight of each neuron in the BP-Deep NMF model is determined by the projection of the projection gradient method;

基于所述非负权重,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量。Based on the non-negative weight, determine the second weight of each layer of neurons in the BP-Deep NMF model after the BP-Deep NMF model training is completed, and use the second weight as the face image Base image volume.

进一步地,处理器1001可以调用存储器1005中存储的人脸图像聚类程序,还执行以下操作:Further, the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:

基于所述训练样本数据和所述非负权重,确定所述BP-Deep NMF模型中的激活函数对应的第二线性参数,并将所述第二线性参数作为所述第一线性参数,执行所述基于所述第一线性参数、所述训练样本数据和所述第一权重,确定所述BP-Deep NMF模型中输出层神经元的第一输出误差,以及基于所述第一输出误差、所述第一线性参数和所述第一权重,确定所述BP-Deep NMF 模型中输入层神经元和隐含层神经元的第二输出误差的步骤。Based on the training sample data and the non-negative weights, determine the second linear parameter corresponding to the activation function in the BP-Deep NMF model, and use the second linear parameter as the first linear parameter to perform all Said determining the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data and the first weight; The first linear parameter and the first weight determine the step of determining the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model.

进一步地,处理器1001可以调用存储器1005中存储的人脸图像聚类程序,还执行以下操作:Further, the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:

获取所述BP-Deep NMF模型的训练次数以及所述BP-Deep NMF模型的损失函数值;Acquiring the number of times of training of the BP-Deep NMF model and the loss function value of the BP-Deep NMF model;

若所述训练次数达到最大训练次数或者所述损失函数值小于或等于模型误差阈值,则停止训练所述BP-Deep NMF模型,将所述非负权重作为所述第二权重,以确定所述BP-Deep NMF模型训练完成后的所述BP-Deep NMF模型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量。If the number of training times reaches the maximum number of training times or the loss function value is less than or equal to the model error threshold, stop training the BP-Deep NMF model, and use the non-negative weight as the second weight to determine the The second weight of each layer of neurons in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and the second weight is used as the base image amount of the face image.

进一步地,处理器1001可以调用存储器1005中存储的人脸图像聚类程序,还执行以下操作:Further, the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:

获取所述人脸图像,并将所述人脸图像转化为训练样本数据;Acquiring the face image, and converting the face image into training sample data;

获取所述待聚类人脸图像,并将所述待聚类人脸图像转化为待聚类数据集;Acquiring the face image to be clustered, and converting the face image to be clustered into a data set to be clustered;

基于所述训练样本数据,通过预设计算方法,确定所述训练样本数据对应的最高特征量。Based on the training sample data, the highest feature amount corresponding to the training sample data is determined through a preset calculation method.

进一步地,处理器1001可以调用存储器1005中存储的人脸图像聚类程序,还执行以下操作:Further, the processor 1001 may call a face image clustering program stored in the memory 1005, and also perform the following operations:

基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;Extracting hidden feature quantities of features of each layer in the to-be-clustered data set based on the base image amount and the to-be-clustered data set;

基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,以得到人脸图像的聚类结果。Based on the hidden feature quantity, the face image to be clustered is classified by a preset clustering rule, and the clustering result of the data set to be clustered is determined to obtain the clustering result of the face image.

本申请还提供一种人脸图像聚类方法,参照图2,图2为本申请方法第一实施例的流程示意图,该人脸图像聚类方法包括以下步骤:The present application also provides a face image clustering method. Referring to FIG. 2, FIG. 2 is a schematic flowchart of the first embodiment of the method in this application. The face image clustering method includes the following steps:

本申请实施例所适用的一种系统架构,包括但不限于一个服务器或者多个服务器,服务器可以是计算机等网络设备。服务器可以是一个独立的设备,也可以是多个服务器所形成的服务器集群。优选地,服务器可以采用云计算技术进行信息处理。A system architecture to which the embodiment of the present application is applicable includes but is not limited to one server or multiple servers. The server may be a network device such as a computer. The server can be an independent device or a server cluster formed by multiple servers. Preferably, the server can use cloud computing technology for information processing.

目前已有的Deep NMF模型虽然具有深度分层结构,但这种结构一般是简单重复使用单层NMF算法来构建的,其性能达不到理想的要求,而且这些方法的计算效率不高。特别地,这些方法均不是利用深度神经网络来产生的,因而不能利用深度神经网络强大的特征表达能力和聚类能力。鉴于此,本专利将神经网络算法引入到Deep NMF领域,提出了一种新的基于BP神经网络的高性能深度非负矩阵分解(BP-Deep NMF)聚类模型,首先构建深度非负矩阵分解(BP-Deep NMF)模型,再利用该BP-Deep NMF模型对待聚类数据进行聚类,也就是将待聚类人脸图像进行聚类。BP-Deep NMF方法使用径向基函数(RBF)构造神经网络的输入信号,该输入等同于深度非负矩阵分解中的最高层特征,并以原始训练数据作为网络的期望输出。模型的优化则采用BP神经网络算法对网络权重矩阵更新的法则。最终训练出的神经网络模型能直接得到数据的深度非负矩阵分解,而不需要对分解进行微调。本申请提出的BP-Deep NMF模型在人脸数据聚类任务中显示出了优越的性能。Although the existing Deep NMF model has a deep hierarchical structure, this structure is generally constructed by simply reusing the single-layer NMF algorithm, and its performance does not meet the ideal requirements, and the computational efficiency of these methods is not high. In particular, none of these methods are produced by using deep neural networks, so they cannot use the powerful feature expression and clustering capabilities of deep neural networks. In view of this, this patent introduces the neural network algorithm into the Deep NMF field, and proposes a new high-performance deep non-negative matrix factorization (BP-Deep NMF) clustering model based on BP neural network. First, deep non-negative matrix factorization is constructed. (BP-Deep NMF) model, and then use the BP-Deep NMF model to cluster the data to be clustered, that is, cluster the face images to be clustered. The BP-Deep NMF method uses the radial basis function (RBF) to construct the input signal of the neural network. The input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network. The optimization of the model uses the BP neural network algorithm to update the network weight matrix. Finally, the trained neural network model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition. The BP-Deep NMF model proposed in this application shows superior performance in the task of clustering face data.

为了便于理解,下面对本申请实施例中可能涉及的名词进行定义和解释。In order to facilitate understanding, the terms that may be involved in the embodiments of the present application are defined and explained below.

深度非负矩阵分解(Deep NMF):NMF仅是一个单层分解的学习过程,它通过分解数据矩阵X来学习基矩阵W和特征矩阵H。而深度非负矩阵分解(Deep NMF)能进一步捕捉数据集中隐藏的直观层次特征信息。其基本思想是:矩阵X在经过第一次浅层分解上得到的矩阵H 1可以再次分解为矩阵W 2和H 2,从而将原来的单层结构扩展为一个两层结构。以此类推,最终即可将浅层结构模型增广为多层(深度)结构模型。具体来说,Deep NMF模型将非负数据矩阵X分解为L+1个非负因子矩阵,即: Deep non-negative matrix factorization (Deep NMF): NMF is only a single-layer decomposition learning process. It learns the base matrix W and the feature matrix H by decomposing the data matrix X. The deep non-negative matrix factorization (Deep NMF) can further capture the intuitive hierarchical feature information hidden in the data set. The basic idea is: the matrix H 1 obtained by the first shallow decomposition of the matrix X can be decomposed into the matrices W 2 and H 2 again , thereby expanding the original single-layer structure into a two-layer structure. By analogy, the shallow structure model can eventually be expanded to a multi-layer (deep) structure model. Specifically, the Deep NMF model decomposes the non-negative data matrix X into L+1 non-negative factor matrices, namely:

X≈W 1W 2…W LH LX≈W 1 W 2 …W L H L ,

其中

Figure PCTCN2019125298-appb-000001
among them
Figure PCTCN2019125298-appb-000001

由上式知,Deep NMF模型学习得到的数据的隐藏属性H 1,...,H L可表示为H 1≈W 2…W LH L,H 2≈W 3…W LH L,...,H L-1≈W LH L,称H 1,...,H L是在非负约束下的特征矩阵。为降低整个模型的总重构误差,Deep NMF算法一般在逐层分解全部完成后,再对整个网络模型进行微调,即对以下基于F-范数的损失函数进行极小化: From the above formula, the hidden attributes H 1 ,..., H L of the data learned by the Deep NMF model can be expressed as H 1 ≈W 2 …W L H L , H 2 ≈W 3 …W L H L ,. .., H L-1 ≈W L H L , said H 1 ,..., H L is the characteristic matrix under non-negative constraints. In order to reduce the total reconstruction error of the entire model, the Deep NMF algorithm generally performs fine-tuning of the entire network model after the layer-by-layer decomposition is completed, that is, the following loss function based on the F-norm is minimized:

Figure PCTCN2019125298-appb-000002
Figure PCTCN2019125298-appb-000002

BP神经网络(Back-Propagation Algorithm of Neural Network,反向传播神经网络):神经网络通过样本信号不断改变网络的权值,调整网络的误差 值,最后输出误差达到预期的误差范围。其中,BP神经网络是一种基于误差反向传播的多层前馈网络,而标准的BP神经网络则包含有输入层、隐含层、输出层,其中层与层间通过神经元相互连接,而同一层网络的神经元之间互不相连。BP neural network (Back-Propagation Algorithm of Neural Network): The neural network continuously changes the weight of the network through the sample signal, adjusts the error value of the network, and finally the output error reaches the expected error range. Among them, the BP neural network is a multi-layer feedforward network based on error back propagation, while the standard BP neural network includes an input layer, a hidden layer, and an output layer. The layers are connected to each other through neurons. The neurons in the same layer of the network are not connected to each other.

步骤S10,获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集;Step S10: Obtain the training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered;

在本实施例中,在构建BP-Deep NMF模型之前,首先将用于训练BP-Deep NMF模型的人脸图像转换成训练样本数据,以及使用径向基函数(RBF)将训练样本数据转换成最高特征量,径向基函数(RBF)构造神经网络的输入信号即最高特征量,该输入等同于深度非负矩阵分解中的最高层特征,其中,人脸图像为用于训练BP-Deep NMF模型的人脸图像信息,包括不同表情、不同身份或者不同面部特征的人脸图像信息。将人脸图像转换成训练样本数据以及将训练样本数据转换成最高特征量后,获取转换完成后的训练样本数据和最高特征量,以进行对BP-Deep NMF模型的构建。In this embodiment, before constructing the BP-Deep NMF model, the face image used to train the BP-Deep NMF model is first converted into training sample data, and the radial basis function (RBF) is used to convert the training sample data into The highest feature quantity, the input signal of the radial basis function (RBF) to construct the neural network is the highest feature quantity. This input is equivalent to the highest level feature in the deep non-negative matrix factorization. Among them, the face image is used for training BP-Deep NMF The facial image information of the model includes facial image information with different expressions, different identities, or different facial features. After the face image is converted into training sample data and the training sample data is converted into the highest feature quantity, the converted training sample data and the highest feature quantity are obtained to construct the BP-Deep NMF model.

具体地,假设训练样本数目为n和训练样本数据矩阵即训练样本数据为X=(x 1,x 2,…,x n)。利用径向基函数(RBF)

Figure PCTCN2019125298-appb-000003
(其中为空间x,y中两个数据点,t≥0)生成算法最高层特征矩阵即最高特征量H=(H ij) n×n,其中,最高特征量
Figure PCTCN2019125298-appb-000004
当x i与x j属于同一个人的不同脸部特征或者不同表情的人脸图像时,x i与x j属于同一类,则H ij=k(x i,x j);当x i与x j属于不同人的脸部特征或者表情的人脸图像时,x i与x j属于不同类,则H ij=0。因此,显然,最高特征量H是对角分块矩阵,这种结构本身具有很好的聚类性质。 Specifically, it is assumed that the number of training samples is n and the training sample data matrix, that is, the training sample data is X=(x 1 , x 2 ,..., x n ). Utilize Radial Basis Function (RBF)
Figure PCTCN2019125298-appb-000003
(Among them are two data points in space x and y, t≥0) Generate the highest feature matrix of the algorithm, that is, the highest feature quantity H=(H ij ) n×n , where the highest feature quantity
Figure PCTCN2019125298-appb-000004
When x i and x j belong to the same person’s face images with different facial features or different expressions, x i and x j belong to the same category, then H ij =k(x i , x j ); when x i and x When j belongs to facial images of different people's facial features or expressions, and x i and x j belong to different classes, then H ij =0. Therefore, it is obvious that the highest feature quantity H is a diagonal block matrix, and this structure itself has good clustering properties.

构建BP-Deep NMF模型完成后,首先获取待聚类人脸图像的待聚类数据集,以供将待聚类人脸图像进行聚类。After the construction of the BP-Deep NMF model is completed, the to-be-clustered data set of the face images to be clustered is first obtained for clustering the face images to be clustered.

步骤S20,将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量;Step S20: Input the training sample data and the highest feature amount into a BP-Deep NMF model, train the BP-Deep NMF model, and determine the base image amount of the face image to be clustered;

在本实施例中,在获取人脸图像的训练样本数据和训练样本数据对应的最高特征量后,开始构建BP-Deep NMF模型即训练BP-Deep NMF模型,首先将训练样本数据X和最高特征量H输入至BP-Deep NMF模型中,具体地, 将训练样本数据X作为BP神经网络的期望输出,即把原始的训练样本数据X输入进BP-Deep NMF模型的输出层,将最高特征量H作为BP-Deep NMF模型的输入信号,即把最高特征量H输入至BP-Deep NMF模型的输入层。之后,基于训练样本数据和最高特征量,通过训练BP-Deep NMF模型,确定在BP-Deep NMF模型训练完成后待聚类人脸图像的基图像量W,具体地,输入最高特征量H以求BP神经网络的正向输出,求得BP神经网络的正向输出后,输入最高特征量至BP神经网络的输出层,结合BP神经网络的正向输出求BP神经网络的反向输出,以更新BP-Deep NMF模型的模型参数即BP-Deep NMF模型中每个神经元的权重。In this embodiment, after the training sample data of the face image and the highest feature amount corresponding to the training sample data are obtained, the BP-Deep NMF model is built, that is, the BP-Deep NMF model is trained. First, the training sample data X and the highest feature The quantity H is input into the BP-Deep NMF model, specifically, the training sample data X is used as the expected output of the BP neural network, that is, the original training sample data X is input into the output layer of the BP-Deep NMF model, and the highest feature quantity H is used as the input signal of the BP-Deep NMF model, that is, the highest feature quantity H is input to the input layer of the BP-Deep NMF model. After that, based on the training sample data and the highest feature amount, by training the BP-Deep NMF model, determine the base image amount W of the face images to be clustered after the BP-Deep NMF model training is completed. Specifically, input the highest feature amount H to Calculate the forward output of the BP neural network. After obtaining the forward output of the BP neural network, input the highest feature quantity to the output layer of the BP neural network, and combine the forward output of the BP neural network to find the reverse output of the BP neural network. Update the model parameters of the BP-Deep NMF model, that is, the weight of each neuron in the BP-Deep NMF model.

构建BP-Deep NMF模型期间,不断迭代该BP-Deep NMF模型,直至训练次数达到最大训练次数或者损失函数值小于或等于一阈值,否则继续对BP-Deep NMF模型进行迭代。当训练次数达到最大训练次数或者损失函数值小于或等于一阈值,说明BP-Deep NMF模型构建完成。BP-Deep NMF模型构建完成后,输出BP-Deep NMF模型的待聚类人脸图像的基图像量W。During the construction of the BP-Deep NMF model, the BP-Deep NMF model is continuously iterated until the number of training times reaches the maximum number of training times or the loss function value is less than or equal to a threshold, otherwise the BP-Deep NMF model continues to be iterated. When the number of training times reaches the maximum number of training times or the loss function value is less than or equal to a threshold, it indicates that the construction of the BP-Deep NMF model is completed. After the construction of the BP-Deep NMF model is completed, the base image amount W of the face image to be clustered of the BP-Deep NMF model is output.

步骤S30,基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果。Step S30, based on the amount of base images and the data set to be clustered, classify the face images to be clustered by preset clustering rules, determine the clustering result of the data set to be clustered, and obtain The classification result of the face image to be clustered.

在本实施例中,在BP-Deep NMF模型构建完成后,输出BP-Deep NMF模型的待聚类人脸图像的基图像量W,基于基图像量W和待聚类数据集Y=(y 1,y 2,…,y m),计算待聚类数据集中的每个样本y k(k=1,…,m)的第i层特征向量即隐藏特征量

Figure PCTCN2019125298-appb-000005
运用预设聚类规则即k均值聚类法分别对样本第i层特征向量集
Figure PCTCN2019125298-appb-000006
进行聚类确定待聚类数据集的聚类结果,最终输出聚类结果得到待聚类人脸图像的分类结果,隐藏特征量可以体现在人脸图像信息的身份或者表情或者姿势特征等。例如,基于姿势特征的一级隐藏特征,可以将待聚类人脸图像分类成不同姿势特征的聚类结果,例如头部朝向角度分别为0度、30度或者-30度等等角度的聚类结果;基于表情特征的二级隐藏特征,可以将待聚类人脸图像分类成不同表情特征的聚类结果,例如表情开心、愤怒或者疑惑等不同表情特征的聚类结果;基于身份特征的三级隐藏特征,可以将待聚类人脸图像分类成不同人的聚类结果,也就是说以人为分类依据将待聚类人脸图像分类。 In this embodiment, after the construction of the BP-Deep NMF model is completed, the base image amount W of the face image to be clustered of the BP-Deep NMF model is output, based on the base image amount W and the data set to be clustered Y=(y 1 ,y 2 ,...,y m ), calculate the ith layer feature vector of each sample y k (k=1,...,m) in the data set to be clustered, which is the hidden feature quantity
Figure PCTCN2019125298-appb-000005
Use the preset clustering rules, namely the k-means clustering method, to separately analyze the feature vector set of the i-th layer of the sample
Figure PCTCN2019125298-appb-000006
Perform clustering to determine the clustering result of the data set to be clustered, and finally output the clustering result to obtain the classification result of the face image to be clustered. The hidden feature amount can be reflected in the identity or expression or posture characteristics of the face image information. For example, based on the first-level hidden features of posture features, the face images to be clustered can be classified into clustering results of different posture features, for example, the head orientation angles are 0 degrees, 30 degrees, or -30 degrees. Class results; based on the secondary hidden features of expression features, the face images to be clustered can be classified into clustering results of different expression features, such as the clustering results of different expression features such as happy, angry, or confused; based on identity features The three-level hidden features can classify the face images to be clustered into clustering results of different people, that is to say, classify the face images to be clustered on the basis of human classification.

本实施例提出的人脸图像聚类方法提出了一种基于BP神经网络的高性能深度非负矩阵分解(BP-Deep NMF)聚类模型,BP-Deep NMF模型使用径向基函数(RBF)构造神经网络的输入信号,该输入等同于深度非负矩阵分解中的最高层特征,并以原始训练数据作为网络的期望输出,模型的优化则采用BP神经网络算法对网络权重矩阵更新的法则。最终训练出的BP-Deep NMF模型能直接得到数据的深度非负矩阵分解,而不需要对分解进行微调,并且在人脸数据聚类任务中显示出了优越的性能。The face image clustering method proposed in this embodiment proposes a high-performance deep non-negative matrix factorization (BP-Deep NMF) clustering model based on BP neural network. The BP-Deep NMF model uses radial basis functions (RBF) The input signal of the neural network is constructed. The input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network. The optimization of the model adopts the BP neural network algorithm to update the network weight matrix. The finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.

基于第一实施例,提出本申请人脸图像聚类方法的第二实施例,在本实施例中,步骤S20包括:Based on the first embodiment, a second embodiment of the applicant’s face image clustering method is proposed. In this embodiment, step S20 includes:

步骤a,获取所述BP-Deep NMF模型中的激活函数对应的第一线性参数以及所述BP-Deep NMF模型中各个神经元的第一权重;Step a: Obtain the first linear parameter corresponding to the activation function in the BP-Deep NMF model and the first weight of each neuron in the BP-Deep NMF model;

在本实施例中,先设定BP神经网络的层数L,即BP神经网络一共具有L层神经元,取f(x)=p 1/L·x(p>0)为构建BP神经网络的激活函数,第一线性参数即为激活函数f(x)=p 1/L·x中的线性参数p,第一线性参数p作用于将BP-Deep NMF模型中每个神经元的输入信号转换成一个输出信号,神经元的输入和输入对应的权重的乘积之和,并将激活函数f(x)应用于其获取该层神经元的输出并将其作为输入馈送到神经网络的下一个层。 In this embodiment, first set the number of layers L of the BP neural network, that is, the BP neural network has a total of L layers of neurons, and f(x)=p 1/L ·x(p>0) is used to construct the BP neural network The first linear parameter is the linear parameter p in the activation function f(x)=p 1/L ·x. The first linear parameter p acts on the input signal of each neuron in the BP-Deep NMF model Converted into an output signal, the sum of the product of the input of the neuron and the weight corresponding to the input, and the activation function f(x) is applied to it to obtain the output of the neuron of this layer and feed it as the input to the next neural network Floor.

步骤b,基于所述第一线性参数、所述训练样本数据和所述第一权重,确定所述BP-Deep NMF模型中输出层神经元的第一输出误差;Step b: Determine the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data, and the first weight;

在本实施例中,取f(x)=p 1/L·x(参数p>0)为BP神经网络的激活函数,设定偏置向量b=0,BP神经网络的损失函数可定义为:

Figure PCTCN2019125298-appb-000007
构建BP-Deep NMF模型开始,将最高特征量H输入至BP-Deep NMF模型的输入层,则BP神经网络输入层的输入为a 0=H,BP神经网络的层数为L,BP神经网络第l层加权重后的值为Z l=W la l-1,BP神经网络第l层的输出为a l=p 1/L·Z l,其中,l=1,...,L。基于训练样本数据X、第一线性参数p和BP神经网络的第一权重W,输出BP神经网络输出层神经元的第一输出误差δ L=p 1/L·(X-p·W L·W L-1·...·W 1·H)。 In this embodiment, f(x)=p 1/L ·x (parameter p>0) is the activation function of the BP neural network, and the bias vector b=0 is set, and the loss function of the BP neural network can be defined as :
Figure PCTCN2019125298-appb-000007
Begin building the BP-Deep NMF model, input the highest feature amount H into the input layer of the BP-Deep NMF model, then the input of the BP neural network input layer is a 0 = H, the number of layers of the BP neural network is L, and the BP neural network The weighted value of the l layer is Z l =W l a l-1 , the output of the l layer of the BP neural network is a l =p 1/L ·Z l , where l=1,...,L . Based on the training sample data X, the first linear parameter p, and the first weight W of the BP neural network, output the first output error of the neurons in the output layer of the BP neural network δ L = p 1/L · (Xp · W L · W L -1 ·...·W 1 ·H).

步骤c,基于所述第一输出误差、所述第一线性参数和所述第一权重,确定所述BP-Deep NMF模型中输入层神经元和隐含层神经元的第二输出误差;Step c: Determine the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model based on the first output error, the first linear parameter and the first weight;

在本实施例中,输出BP神经网络输出层神经元的第一输出误差后,基于所求得的第一输出误差、第一线性参数p和第一权重W,输出BP-Deep NMF模型中第l层输入层神经元或隐含层神经元的第二输出误差,第二输出误差δ l 的计算公式如下: In this embodiment, after outputting the first output error of the neurons in the output layer of the BP neural network, based on the obtained first output error, the first linear parameter p, and the first weight W, the first output error in the BP-Deep NMF model is output The second output error of the input layer neuron or hidden layer neuron of the l layer, the calculation formula of the second output error δ l is as follows:

Figure PCTCN2019125298-appb-000008
Figure PCTCN2019125298-appb-000008

其中,l=L-1,L-2,...,1,W为BP神经网络的权重。Among them, l=L-1,L-2,...,1, and W is the weight of the BP neural network.

步骤d,基于所述第一输出误差和所述第二输出误差,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量。Step d: Based on the first output error and the second output error, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and combine the The second weight is used as the base image amount of the face image to be clustered.

在本实施例中,基于第一输出误差和第二输出误差,通过迭代BP-Deep NMF模型,确定在BP-Deep NMF模型训练完成后各个神经元的第二权重,输出该第二权重,并将BP-Deep NMF模型训练完成后的第二权重作为待聚类人脸图像的基图像量,以确定待聚类人脸图像的基图像量W,具体地,输入最高特征量H以求BP神经网络的正向输出,求得BP神经网络的正向输出后,基于正向输出确定第一输出误差和第二输出误差,以求BP神经网络的反向输出,以更新BP-Deep NMF模型的模型参数即BP-Deep NMF模型中每个神经元的权重,以供后续输出BP-Deep NMF模型训练完成后的第二权重作为待聚类人脸图像的基图像量,以便于进一步基于该基图像量将待聚类人脸图像进行聚类。In this embodiment, based on the first output error and the second output error, by iterating the BP-Deep NMF model, the second weight of each neuron after the BP-Deep NMF model training is completed is determined, and the second weight is output, and The second weight after the training of the BP-Deep NMF model is used as the base image amount of the face image to be clustered to determine the base image amount W of the face image to be clustered, specifically, the highest feature amount H is input to obtain BP The forward output of the neural network, after the forward output of the BP neural network is obtained, the first output error and the second output error are determined based on the forward output to obtain the reverse output of the BP neural network to update the BP-Deep NMF model The model parameter of the BP-Deep NMF model is the weight of each neuron in the BP-Deep NMF model, and the second weight after the completion of the BP-Deep NMF model training is used as the base image amount of the face image to be clustered, so as to be further based on this The amount of base image clusters the face images to be clustered.

进一步地,一实施例中,所述基于所述第一输出误差和所述第二输出误差,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量的步骤包括:Further, in an embodiment, the determining the first output error of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed based on the first output error and the second output error Two weights, and the step of using the second weight as the base image amount of the face image to be clustered includes:

步骤e,基于所述第一输出误差和所述第二输出误差,确定所述BP-Deep NMF模型中各个神经元的权重偏置量;Step e: Determine the weight offset of each neuron in the BP-Deep NMF model based on the first output error and the second output error;

在本实施例中,在确定第一输出误差δ L和第二输出误差δ l后,BP神经网络的每个神经元根据所输出的第一输出误差δ L和第二输出误差δ l,计算BP-Deep NMF模型中各个神经元的权重偏置量,权重偏置量

Figure PCTCN2019125298-appb-000009
的计算公式如下: In this embodiment, after determining the first output error δ L and the second output error δ l , each neuron of the BP neural network calculates according to the output first output error δ L and the second output error δ l The weight bias of each neuron in the BP-Deep NMF model, the weight bias
Figure PCTCN2019125298-appb-000009
The calculation formula is as follows:

Figure PCTCN2019125298-appb-000010
Figure PCTCN2019125298-appb-000010

其中,

Figure PCTCN2019125298-appb-000011
为第l-1层神经元的输出a l-1的转置,δ l为第二输出误差,以及参数l为l=L,...,1。 among them,
Figure PCTCN2019125298-appb-000011
Is the transpose of the output a l-1 of the neuron of the l-1th layer , δ l is the second output error, and the parameter l is l=L,...,1.

步骤f,基于所述权重偏置量,确定在所述BP-Deep NMF模型训练完成 后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量。Step f: Based on the weight bias, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and use the second weight as the waiting The amount of base images for clustering face images.

在本实施例中,基于权重偏置量,通过迭代BP-Deep NMF模型,确定在BP-Deep NMF模型训练完成后各个神经元的第二权重,输出该第二权重,并将BP-Deep NMF模型训练完成后的第二权重作为待聚类人脸图像的基图像量,以确定待聚类人脸图像的基图像量W,具体地,输入最高特征量H以求BP神经网络的正向输出,求得BP神经网络的正向输出后,基于正向输出确定第一输出误差和第二输出误差,以求BP神经网络的反向输出,以更新BP-Deep NMF模型的模型参数即BP-Deep NMF模型中每个神经元的权重,以供后续输出BP-Deep NMF模型训练完成后的第二权重作为待聚类人脸图像的基图像量,以便于进一步地基于该基图像量将待聚类人脸图像进行聚类。In this embodiment, based on the weight offset, the BP-Deep NMF model is iterated to determine the second weight of each neuron after the BP-Deep NMF model training is completed, and the second weight is output, and the BP-Deep NMF After the model training is completed, the second weight is used as the base image amount of the face image to be clustered to determine the base image amount W of the face image to be clustered. Specifically, the highest feature amount H is input to obtain the forward direction of the BP neural network Output, after obtaining the forward output of the BP neural network, determine the first output error and the second output error based on the forward output to obtain the reverse output of the BP neural network to update the model parameter of the BP-Deep NMF model, namely BP -The weight of each neuron in the Deep NMF model for subsequent output. The second weight after the completion of the BP-Deep NMF model training is used as the base image amount of the face image to be clustered, so that the base image amount can be further calculated based on the base image amount. The face images to be clustered are clustered.

进一步地,一实施例中,其中,所述基于所述权重偏置量,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量的步骤包括:Further, in an embodiment, wherein, based on the weight bias, the second weight of each neuron in the BP-Deep NMF model is determined after the training of the BP-Deep NMF model is completed, and The step of using the second weight as the base image amount of the face images to be clustered includes:

步骤g,获取所述BP-Deep NMF模型的学习率;Step g: Obtain the learning rate of the BP-Deep NMF model;

步骤h,基于所述权重偏置量和所述学习率,通过投影梯度法的投影确定所述BP-Deep NMF模型中各个神经元的非负权重;Step h, based on the weight offset and the learning rate, determine the non-negative weight of each neuron in the BP-Deep NMF model through the projection of the projection gradient method;

在本实施例中,在确定BP-Deep NMF模型权重偏置量后,获取BP-Deep NMF模型的学习率r,以供更新BP神经网络的权重参数,基于权重偏置量和学习率更新BP神经网络的权重参数的计算公式如下:In this embodiment, after determining the weight offset of the BP-Deep NMF model, the learning rate r of the BP-Deep NMF model is obtained for updating the weight parameters of the BP neural network, and the BP is updated based on the weight offset and the learning rate The calculation formula of the weight parameter of the neural network is as follows:

Figure PCTCN2019125298-appb-000012
Figure PCTCN2019125298-appb-000012

其中,l=L,...,1。Among them, l=L,...,1.

更新BP神经网络的权重参数后,通过投影梯度法将所更新的BP神经网络的权重参数进行投影,以投影成非负的权重参数,投影权重参数确定BP-Deep NMF模型中各个神经元的非负权重。After updating the weight parameters of the BP neural network, the updated weight parameters of the BP neural network are projected by the projection gradient method to project into non-negative weight parameters. The projection weight parameters determine the non-negative parameters of each neuron in the BP-Deep NMF model. Negative weight.

步骤i,基于所述非负权重,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量。Step i: Based on the non-negative weight, determine the second weight of each layer of neurons in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and use the second weight as the person The amount of the base image of the face image.

在本实施例中,基于所投影的非负权重,通过迭代BP-Deep NMF模型,确定在BP-Deep NMF模型训练完成后各个神经元的第二权重,输出该第二权 重,并将BP-Deep NMF模型训练完成后的第二权重作为待聚类人脸图像的基图像量,以确定待聚类人脸图像的基图像量W,具体地,输入最高特征量H以求BP神经网络的正向输出,求得BP神经网络的正向输出后,基于正向输出确定第一输出误差和第二输出误差,以求BP神经网络的反向输出,以更新BP-Deep NMF模型的模型参数即BP-Deep NMF模型中每个神经元的权重,以供后续输出BP-Deep NMF模型训练完成后的第二权重作为待聚类人脸图像的基图像量,以便于进一步地基于该基图像量将待聚类人脸图像进行聚类。In this embodiment, based on the projected non-negative weights, by iterating the BP-Deep NMF model, determine the second weight of each neuron after the BP-Deep NMF model training is completed, output the second weight, and add the BP-Deep NMF model. After Deep NMF model training is completed, the second weight is used as the base image amount of the face image to be clustered to determine the base image amount W of the face image to be clustered. Specifically, the highest feature amount H is input to obtain the BP neural network Forward output, after obtaining the forward output of the BP neural network, determine the first output error and the second output error based on the forward output to obtain the reverse output of the BP neural network to update the model parameters of the BP-Deep NMF model That is, the weight of each neuron in the BP-Deep NMF model is used to output the second weight after the completion of the BP-Deep NMF model training as the base image amount of the face image to be clustered, so as to be further based on the base image The face images to be clustered are clustered.

进一步地,一实施例中,所述基于所述权重偏置量和所述学习率,通过投影梯度法的投影确定所述BP-Deep NMF模型中各个神经元的非负权重的步骤之后,还包括:Further, in an embodiment, after the step of determining the non-negative weight of each neuron in the BP-Deep NMF model through the projection of the projection gradient method based on the weight offset and the learning rate, further include:

步骤j,基于所述训练样本数据和所述非负权重,确定所述BP-Deep NMF模型中的激活函数对应的第二线性参数,并将所述第二线性参数作为所述第一线性参数,执行所述基于所述第一线性参数、所述训练样本数据和所述第一权重,确定所述BP-Deep NMF模型中输出层神经元的第一输出误差,以及基于所述第一输出误差、所述第一线性参数和所述第一权重,确定所述BP-Deep NMF模型中输入层神经元和隐含层神经元的第二输出误差的步骤。Step j: Determine a second linear parameter corresponding to the activation function in the BP-Deep NMF model based on the training sample data and the non-negative weight, and use the second linear parameter as the first linear parameter , Performing the determining the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data, and the first weight, and determining the first output error based on the first output The error, the first linear parameter, and the first weight determine the step of determining the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model.

在本实施例中,在所述基于所述权重偏置量和所述学习率,通过投影梯度法的投影确定所述BP-Deep NMF模型中各个神经元的非负权重的步骤之后,即在求得BP-Deep NMF模型中各个神经元的非负权重的步骤之后,更新BP-Deep NMF模型中的激活函数的线性参数,也就是确定BP-Deep NMF模型中的激活函数对应的第二线性参数,将第二线性参数作为第一线性参数,激活函数的线性参数的更新公式如下:In this embodiment, after the step of determining the non-negative weight of each neuron in the BP-Deep NMF model through the projection of the projection gradient method based on the weight offset and the learning rate, that is, After obtaining the non-negative weight of each neuron in the BP-Deep NMF model, update the linear parameters of the activation function in the BP-Deep NMF model, that is, determine the second linearity corresponding to the activation function in the BP-Deep NMF model Parameters, taking the second linear parameter as the first linear parameter, the update formula of the linear parameter of the activation function is as follows:

Figure PCTCN2019125298-appb-000013
Figure PCTCN2019125298-appb-000013

可以理解的是,为了确定最优参数p,需要求解如下问题的子优化:It is understandable that in order to determine the optimal parameter p, sub-optimization of the following problem needs to be solved:

Figure PCTCN2019125298-appb-000014
Figure PCTCN2019125298-appb-000014

采用梯度下降法对参数p进行求解,有:Using gradient descent method to solve the parameter p, there are:

Figure PCTCN2019125298-appb-000015
Figure PCTCN2019125298-appb-000015

其中ρ(p (t))是关于p的步长向量,

Figure PCTCN2019125298-appb-000016
是C deep关于p (t)的导数,可以计算得: Where ρ(p (t) ) is the step vector of p,
Figure PCTCN2019125298-appb-000016
Is the derivative of C deep with respect to p (t) , which can be calculated as:

Figure PCTCN2019125298-appb-000017
Figure PCTCN2019125298-appb-000017

为了保证p (t+1)的非负性,令: In order to ensure the non-negativity of p (t+1), let:

Figure PCTCN2019125298-appb-000018
Figure PCTCN2019125298-appb-000018

结合以上公式,即可得到激活函数的线性参数的更新公式。Combining the above formula, the update formula of the linear parameter of the activation function can be obtained.

进一步地,一实施例中,所述基于所述非负权重,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量的步骤包括:Further, in an embodiment, the second weight of neurons in each layer of the BP-Deep NMF model after the completion of the training of the BP-Deep NMF model is determined based on the non-negative weight, and the The step of using the second weight as the base image amount of the face image includes:

步骤k,获取所述BP-Deep NMF模型的训练次数以及所述BP-Deep NMF模型的损失函数值;Step k: Obtain the number of training times of the BP-Deep NMF model and the loss function value of the BP-Deep NMF model;

步骤l,若所述训练次数达到最大训练次数或者所述损失函数值小于或等于模型误差阈值,则停止训练所述BP-Deep NMF模型,将所述非负权重作为所述第二权重,确定所述BP-Deep NMF模型训练完成后的所述BP-Deep NMF模型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量。Step 1. If the number of training times reaches the maximum number of training times or the value of the loss function is less than or equal to the model error threshold, stop training the BP-Deep NMF model, use the non-negative weight as the second weight, and determine The second weight of each layer of neurons in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and the second weight is used as the base image amount of the face image.

在本实施例中,获取BP-Deep NMF模型的训练次数以及BP-Deep NMF模型的损失函数值,以检测BP-Deep NMF模型迭代是否完成。检测训练次数达到最大训练次数以及损失函数值小于或等于模型误差阈值,若损失函数值小于或等于模型误差阈值即C deep≤ε或迭代次数即训练次数达到最大训练次数I max,说明BP-Deep NMF模型迭代完成,则停止迭代BP-Deep NMF模型,输出BP-Deep NMF模型的权重参数矩阵W i(i=1,…,L)。 In this embodiment, the number of training times of the BP-Deep NMF model and the loss function value of the BP-Deep NMF model are obtained to detect whether the iteration of the BP-Deep NMF model is completed. It is detected that the number of training times reaches the maximum number of training times and the loss function value is less than or equal to the model error threshold. If the loss function value is less than or equal to the model error threshold value, ie C deep ≤ε or the number of iterations, that is, the number of training times reaches the maximum number of training times I max , indicating BP-Deep NMF model iteration is complete, the iteration is stopped BP-Deep NMF model output weight NMF model parameter BP-Deep weight matrix W i (i = 1, ... , L).

否则,若训练次数未达最大训练次数,继续获取人脸图像的训练样本数据和训练样本数据对应的最高特征量,将训练样本数据和最高特征量输入至BP-Deep NMF模型中,并基于训练样本数据和最高特征量,通过训练BP-Deep NMF模型,确定在BP-Deep NMF模型训练完成后待聚类人脸图像的基图像量。Otherwise, if the number of training times has not reached the maximum number of training times, continue to obtain the training sample data of the face image and the highest feature amount corresponding to the training sample data, and input the training sample data and the highest feature amount into the BP-Deep NMF model, and based on the training For the sample data and the highest feature amount, the BP-Deep NMF model is trained to determine the base image amount of the face images to be clustered after the BP-Deep NMF model training is completed.

或者,损失函数值大于模型误差阈值,继续获取人脸图像的训练样本数据和训练样本数据对应的最高特征量,将训练样本数据和最高特征量输入至BP-Deep NMF模型中,并基于训练样本数据和最高特征量,通过训练BP-Deep NMF模型,确定在BP-Deep NMF模型训练完成后待聚类人脸图像的基图像量。Or, the loss function value is greater than the model error threshold, continue to obtain the training sample data of the face image and the highest feature amount corresponding to the training sample data, and input the training sample data and the highest feature amount into the BP-Deep NMF model, and based on the training sample Data and the highest feature amount, through training the BP-Deep NMF model, determine the base image amount of the face image to be clustered after the BP-Deep NMF model training is completed.

由此可见,一旦BP神经网络训练完成,可自动得到如下深度非负矩阵分 解:X≈W LW L-1…W 1H W i≥0,i=1,2,…,L.H≥0。 It can be seen that once the BP neural network training is completed, the following deep non-negative matrix decomposition can be automatically obtained: X≈W L W L-1 …W 1 H W i ≥0, i=1, 2,...,LH≥0.

本实施例提出的人脸图像聚类方法提出了一种基于BP神经网络的高性能深度非负矩阵分解(BP-Deep NMF)聚类模型,BP-Deep NMF方法使用径向基函数(RBF)构造神经网络的输入信号,该输入等同于深度非负矩阵分解中的最高层特征,并以原始训练数据作为网络的期望输出,模型的优化则采用BP神经网络算法对网络权重矩阵更新的法则。最终训练出的BP-Deep NMF模型能直接得到数据的深度非负矩阵分解,而不需要对分解进行微调,并且在人脸数据聚类任务中显示出了优越的性能。The face image clustering method proposed in this embodiment proposes a high-performance deep non-negative matrix factorization (BP-Deep NMF) clustering model based on BP neural network. The BP-Deep NMF method uses radial basis functions (RBF) The input signal of the neural network is constructed. The input is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network. The optimization of the model adopts the BP neural network algorithm to update the network weight matrix. The finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.

基于第一实施例,提出本申请人脸图像聚类方法的第三实施例,在本实施例中,步骤S10包括:Based on the first embodiment, a third embodiment of the applicant’s face image clustering method is proposed. In this embodiment, step S10 includes:

步骤m,获取所述人脸图像,并将所述人脸图像转化为训练样本数据;Step m: Obtain the face image, and convert the face image into training sample data;

在本实施例中,人脸图像为用于训练BP-Deep NMF模型的人脸图像信息,包括不同表情、不同身份或者不同面部特征的人脸图像信息,在构建BP-Deep NMF模型之前的准备工作,将用于训练BP-Deep NMF模型的人脸图像转换成训练样本数据,以供后续进行对BP-Deep NMF模型的构建,训练样本数据带有数据标签,数据标签一般为不同的人,即人脸图像包括不同人的人脸图像信息。In this embodiment, the face image is the face image information used to train the BP-Deep NMF model, including face image information with different expressions, different identities, or different facial features. Preparations before constructing the BP-Deep NMF model Work, convert the face image used to train the BP-Deep NMF model into training sample data for subsequent construction of the BP-Deep NMF model. The training sample data has data labels, and the data labels are generally different people. That is, the face image includes the face image information of different people.

步骤n,获取所述待聚类人脸图像,并将所述待聚类人脸图像转化为待聚类数据集;Step n: Obtain the face image to be clustered, and convert the face image to be clustered into a data set to be clustered;

在本实施例中,构建BP-Deep NMF模型完成后,获取待聚类人脸图像,并将待聚类人脸图像转化为待聚类数据集,以供后续将待聚类人脸图像进行聚类。In this embodiment, after the construction of the BP-Deep NMF model is completed, the face image to be clustered is obtained, and the face image to be clustered is converted into a data set to be clustered for subsequent processing of the face image to be clustered. Clustering.

步骤o,基于所述训练样本数据,通过预设计算方法,确定所述训练样本数据对应的最高特征量。Step o, based on the training sample data, determine the highest feature amount corresponding to the training sample data through a preset calculation method.

在本实施例中,在构建BP-Deep NMF模型之前,使用径向基函数(RBF)将训练样本数据转换成最高特征量,以供后续进行对BP-Deep NMF模型的构建,径向基函数(RBF)构造神经网络的输入信号即最高特征量,该输入等同于深度非负矩阵分解中的最高层特征。假设训练样本数目为n和训练样本数据矩阵即训练样本数据为X=(x 1,x 2,…,x n)。利用径向基函数(RBF)

Figure PCTCN2019125298-appb-000019
(其中为空间x,y中两个数据点,t≥0)生成算法最高层特征矩阵即最高特征量H=(H ij) n×n,其中,最高特征量
Figure PCTCN2019125298-appb-000020
当x i与x j属于同一个人的不同脸部特征或者不同表情的人脸图像时,x i与x j属于同一类,则H ij=k(x i,x j);当x i与x j属于不同人的脸部特征或者表情的人脸图像时,x i与x j属于不同类,则H ij=0。因此,显然,最高特征量H是对角分块矩阵,这种结构本身具有很好的聚类性质。 In this embodiment, before constructing the BP-Deep NMF model, the radial basis function (RBF) is used to convert the training sample data into the highest feature quantity for subsequent construction of the BP-Deep NMF model. The radial basis function (RBF) The input signal of the neural network is the highest feature quantity, which is equivalent to the highest level feature in the deep non-negative matrix factorization. Assume that the number of training samples is n and the training sample data matrix, that is, the training sample data is X=(x 1 , x 2 ,..., x n ). Utilize Radial Basis Function (RBF)
Figure PCTCN2019125298-appb-000019
(Among them are two data points in space x and y, t≥0) Generate the highest feature matrix of the algorithm, that is, the highest feature quantity H=(H ij ) n×n , where the highest feature quantity
Figure PCTCN2019125298-appb-000020
When x i and x j belong to the same person’s face images with different facial features or different expressions, x i and x j belong to the same category, then H ij =k(x i , x j ); when x i and x When j belongs to facial images of different people's facial features or expressions, and x i and x j belong to different classes, then H ij =0. Therefore, it is obvious that the highest feature quantity H is a diagonal block matrix, and this structure itself has good clustering properties.

进一步地,一实施例中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:Further, in an embodiment, the face image to be clustered is classified by a preset clustering rule based on the amount of base images and the data set to be clustered, and the data to be clustered is determined The steps of obtaining the classification result of the face image to be clustered include:

步骤p,基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;Step p, extracting the hidden feature amount of each layer feature in the to-be-clustered data set based on the base image amount and the to-be-clustered data set;

在本实施例中,在BP-Deep NMF模型构建完成后,输出BP-Deep NMF模型的待聚类人脸图像的基图像量W,基于基图像量W和待聚类数据集Y=(y 1,y 2,…,y m),计算待聚类数据集中的每个样本y k(k=1,…,m)的第i层特征向量即隐藏特征量

Figure PCTCN2019125298-appb-000021
隐藏特征量
Figure PCTCN2019125298-appb-000022
的计算公式如下: In this embodiment, after the construction of the BP-Deep NMF model is completed, the base image amount W of the face image to be clustered of the BP-Deep NMF model is output, based on the base image amount W and the data set to be clustered Y=(y 1 ,y 2 ,...,y m ), calculate the i-th layer feature vector of each sample y k (k=1,...,m) in the data set to be clustered, that is, the hidden feature quantity
Figure PCTCN2019125298-appb-000021
Hidden feature
Figure PCTCN2019125298-appb-000022
The calculation formula is as follows:

Figure PCTCN2019125298-appb-000023
Figure PCTCN2019125298-appb-000023

步骤q,基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Step q: Based on the hidden feature quantity, classify the face image to be clustered by preset clustering rules, determine the clustering result of the data set to be clustered, and obtain the clustering result of the face image.

在本实施例中,基于隐藏特征量,运用预设聚类规则即k均值聚类法分别对样本第i层特征向量集

Figure PCTCN2019125298-appb-000024
进行聚类确定待聚类数据集的聚类结果,最终输出聚类结果得到待聚类人脸图像的分类结果,隐藏特征量可以体现在人脸图像信息的身份或者表情或者姿势特征等。例如,基于姿势特征的一级隐藏特征,可以将待聚类人脸图像分类成不同姿势特征的聚类结果,例如头部朝向角度分别为0度、30度或者-30度等等角度的聚类结果;基于表情特征的二级隐藏特征,可以将待聚类人脸图像分类成不同表情特征的聚类结果,例如表情开心、愤怒或者疑惑等不同表情特征的聚类结果;基于身份特征的三级隐藏特征,可以将待聚类人脸图像分类成不同人的聚类结果,也就是说以人为分类依据将待聚类人脸图像分类。 In this embodiment, based on the hidden feature amount, the preset clustering rule, namely the k-means clustering method, is used to separately analyze the feature vector set of the i-th layer of the sample.
Figure PCTCN2019125298-appb-000024
Perform clustering to determine the clustering result of the data set to be clustered, and finally output the clustering result to obtain the classification result of the face image to be clustered. The hidden feature amount can be reflected in the identity or expression or posture characteristics of the face image information. For example, based on the first-level hidden features of posture features, the face images to be clustered can be classified into clustering results of different posture features, for example, the head orientation angles are 0 degrees, 30 degrees, or -30 degrees. Class results; based on the secondary hidden features of expression features, the face images to be clustered can be classified into clustering results of different expression features, such as the clustering results of different expression features such as happy, angry, or confused; based on identity features The three-level hidden features can classify the face images to be clustered into clustering results of different people, that is to say, classify the face images to be clustered on the basis of human classification.

本实施例提出的人脸图像聚类方法提出了一种基于BP神经网络的高性能深度非负矩阵分解(BP Deep NMF)聚类模型,BP Deep NMF方法使用径向基函数(RBF)构造神经网络的输入信号,该输入等同于深度非负矩阵分解中的 最高层特征,并以原始训练数据作为网络的期望输出。模型的优化则采用BP神经网络算法对网络权重矩阵更新的法则。最终训练出的BP-Deep NMF模型能直接得到数据的深度非负矩阵分解,而不需要对分解进行微调,并且在人脸数据聚类任务中显示出了优越的性能。The face image clustering method proposed in this embodiment proposes a high-performance deep non-negative matrix factorization (BP Deep NMF) clustering model based on BP neural network. The BP Deep NMF method uses radial basis functions (RBF) to construct neural networks. The input signal of the network is equivalent to the highest-level feature in the deep non-negative matrix factorization, and the original training data is used as the expected output of the network. The optimization of the model uses the BP neural network algorithm to update the network weight matrix. The finally trained BP-Deep NMF model can directly obtain the deep non-negative matrix decomposition of the data without fine-tuning the decomposition, and shows superior performance in the task of face data clustering.

此外,本申请实施例还提出一种计算机可读存储介质,所述计算机存储介质存储有人脸图像聚类程序,所述人脸图像聚类程序还可被处理器执行以用于实现上述人脸图像聚类方法各实施例的步骤。In addition, an embodiment of the present application also proposes a computer-readable storage medium that stores a human face image clustering program, and the human face image clustering program may also be executed by a processor to implement the above-mentioned face image clustering program. The steps of each embodiment of the image clustering method.

本申请计算机存储介质的具体实施方式的拓展内容与上述人脸图像聚类方法各实施例基本相同,在此不做赘述。The expanded content of the specific implementation of the computer storage medium of this application is basically the same as each embodiment of the above-mentioned face image clustering method, and will not be repeated here.

需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or system including a series of elements not only includes those elements, It also includes other elements that are not explicitly listed, or elements inherent to the process, method, article, or system. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, article, or system that includes the element.

上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the foregoing embodiments of the present application are for description only, and do not represent the superiority or inferiority of the embodiments.

通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。Through the description of the above implementation manners, those skilled in the art can clearly understand that the above-mentioned embodiment method can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above. , Magnetic disks, optical disks), including several instructions to make a terminal device (which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the method described in each embodiment of the present application. The above are only the preferred embodiments of the application, and do not limit the scope of the patent for this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of the application, or directly or indirectly applied to other related technical fields , The same reason is included in the scope of patent protection of this application.

Claims (16)

一种人脸图像聚类方法,其中,所述人脸图像聚类方法包括以下步骤:A face image clustering method, wherein the face image clustering method includes the following steps: 获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集;Acquiring training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered; 将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量;以及,Input the training sample data and the highest feature amount into a BP-Deep NMF model, train the BP-Deep NMF model, and determine the base image amount of the face image to be clustered; and, 基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果。Based on the amount of base images and the data set to be clustered, the face images to be clustered are classified by preset clustering rules, the clustering result of the data set to be clustered is determined, and the data set to be clustered is obtained The classification result of the face image. 如权利要求1所述的人脸图像聚类方法,其中,所述将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量的步骤包括:The face image clustering method according to claim 1, wherein said inputting said training sample data and said highest feature amount into a BP-Deep NMF model, training said BP-Deep NMF model, and determining The step of the amount of base images of the face images to be clustered includes: 获取所述BP-Deep NMF模型中的激活函数对应的第一线性参数以及所述BP-Deep NMF模型中各个神经元的第一权重;Acquiring the first linear parameter corresponding to the activation function in the BP-Deep NMF model and the first weight of each neuron in the BP-Deep NMF model; 基于所述第一线性参数、所述训练样本数据和所述第一权重,确定所述BP-Deep NMF模型中输出层神经元的第一输出误差;Determine the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data, and the first weight; 基于所述第一输出误差、所述第一线性参数和所述第一权重,确定所述BP-Deep NMF模型中输入层神经元和隐含层神经元的第二输出误差;以及,Determine the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model based on the first output error, the first linear parameter, and the first weight; and, 基于所述第一输出误差和所述第二输出误差,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量。Based on the first output error and the second output error, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and set the second weight As the base image amount of the face image to be clustered. 如权利要求2所述的人脸图像聚类方法,其中,所述基于所述第一输出误差和所述第二输出误差,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量的步骤包括:3. The face image clustering method according to claim 2, wherein, based on the first output error and the second output error, it is determined that the BP-Deep NMF model is trained after the completion of the BP-Deep NMF model training. The second weight of each neuron in the NMF model, and the step of using the second weight as the base image amount of the face image to be clustered includes: 基于所述第一输出误差和所述第二输出误差,确定所述BP-Deep NMF模型中各个神经元的权重偏置量;以及,Determining the weight offset of each neuron in the BP-Deep NMF model based on the first output error and the second output error; and, 基于所述权重偏置量,确定在所述BP-Deep NMF模型训练完成后所述 BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量。Based on the weight bias, determine the second weight of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed, and use the second weight as the person to be clustered The amount of the base image of the face image. 如权利要求3所述的人脸图像聚类方法,其中,所述基于所述权重偏置量,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各个神经元的第二权重,并将所述第二权重作为所述待聚类人脸图像的基图像量的步骤包括:The face image clustering method of claim 3, wherein the weight bias is used to determine the value of each neuron in the BP-Deep NMF model after the training of the BP-Deep NMF model is completed. The second weight, and the step of using the second weight as the base image amount of the face image to be clustered includes: 获取所述BP-Deep NMF模型的学习率;Acquiring the learning rate of the BP-Deep NMF model; 基于所述权重偏置量和所述学习率,通过投影梯度法的投影确定所述BP-Deep NMF模型中各个神经元的非负权重;以及,Based on the weight offset and the learning rate, the non-negative weight of each neuron in the BP-Deep NMF model is determined by the projection of the projection gradient method; and, 基于所述非负权重,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量。Based on the non-negative weight, determine the second weight of each layer of neurons in the BP-Deep NMF model after the BP-Deep NMF model training is completed, and use the second weight as the face image Base image volume. 如权利要求4所述的人脸图像聚类方法,其中,所述基于所述权重偏置量和所述学习率,通过投影梯度法的投影确定所述BP-Deep NMF模型中各个神经元的非负权重的步骤之后,还包括:5. The face image clustering method according to claim 4, wherein, based on the weight offset and the learning rate, the projection of the projection gradient method is used to determine the value of each neuron in the BP-Deep NMF model. After the step of non-negative weighting, it also includes: 基于所述训练样本数据和所述非负权重,确定所述BP-Deep NMF模型中的激活函数对应的第二线性参数,并将所述第二线性参数作为所述第一线性参数,执行所述基于所述第一线性参数、所述训练样本数据和所述第一权重,确定所述BP-Deep NMF模型中输出层神经元的第一输出误差,以及基于所述第一输出误差、所述第一线性参数和所述第一权重,确定所述BP-Deep NMF模型中输入层神经元和隐含层神经元的第二输出误差的步骤。Based on the training sample data and the non-negative weights, determine the second linear parameter corresponding to the activation function in the BP-Deep NMF model, and use the second linear parameter as the first linear parameter to perform all Said determining the first output error of the output layer neuron in the BP-Deep NMF model based on the first linear parameter, the training sample data and the first weight; The first linear parameter and the first weight determine the step of determining the second output error of the input layer neuron and the hidden layer neuron in the BP-Deep NMF model. 如权利要求4所述的人脸图像聚类方法,其中,所述基于所述非负权重,确定在所述BP-Deep NMF模型训练完成后所述BP-Deep NMF模型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量的步骤包括:The face image clustering method according to claim 4, wherein the determination is based on the non-negative weight to determine the level of neurons in each layer of the BP-Deep NMF model after the training of the BP-Deep NMF model is completed. The second weight, and the step of using the second weight as the base image amount of the face image includes: 获取所述BP-Deep NMF模型的训练次数以及所述BP-Deep NMF模型的损失函数值;以及,Acquiring the number of training times of the BP-Deep NMF model and the loss function value of the BP-Deep NMF model; and, 若所述训练次数达到最大训练次数或者所述损失函数值小于或等于模型误差阈值,则停止训练所述BP-Deep NMF模型,将所述非负权重作为所述第二权重,确定所述BP-Deep NMF模型训练完成后的所述BP-Deep NMF模 型中各层神经元的第二权重,并将所述第二权重作为所述人脸图像的基图像量。If the number of training times reaches the maximum number of training times or the value of the loss function is less than or equal to the model error threshold, stop training the BP-Deep NMF model, use the non-negative weight as the second weight, and determine the BP -The second weight of each layer of neurons in the BP-Deep NMF model after the Deep NMF model training is completed, and the second weight is used as the base image amount of the face image. 如权利要求1所述的人脸图像聚类方法,其中,所述获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集的步骤包括:The face image clustering method according to claim 1, wherein the acquiring training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered The steps include: 获取所述人脸图像,并将所述人脸图像转化为训练样本数据;Acquiring the face image, and converting the face image into training sample data; 获取所述待聚类人脸图像,并将所述待聚类人脸图像转化为待聚类数据集;以及,Acquiring the face image to be clustered, and converting the face image to be clustered into a data set to be clustered; and, 基于所述训练样本数据,通过预设计算方法,确定所述训练样本数据对应的最高特征量。Based on the training sample data, the highest feature amount corresponding to the training sample data is determined through a preset calculation method. 如权利要求1所述的人脸图像聚类方法,其中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:The face image clustering method according to claim 1, wherein the face image to be clustered is classified by a preset clustering rule based on the amount of the base image and the data set to be clustered , The step of determining the clustering result of the data set to be clustered and obtaining the classification result of the face image to be clustered includes: 基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;以及,Extracting the hidden feature quantities of the features of each layer in the data set to be clustered based on the base image amount and the data set to be clustered; and, 基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Based on the hidden feature amount, the face image to be clustered is classified by a preset clustering rule, the clustering result of the data set to be clustered is determined, and the clustering result of the face image is obtained. 如权利要求2所述的人脸图像聚类方法,其中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:The face image clustering method according to claim 2, wherein the face image to be clustered is classified by a preset clustering rule based on the amount of the base image and the data set to be clustered , The step of determining the clustering result of the data set to be clustered and obtaining the classification result of the face image to be clustered includes: 基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;以及,Extracting the hidden feature quantities of the features of each layer in the data set to be clustered based on the base image amount and the data set to be clustered; and, 基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Based on the hidden feature amount, the face image to be clustered is classified by a preset clustering rule, the clustering result of the data set to be clustered is determined, and the clustering result of the face image is obtained. 如权利要求3所述的人脸图像聚类方法,其中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:The face image clustering method according to claim 3, wherein the face image to be clustered is classified by a preset clustering rule based on the amount of the base image and the data set to be clustered , The step of determining the clustering result of the data set to be clustered and obtaining the classification result of the face image to be clustered includes: 基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;以及,Extracting the hidden feature quantities of the features of each layer in the data set to be clustered based on the base image amount and the data set to be clustered; and, 基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Based on the hidden feature amount, the face image to be clustered is classified by a preset clustering rule, the clustering result of the data set to be clustered is determined, and the clustering result of the face image is obtained. 如权利要求4所述的人脸图像聚类方法,其中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:The face image clustering method according to claim 4, wherein the face image to be clustered is classified by a preset clustering rule based on the amount of the base image and the data set to be clustered , The step of determining the clustering result of the data set to be clustered and obtaining the classification result of the face image to be clustered includes: 基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;以及,Extracting the hidden feature amount of each layer feature in the to-be-clustered data set based on the base image amount and the to-be-clustered data set; and, 基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Based on the hidden feature amount, the face image to be clustered is classified by a preset clustering rule, the clustering result of the data set to be clustered is determined, and the clustering result of the face image is obtained. 如权利要求5所述的人脸图像聚类方法,其中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:The face image clustering method of claim 5, wherein the face image to be clustered is classified based on the amount of the base image and the data set to be clustered by using a preset clustering rule , The step of determining the clustering result of the data set to be clustered and obtaining the classification result of the face image to be clustered includes: 基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;以及,Extracting the hidden feature quantities of the features of each layer in the data set to be clustered based on the base image amount and the data set to be clustered; and, 基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Based on the hidden feature amount, the face image to be clustered is classified by a preset clustering rule, the clustering result of the data set to be clustered is determined, and the clustering result of the face image is obtained. 如权利要求6所述的人脸图像聚类方法,其中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:The face image clustering method according to claim 6, wherein the face image to be clustered is classified based on the amount of the base image and the data set to be clustered by using a preset clustering rule , The step of determining the clustering result of the data set to be clustered and obtaining the classification result of the face image to be clustered includes: 基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;以及,Extracting the hidden feature quantities of the features of each layer in the data set to be clustered based on the base image amount and the data set to be clustered; and, 基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Based on the hidden feature amount, the face image to be clustered is classified by a preset clustering rule, the clustering result of the data set to be clustered is determined, and the clustering result of the face image is obtained. 如权利要求7所述的人脸图像聚类方法,其中,所述基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分 类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果的步骤包括:The face image clustering method according to claim 7, wherein the face image to be clustered is classified by a preset clustering rule based on the amount of the base image and the data set to be clustered , The step of determining the clustering result of the data set to be clustered and obtaining the classification result of the face image to be clustered includes: 基于所述基图像量和所述待聚类数据集,提取所述待聚类数据集中各层特征的隐藏特征量;以及,Extracting the hidden feature quantities of the features of each layer in the data set to be clustered based on the base image amount and the data set to be clustered; and, 基于所述隐藏特征量,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到人脸图像的聚类结果。Based on the hidden feature amount, the face image to be clustered is classified by a preset clustering rule, the clustering result of the data set to be clustered is determined, and the clustering result of the face image is obtained. 一种人脸图像聚类装置,其中,所述人脸图像聚类装置包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,所述计算机可读指令被所述处理器执行时,实现如下步骤:A face image clustering device, wherein the face image clustering device includes: a memory, a processor, and computer-readable instructions stored on the memory and running on the processor, the computer When the readable instruction is executed by the processor, the following steps are implemented: 获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集;Acquiring training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered; 将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量;以及,Input the training sample data and the highest feature amount into a BP-Deep NMF model, train the BP-Deep NMF model, and determine the base image amount of the face image to be clustered; and, 基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果。Based on the amount of base images and the data set to be clustered, the face images to be clustered are classified by preset clustering rules, the clustering result of the data set to be clustered is determined, and the data set to be clustered is obtained The classification result of the face image. 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有计算机可读指令,所述计算机可读指令被处理器执行时,实现如下步骤:A computer-readable storage medium, wherein computer-readable instructions are stored on the computer-readable storage medium, and when the computer-readable instructions are executed by a processor, the following steps are implemented: 获取人脸图像的训练样本数据、所述训练样本数据对应的最高特征量和待聚类人脸图像的待聚类数据集;Acquiring training sample data of the face image, the highest feature amount corresponding to the training sample data, and the to-be-clustered data set of the face image to be clustered; 将所述训练样本数据和所述最高特征量输入至BP-Deep NMF模型中,训练所述BP-Deep NMF模型,并确定所述待聚类人脸图像的基图像量;以及,Input the training sample data and the highest feature amount into a BP-Deep NMF model, train the BP-Deep NMF model, and determine the base image amount of the face image to be clustered; and, 基于所述基图像量和所述待聚类数据集,通过预设聚类规则将所述待聚类人脸图像进行分类,确定所述待聚类数据集的聚类结果,得到待聚类人脸图像的分类结果。Based on the amount of base images and the data set to be clustered, the face images to be clustered are classified by preset clustering rules, the clustering result of the data set to be clustered is determined, and the data set to be clustered is obtained The classification result of the face image.
PCT/CN2019/125298 2019-12-13 2019-12-13 Facial image clustering method and apparatus, and computer-readable storage medium Ceased WO2021114262A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/125298 WO2021114262A1 (en) 2019-12-13 2019-12-13 Facial image clustering method and apparatus, and computer-readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/125298 WO2021114262A1 (en) 2019-12-13 2019-12-13 Facial image clustering method and apparatus, and computer-readable storage medium

Publications (1)

Publication Number Publication Date
WO2021114262A1 true WO2021114262A1 (en) 2021-06-17

Family

ID=76328802

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/125298 Ceased WO2021114262A1 (en) 2019-12-13 2019-12-13 Facial image clustering method and apparatus, and computer-readable storage medium

Country Status (1)

Country Link
WO (1) WO2021114262A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116245157A (en) * 2023-01-16 2023-06-09 北京百度网讯科技有限公司 Human facial expression representation model training method, human facial expression recognition method and device
CN113642607B (en) * 2021-07-13 2023-07-14 深圳大学 Face image clustering method, device, equipment and computer-readable storage medium
CN118246744A (en) * 2024-04-12 2024-06-25 中国水利水电第四工程局有限公司 Risk assessment method and system for construction site of extra-long tunnel

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8515879B2 (en) * 2010-08-11 2013-08-20 Seiko Epson Corporation Supervised nonnegative matrix factorization
CN109508737A (en) * 2018-10-31 2019-03-22 江苏理工学院 Constrained concept based on matrix of depths decomposes clustering method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8515879B2 (en) * 2010-08-11 2013-08-20 Seiko Epson Corporation Supervised nonnegative matrix factorization
CN109508737A (en) * 2018-10-31 2019-03-22 江苏理工学院 Constrained concept based on matrix of depths decomposes clustering method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CHENG JINGTIAN: "Based on NMFs and RBF Neural Nctwork Face Recognition Algorithm Research", CHINESE MASTER'S THESES FULL-TEXT DATABASE, INFORMATION SCIENCE, VOL. S2, 2013, I138-1217, 1 January 2013 (2013-01-01), XP055819861 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113642607B (en) * 2021-07-13 2023-07-14 深圳大学 Face image clustering method, device, equipment and computer-readable storage medium
CN116245157A (en) * 2023-01-16 2023-06-09 北京百度网讯科技有限公司 Human facial expression representation model training method, human facial expression recognition method and device
CN118246744A (en) * 2024-04-12 2024-06-25 中国水利水电第四工程局有限公司 Risk assessment method and system for construction site of extra-long tunnel

Similar Documents

Publication Publication Date Title
CN110942108B (en) Face image clustering method, device and computer-readable storage medium
CN111626128B (en) A Pedestrian Detection Method Based on Improved YOLOv3 in Orchard Environment
CN110909820B (en) Image classification method and system based on self-supervision learning
CN103605972B (en) Non-restricted environment face verification method based on block depth neural network
CN111291678B (en) Face image clustering method and device based on multi-feature fusion
CN108960127B (en) Re-identification of occluded pedestrians based on adaptive deep metric learning
CN110782015A (en) Training method, device and storage medium for network structure optimizer of neural network
WO2021022521A1 (en) Method for processing data, and method and device for training neural network model
CN110163258A (en) A kind of zero sample learning method and system reassigning mechanism based on semantic attribute attention
CN111291604A (en) Facial attribute recognition method, device, storage medium and processor
CN113255543B (en) Facial expression recognition method based on graph convolution network
CN115100476B (en) Fine granularity classification method based on structure modeling local proposal network
CN113076905B (en) A Method of Emotion Recognition Based on Context Interaction Relationship
CN107563407B (en) Feature representation learning system for multi-modal big data of network space
CN112732921B (en) False user comment detection method and system
CN112926645B (en) Electricity stealing detection method based on edge calculation
CN111126389A (en) Text detection method, device, electronic device and storage medium
WO2021003637A1 (en) Kernel non-negative matrix factorization face recognition method, device and system based on additive gaussian kernel, and storage medium
CN118378128A (en) Multi-mode emotion recognition method based on staged attention mechanism
CN115063832A (en) A cross-modal person re-identification method based on adversarial learning based on global and local features
CN115100709B (en) Feature separation image face recognition and age estimation method
WO2021114262A1 (en) Facial image clustering method and apparatus, and computer-readable storage medium
CN107403191A (en) A kind of semi-supervised learning machine sorting technique that transfinites with depth structure
CN113704528B (en) Cluster center determining method, device and equipment and computer storage medium
CN112381047B (en) Enhanced recognition method for facial expression image

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19955458

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 24.10.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19955458

Country of ref document: EP

Kind code of ref document: A1