CN111325268A

CN111325268A - Image classification method and device based on multi-level feature representation and integrated learning

Info

Publication number: CN111325268A
Application number: CN202010099868.7A
Authority: CN
Inventors: 刘锦; 王建新; 项艺珍; 王宇菲
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2020-02-18
Filing date: 2020-02-18
Publication date: 2020-06-23

Abstract

The invention discloses an image classification method and device based on multi-level feature representation and integrated learning. The method includes the following steps: Step 1: acquiring multiple images and preprocessing them; image, extract the feature vectors of three different levels of points, edges and networks respectively; Step 3: Perform feature selection on the feature vectors of the three different levels respectively, and obtain the optimal feature vectors of the three different levels; Step 4: Construct the Three weak classifiers based on single-level features are trained using the optimal feature vectors of the three levels of the image sample respectively, and three trained weak classifiers are obtained; Step 5: The optimal features of the three levels of the image to be tested are used for training. The vectors are respectively input to the corresponding three trained weak classifiers to obtain three classification results of the image to be tested; the classification results of the three weak classifiers are fused to obtain the final classification label of the image to be tested. The present invention has high classification accuracy.

Description

Image classification method and device based on multi-level feature representation and ensemble learning

技术领域technical field

本发明具体涉及一种基于多层次特征表示和集成学习的影像分类方法及装置。The invention specifically relates to an image classification method and device based on multi-level feature representation and integrated learning.

背景技术Background technique

近些年来，神经影像技术在神经学研究中的广泛应用，加速了神经影像学的发展，推动脑科学研究进入高速发展时期。磁共振成像技术的发展为人们提供了一种无创的方式去研究人脑解剖结构和功能机制，常用的几种脑成像技术包括有结构磁共振成像(structural Magnetic Resonance Imaging，sMRI)，功能磁共振成像(functionalMagnetic Resonance Imaging，fMRI)，弥散张量成像(Diffusion Tensor Imaging，DTI)以及正电子发射断层扫描(Positron Emission Tomography，PET)等。不同的神经成像技术提供对大脑不同层面不同角度的刻画，从大脑的整体结构到脑区之间的功能协作，甚至到神经元的活动状态，为各种脑科学研究提供了数据基础。In recent years, the extensive application of neuroimaging technology in neurological research has accelerated the development of neuroimaging and pushed brain science research into a period of rapid development. The development of magnetic resonance imaging technology provides people with a non-invasive way to study the anatomical structure and functional mechanism of the human brain. Several commonly used brain imaging techniques include structural magnetic resonance imaging (sMRI), functional magnetic resonance imaging Imaging (functionalMagnetic Resonance Imaging, fMRI), diffusion tensor imaging (Diffusion Tensor Imaging, DTI) and positron emission tomography (Positron Emission Tomography, PET) and so on. Different neuroimaging technologies provide different perspectives on different levels of the brain, from the overall structure of the brain to the functional cooperation between brain regions, and even the activity state of neurons, providing a data basis for various brain research.

而目前基于脑成像数据特征来进行影像分类的方法通常只采用脑成像数据单一层次的特征，没有考虑使用多层次特征信息互补的优势来提高分类的准确性。因此，有必要提供一种能够充分利用多层次特征的信息互补优势，对影像进行准确分类的方法和装置。However, the current methods for image classification based on the features of brain imaging data usually only use the features of a single level of brain imaging data, without considering the complementary advantages of multi-level feature information to improve the accuracy of classification. Therefore, it is necessary to provide a method and device that can make full use of the information complementary advantages of multi-level features to accurately classify images.

发明内容SUMMARY OF THE INVENTION

本发明的目的是，针对现有技术的不足提供一种基于多层次特征表示和集成学习的影像分类方法及装置，提高了影像分类准确率。The purpose of the present invention is to provide an image classification method and device based on multi-level feature representation and integrated learning to improve the accuracy of image classification in view of the shortcomings of the prior art.

为实现上述目的，本发明的技术方案如下：For achieving the above object, technical scheme of the present invention is as follows:

一方面，提供一种基于多层次特征表示和集成学习的影像分类方法，包括以下步骤：In one aspect, an image classification method based on multi-level feature representation and ensemble learning is provided, including the following steps:

步骤1：获取多个影像，并对其进行预处理；Step 1: Acquire multiple images and preprocess them;

步骤2：根据预处理后的影像，分别提取其点、边及网络三种不同层次的特征向量：Step 2: According to the preprocessed image, extract the feature vectors of three different levels: point, edge and network:

将预处理后的影像划分为多个感兴趣区域(ROI)；Divide the preprocessed image into multiple regions of interest (ROI);

提取预处理后的影像上各个感兴趣区域的多种不同角度的功能激活谱特征，并将各个感兴趣区域的多种功能激活谱特征级联成一个特征向量，作为该影像的点特征向量RVF；Extract the functional activation spectral features of different angles of each region of interest on the preprocessed image, and cascade the functional activation spectral features of each region of interest into a feature vector, which is used as the point feature vector RVF of the image. ;

基于预处理后的影像上各个感兴趣区域之间的相关性提取全脑功能连接特征，作为该影像的边特征向量FC；Based on the correlation between each region of interest on the preprocessed image, the whole-brain functional connectivity feature is extracted as the edge feature vector FC of the image;

基于预处理后的影像上各个感兴趣区域之间的相关性构建功能连接网络，基于功能连接网络提取各个感兴趣区域的多种不同角度的网络属性特征，并将各个感兴趣区域的多种网络属性级联成一个特征向量，作为该影像的网络特征向量NET；Based on the correlation between the regions of interest on the preprocessed images, a functional connection network is constructed. Based on the functional connection network, various network attribute features of each region of interest are extracted from different angles. The attributes are concatenated into a feature vector as the network feature vector NET of the image;

步骤3：分别对点、边及网络三种不同层次的特征向量进行特征选择，得到不同层次的最优特征向量；Step 3: Perform feature selection on the feature vectors of three different levels of point, edge and network respectively, and obtain the optimal feature vector of different levels;

步骤4：构建三个基于单层次特征的弱分类器，分别利用影像样本三个层次的最优特征向量进行训练，得到训练好的三个弱分类器；Step 4: Build three weak classifiers based on single-level features, and use the optimal feature vectors of the three levels of image samples for training respectively, and obtain three trained weak classifiers;

步骤5：采用基于单层次特征的弱分类器集成学习方法最大限度利用多层次特征的信息互补优势，具体为：将待测影像三个层次的最优特征向量分别输入训练好的相应的三个弱分类器，得到该待测影像的三个分类结果；然后通过一个分类器集成策略融合三个弱分类器的分类结果，得到待测影像的最终分类标签。Step 5: Use the weak classifier ensemble learning method based on single-level features to maximize the information complementary advantages of multi-level features, specifically: input the optimal feature vectors of the three levels of the image to be tested into the trained corresponding three Weak classifier is used to obtain three classification results of the image to be tested; and then the classification results of the three weak classifiers are fused through a classifier integration strategy to obtain the final classification label of the image to be tested.

进一步地，所述步骤1中，采用DPABI工具对影像数据执行时间层校正、头动校正、空间标准化、平滑、滤波等预处理操作，得到预处理后的影像数据。Further, in the step 1, the DPABI tool is used to perform preprocessing operations such as temporal layer correction, head motion correction, spatial normalization, smoothing, filtering, etc. on the image data, to obtain preprocessed image data.

进一步地，所述步骤2中，先根据脑网络组(Brainnetome,BN246)图谱，将预处理后的影像划分为246个感兴趣区域，再基于这些感兴趣区域提取其特征向量。Further, in the step 2, according to the brain network group (Brainnetome, BN246) atlas, the preprocessed image is divided into 246 regions of interest, and then the feature vectors are extracted based on these regions of interest.

进一步地，所述步骤2中，所述不同角度的功能激活谱特征包括ReHo、VMHC以及fALFF这三种功能激活谱特征，提取影像的点特征向量RVF具体包括以下步骤：Further, in the step 2, the functional activation spectral features of the different angles include the three functional activation spectral features of ReHo, VMHC and fALFF, and the point feature vector RVF of the extracted image specifically includes the following steps:

首先，计算预处理后的影像上全脑体素的三种功能激活谱特征，即ReHo值、VMHC值以及fALFF值，构成三种功能激活谱图，这三种功能激活谱特征可从不同角度反映影像数据的局部区域活动；First, calculate the three functional activation spectral features of the whole brain voxels on the preprocessed image, namely ReHo value, VMHC value and fALFF value, to form three functional activation spectral maps. These three functional activation spectral features can be viewed from different angles. Reflect the local area activity of image data;

随后，计算影像上每个感兴趣区域内所有体素的平均ReHo值、VMHC值以及fALFF值作为量化该感兴趣区域的特征(ReHo特征、VMHC特征以及fALFF特征)，得到维度分别为D的ReHo特征向量、VMHC特征向量以及fALFF特征向量；将这三种功能激活谱特征向量级联成一个维度3D的特征向量，作为该影像的点特征向量RVF，其中D为影像上感兴趣区域的个数。Then, the average ReHo value, VMHC value and fALFF value of all voxels in each region of interest on the image are calculated as the features (ReHo feature, VMHC feature and fALFF feature) of the quantified region of interest, and ReHo with dimension D is obtained. Feature vector, VMHC feature vector and fALFF feature vector; these three function activation spectrum feature vectors are concatenated into a 3D feature vector, which is used as the point feature vector RVF of the image, where D is the number of regions of interest on the image. .

进一步地，所述步骤2中，提取影像的边特征向量FC具体包括以下步骤：Further, in the step 2, extracting the edge feature vector FC of the image specifically includes the following steps:

首先提取影像上每个感兴趣区域的平均时间序列(每个感兴趣区域内所有体素的时间序列的平均)，计算感兴趣区域对平均时间序列之间的皮尔逊相关性系数，然后构建出D×D的功能连接矩阵，矩阵中的每个元素即感兴趣区域之间的连接强度，定义感兴趣区域之间的连接强度为边，将该矩阵的上三角矩阵元素展平成一个向量，作为该影像的边特征向量FC。感兴趣区域对平均时间序列之间的皮尔逊相关性系数计算公式如下：First, extract the average time series of each region of interest on the image (the average of the time series of all voxels in each region of interest), calculate the Pearson correlation coefficient between the region of interest and the average time series, and then construct D×D functional connection matrix, each element in the matrix is the connection strength between the regions of interest, the connection strength between the regions of interest is defined as an edge, and the upper triangular matrix elements of the matrix are flattened into a vector, as The edge feature vector FC of the image. The formula for calculating the Pearson correlation coefficient between the region of interest and the average time series is as follows:

其中r_ij表示感兴趣区域i与感兴趣区域j平均时间序列的皮尔逊相关性系数；x_i和x_j分别为感兴趣区域i和感兴趣区域j的平均时间序列；

和

分别表示感兴趣区域i和感兴趣区域j的平均时间序列均值；n为影像的扫描时间点数；则x_it和x_jt分别为在第t个时间点处的感兴趣区域i和感兴趣区域j的平均时间序列值。where r _ij represents the Pearson correlation coefficient of the average time series of the region of interest i and the region of interest j; x _i and x _j are the average time series of the region of interest i and the region of interest j, respectively;

and

Represents the average time series mean of the region of interest i and region of interest j respectively; n is the number of scanning time points of the image; then x _it and x _jt are the region of interest i and the region of interest j at the t-th time point, respectively The average time series value of .

进一步地，所述步骤2中，提取影像的网络特征向量NET具体包括以下步骤：Further, in the described step 2, the network feature vector NET of the extraction image specifically includes the following steps:

首先将影像上的感兴趣区域定义为其功能连接网络中的节点，感兴趣区域之间的相关性定义为其功能连接网络中的边；Firstly, the region of interest on the image is defined as a node in its functional connection network, and the correlation between regions of interest is defined as an edge in its functional connection network;

然后利用图论方法计算影像的功能连接网络中每个节点五种不同角度的网络属性特征，即中介中心性(Betweenness centrality，BC)、节点局部效率(Nodal localefficiency，Eloc)、节点聚类系数(Nodal clustering coefficient，NCC)、参与者系数(Participant coefficient，PC)以及度(Degree，DG)。这些网络属性可从不同角度表征网络节点的信息交互能力。Then, the graph theory method is used to calculate the network attribute characteristics of each node from five different angles in the functional connection network of the image, namely between centrality (Betweenness centrality, BC), node local efficiency (Nodal local efficiency, Eloc), node clustering coefficient ( Nodal clustering coefficient, NCC), participant coefficient (Participant coefficient, PC) and degree (Degree, DG). These network attributes can characterize the information interaction capability of network nodes from different perspectives.

依据影像上的D个感兴趣区域，即其功能连接网络中D个节点，得到维度为D的BC特征向量、Eloc特征向量、NCC特征向量、PC特征向量以及DG特征向量，将这5种特征向量级联成一个维度为5D的特征向量，作为该影像的网络特征向量NET。According to the D regions of interest on the image, that is, their functions connect D nodes in the network, the BC feature vector, Eloc feature vector, NCC feature vector, PC feature vector and DG feature vector with dimension D are obtained. The vectors are concatenated into a feature vector with a dimension of 5D, which is used as the network feature vector NET of the image.

进一步地，所述步骤3中，采用Least absolute shrinkage and selectionoperator(Lasso)方法分别对三种不同层次的特征向量进行特征选择，Further, in the step 3, the Least absolute shrinkage and selection operator (Lasso) method is used to perform feature selection on the feature vectors of three different levels respectively,

Lasso方法进行特征选择的目标函数如下：The objective function of Lasso method for feature selection is as follows:

其中X表示N*P的特征矩阵，N为样本数量，P为特征向量维度，Y表示样本对应的分类标签向量，Y＝(y₁,y₂,…,y_N)^T，y_i表示第i个影像样本的分类标签；α表示P*1的权重系数向量，α中非0元素位置表示在该位置的特征将被选中(即若α中某一维度的元素非0，则将特征向量中相应维度的元素保持不变，若α中某一维度的元素为0，则将特征向量中相应维度的元素置0，或删除该维度的元素，由此得到相应的最优特征向量)；λ₁是一个正则化参数，用于模型稀疏。Where X represents the feature matrix of N*P, N is the number of samples, P is the feature vector dimension, Y represents the classification label vector corresponding to the sample, Y=(y ₁ , y ₂ ,...,y _N ) ^T , y _i represents the first The classification labels of i image samples; α represents the weight coefficient vector of P*1, and the non-zero element position in α indicates that the feature at this position will be selected (that is, if the element of a dimension in α is not 0, then the feature vector The elements of the corresponding dimension in α remain unchanged. If the element of a certain dimension in α is 0, the element of the corresponding dimension in the feature vector is set to 0, or the element of the dimension is deleted, thereby obtaining the corresponding optimal feature vector); λ ₁ is a regularization parameter for model sparsity.

进一步地，所述步骤4中，弱分类器为SVM分类器。Further, in the step 4, the weak classifier is an SVM classifier.

进一步地，所述步骤5中，分类器集成策略定义为：Further, in the step 5, the classifier integration strategy is defined as:

其中，y表示待测影像的分类标签，f_m(x)表示第m个弱分类器基于待测影像x的第m个层次的最优特征向量进行预测得到的该待测影像分类标签为1的概率值，m＝1,2,3；u_m是第m个层次的最优特征向量的预测概率的权重，所有权重之和为1。Among them, y represents the classification label of the image to be tested, and f _m (x) represents the classification label of the image to be tested obtained by the m-th weak classifier based on the optimal feature vector of the m-th level of the image to be tested x is predicted to be 1 The probability value of , m=1, 2, 3; _um is the weight of the predicted probability of the optimal feature vector of the mth level, and the sum of all weights is 1.

另一方面，提供一种基于多层次特征表示和集成学习的影像分类装置，包括以下模块：On the other hand, an image classification device based on multi-level feature representation and ensemble learning is provided, including the following modules:

数据预处理模块，用于获取影像，并对其进行预处理；Data preprocessing module for acquiring images and preprocessing them;

特征提取模块，用于根据预处理后的影像，分别提取其点、边及网络三种不同层次的特征向量：The feature extraction module is used to extract three different levels of feature vectors of points, edges and networks according to the preprocessed images:

特征选择模块，用于分别对点、边及网络三种不同层次的特征向量进行特征选择，得到不同层次的最优特征向量；The feature selection module is used to perform feature selection on the feature vectors of three different levels of point, edge and network respectively, and obtain the optimal feature vector of different levels;

弱分类器构建及训练模块，用于构建三个基于单层次特征的弱分类器，分别利用影像样本三个层次的最优特征向量进行训练，得到训练好的三个弱分类器；The weak classifier construction and training module is used to construct three weak classifiers based on single-level features, and use the optimal feature vectors of the three levels of image samples for training, and obtain three trained weak classifiers;

分类模块，用于先将待测影像三个层次的最优特征向量分别输入训练好的相应的三个弱分类器，得到该待测影像的三个分类结果；然后通过一个分类器集成策略融合三个弱分类器分类结果，得到待测影像的最终分类标签；The classification module is used to input the optimal feature vectors of the three levels of the image to be tested into the corresponding three trained weak classifiers to obtain three classification results of the image to be tested; and then fuse them through a classifier integration strategy Three weak classifier classification results to obtain the final classification label of the image to be tested;

所述装置采用上述的方法实现影像分类。The apparatus adopts the above-mentioned method to realize image classification.

另一方面，提供一种电子设备，包括处理器和存储器，所述存储器上存储有计算机程序，当所述计算机程序被处理器执行时，使得处理器实现如权利要求1至7中任一项所述的方法。In another aspect, an electronic device is provided, comprising a processor and a memory, the memory having a computer program stored thereon, when the computer program is executed by the processor, the processor is made to implement any one of claims 1 to 7 the method described.

另一方面，提供一种计算机可读存储介质，其上存储有计算机程序，当所述计算机程序被处理器执行时，使得处理器实现如权利要求1至7中任一项所述的方法。In another aspect, there is provided a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, causes the processor to implement the method as claimed in any one of claims 1 to 7.

有益效果：Beneficial effects:

本发明上述技术方案通过从不同层次提取不同角度特征，充分挖掘这些特征之间的互补信息，并采用基于单层次特征的弱分类器集成学习方法最大限度利用不同层次特征信息互补优势，进一步提升模型分类性能。具体而言，本发明首先获取rs-fMRI影像数据并进行一系列预处理操作，从预处理后的rs-fMRI数据中提取不同层次不同角度特征，包括点特征向量RVF、边特征向量FC以及网络特征向量NET。然后分别使用Lasso方法对不同层次特征进行特征选择，得到多个不同层次特征最优子集并提出一个基于单层次特征的弱分类器集成学习方法，该方法主要是首先采用分别基于点、边及网络特征向量的弱分类器进行预测，然后通过一个分类器集成策略融合基于单层次特征的弱分类器的预测概率去决定最终样本分类标签。本发明上述技术方案提出的多层次多角度特征表示可从不同层面刻画影像数据的结构和功能特性，通过基于单层次特征的弱分类器集成学习方法提高分类准确率。The above technical solution of the present invention fully exploits the complementary information between these features by extracting features from different angles from different levels, and adopts the integrated learning method of weak classifiers based on single-level features to maximize the complementary advantages of feature information at different levels to further improve the model. Classification performance. Specifically, the present invention first acquires rs-fMRI image data and performs a series of preprocessing operations, and extracts features at different levels and angles from the preprocessed rs-fMRI data, including point feature vector RVF, edge feature vector FC and network Eigenvectors NET. Then, the Lasso method is used to select features at different levels, and multiple optimal subsets of different levels of features are obtained, and an ensemble learning method of weak classifiers based on single-level features is proposed. This method is mainly based on points, edges and The weak classifier of the network feature vector is used to predict, and then the predicted probability of the weak classifier based on single-level features is fused through a classifier integration strategy to determine the final sample classification label. The multi-level and multi-angle feature representation proposed by the above technical solutions of the present invention can describe the structure and functional characteristics of image data from different levels, and improve the classification accuracy through the integrated learning method of weak classifiers based on single-level features.

本发明可以使影像分类准确率得到显著提升。The present invention can significantly improve the accuracy of image classification.

附图说明Description of drawings

图1为本发明实施例中一种基于多层次特征表示和集成学习的影像分类方法的流程图。FIG. 1 is a flowchart of an image classification method based on multi-level feature representation and ensemble learning in an embodiment of the present invention.

具体实施方式Detailed ways

下面结合附图对本发明的实施例进行详细阐述。The embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

本发明可以应用于适用于所有包括大脑图像时间序列并可以从中提取出感兴趣区域(脑区)的影像数据的分类，即能对具有较高时间分辨率和空间分辨率的影像数据(包括rs-fMRI(静息态fMRI)、ts-fMRI(任务态fMRI)等)进行分类，从这些影像数据中可以同时提取多层次多角度的大脑图像特征。The present invention can be applied to the classification of all image data including brain image time series and from which regions of interest (brain regions) can be extracted, that is, it can classify image data with higher temporal resolution and spatial resolution (including rs -fMRI (resting-state fMRI), ts-fMRI (task-state fMRI), etc.), and from these image data, multi-level and multi-angle brain image features can be simultaneously extracted.

实施例1：Example 1:

本实施例中的影像数据为rs-fMRI数据。The image data in this embodiment is rs-fMRI data.

静息态fMRI(resting-state fMRI，rs-fMRI)技术根据脑组织中血氧水平依赖(Blood Oxygenation-Level Dependent，BOLD)信号变化来研究大脑活动状态，因其相对于其它成像技术具有较高的时空分辨率，对激活脑区的准确定位以及非侵入式的无创测量方式的优点，使其被广泛应用于各种生物信息研究领域中，并且通过探索Rs-fMRI特征为研究神经精神疾病问题带来了新的视角。Resting-state fMRI (rs-fMRI) technology studies the state of brain activity according to the blood oxygen level-dependent (Blood Oxygenation-Level Dependent, BOLD) signal changes in brain tissue, because it has higher relative to other imaging technologies. The advantages of high spatial and temporal resolution, accurate localization of activated brain regions, and non-invasive non-invasive measurement methods make it widely used in various bioinformatics research fields, and by exploring the characteristics of Rs-fMRI for the study of neuropsychiatric diseases. brings a new perspective.

研究rs-fMRI特征，可以从不同层次上提取不同角度特征用于分类研究。其中从点的层次上可提取基于fMRI自身BOLD信号反映大脑局部区域活动情况的功能激活谱图，包括有局部一致性(Regional homogeneity，ReHo)、比率低频振幅(amplitude of lowfrequency fluctuations，fALFF)以及体素镜像同伦对称性(Voxel-Mirrored HomotopicConnectivity，VMHC)。这些功能激活谱图均是基于体素分析计算得到，为了提取感兴趣区域上不同功能激活谱特征，则计算这些区域上所有体素功能激活谱特征均值来量化该感兴趣区域。进一步地，为了研究大脑感兴趣区域之间的功能协作关系，则提取感兴趣区域时间序列并计算区域时间序列之间的相关性，构建大脑功能连接矩阵，提取该矩阵的上三角阵元素作为边的特征。在功能连接基础上构建功能连接网络，通过图论方法，从网络层次上进一步计算一些网络属性，常用的网络属性包括有度(Degree)、中介中心性(Betweennesscentrality)、节点聚类系数(Nodal clustering coefficient)、参与者系数(Participation coefficient)、节点局部效率(Nodal local efficient)、小世界性(Smallworldness)等，这些网络属性能够从不同角度表征网络内在功能变化，利于研究大脑感兴趣区域之间的功能信息交互。以上从不同层次上提取的多角度特征之间既是相互独立的，同时又是彼此相关联的，他们可以从不同层面刻画fMRI数据特性。本实施例通过从不同层次提取不同角度特征，充分挖掘这些特征之间的互补信息，并采用基于单层次特征的弱分类器集成学习方法最大限度利用不同层次特征信息互补优势，进一步提升模型分类性能。To study rs-fMRI features, features from different perspectives can be extracted from different levels for classification research. At the point level, the functional activation spectrum that reflects the activity of local regions of the brain based on the fMRI BOLD signal can be extracted, including regional homogeneity (ReHo), ratio of low frequency fluctuations (fALFF) and body Prime mirror homotopy symmetry (Voxel-Mirrored Homotopic Connectivity, VMHC). These functional activation spectra are calculated based on voxel analysis. In order to extract different functional activation spectral features in the region of interest, the average of all voxel functional activation spectral features in these regions is calculated to quantify the region of interest. Further, in order to study the functional cooperative relationship between regions of interest in the brain, extract the time series of the region of interest and calculate the correlation between the time series of the regions, construct a brain functional connectivity matrix, and extract the upper triangular element of the matrix as an edge. Characteristics. A functional connection network is constructed on the basis of functional connection, and some network attributes are further calculated from the network level through graph theory. Common network attributes include degree, betweenness centrality, node clustering coefficient (Nodal clustering). coefficient), participant coefficient (Participation coefficient), node local efficiency (Nodal local efficient), small worldness (Smallworldness), etc. These network attributes can represent the internal functional changes of the network from different perspectives, which is conducive to the study of brain regions of interest. Functional information interaction. The above multi-angle features extracted from different levels are both independent and related to each other, and they can describe the characteristics of fMRI data from different levels. In this embodiment, by extracting features from different angles from different levels, the complementary information between these features is fully exploited, and the ensemble learning method of weak classifiers based on single-level features is used to maximize the complementary advantages of feature information at different levels to further improve the classification performance of the model. .

参见图1，本实施例提供的基于多层次特征表示和集成学习的影像分类方法具体包括以下步骤：Referring to FIG. 1 , the image classification method based on multi-level feature representation and ensemble learning provided in this embodiment specifically includes the following steps:

步骤1：获取多个rs-fMRI影像数据，采用Data Processing&Analysis for BrainImaging(DPABI)工具对影像数据完成一系列预处理流程，包括有删除前十帧图像、时间层校正、头动校正、空间标准化、平滑、滤波操作，得到预处理后的影像数据。Step 1: Acquire multiple rs-fMRI image data, and use the Data Processing & Analysis for Brain Imaging (DPABI) tool to complete a series of preprocessing processes on the image data, including deleting the first ten frames of images, temporal layer correction, head motion correction, spatial standardization, Smoothing and filtering operations to obtain preprocessed image data.

(1)提取影像的点特征向量(1) Extract the point feature vector of the image

根据预处理后的影像数据，基于感兴趣区域从不同角度提取ReHo、VMHC以及fALFF三种功能激活谱特征，具体是计算预处理后图像上全脑体素的ReHo值、VMHC值以及fALFF值，构成三种功能激活谱图，这三种功能激活谱特征可从不同角度反映影像数据的局部区域活动。随后采用脑网络组(Brainnetome,BN246)图谱对三种功能激活谱图划分246个感兴趣区域，计算每个感兴趣区域下所有体素的平均ReHo值、VMHC值以及fALFF值作为量化该感兴趣区域的特征，得到维度分别为246的ReHo特征向量、VMHC特征向量以及fALFF特征向量。将这三种功能激活谱特征向量级联成一个维度738的点特征向量，命名为RVF。According to the preprocessed image data, the three functional activation spectrum features of ReHo, VMHC and fALFF are extracted from different angles based on the region of interest. Specifically, the ReHo value, VMHC value and fALFF value of the whole brain voxels on the preprocessed image are calculated. Three functional activation spectra are formed, and these three functional activation spectral features can reflect the local area activities of image data from different angles. Then, the brain network group (Brainnetome, BN246) map was used to divide the three functional activation spectra into 246 regions of interest, and the average ReHo value, VMHC value and fALFF value of all voxels under each region of interest were calculated as the quantification of the interest. The features of the region are obtained to obtain ReHo feature vector, VMHC feature vector and fALFF feature vector with dimensions of 246 respectively. These three functional activation spectral feature vectors are concatenated into a point feature vector of dimension 738, named RVF.

对于任意体素的ReHo值常通过计算体素的时间序列的肯德尔和谐系数(Kendall’s Coefficient Concordance,KCC)得到，计算公式如下：The ReHo value for any voxel is usually obtained by calculating the Kendall's Coefficient Concordance (KCC) of the time series of the voxel. The calculation formula is as follows:

其中K代表特定体素和最近邻域体素的总个数，通常取值为27，即27个体素组成一个体素簇；n为时间点数；R_i表示第i个时间点的秩和；

表示R_i的平均。KCC的取值范围在0到1之间，度量了相邻体素时间序列的一致性与相似性，当某个脑部区域的ReHo值越接近1时，表示该大脑局部区域的时间序列一致性越高，反映了该局部区域的脑部神经元的活动具有很高的同步性，反之亦然。where K represents the total number of specific voxels and the nearest neighbor voxels, usually 27, that is, 27 voxels form a voxel cluster; n is the number of time points; R _i represents the rank sum of the i-th time point;

represents the average of R _i . The value range of KCC is between 0 and 1, which measures the consistency and similarity of the time series of adjacent voxels. When the ReHo value of a certain brain area is closer to 1, it means that the time series of the local area of the brain are consistent The higher the sex, the higher the synchronization of the activity of the brain neurons in the local area, and vice versa.

对于任意体素的VMHC值计算，即通过计算一个体素与大脑对称半球相应体素时间序列之间的皮尔逊相关性系数，计算公式如下：For the calculation of the VMHC value of any voxel, that is, by calculating the Pearson correlation coefficient between a voxel and the corresponding voxel time series in the symmetric hemisphere of the brain, the calculation formula is as follows:

其中r_ij表示大脑半球两对称体素i,j时间序列之间的皮尔逊相关性；x_i和x_j分别表示两对称体素i,j的时间序列。cov(x_i,x_j)为体素i,j时间序列的协方差；where r _ij represents the Pearson correlation between the time series of two symmetrical voxels i, j in the cerebral hemisphere; _xi and x _j represent the time series of two symmetrical voxels i, j, respectively. cov(x _i ,x _j ) is the covariance of voxel i,j time series;

分别表示体素i，j时间序列的标准差。

are the standard deviations of the time series of voxels i and j, respectively.

对于任意体素的fALFF值计算公式如下：The formula for calculating the fALFF value for any voxel is as follows:

其中f_k表示频率，N表示体素的个数，a_k和b_k分别代表不同频率所对应的系数。fALFF值的增减与血氧水平依赖信号强度相一致，当大脑某个区域的fALFF值升高，表明该区域的神经元活动增强，反之则减弱。where f _k represents the frequency, N represents the number of voxels, and a _k and b _k represent the coefficients corresponding to different frequencies, respectively. The increase or decrease of the fALFF value is consistent with the blood oxygen level-dependent signal strength. When the fALFF value in a certain area of the brain increases, it indicates that the neuronal activity in that area is increased, and vice versa.

(2)提取影像的边特征向量(2) Extract the edge feature vector of the image

基于感兴趣区域之间的相关性提取全脑功能连接特征，具体是首先采用BN246图谱将预处理后的影像数据划分为246个感兴趣区域(脑区)，然后提取每个感兴趣区域的平均时间序列，计算感兴趣区域对平均时间序列之间的皮尔逊相关性系数，最后构建出246*246的功能连接矩阵，矩阵中的每个元素即一个感兴趣区域对平均时间序列之间的皮尔逊相关性系数，即两个感兴趣区域之间的连接强度，定义感兴趣区域之间的连接强度为边，将该矩阵的上三角矩阵元素展平作为每个影像的边特征向量，命名为FC。感兴趣区域对平均时间序列之间的皮尔逊相关性系数计算公式如下：The whole brain functional connectivity feature is extracted based on the correlation between the regions of interest. Specifically, the preprocessed image data is firstly divided into 246 regions of interest (brain regions) by using the BN246 atlas, and then the average value of each region of interest is extracted. Time series, calculate the Pearson correlation coefficient between the region of interest and the average time series, and finally construct a 246*246 functional connection matrix. Each element in the matrix is the Pearson correlation between a region of interest and the average time series. Correlation coefficient, that is, the connection strength between two regions of interest, defines the connection strength between regions of interest as an edge, and flattens the upper triangular matrix elements of the matrix as the edge feature vector of each image, named as fc. The formula for calculating the Pearson correlation coefficient between the region of interest and the average time series is as follows:

其中r_ij表示感兴趣区域i与感兴趣区域j的平均时间序列之间的皮尔逊相关性系数；x_i和x_j分别为感兴趣区域i和感兴趣区域j的平均时间序列；

和

分别表示感兴趣区域i和感兴趣区域j的平均时间序列均值；n为rs-fMRI影像数据的扫描时间点数；则x_it和x_jt分别为在某个时间点处的感兴趣区域i和感兴趣区域j的平均时间序列值。where r _ij represents the Pearson correlation coefficient between the average time series of the region of interest i and the region of interest j; x _i and x _j are the average time series of the region of interest i and the region of interest j, respectively;

and

Represents the average time series mean value of region of interest i and region of interest j, respectively; n is the number of scanning time points of rs-fMRI image data; then x _it and x _jt are the region of interest i and the sense of interest at a certain time point, respectively. Average time series value for region of interest j.

(3)提取影像的网络特征向量(3) Extract the network feature vector of the image

基于感兴趣区域之间的相关性构建功能连接网络，具体是首先将感兴趣区域定义为网络的节点，感兴趣区域之间的相关性定义为网络中的边，利用图论方法计算中介中心性(Betweeness centrality，BC)、节点局部效率(Nodal local efficiency，Eloc)、节点聚类系数(Nodal clustering coefficient，NCC)、参与者系数(Participant coefficient，PC)以及度(Degree，DG)五种不同角度的网络属性特征。中介中心性是衡量节点在网络中传输的重要性，定义为网络中任意两个节点之间的最短路径中，经过该节点的条数占总路径数的比值，计算公式为：The functional connection network is constructed based on the correlation between the regions of interest. Specifically, the region of interest is first defined as the node of the network, the correlation between the regions of interest is defined as the edge in the network, and the betweenness centrality is calculated by using the graph theory method. (Betweeness centrality, BC), node local efficiency (Nodal local efficiency, Eloc), node clustering coefficient (Nodal clustering coefficient, NCC), participant coefficient (Participant coefficient, PC) and degree (Degree, DG) five different angles characteristics of the network properties. Betweenness centrality is a measure of the importance of a node's transmission in the network. It is defined as the ratio of the number of paths passing through the node to the total number of paths in the shortest path between any two nodes in the network. The calculation formula is:

其中BC_i表示节点i的中介中心性；n_mj表示节点m和节点j之间最短路径的数量,；n_mj(i)表示节点m和节点j之间最短路径中经过节点i的数量。where BC _i represents the betweenness centrality of node i; n _mj represents the number of shortest paths between node m and node j, and n _mj (i) represents the number of nodes passing through node i in the shortest path between node m and node j.

节点局部效率用于衡量网络在局部范围内的信息传输能力，与节点最短路径相关，节点最短路径长度越短，节点局部效率就越高，节点及其邻居节点共同形成的子网络信息交换越快。节点局部效率计算公式如下：The local efficiency of a node is used to measure the information transmission capability of the network in the local area, and it is related to the shortest path of the node. . The calculation formula of node local efficiency is as follows:

其中E_loc(i)表示节点i的局部效率；G_i为节点i及其邻居节点共同形成的子网络；

为子网络G_i的节点数量；l_jh为子网络中节点j与节点h的最短路径长度。where E _loc (i) represents the local efficiency of node i; G _i is the sub-network formed by node i and its neighbors;

is the number of nodes in the sub-network _Gi ; l _jh is the shortest path length between node j and node h in the sub-network.

节点聚类系数用于衡量给定节点的任意两个邻居节点也为邻居的可能性，是网络功能分割的度量指标之一。其计算公式如下：The node clustering coefficient is used to measure the possibility that any two neighboring nodes of a given node are also neighbors, and it is one of the metrics of network function segmentation. Its calculation formula is as follows:

其中C_i表示节点i的聚类系数；K_i为节点i的度；sw_i表示节点i的所有邻居节点之间边的权重和。where C _i represents the clustering coefficient of node i; K _i is the degree of node i; s _i represents the weight sum of edges between all neighbor nodes of node i.

参与者系数是在网络模块化下，衡量节点在网络模块化连接的多样性，节点具有较低的参与者系数但具有较高的度则表明节点在模块划分中具有重要作用。计算公式为：Participant coefficient is a measure of the diversity of nodes in the network modular connection under the network modularity. A node with a low participation coefficient but a high degree indicates that the node plays an important role in the module division. The calculation formula is:

其中PC_i表示节点i的参与者系数；M表示网络划分的模块集合；k_i(m)表示节点i与模块m中所有节点的连接权重之和；k_i表示节点i的度。where PC _i represents the participant coefficient of node i; M represents the module set divided by the network; _ki (m) represents the sum of the connection weights between node i and all nodes in module m; _ki represents the degree of node i.

度是用来衡量节点在网络中中心性的指标之一，节点具有较高的度则表明该节点在网络中的重要性越强。其计算公式如下：Degree is one of the indicators used to measure the centrality of a node in the network. The higher the degree of a node, the stronger the importance of the node in the network. Its calculation formula is as follows:

其中K_i为节点i的度；N是网络中的节点数量；a_ij＝1表示节点i与节点j存在边相连，反之则a_ij＝0。where K _i is the degree of node i; N is the number of nodes in the network; a _ij =1 means that node i and node j are connected by an edge, otherwise a _ij =0.

根据上述公式计算功能连接网络中每个节点的五种网络属性，这些网络属性可从不同角度表征网络节点的信息交互能力。依据网络中246个感兴趣区域，可分别为每个影像计算得到维度为246的BC特征向量、Eloc特征向量、NCC特征向量、PC特征向量以及DG特征向量，并将这5种特征向量级联成一个维度为1230的网络特征向量，命名为NET。The five network attributes of each node in the functional connection network are calculated according to the above formula, and these network attributes can characterize the information interaction ability of network nodes from different angles. According to the 246 regions of interest in the network, BC eigenvectors, Eloc eigenvectors, NCC eigenvectors, PC eigenvectors, and DG eigenvectors with a dimension of 246 can be calculated for each image respectively, and these 5 eigenvectors can be cascaded. into a network feature vector with dimension 1230, named NET.

步骤3：采用Lasso方法分别对RVF、FC以及NET三种从不同层次提取得到的特征向量进行特征选择，Lasso特征选择方法的目标函数如下：Step 3: Use the Lasso method to perform feature selection on the three feature vectors extracted from different levels: RVF, FC and NET. The objective function of the Lasso feature selection method is as follows:

其中X表示N*P的特征矩阵，N为样本数量，P为特征向量维度，Y表示样本对应的分类标签向量，Y＝(y₁,y₂,…,y_N)^T，y_i表示第i个影像样本的分类标签；α表示P*1的权重系数向量，α中非0元素位置表示在该位置的特征将被选中(即若α中某一维度的元素非0，则将特征向量中相应维度的元素保持不变，若α中某一维度的元素为0，则将特征向量中相应维度的元素置0，或删除该维度的元素，由此得到相应的最优特征向量)；λ₁是一个正则化参数，用于模型稀疏，为经验参数。Where X represents the feature matrix of N*P, N is the number of samples, P is the feature vector dimension, Y represents the classification label vector corresponding to the sample, Y=(y ₁ , y ₂ ,...,y _N ) ^T , y _i represents the first The classification labels of i image samples; α represents the weight coefficient vector of P*1, and the non-zero element position in α indicates that the feature at this position will be selected (that is, if the element of a dimension in α is not 0, then the feature vector The elements of the corresponding dimension in α remain unchanged. If the element of a certain dimension in α is 0, the element of the corresponding dimension in the feature vector is set to 0, or the element of the dimension is deleted, thereby obtaining the corresponding optimal feature vector); λ ₁ is a regularization parameter for model sparseness and is an empirical parameter.

步骤4：首先利用影像样本经步骤3特征选择后得到的三个层次的最优特征向量分别训练一个SVM分类器，用于利用单层次特征进行预测；Step 4: First, use the optimal feature vectors of the three levels obtained by the image samples through the feature selection in Step 3 to train an SVM classifier respectively, which is used for prediction by using single-level features;

步骤5：采用一个基于单层次特征的弱分类器集成学习方法用于最大限度利用多层次特征的信息互补优势，具体是采用一个分类器集成策略融合多个基于单层次特征的SVM分类器预测概率，决定待测影像的最终分类标签。分类器集成策略定义为：Step 5: A weak classifier ensemble learning method based on single-level features is used to maximize the information complementary advantages of multi-level features. Specifically, a classifier ensemble strategy is used to fuse multiple SVM classifiers based on single-level features to predict the probability , which determines the final classification label of the image to be tested. The classifier integration strategy is defined as:

其中，y表示待测影像的分类标签，f_m(x)表示第m个弱分类器基于待测影像x的第m个层次的最优特征向量进行预测得到的该待测影像分类标签为1的概率值，m＝1,2,3；u_m是第m个层次的最优特征向量的预测概率的权重，为经验参数，所有权重之和为1。Among them, y represents the classification label of the image to be tested, and f _m (x) represents the classification label of the image to be tested obtained by the m-th weak classifier based on the optimal feature vector of the m-th level of the image to be tested x is predicted to be 1 The probability value of , m=1, 2, 3; _um is the weight of the predicted probability of the optimal feature vector of the mth level, which is an empirical parameter, and the sum of all weights is 1.

实施例2：Example 2:

本实施例公开了一种基于多层次特征表示和集成学习的影像分类装置，包括以下模块：This embodiment discloses an image classification device based on multi-level feature representation and integrated learning, including the following modules:

所述装置采用实施例1所述的方法实现影像分类。The apparatus adopts the method described in Embodiment 1 to realize image classification.

实施例3：Example 3:

本实施例公开了一种电子设备，包括处理器和存储器，所述存储器上存储有计算机程序，当所述计算机程序被处理器执行时，使得处理器实现如实施例1所述的方法。This embodiment discloses an electronic device, including a processor and a memory, where a computer program is stored in the memory, and when the computer program is executed by the processor, the processor is made to implement the method described in Embodiment 1.

实施例4：Example 4:

本实施例公开了一种计算机可读存储介质，其上存储有计算机程序，当所述计算机程序被处理器执行时，使得处理器实现如实施例1所述的方法。This embodiment discloses a computer-readable storage medium on which a computer program is stored. When the computer program is executed by a processor, the processor is made to implement the method described in Embodiment 1.

上述实施例中所述方案同时提取rs-fMRI影像数据不同层次不同角度特征，考虑到这些不同类型特征能够从不同层面刻画rs-fMRI影像数据的特性，通过基于单层次特征的弱分类器集成学习方法最大限度挖掘这些不同层次特征之间潜在的互补信息，进一步提升模型分类性能。采用Lasso方法对不同层次特征分别做特征选择，得到不同层次特征的最优子集，并提出一个基于单层次特征的弱分类器集成学习方法用于最大限度利用这些不同层次特征最优子集的信息互补优势，可大幅度提升分类性能。The scheme described in the above embodiment simultaneously extracts features from different levels and angles of rs-fMRI image data. Considering that these different types of features can describe the characteristics of rs-fMRI image data from different levels, the ensemble learning based on weak classifiers based on single-level features The method maximizes the potential complementary information between these different levels of features, and further improves the classification performance of the model. The Lasso method is used to select the features at different levels, and the optimal subsets of the different levels of features are obtained, and an ensemble learning method of weak classifiers based on single-level features is proposed to maximize the use of the optimal subsets of these different levels of features. The complementary advantages of information can greatly improve the classification performance.

Claims

1. An image classification method based on multi-level feature representation and integrated learning is characterized by comprising the following steps:

step 1: acquiring a plurality of images and preprocessing the images;

step 2: respectively extracting feature vectors of three different levels of points, edges and networks of the preprocessed images;

and step 3: respectively selecting features of the feature vectors of three different levels, namely, a point, an edge and a network to obtain optimal feature vectors of different levels;

and 4, step 4: constructing three weak classifiers based on single-level features, and training by respectively utilizing optimal feature vectors of three levels of an image sample to obtain three trained weak classifiers;

and 5: firstly, respectively inputting the optimal feature vectors of three levels of the image to be detected into the corresponding trained three weak classifiers to obtain three classification results of the image to be detected; and then, fusing the classification results of the three weak classifiers through a classifier integration strategy to obtain a final classification label of the image to be detected.

2. The method for classifying images based on multi-level feature representation and integrated learning of claim 1, wherein in the step 2, the preprocessed image is divided into 246 regions of interest according to the brain network group atlas, and then the feature vectors of the regions of interest are extracted based on the regions of interest.

3. The method for classifying images based on multi-level feature representation and ensemble learning of claim 1, wherein the step 2 of extracting the point feature vector of the image specifically comprises the following steps:

firstly, calculating function activation spectrum characteristics of a whole brain voxel at three different angles on a preprocessed image, namely a ReHo value, a VMHC value and a fALFF value, wherein the three function activation spectrum characteristics can reflect local region activities of image data from different angles;

then, calculating the average ReHo value, VMHC value and fALFF value of all voxels in each region of interest on the image as the features of the region of interest to obtain a ReHo feature vector, a VMHC feature vector and an fALFF feature vector with the dimensions of D respectively; and cascading the three feature vectors into a feature vector of one dimension 3D, wherein D is the number of interested areas on the image and is used as a point feature vector of the image.

4. The method of claim 1, wherein the step 2 of extracting edge feature vectors of the image comprises the steps of first extracting an average time sequence of each region of interest on the image, calculating a Pearson correlation coefficient between the region of interest and the average time sequence, and then constructing a functional connection matrix D × D, wherein each element in the matrix is the Pearson correlation coefficient between one region of interest and the average time sequence, and flattening an upper triangular matrix element of the matrix into a vector as the edge feature vector of the image, wherein D is the number of the regions of interest on the image.

5. The method for classifying images based on multi-level feature representation and integrated learning of claim 1, wherein the step 2 of extracting the network feature vector of the image specifically comprises the following steps:

firstly, defining an interested area on an image as a node in a functional connection network, and defining the correlation between the interested areas as an edge in the functional connection network;

then, calculating network attribute characteristics of each node at five different angles in the functional connection network of the image by using a graph theory method, namely medium centrality BC, node local efficiency Eloc, node clustering coefficient NCC, participant coefficient PC and degree DG; according to the D interested areas on the image, namely the D nodes in the network are functionally connected, a BC eigenvector, an Eloc eigenvector, an NCC eigenvector, a PC eigenvector and a DG eigenvector with the dimension of D are obtained, and the 5 eigenvectors are cascaded into one eigenvector with the dimension of 5D and used as the network eigenvector of the image.

6. The method for classifying images based on multi-level feature representation and integrated learning of claim 1, wherein in the step 3, feature selection is performed on feature vectors of three different levels by using a Lasso method;

the objective function for feature selection by the Lasso method is as follows:

wherein, X represents a feature matrix of N × P, N is the number of image samples, P is the dimension of the feature vector, Y represents the classification label vector corresponding to the sample, and Y is (Y ═ P)₁,y₂,…,y_N)^T，y_iClass label representing the ith image sample, α representing a weight coefficient vector of P1, the position of the element α other than 0 indicating that the feature at that position is to be selected, and λ₁Is a regularization parameter for model sparsity.

7. The method for classifying images based on multi-level feature representation and ensemble learning of claim 1, wherein in the step 5, the classifier ensemble policy is defined as:

wherein y represents the classification label of the image to be measured, f_m(x) The probability value of the classification label of the image to be detected, which is obtained by predicting the m-th weak classifier based on the optimal feature vector of the m-th level of the image to be detected x, is 1,2 and 3; u. of_mIs the weight of the prediction probability of the optimal feature vector of the mth level.

8. The apparatus for classifying images based on multi-level feature representation and integrated learning of claim 1, comprising:

the data preprocessing module is used for acquiring and preprocessing images;

the feature extraction module is used for respectively extracting feature vectors of points, edges and networks of the images according to the preprocessed images:

the characteristic selection module is used for respectively selecting characteristics of the characteristic vectors of three different levels, namely points, edges and networks to obtain optimal characteristic vectors of different levels;

the weak classifier building and training module is used for building three weak classifiers based on single-level features, and training the weak classifiers by respectively utilizing the optimal feature vectors of three levels of the image sample to obtain three trained weak classifiers;

the classification module is used for respectively inputting the optimal feature vectors of the three layers of the image to be detected into the corresponding trained three weak classifiers to obtain three classification results of the image to be detected; then, fusing the classification results of the three weak classifiers through a classifier integration strategy to obtain a final classification label of the image to be detected;

the device realizes image classification by adopting the method of any one of claims 1 to 7.

9. An electronic device, comprising a processor and a memory, the memory having stored thereon a computer program which, when executed by the processor, causes the processor to carry out the method of any one of claims 1 to 7.

10. A computer-readable storage medium, on which a computer program is stored, which, when executed by a processor, causes the processor to carry out the method according to any one of claims 1 to 7.