CN111666972A - Liver case image classification method and system based on deep neural network - Google Patents
Liver case image classification method and system based on deep neural network Download PDFInfo
- Publication number
- CN111666972A CN111666972A CN202010350199.6A CN202010350199A CN111666972A CN 111666972 A CN111666972 A CN 111666972A CN 202010350199 A CN202010350199 A CN 202010350199A CN 111666972 A CN111666972 A CN 111666972A
- Authority
- CN
- China
- Prior art keywords
- image
- case
- neural network
- deep neural
- liver
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Biophysics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Apparatus For Radiation Diagnosis (AREA)
- Image Analysis (AREA)
Abstract
本发明提供一种基于深度神经网络的肝部病例图像分类方法及系统,包括:获得属于同一用户设定时间段的多张病例图像;对多张病例图像进行重采样,形成单张三维病例图像;将上述单张三维病例图像输入深度神经网络模型提取图像特征,所述图像特征包括图像颜色通道和位置信息;采用注意力机制对上述图像特征赋予权重,与肝脏有关的图像特征的权重大于其他图像特征的权重;将赋予权重的图像特征输入分类器,获得单张三维病例图像的分类概率,所述分类包括病变和正常。上述方法及系统无需标注病例图像中的位置信息。
The invention provides a method and system for classifying liver case images based on a deep neural network, including: obtaining multiple case images belonging to the same user-set time period; resampling the multiple case images to form a single three-dimensional case image The above-mentioned single three-dimensional case image is input into the deep neural network model to extract image features, and the image features include image color channels and position information; the attention mechanism is used to give weights to the above-mentioned image features, and the weight of the image features related to the liver is greater than that of other images. The weight of the image features; the weighted image features are input into the classifier to obtain the classification probability of a single three-dimensional case image, and the classification includes lesions and normal. The above method and system do not need to label the position information in the case image.
Description
技术领域technical field
本发明涉及图像分析技术领域,更具体地,涉及一种基于深度神经网络的肝部病例图像分类方法及系统。The invention relates to the technical field of image analysis, and more particularly, to a method and system for classifying images of liver cases based on a deep neural network.
背景技术Background technique
现有技术的病例图像识别方法中,通常基于标注位置的肿瘤图像进行分类识别,为了保证分类的正确性,通常采用人工标注,标注的工作量大。In the case image recognition method in the prior art, classification and recognition are usually performed based on the tumor image of the labeled position. In order to ensure the correctness of the classification, manual labeling is usually used, and the workload of labeling is large.
发明内容SUMMARY OF THE INVENTION
本发明提供一种无需标注位置信息的基于深度神经网络的肝部病例图像分类方法及系统。The invention provides a method and system for classifying images of liver cases based on a deep neural network without labeling position information.
根据本发明的一个方面,提供一种基于深度神经网络的肝部病例图像分类方法包括:According to one aspect of the present invention, a method for classifying images of liver cases based on a deep neural network is provided, comprising:
获得属于同一用户设定时间段的多张病例图像;Obtain multiple case images belonging to the same user-set time period;
对多张病例图像进行重采样,形成单张三维病例图像;Resampling multiple case images to form a single 3D case image;
将上述单张三维病例图像输入深度神经网络模型提取图像特征,所述图像特征包括图像颜色通道、位置信息、纹理和图像像素点之间关联性;The above-mentioned single three-dimensional case image is input into the deep neural network model to extract image features, and the image features include the correlation between image color channels, position information, texture and image pixels;
采用注意力机制对上述图像特征赋予权重,与肝脏有关的图像特征的权重大于其他图像特征的权重;The attention mechanism is used to give weight to the above image features, and the weight of the image features related to the liver is greater than the weight of other image features;
将赋予权重的图像特征输入分类器,获得单张三维病例图像的分类概率,所述分类概率表示正常概率和非正常概率。The weighted image features are input into the classifier, and the classification probability of a single three-dimensional case image is obtained, and the classification probability represents the normal probability and the abnormal probability.
所述的基于深度神经网络的肝部病例图像分类方法,其中,还包括:The method for classifying images of liver cases based on a deep neural network, further comprising:
对重采样后的单张三维病例图像进行裁剪,获得多张尺寸统一的三维病例图像块,将多张三维图像块输入深度神经网络模型。The resampled single 3D case image is cropped to obtain multiple 3D case image blocks of uniform size, and the multiple 3D image blocks are input into the deep neural network model.
所述的基于深度神经网络的肝部病例图像分类方法,其中,所述深度神经网络模型和分类器的训练步骤包括:The described deep neural network-based liver case image classification method, wherein, the training steps of the deep neural network model and the classifier include:
获得已知分类的多个用户的病例图像,每一个用户具有多张肝部病例图像;Obtain case images of multiple users of known classification, each user has multiple liver case images;
将每个用户的多张病例图像进行重采样,形成每个用户的单张三维病例图像;Resampling multiple case images of each user to form a single 3D case image for each user;
对每个用户重采样后的单张三维病例图像进行裁剪,获得每个用户的多张尺寸统一的三维病例图像块,构成深度神经网络的第一训练集;Crop a single 3D case image resampled by each user to obtain multiple 3D case image blocks of uniform size for each user, forming the first training set of the deep neural network;
采用深度神经网络提取第一训练集中每个用户的图像特征;Use a deep neural network to extract the image features of each user in the first training set;
采用注意力机制对每个用户的图像特征赋予权重,形成分类器的第二训练集;The attention mechanism is used to give weights to the image features of each user to form the second training set of the classifier;
将第二训练集输入分类器,获得第二训练集中每个用户的病例图像的分类;Input the second training set into the classifier to obtain the classification of each user's case image in the second training set;
采用交叉熵损失函数迭代对深度神经网络和分类器进行迭代训练。Deep neural networks and classifiers are iteratively trained using a cross-entropy loss function.
所述的基于深度神经网络的肝部病例图像分类方法,其中,所述对多张病例图像进行重采样的步骤包括:The method for classifying liver case images based on a deep neural network, wherein the step of resampling multiple case images includes:
将属于同一用户的多张病例图像的二维图像按照从包含心脏的病例图像到包含腹部的病例图像堆叠为一个三维病例图像;stacking two-dimensional images of multiple case images belonging to the same user into a three-dimensional case image from the case image containing the heart to the case image containing the abdomen;
对三维病例图像进行重采样,将堆叠后的多张病例图像在厚度方向线性插值至设定厚度。The three-dimensional case images are resampled, and the stacked multiple case images are linearly interpolated to the set thickness in the thickness direction.
所述的基于深度神经网络的肝部病例图像分类方法,其中,所述对多张病例图像进行重采样的步骤还包括:The method for classifying images of liver cases based on a deep neural network, wherein the step of resampling multiple case images further includes:
对重采样后的三维病例图像进行阈值映射。Threshold mapping is performed on the resampled 3D case images.
所述的基于深度神经网络的肝部病例图像分类方法,其中,所述多张病例图像为弱标注的病例图像,所述弱标注是标注了病例图像中是否有病变,但是没有标注病变位置的病例图像。The method for classifying liver case images based on a deep neural network, wherein the plurality of case images are weakly labeled case images, and the weak labeling indicates whether there is a lesion in the case image, but does not label the location of the lesion. case image.
所述的基于深度神经网络的肝部病例图像分类方法,其中,所述深度神经网络为Bninception网络,包括多个卷积层、池化层、数据压平层和融合层,采用注意力机制对Bninception网络倒数3个卷积层输出的图像特征进行筛选,筛选出与肝脏有关的图像特征,将筛选出的图像特征输入池化层、数据压平层和融合层,进行与肝脏有关的图像特征的特征融合。The method for classifying images of liver cases based on a deep neural network, wherein the deep neural network is a Bninception network, which includes multiple convolutional layers, pooling layers, data flattening layers and fusion layers. The image features output by the last three convolutional layers of the Bninception network are screened, and the image features related to the liver are screened out. feature fusion.
所述的基于深度神经网络的肝部病例图像分类方法,其中,所述采用注意力机制对上述图像特征赋予权重的步骤包括:The method for classifying images of liver cases based on a deep neural network, wherein the step of using an attention mechanism to assign weights to the above image features includes:
采用压缩操作,按照空间维度来进行压缩图像特征,将二维的特征通道变成1个数;The compression operation is used to compress the image features according to the spatial dimension, and the two-dimensional feature channels are changed into 1 number;
采用激发操作,为每个特征通道配有一个权重,所述权重反应特征通道之间相关性;Using the excitation operation, each feature channel is assigned a weight, and the weight reflects the correlation between the feature channels;
采用重复值操作,将激励操作的输出加权到原来的图像特征上。Using a repeated value operation, the output of the excitation operation is weighted to the original image features.
所述的基于深度神经网络的肝部病例图像分类方法,其中,所述深度神经网络包括多个神经网络模型,将多个神经网络模型经分类器输出的多个分类概率的平均值作为分类概率的最终结果。The method for classifying images of liver cases based on a deep neural network, wherein the deep neural network includes multiple neural network models, and the average value of multiple classification probabilities output by the multiple neural network models through the classifier is used as the classification probability. the final result.
根据本发明的另一个方面,提供一种基于深度神经网络的肝部病例图像分类系统,包括:According to another aspect of the present invention, a deep neural network-based liver case image classification system is provided, comprising:
采集模块,采集属于同一用户设定时间段的多张病例图像;The acquisition module collects multiple case images belonging to the same user-set time period;
重采样模块,对多张病例图像进行重采样,形成单张三维病例图像;Resampling module, resampling multiple case images to form a single 3D case image;
图像特征提取模块,将上述单张三维病例图像输入深度神经网络模型提取图像特征,所述图像特征包括图像颜色通道、位置信息、纹理和图像像素点之间关联性;Image feature extraction module, input the above-mentioned single three-dimensional case image into the deep neural network model to extract image features, and the image features include image color channel, position information, texture and correlation between image pixels;
赋权模块,采用注意力机制对上述图像特征赋予权重,与肝脏有关的图像特征的权重大于其他图像特征的权重;The weighting module uses the attention mechanism to give weights to the above image features, and the weights of the image features related to the liver are greater than the weights of other image features;
分类模块,将赋予权重的图像特征输入分类器,获得单张三维病例图像的分类概率,所述分类概率表示正常概率和非正常概率。The classification module inputs the weighted image features into the classifier, and obtains the classification probability of a single three-dimensional case image, where the classification probability represents the normal probability and the abnormal probability.
上述基于深度神经网络的肝部病例图像分类方法及系统通过注意力机制来帮助深度神经网络模型自动寻找位置信息,无需标注位置信息,极大降低标注工作量,在较少标注工作量情况下通过深度深度神经网络、注意力机制和分类模型得到较好的识别效果。The above-mentioned deep neural network-based liver case image classification method and system use the attention mechanism to help the deep neural network model to automatically find location information, without the need to label location information, greatly reducing the workload of labeling, and passing the method with less labeling workload. Deep deep neural network, attention mechanism and classification model get better recognition effect.
附图说明Description of drawings
图1是本发明所述基于深度神经网络的肝部病例图像分类方法的流程图;Fig. 1 is the flow chart of the liver case image classification method based on the deep neural network of the present invention;
图2是本发明所述重采样的示意图;Fig. 2 is the schematic diagram of the resampling of the present invention;
图3是本发明所述采用注意力机制对上述图像特征赋予权重的方法的示意图;3 is a schematic diagram of a method for assigning weights to the above-mentioned image features using an attention mechanism according to the present invention;
图4是本发明所述基于深度神经网络的肝部病例图像分类系统的构成框图;4 is a block diagram of the structure of the deep neural network-based liver case image classification system according to the present invention;
图5是本发明所述基于深度神经网络的肝部病例图像分类系统一个优选实施例的构成框图。FIG. 5 is a block diagram of a preferred embodiment of the deep neural network-based liver case image classification system according to the present invention.
具体实施方式Detailed ways
在下面的描述中,出于说明的目的,为了提供对一个或多个实施例的全面理解,阐述了许多具体细节。然而,很明显,也可以在没有这些具体细节的情况下实现这些实施例。在其它例子中,为了便于描述一个或多个实施例,公知的结构和设备以方框图的形式示出。In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of one or more embodiments. It will be apparent, however, that the embodiments may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form in order to facilitate describing one or more embodiments.
下面将参照附图来对根据本发明的各个实施例进行详细描述。Various embodiments according to the present invention will be described in detail below with reference to the accompanying drawings.
图1是本发明所述基于深度神经网络的肝部病例图像分类方法的流程图,如图1所示,所述部病例图像分类方法包括:Fig. 1 is the flow chart of the liver case image classification method based on the deep neural network according to the present invention, as shown in Fig. 1, the described case image classification method includes:
步骤S1,获得属于同一用户设定时间段的多张病例图像;Step S1, obtaining multiple case images belonging to the same user-set time period;
步骤S2,对多张病例图像进行重采样,形成单张三维病例图像;Step S2, resampling multiple case images to form a single three-dimensional case image;
步骤S4,将上述单张三维病例图像输入深度神经网络模型提取图像特征,所述图像特征包括图像颜色通道、位置信息、纹理和图像像素点之间关联性;Step S4, the above-mentioned single three-dimensional case image is input into the deep neural network model to extract image features, and the image features include image color channel, position information, texture and correlation between image pixels;
步骤S5,采用注意力机制对上述图像特征赋予权重,与肝脏有关的图像特征的权重大于其他图像特征的权重;Step S5, using the attention mechanism to give weights to the above-mentioned image features, and the weights of the image features related to the liver are greater than the weights of other image features;
步骤S6,将赋予权重的图像特征输入分类器,获得单张三维病例图像的分类概率,所述分类概率表示正常概率和非正常概率。In step S6, the weighted image features are input into the classifier to obtain the classification probability of a single three-dimensional case image, where the classification probability represents normal probability and abnormal probability.
在一个实施例中,所述深度神经网络模型和分类器的训练步骤包括:In one embodiment, the training steps of the deep neural network model and the classifier include:
获得已知分类的多个用户的病例图像,每一个用户具有多张肝部病例图像;Obtain case images of multiple users of known classification, each user has multiple liver case images;
将每个用户的多张病例图像进行重采样,形成每个用户的单张三维病例图像;Resampling multiple case images of each user to form a single 3D case image for each user;
对每个用户重采样后的单张三维病例图像进行裁剪,获得每个用户的多张尺寸统一的三维病例图像块,构成深度神经网络的第一训练集;Crop a single 3D case image resampled by each user to obtain multiple 3D case image blocks of uniform size for each user, forming the first training set of the deep neural network;
采用深度神经网络提取第一训练集中每个用户的图像特征;Use a deep neural network to extract the image features of each user in the first training set;
采用注意力机制对每个用户的图像特征赋予权重,形成分类器的第二训练集;The attention mechanism is used to give weights to the image features of each user to form the second training set of the classifier;
将第二训练集输入分类器,获得第二训练集中每个用户的病例图像的分类;Input the second training set into the classifier to obtain the classification of each user's case image in the second training set;
采用交叉熵损失函数迭代对深度神经网络和分类器进行迭代训练Iterative training of deep neural networks and classifiers using cross-entropy loss function iteration
其中是分类器的输出分类概率大的分类标签、y是真实的分类标签。in is the classification label with a large classification probability of the output of the classifier, and y is the real classification label.
在一个实施例中,在步骤S2中,所述对多张病例图像进行重采样的步骤包括:In one embodiment, in step S2, the step of resampling multiple case images includes:
将属于同一用户的多张病例图像的二维图像按照从包含心脏的病例图像到包含腹部的病例图像堆叠为一个三维病例图像;stacking two-dimensional images of multiple case images belonging to the same user into a three-dimensional case image from the case image containing the heart to the case image containing the abdomen;
对三维病例图像进行重采样,将堆叠后的多张病例图像在厚度方向线性插值至设定厚度。The three-dimensional case images are resampled, and the stacked multiple case images are linearly interpolated to the set thickness in the thickness direction.
优选地,所述对多张病例图像进行重采样的步骤还包括:Preferably, the step of resampling the multiple case images further includes:
对重采样后的三维病例图像进行阈值映射,优选地,采用HU映射,Threshold mapping is performed on the resampled three-dimensional case image, preferably, HU mapping is used,
HU=pixel_value*rescale_slope+rescale_interceptHU=pixel_value*rescale_slope+rescale_intercept
其中,pixel_value为像素值,rescale_slope为伸缩范围,rescale_intercept为伸缩截距,HU为映射值。Among them, pixel_value is the pixel value, rescale_slope is the scaling range, rescale_intercept is the scaling intercept, and HU is the mapping value.
优选地,在步骤S4之前还包括步骤S3:Preferably, before step S4, it also includes step S3:
对重采样后的单张三维病例图像进行裁剪,获得多张尺寸统一的三维病例图像块,将多张三维图像块输入深度神经网络模型。The resampled single 3D case image is cropped to obtain multiple 3D case image blocks of uniform size, and the multiple 3D image blocks are input into the deep neural network model.
在一个具体实施例中,如图2所示,不同病人一次拍摄的CT图像为不同数量的多张2D图,我们首先将每个病人的所有2D图像按照上至下堆叠为一个3D图像,然后进行重采样,即将所有样本在Z维度(高度维度)线性插值至统一厚度(1mm x 1mm x 1mm),之后通过HU映射,最终裁剪至(160,256,256)的同一大小。In a specific embodiment, as shown in FIG. 2 , the CT images captured by different patients at one time are multiple 2D images of different numbers. We first stack all the 2D images of each patient into a 3D image from top to bottom, and then Resampling is performed, that is, all samples are linearly interpolated in the Z dimension (height dimension) to a uniform thickness (1mm x 1mm x 1mm), and then mapped through HU, and finally cropped to the same size of (160, 256, 256).
在一个优选实施例中,所述深度神经网络为Bninception网络,包括多个卷积层、池化层、数据压平层和融合层,采用注意力机制对Bninception网络倒数3个卷积层输出的图像特征进行筛选,筛选出与肝脏有关的图像特征,将筛选出的图像特征输入池化层、数据压平层和融合层,进行与肝脏有关的图像特征的特征融合。In a preferred embodiment, the deep neural network is a Bninception network, including a plurality of convolutional layers, pooling layers, data flattening layers and fusion layers. The image features are screened to screen out the image features related to the liver, and the screened image features are input into the pooling layer, the data flattening layer and the fusion layer to perform feature fusion of the image features related to the liver.
在步骤S4中,如图3所示,所述采用注意力机制对上述图像特征赋予权重的步骤包括:In step S4, as shown in Figure 3, the step of using the attention mechanism to assign weights to the above image features includes:
采用压缩操作,按照空间维度来进行压缩图像特征,将二维的特征通道变成1个数,例如,深度神经网络输入c1(B,C,W,H经过1x1卷积操作Ftr,改变特征通道数为C2,B是图像张数,C是特征通道,W是图像宽度,H是图像高度,W,H可以取每个像素点的点累和再取平均;The compression operation is used to compress the image features according to the spatial dimension, and the two-dimensional feature channel is changed into a number. For example, the deep neural network input c1 (B, C, W, H undergoes a 1x1 convolution operation Ftr to change the feature channel. The number is C2, B is the number of images, C is the feature channel, W is the width of the image, H is the height of the image, W, H can take the cumulative sum of each pixel and then average;
采用激发操作,为每个特征通道配有一个权重,所述权重反应特征通道之间相关性,例如,特征通道数C2经过Fsq全局池化操作,经过1x1卷积操Ftr特征通道数C2;Using the excitation operation, each feature channel is equipped with a weight, and the weight reflects the correlation between the feature channels. For example, the number of feature channels C2 is subjected to the global pooling operation of Fsq, and the number of Ftr feature channels C2 is operated by 1x1 convolution;
采用重复值操作,将激励操作的输出加权到原来的图像特征上,例如,权重与原输入相乘操作Fscale,最后输出为加权后的图像特征x~。Using repeated value operation, the output of the excitation operation is weighted to the original image feature, for example, the weight is multiplied by the original input to operate Fscale, and the final output is the weighted image feature x~.
在一个实施例中,所述深度神经网络包括多个神经网络模型,将多个神经网络模型经分类器输出的多个分类概率的平均值作为分类概率的最终结果。In one embodiment, the deep neural network includes multiple neural network models, and the average value of multiple classification probabilities output by the multiple neural network models through the classifier is used as the final result of the classification probability.
本发明所述基于深度神经网络的肝部病理图像的分类方法首先,考虑到肝部CT图像为多张2D图像切片的特点,对CT图像进行组合成单张3D图像;其次,考虑到组织切片之间厚度不一、不同病人切片张数不同的特性,将3D图像重采样到统一尺度、HU阈值转换并且进行裁剪至相同大小;然后,因为肝部CT中显示的干扰器官多,在网络中加入了注意力机制,让模型网络更专注于肝部病变区域的分类。最终,我们从神经网络中得到了较好的结果,即直接对一个病人的所有CT片子输出病变或者正常的类别分类。The classification method of liver pathological images based on the deep neural network of the present invention First, considering the characteristics of liver CT images as multiple 2D image slices, the CT images are combined into a single 3D image; secondly, considering the tissue slices Due to the characteristics of different thicknesses and different number of slices for different patients, the 3D images are resampled to a uniform scale, HU threshold conversion and cropped to the same size; then, because there are many interfering organs displayed in the liver CT, in the network An attention mechanism is added to allow the model network to focus more on the classification of liver lesions. In the end, we got better results from the neural network, which is to directly classify all CT slices of a patient to output the lesion or normal category.
图4是本发明基于深度神经网络的肝部病例图像分类系统的构成框图,如图4所示,所述肝部病例图像分类系统包括:FIG. 4 is a block diagram showing the structure of the liver case image classification system based on the deep neural network of the present invention. As shown in FIG. 4 , the liver case image classification system includes:
采集模块1,采集属于同一用户设定时间段的多张病例图像;The acquisition module 1 collects multiple case images belonging to the same user-set time period;
重采样模块2,对多张病例图像进行重采样,形成单张三维病例图像;Resampling module 2, resampling multiple case images to form a single three-dimensional case image;
图像特征提取模块3,将上述单张三维病例图像输入深度神经网络模型提取图像特征,所述图像特征包括图像颜色通道、位置信息、纹理和图像像素点之间关联性;Image
赋权模块4,采用注意力机制对上述图像特征赋予权重,与肝脏有关的图像特征的权重大于其他图像特征的权重;The weighting module 4 uses the attention mechanism to give weights to the above-mentioned image features, and the weights of the image features related to the liver are greater than the weights of other image features;
分类模块5,将赋予权重的图像特征输入分类器,获得单张三维病例图像的分类概率,所述分类概率表示正常概率和非正常概率。The classification module 5 inputs the weighted image features into the classifier to obtain the classification probability of a single three-dimensional case image, where the classification probability represents normal probability and abnormal probability.
优选地,还包括裁剪模块6,对重采样后的单张三维病例图像进行裁剪,获得多张尺寸统一的三维病例图像块,将多张三维图像块输入深度神经网络模型。Preferably, a cropping module 6 is also included to crop the resampled single 3D case image to obtain multiple 3D case image blocks of uniform size, and input the multiple 3D image blocks into the deep neural network model.
优选地,还包括训练模块,对深度神经网络模型和分类器进行训练,包括:Preferably, a training module is also included to train the deep neural network model and the classifier, including:
第一训练集构建单元,获得已知分类的多个用户的病例图像,每一个用户具有多张肝部病例图像;将每个用户的多张病例图像进行重采样,形成每个用户的单张三维病例图像;对每个用户重采样后的单张三维病例图像进行裁剪,获得每个用户的多张尺寸统一的三维病例图像块,构成深度神经网络的第一训练集;The first training set construction unit obtains case images of multiple users of known classification, each user has multiple liver case images; resamples multiple case images of each user to form a single image of each user 3D case image; crop a single 3D case image resampled by each user to obtain multiple 3D case image blocks of uniform size for each user, forming the first training set of the deep neural network;
第二训练集构建单元,采用注意力机制对每个用户的图像特征赋予权重,形成分类器的第二训练集;The second training set construction unit adopts the attention mechanism to assign weights to the image features of each user to form the second training set of the classifier;
训练单元,采用深度神经网络提取第一训练集中每个用户的图像特征;将第二训练集输入分类器,获得第二训练集中每个用户的病例图像的分类;采用交叉熵损失函数迭代对深度神经网络和分类器进行迭代训练。The training unit adopts a deep neural network to extract the image features of each user in the first training set; the second training set is input into the classifier to obtain the classification of the case images of each user in the second training set; the cross entropy loss function is used to iteratively analyze the depth Neural networks and classifiers are iteratively trained.
在一个实施例中,重采样模块2包括:In one embodiment, the resampling module 2 includes:
堆叠单元,将属于同一用户的多张病例图像的二维图像按照从包含心脏的病例图像到包含腹部的病例图像堆叠为一个三维病例图像;a stacking unit, which stacks the two-dimensional images of the multiple case images belonging to the same user into a three-dimensional case image from the case image containing the heart to the case image containing the abdomen;
插值单元,对三维病例图像进行重采样,将堆叠后的多张病例图像在厚度方向线性插值至设定厚度。The interpolation unit resamples the three-dimensional case images, and linearly interpolates the stacked multiple case images to the set thickness in the thickness direction.
优选地,重采样模块2还包括阈值映射单元,对重采样后的三维病例图像进行阈值映射。Preferably, the resampling module 2 further includes a threshold mapping unit, which performs threshold mapping on the resampled three-dimensional case image.
在本发明的一个优选实施例中,如图5所示,所述深度神经网络为Bninception网络,包括多个卷积层、池化层、数据压平层和融合层,所述赋权模块4包括多个SE模块,所述分类模块5包括分类器和收敛模块,其中,In a preferred embodiment of the present invention, as shown in FIG. 5 , the deep neural network is a Bninception network, including multiple convolution layers, pooling layers, data flattening layers and fusion layers, and the weighting module 4 Including a plurality of SE modules, the classification module 5 includes a classifier and a convergence module, wherein,
所述SE模块包括压缩单元、激发单元和重赋值单元,所述压缩单元采用压缩操作(Squeeze操作)按照空间维度来进行压缩图像特征,将二维的特征通道变成1个数,例如,输入到全局平均池化,直接把图像颜色通道和位置信息降低到一个数;所述激发单元采用激发操作(Excitation操作)为每个特征通道配有一个权重,所述权重反应特征通道之间相关性,例如,先后经过1x1卷积、sigmoid函数、1x1卷积,获得权重值;所述重赋值单元采用重复值操作(Reweight操作)将激励操作的输出加权到原来的图像特征上,也就是说,将激发单元输出权重与网络输入到压缩单元之前的特征相乘,即为SE模块的输出;The SE module includes a compression unit, an excitation unit and a reassignment unit. The compression unit uses a compression operation (Squeeze operation) to compress the image features according to the spatial dimension, and changes the two-dimensional feature channel into a number, for example, input To global average pooling, the image color channel and position information are directly reduced to a number; the excitation unit adopts an excitation operation (Excitation operation) to assign a weight to each feature channel, and the weight reflects the correlation between feature channels. , for example, successively through 1x1 convolution, sigmoid function, and 1x1 convolution to obtain the weight value; the reassignment unit adopts the repeated value operation (Reweight operation) to weight the output of the excitation operation to the original image feature, that is, The output weight of the excitation unit is multiplied by the feature before the network input to the compression unit, which is the output of the SE module;
其中,所述SE模块采用注意力机制对Bninception网络倒数3个卷积层输出的图像特征进行筛选,筛选出与肝脏有关的图像特征,将筛选出的图像特征输入池化层、数据压平层和融合层,进行与肝脏有关的图像特征的特征融合。Among them, the SE module uses the attention mechanism to screen the image features output by the last three convolutional layers of the Bninception network, screen out the image features related to the liver, and input the screened image features into the pooling layer and the data flattening layer. and fusion layer to perform feature fusion of liver-related image features.
BNinception网络不同深浅的CNN输出,每个SE模块相同,目的是在不同CNN输出值做一个重要程度筛选,提高重要特征权重,降低其余权重值,CNN特征输入至全局平均池化->1x1卷积->sigmoid->1x1卷积之后再与原输入相乘,具体为:Squeeze操作是按照空间维度来进行压缩特征,将二维的特征通道变成1个数(全局平均池化),其在某种程度代表着全局的感受野。Excitation操作为神经网络中的门机制(1x1卷积,为每个通道配有一个权重w,来反应通道之间相关性。Reweight操作将Excitation操作的输出加权到原来的特征上,来得到最终的结果。The CNN output of different depths of BNinception network, each SE module is the same, the purpose is to filter the importance of different CNN output values, increase the weight of important features, reduce the rest of the weight values, CNN features are input to the global average pooling -> 1x1 convolution ->sigmoid->1x1 convolution and then multiplied with the original input, specifically: the Squeeze operation is to compress the features according to the spatial dimension, and change the two-dimensional feature channel into 1 number (global average pooling), which is in To some extent, it represents the global receptive field. The excitation operation is a gate mechanism in the neural network (1x1 convolution, with a weight w for each channel to reflect the correlation between channels. The Reweight operation weights the output of the excitation operation to the original features to obtain the final result.
Bninception包括多个卷积层,融合不同层特征有利于获得高低级信息,但是越小stage越低级的特征不利于分类,实验中3层最好,融合是直接拼接在一起,分别经过全局平均池化和最大池化,再拼接一起,经过全连接层输出最后是否癌变的概率值。Bninception includes multiple convolutional layers. The fusion of different layer features is beneficial to obtain high-level information, but the smaller the stage, the lower-level features are not conducive to classification. In the experiment, 3 layers are the best, and the fusion is directly spliced together. Pooling and max pooling, splicing together, and outputting the final probability value of canceration through the fully connected layer.
在一个实施例中,图像特征提取模块3的深度神经网络包括多个神经网络模型,分类模块5将多个神经网络模型经分类器输出的多个分类概率的平均值作为分类概率的最终结果,例如,深度神经网络还可以包括resnet50,也加上了SE模块,SE模块只是影响了resnet\bninception的内部多个CNN输出值,继而影响了最终输出概率值。In one embodiment, the deep neural network of the image
在上述各实施例中,所述多张病例图像为弱标注的病例图像,所述弱标注是标注了病例图像中是否有病变,但是没有标注病变位置的病例图像。因为数据是弱标注,并没有标注出病人的病变位置,只是简单给出了这个病人所有CT中有无病变信息(1代表有病变、0代表没有病变),因为人体器官较多,在没有标注病变位置的情况下模型较难分类,因此,将注意力机制融合至BNInception网络中,一是将2D网络改为3D、二是用SEModu1e注意力机制来筛选与肝脏有关的特征,让模型专注于肝部分类、三是融合倒数3个stages(卷积层)的输出特征。In each of the above embodiments, the multiple case images are weakly labeled case images, and the weakly labeled case images are case images that indicate whether there is a lesion in the case image, but do not label the location of the lesion. Because the data is weakly labeled, the location of the patient's lesion is not marked, but simply gives the information about whether there is any lesion in all CTs of the patient (1 means there is a lesion, 0 means no lesion), because there are many human organs, there is no labelling. The model is more difficult to classify in the case of the lesion location. Therefore, the attention mechanism is integrated into the BNInception network. One is to change the 2D network to 3D, and the other is to use the SEModu1e attention mechanism to screen features related to the liver, so that the model can focus on The third is to fuse the output features of the last three stages (convolutional layers).
尽管前面公开的内容示出了本发明的示例性实施例,但是应当注意,在不背离权利要求限定的范围的前提下,可以进行多种改变和修改。根据这里描述的发明实施例的方法权利要求的功能、步骤和/或动作不需以任何特定顺序执行。此外,尽管本发明的元素可以以个体形式描述或要求,但是也可以设想具有多个元素,除非明确限制为单个元素。Although the foregoing disclosure shows exemplary embodiments of the present invention, it should be noted that various changes and modifications can be made without departing from the scope as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the inventive embodiments described herein need not be performed in any particular order. Furthermore, although elements of the invention may be described or claimed in an individual form, it is also contemplated that there are multiple elements unless explicitly limited to a single element.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010350199.6A CN111666972A (en) | 2020-04-28 | 2020-04-28 | Liver case image classification method and system based on deep neural network |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010350199.6A CN111666972A (en) | 2020-04-28 | 2020-04-28 | Liver case image classification method and system based on deep neural network |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN111666972A true CN111666972A (en) | 2020-09-15 |
Family
ID=72382935
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010350199.6A Pending CN111666972A (en) | 2020-04-28 | 2020-04-28 | Liver case image classification method and system based on deep neural network |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111666972A (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112183645A (en) * | 2020-09-30 | 2021-01-05 | 深圳龙岗智能视听研究院 | Image aesthetic quality evaluation method based on context-aware attention mechanism |
| CN114359637A (en) * | 2022-01-10 | 2022-04-15 | 华南师范大学 | Brain medical image classification method and device |
Citations (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105139030A (en) * | 2015-08-18 | 2015-12-09 | 青岛海信医疗设备股份有限公司 | Method for sorting hepatic vessels |
| CN108603922A (en) * | 2015-11-29 | 2018-09-28 | 阿特瑞斯公司 | Automatic cardiac volume is divided |
| CN109003672A (en) * | 2018-07-16 | 2018-12-14 | 北京睿客邦科技有限公司 | A kind of early stage of lung cancer detection classification integration apparatus and system based on deep learning |
| CN109035234A (en) * | 2018-07-25 | 2018-12-18 | 腾讯科技(深圳)有限公司 | A kind of nodule detection methods, device and storage medium |
| CN109102506A (en) * | 2018-08-20 | 2018-12-28 | 东北大学 | A kind of automatic division method carrying out abdominal CT hepatic disease image based on three-stage cascade network |
| CN109165667A (en) * | 2018-07-06 | 2019-01-08 | 中国科学院自动化研究所 | Based on the cerebral disease categorizing system from attention mechanism |
| CN109410167A (en) * | 2018-08-31 | 2019-03-01 | 深圳大学 | A kind of analysis method and Related product of 3D galactophore image |
| CN110288577A (en) * | 2019-06-20 | 2019-09-27 | 翼健(上海)信息科技有限公司 | A kind of 3D realizes the control method and control device of multiple organ lesion integration |
| CN110348515A (en) * | 2019-07-10 | 2019-10-18 | 腾讯科技(深圳)有限公司 | Image classification method, image classification model training method and device |
| CN110475505A (en) * | 2017-01-27 | 2019-11-19 | 阿特瑞斯公司 | Automatic segmentation using fully convolutional networks |
| CN110619947A (en) * | 2019-09-19 | 2019-12-27 | 南京工程学院 | Lung CT auxiliary screening system and method based on lightweight deep learning |
| CN110766713A (en) * | 2019-10-30 | 2020-02-07 | 上海微创医疗器械(集团)有限公司 | Lung image segmentation method and device and lung lesion region identification equipment |
| US20200085382A1 (en) * | 2017-05-30 | 2020-03-19 | Arterys Inc. | Automated lesion detection, segmentation, and longitudinal identification |
-
2020
- 2020-04-28 CN CN202010350199.6A patent/CN111666972A/en active Pending
Patent Citations (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105139030A (en) * | 2015-08-18 | 2015-12-09 | 青岛海信医疗设备股份有限公司 | Method for sorting hepatic vessels |
| CN108603922A (en) * | 2015-11-29 | 2018-09-28 | 阿特瑞斯公司 | Automatic cardiac volume is divided |
| CN110475505A (en) * | 2017-01-27 | 2019-11-19 | 阿特瑞斯公司 | Automatic segmentation using fully convolutional networks |
| US20200085382A1 (en) * | 2017-05-30 | 2020-03-19 | Arterys Inc. | Automated lesion detection, segmentation, and longitudinal identification |
| CN109165667A (en) * | 2018-07-06 | 2019-01-08 | 中国科学院自动化研究所 | Based on the cerebral disease categorizing system from attention mechanism |
| CN109003672A (en) * | 2018-07-16 | 2018-12-14 | 北京睿客邦科技有限公司 | A kind of early stage of lung cancer detection classification integration apparatus and system based on deep learning |
| CN109035234A (en) * | 2018-07-25 | 2018-12-18 | 腾讯科技(深圳)有限公司 | A kind of nodule detection methods, device and storage medium |
| CN109102506A (en) * | 2018-08-20 | 2018-12-28 | 东北大学 | A kind of automatic division method carrying out abdominal CT hepatic disease image based on three-stage cascade network |
| CN109410167A (en) * | 2018-08-31 | 2019-03-01 | 深圳大学 | A kind of analysis method and Related product of 3D galactophore image |
| CN110288577A (en) * | 2019-06-20 | 2019-09-27 | 翼健(上海)信息科技有限公司 | A kind of 3D realizes the control method and control device of multiple organ lesion integration |
| CN110348515A (en) * | 2019-07-10 | 2019-10-18 | 腾讯科技(深圳)有限公司 | Image classification method, image classification model training method and device |
| CN110619947A (en) * | 2019-09-19 | 2019-12-27 | 南京工程学院 | Lung CT auxiliary screening system and method based on lightweight deep learning |
| CN110766713A (en) * | 2019-10-30 | 2020-02-07 | 上海微创医疗器械(集团)有限公司 | Lung image segmentation method and device and lung lesion region identification equipment |
Non-Patent Citations (2)
| Title |
|---|
| PENGSHUAI YANG等: "Liver Histopathological Image Retrieval Based on Deep Metric Learning", 《2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE》 * |
| PENGSHUAI YANG等: "Liver Histopathological Image Retrieval Based on Deep Metric Learning", 《2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE》, 31 December 2019 (2019-12-31) * |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112183645A (en) * | 2020-09-30 | 2021-01-05 | 深圳龙岗智能视听研究院 | Image aesthetic quality evaluation method based on context-aware attention mechanism |
| CN114359637A (en) * | 2022-01-10 | 2022-04-15 | 华南师范大学 | Brain medical image classification method and device |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112819910B (en) | Hyperspectral image reconstruction method based on double-ghost attention machine mechanism network | |
| CN116258933B (en) | Medical image segmentation device based on global information perception | |
| CN114004847B (en) | Medical image segmentation method based on graph reversible neural network | |
| CN113344110B (en) | Fuzzy image classification method based on super-resolution reconstruction | |
| CN110889852A (en) | Liver segmentation method based on residual-attention deep neural network | |
| CN110490262B (en) | Image processing model generation method, image processing method, device and electronic device | |
| CN110135574A (en) | Neural network training method, image generation method and computer storage medium | |
| CN112131959A (en) | 2D human body posture estimation method based on multi-scale feature reinforcement | |
| CN114399510A (en) | Skin lesion segmentation and classification method and system combining image and clinical metadata | |
| CN107220971A (en) | A kind of Lung neoplasm feature extracting method based on convolutional neural networks and PCA | |
| CN110929685A (en) | Pedestrian detection network structure based on mixed feature pyramid and mixed expansion convolution | |
| CN119478561A (en) | Breast image classification and evaluation method and system based on deep learning | |
| CN117975170B (en) | Medical information processing method and system based on big data | |
| CN114999637B (en) | Pathological image diagnosis method and system based on multi-angle coding and embedded mutual learning | |
| CN112149662A (en) | A Multimodal Fusion Saliency Detection Method Based on Dilated Convolution Blocks | |
| CN112419342B (en) | Image processing method, device, electronic device and computer readable medium | |
| CN119048351B (en) | Light field image space super-resolution method based on space-angle decoupling mechanism | |
| CN113076890A (en) | Facial expression recognition method and system based on improved channel attention mechanism | |
| CN114926734B (en) | Solid waste detection device and method based on feature aggregation and attention fusion | |
| CN118781420A (en) | A method for frontal classification of lesions in medical images based on deep learning network | |
| CN111666972A (en) | Liver case image classification method and system based on deep neural network | |
| CN112242002A (en) | Object recognition and panorama roaming method based on deep learning | |
| CN119360124B (en) | Brain age prediction method, system, device and medium based on multi-mode information fusion | |
| CN112862837B (en) | A method and system for image processing based on convolutional neural network | |
| CN119090831B (en) | A full-reference light field image quality assessment method from sparse to dense viewpoints |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200915 |
|
| RJ01 | Rejection of invention patent application after publication |