WO2025236925A1

WO2025236925A1 - Method and apparatus for detecting circulating tumor cells, device, and medium

Info

Publication number: WO2025236925A1
Application number: PCT/CN2025/087761
Authority: WO
Inventors: 印明柱; 胡春兰; 赖仁胜; 吴粤松; 张宗波
Original assignee: Chongqing University Three Gorges Hospital
Current assignee: Chongqing University Three Gorges Hospital
Priority date: 2024-05-17
Filing date: 2025-04-08
Publication date: 2025-11-20
Anticipated expiration: 2026-11-17
Also published as: CN118379732A

Abstract

The present application relates to the technical field of biomedicine, and discloses a method and an apparatus for detecting circulating tumor cells, a device, and a medium. The method comprises: acquiring a target optical image of an object under detection, and using a multi-scale feature map of the target optical image to obtain a region-of-interest feature map; inputting the region-of-interest feature map into a target multi-level mining detection model, wherein the target multi-level mining detection model comprises a plurality of detection heads; and grouping self-attention regions of interest in the region-of-interest feature map into initial positive samples and initial negative samples via a self-attention mechanism module in the target multi-level mining detection model, using the detection heads to screen the initial positive samples and the initial negative samples to obtain a target positive sample containing circulating tumor cells, and determining the target positive sample as a circulating tumor cell detection result for the object under detection. Using the model to perform automated circulating tumor cell detection can improve the efficiency and accuracy of detecting circulating tumor cells and reduce the time required for detection.

Description

Methods, devices, equipment and media for detecting circulating tumor cells

Technical Field

本发明涉及生物医药技术领域，特别涉及循环肿瘤细胞检测方法、装置、设备及介质。This invention relates to the field of biomedical technology, and in particular to methods, devices, equipment and media for detecting circulating tumor cells.

Background Technology

癌细胞侵袭人体，可能会出现在肿瘤发展的早期，癌细胞从原发肿瘤到远处重要器官的转移是导致癌症相关死亡的主要原因。循环肿瘤细胞(Circulating Tumor Cells，即CTCs)是肿瘤发展早期原发肿瘤脱落的癌细胞。并会传播到淋巴结、骨髓或外周血中。大量研究表明，CTCs可用于预测癌细胞转移性，甚至早期癌症患者的疾病进展和可存活时间，它可以作为临床试验中癌细胞的生物标志物，用于识别治疗靶点和耐药机制。Cancer cells invade the body, potentially in the early stages of tumor development. Metastasis from the primary tumor to distant vital organs is a leading cause of cancer-related death. Circulating tumor cells (CTCs) are cancer cells shed from the primary tumor in the early stages of development and can spread to lymph nodes, bone marrow, or peripheral blood. Numerous studies have shown that CTCs can be used to predict cancer cell metastasis, and even disease progression and survival time in early-stage cancer patients. They can serve as biomarkers for cancer cells in clinical trials, helping to identify therapeutic targets and drug resistance mechanisms.

然而CTCs的检测和鉴别由于其稀缺性和异质性而具有挑战性，它在外周血中的浓度非常低，在大多数癌症患者中每10毫升只有1到10个细胞。目前采用的检测方法包括免疫荧光、流式细胞术和基于核酸的方法，但是，这些方法所需检测时间较长，并且存在主观性，导致最终检测结果存在高假阴性。However, the detection and identification of CTCs is challenging due to their scarcity and heterogeneity; their concentration in peripheral blood is very low, with only 1 to 10 cells per 10 ml in most cancer patients. Current detection methods include immunofluorescence, flow cytometry, and nucleic acid-based methods; however, these methods are time-consuming and subjective, leading to a high false-negative rate.

综上可见，如何提高循环肿瘤细胞检测的效率和准确度是本领域有待解决的问题。In summary, improving the efficiency and accuracy of circulating tumor cell detection is a problem that needs to be solved in this field.

Summary of the Invention

有鉴于此，本发明的目的在于提供一种循环肿瘤细胞检测方法、装置、设备及介质，提高循环肿瘤细胞检测的效率和准确度。其具体方案如下：In view of this, the purpose of this invention is to provide a method, apparatus, device, and medium for detecting circulating tumor cells, thereby improving the efficiency and accuracy of circulating tumor cell detection. The specific solution is as follows:

第一方面，本申请公开了一种循环肿瘤细胞检测方法，包括：In a first aspect, this application discloses a method for detecting circulating tumor cells, comprising:

采集待检测对象的目标光学图像，利用所述目标光学图像的多尺度特征图获取感兴趣区域特征图；Acquire a target optical image of the object to be detected, and use the multi-scale feature map of the target optical image to obtain the feature map of the region of interest.

将所述感兴趣区域特征图输入至目标多级挖掘检测模型；其中，所述目标多级挖掘检测模型包含多个检测头；The feature map of the region of interest is input into the multi-level target mining and detection model; wherein, the multi-level target mining and detection model contains multiple detection heads;

通过所述目标多级挖掘检测模型中自注意力机制模块将所述感兴趣区域特征图的各个自注意力感兴趣区域划分为初始正样本和初始负样本，利用各个所述检测头从所述初始正样本和所述初始负样本中筛选出含有循环肿瘤细胞的目标正样本，并将所述目标正样本确定为所述待检测对象的循环肿瘤细胞检测结果。The self-attention mechanism module in the target multi-level mining detection model divides each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples. The detection heads are used to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples, and the target positive samples are determined as the circulating tumor cell detection results of the target object.

可选的，所述采集待检测对象的目标光学图像，包括：Optionally, acquiring the target optical image of the object to be detected includes:

获取经染色后的待检测对象的数字化玻片，并获取所述数字化玻片在不同拍摄状态下的各个初始光学图像；A stained digital slide of the object to be tested is acquired, and initial optical images of the digital slide under different shooting conditions are acquired.

调整各个所述初始光学图像的像素大小，以得到目标光学图像。Adjust the pixel size of each of the initial optical images to obtain the target optical image.

可选的，所述利用所述目标光学图像的多尺度特征图获取感兴趣区域特征图，包括：Optionally, obtaining the region of interest feature map using the multi-scale feature map of the target optical image includes:

利用基于特征金字塔网络的骨干网络提取所述目标光学图像的卷积特征，以得到多尺度特征图；Convolutional features of the target optical image are extracted using a backbone network based on a feature pyramid network to obtain a multi-scale feature map;

将所述多尺度特征图输入至区域提案网络，以得到感兴趣区域特征图。The multi-scale feature map is input into the region proposal network to obtain the region of interest feature map.

可选的，所述将所述感兴趣区域特征图输入至目标多级挖掘检测模型之前，还包括：Optionally, before inputting the region of interest feature map into the target multi-level mining detection model, the method further includes:

利用训练数据集对初始多级挖掘检测模型中的各个检测头进行训练，以得到当前多级挖掘检测模型的各个检测头；The detection heads in the initial multi-level mining detection model are trained using the training dataset to obtain the detection heads in the current multi-level mining detection model.

基于最小损失函数对所述当前多级挖掘检测模型的各个检测头进行反向优化，以得到目标多级挖掘检测模型。The target multi-level mining detection model is obtained by back-optimizing each detection head of the current multi-level mining detection model based on the minimum loss function.

可选的，所述通过所述目标多级挖掘检测模型中自注意力机制模块将所述感兴趣区域特征图的各个自注意力感兴趣区域划分为初始正样本和初始负样本，包括：Optionally, the step of dividing each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples through the self-attention mechanism module in the multi-level target mining detection model includes:

通过所述目标多级挖掘检测模型中初始检测头的自注意力机制模块生成所述感兴趣区域特征图的各个自注意力感兴趣区域，根据各个所述自注意力感兴趣区域的交并比将各个所述自注意力感兴趣区域划分为初始正样本和初始负样本。The self-attention mechanism module of the initial detection head in the target multi-level mining detection model generates each self-attention region of interest in the feature map of the region of interest, and divides each self-attention region of interest into initial positive samples and initial negative samples according to the intersection-union ratio of each self-attention region of interest.

可选的，所述通过所述目标多级挖掘检测模型中初始检测头的自注意力机制模块生成所述感兴趣区域特征图的各个自注意力感兴趣区域，包括：Optionally, the step of generating each self-attention region of interest (ROI) of the ROI feature map through the self-attention mechanism module of the initial detection head in the target multi-level mining detection model includes:

通过所述目标多级挖掘检测模型中初始检测头的自注意力机制模块将所述感兴趣区域特征图进行矩阵转换，以得到目标矩阵，并将所述目标矩阵转化为查询向量、键向量和值向量，利用SoftMax函数将所述查询向量与所述键向量的乘积转换为自注意力得分，获取所述自注意力得分与所述值向量之间的矩阵乘积，对所述矩阵乘积进行卷积，以得到维度恢复后矩阵，将所述维度恢复后矩阵和所述感兴趣区域特征图进行逐点相加，以得到所述感兴趣区域特征图的各个自注意力感兴趣区域。The self-attention mechanism module of the initial detection head in the target multi-level mining detection model performs matrix transformation on the feature map of the region of interest to obtain the target matrix. The target matrix is then transformed into a query vector, a key vector, and a value vector. The SoftMax function is used to convert the product of the query vector and the key vector into a self-attention score. The matrix product between the self-attention score and the value vector is obtained. The matrix product is then convolved to obtain the dimension-restored matrix. The dimension-restored matrix and the feature map of the region of interest are then added point by point to obtain each self-attention region of interest in the feature map of the region of interest.

可选的，所述利用各个所述检测头从所述初始正样本和所述初始负样本中筛选出含有循环肿瘤细胞的目标正样本，包括：Optionally, the step of using each of the detection heads to screen for target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples includes:

基于预设正负样本比例，并利用采样器从所述初始正样本和所述初始负样本中筛选出当前初始样本，剔除所述当前初始样本中的难分负样本，以得到当前目标样本，然后将所述当前目标样本输入至当前检测头，以获取所述当前检测头从当前目标样本中筛选出的下一正样本；Based on a preset positive and negative sample ratio, and using a sampler to select the current initial sample from the initial positive sample and the initial negative sample, the difficult negative samples in the current initial sample are removed to obtain the current target sample. Then, the current target sample is input to the current detection head to obtain the next positive sample selected by the current detection head from the current target sample.

若所述当前检测头为最后一个检测头，则将所述下一正样本确定为含有循环肿瘤细胞的目标正样本；If the current detection head is the last detection head, then the next positive sample is determined as the target positive sample containing circulating tumor cells;

若所述当前检测头不为最后一个检测头，则基于所述预设正负样本比例，并利用所述采样器从所述下一正样本和所述初始负样本中筛选出下一初始样本，并将所述下一初始样本更新为当前初始样本，然后重新跳转至所述剔除所述当前初始样本中的难分负样本的步骤。If the current detection head is not the last detection head, then based on the preset positive and negative sample ratio, the sampler is used to select the next initial sample from the next positive sample and the initial negative sample, and the next initial sample is updated to the current initial sample. Then, the process jumps back to the step of removing the difficult-to-distinguish negative samples from the current initial sample.

第二方面，本申请公开了一种循环肿瘤细胞检测装置，包括：Secondly, this application discloses a circulating tumor cell detection device, comprising:

特征图获取模块，用于采集待检测对象的目标光学图像，利用所述目标光学图像的多尺度特征图获取感兴趣区域特征图；The feature map acquisition module is used to acquire the target optical image of the object to be detected, and to acquire the region of interest feature map using the multi-scale feature map of the target optical image.

特征图输入模块，用于将所述感兴趣区域特征图输入至目标多级挖掘检测模型；其中，所述目标多级挖掘检测模型包含多个检测头；The feature map input module is used to input the feature map of the region of interest into the target multi-level mining detection model; wherein, the target multi-level mining detection model includes multiple detection heads;

检测结果确定模块，用于通过所述目标多级挖掘检测模型中自注意力机制模块将所述感兴趣区域特征图的各个自注意力感兴趣区域划分为初始正样本和初始负样本，利用各个所述检测头从所述初始正样本和所述初始负样本中筛选出含有循环肿瘤细胞的目标正样本，并将所述目标正样本确定为所述待检测对象的循环肿瘤细胞检测结果。The detection result determination module is used to divide each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples through the self-attention mechanism module in the multi-level target mining detection model, and to use each of the detection heads to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples, and to determine the target positive samples as the circulating tumor cell detection result of the object to be detected.

第三方面，本申请公开了一种电子设备，包括：Thirdly, this application discloses an electronic device, including:

存储器，用于保存计算机程序；Memory, used to store computer programs;

处理器，用于执行所述计算机程序，以实现前述公开的循环肿瘤细胞检测方法的步骤。A processor is configured to execute the computer program to implement the steps of the aforementioned disclosed method for detecting circulating tumor cells.

第四方面，本申请公开了一种计算机可读存储介质，用于存储计算机程序；其中，所述计算机程序被处理器执行时实现前述公开的循环肿瘤细胞检测方法的步骤。Fourthly, this application discloses a computer-readable storage medium for storing a computer program; wherein, when the computer program is executed by a processor, it implements the steps of the aforementioned disclosed circulating tumor cell detection method.

本申请有益效果为：本申请采集待检测对象的目标光学图像，利用所述目标光学图像的多尺度特征图获取感兴趣区域特征图；将所述感兴趣区域特征图输入至目标多级挖掘检测模型；其中，所述目标多级挖掘检测模型包含多个检测头；通过所述目标多级挖掘检测模型中自注意力机制模块将所述感兴趣区域特征图的各个自注意力感兴趣区域划分为初始正样本和初始负样本，利用各个所述检测头从所述初始正样本和所述初始负样本中筛选出含有循环肿瘤细胞的目标正样本，并将所述目标正样本确定为所述待检测对象的循环肿瘤细胞检测结果。由此可见，本申请将待检测对象的感兴趣区域特征图输入目标多级挖掘检测模型，以得到最终的含有循环肿瘤细胞的目标正样本，即得到待检测对象的循环肿瘤细胞检测结果，也就是说，利用模型进行自动化循环肿瘤细胞检测，能够提高循环肿瘤细胞检测的效率，降低检测所需时间，进一步的，目标多级挖掘检测模型中包含多个检测头，能够提升目标多级挖掘检测模型的分类能力，可以从不够准确的初始正样本和初始负样本中筛选出含有循环肿瘤细胞的目标正样本，提高循环肿瘤细胞检测的准确度。The beneficial effects of this application are as follows: This application acquires a target optical image of the object to be detected, and uses the multi-scale feature map of the target optical image to obtain a region of interest feature map; the region of interest feature map is input into a target multi-level mining detection model; wherein, the target multi-level mining detection model contains multiple detection heads; through the self-attention mechanism module in the target multi-level mining detection model, each self-attention region of interest in the region of interest feature map is divided into initial positive samples and initial negative samples, and each of the detection heads is used to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples, and the target positive samples are determined as the circulating tumor cell detection result of the object to be detected. Therefore, this application inputs the feature map of the region of interest of the object to be detected into the target multi-level mining detection model to obtain the final target positive sample containing circulating tumor cells, that is, to obtain the circulating tumor cell detection result of the object to be detected. In other words, using the model for automated circulating tumor cell detection can improve the efficiency of circulating tumor cell detection and reduce the detection time. Furthermore, the target multi-level mining detection model contains multiple detection heads, which can improve the classification ability of the target multi-level mining detection model. It can screen out the target positive sample containing circulating tumor cells from the initial positive sample and the initial negative sample that are not accurate enough, thereby improving the accuracy of circulating tumor cell detection.

Attached Figure Description

为了更清楚地说明本申请实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本发明的实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据提供的附图获得其他的附图。To more clearly illustrate the technical solutions in the embodiments of this application or the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below. Obviously, the drawings described below are only embodiments of the present invention. For those skilled in the art, other drawings can be obtained based on the provided drawings without creative effort.

图1为本申请公开的一种循环肿瘤细胞检测方法流程图；Figure 1 is a flowchart of a circulating tumor cell detection method disclosed in this application;

图2为本申请公开的一种具体的自注意力感兴趣区域获取示意图；Figure 2 is a schematic diagram of a specific self-attention region of interest acquisition disclosed in this application;

图3为本申请公开的一种具体的目标多级挖掘检测模型处理示意图；Figure 3 is a schematic diagram of a specific multi-level target mining and detection model disclosed in this application;

图4为本申请公开的一种具体的多级挖掘检测模型评估示意图；Figure 4 is a schematic diagram of the evaluation of a specific multi-level mining detection model disclosed in this application;

图5为本申请公开的一种循环肿瘤细胞检测装置结构示意图；Figure 5 is a schematic diagram of the structure of a circulating tumor cell detection device disclosed in this application;

图6为本申请公开的一种电子设备结构图。Figure 6 is a structural diagram of an electronic device disclosed in this application.

Detailed Implementation

下面将结合本申请实施例中的附图，对本申请实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The technical solutions of the embodiments of this application will be clearly and completely described below with reference to the accompanying drawings. Obviously, the described embodiments are only some embodiments of the present invention, and not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative effort are within the scope of protection of the present invention.

癌细胞侵袭人体，可能会出现在肿瘤发展的早期，癌细胞从原发肿瘤到远处重要器官的转移是导致癌症相关死亡的主要原因。循环肿瘤细胞是肿瘤发展早期原发肿瘤脱落的癌细胞。并会传播到淋巴结、骨髓或外周血中。大量研究表明，CTCs可用于预测癌细胞转移性，甚至早期癌症患者的疾病进展和可存活时间，它可以作为临床试验中癌细胞的生物标志物，用于识别治疗靶点和耐药机制。Cancer cells invade the body, potentially in the early stages of tumor development. Metastasis from the primary tumor to distant vital organs is a leading cause of cancer-related deaths. Circulating tumor cells (CTCs) are cancer cells shed from the primary tumor in the early stages of cancer development and can spread to lymph nodes, bone marrow, or peripheral blood. Numerous studies have shown that CTCs can be used to predict cancer cell metastasis, and even disease progression and survival time in early-stage cancer patients. They can serve as biomarkers for cancer cells in clinical trials, identifying therapeutic targets and drug resistance mechanisms.

为此本申请相应的提供了一种循环肿瘤细胞检测方案，提高循环肿瘤细胞检测的效率和准确度。Therefore, this application provides a circulating tumor cell detection scheme to improve the efficiency and accuracy of circulating tumor cell detection.

参见图1所示，本申请实施例公开了一种循环肿瘤细胞检测方法，包括：Referring to Figure 1, this application discloses a method for detecting circulating tumor cells, including:

步骤S11：采集待检测对象的目标光学图像，利用所述目标光学图像的多尺度特征图获取感兴趣区域特征图。Step S11: Acquire the target optical image of the object to be detected, and use the multi-scale feature map of the target optical image to obtain the region of interest feature map.

本实施例中，所述采集待检测对象的目标光学图像，包括：获取经染色后的待检测对象的数字化玻片，并获取所述数字化玻片在不同拍摄状态下的各个初始光学图像；调整各个所述初始光学图像的像素大小，以得到目标光学图像。待检测对象为患者待检测部位的外周血，该外周血中可能含有循环肿瘤细胞，将待检测对象进行H&E染色(Hematoxylin and eosinstains，即苏木精和伊红染色)或亚甲基蓝染色，获取染色后的待检测对象的数字化玻片，将数字化玻片以拍摄状态下不同在光学显微镜下以固定放大倍数拍摄，得到初始光学图像，例如数字化玻片进行水平翻转、垂直翻转和图像亮度调整后进行拍摄，得到多个初始光学图像，再将初始光学图像的短边统一调整为800像素，得到待检测对象的目标光学图像，以减轻计算负荷。In this embodiment, acquiring the target optical image of the object to be tested includes: acquiring a stained digital slide of the object to be tested, and acquiring initial optical images of the digital slide under different shooting conditions; adjusting the pixel size of each initial optical image to obtain the target optical image. The object to be tested is peripheral blood from the patient's testing site, which may contain circulating tumor cells. The object to be tested is stained with H&E (Hematoxylin and eosinstains) or methylene blue to obtain a stained digital slide. The digital slide is then photographed under an optical microscope at a fixed magnification under different shooting conditions to obtain initial optical images. For example, the digital slide is horizontally flipped, vertically flipped, and the image brightness is adjusted before photographing to obtain multiple initial optical images. The short side of each initial optical image is then uniformly adjusted to 800 pixels to obtain the target optical image of the object to be tested, thereby reducing the computational load.

本实施例中，所述利用所述目标光学图像的多尺度特征图获取感兴趣区域特征图，包括：利用基于特征金字塔网络的骨干网络提取所述目标光学图像的卷积特征，以得到多尺度特征图；将所述多尺度特征图输入至区域提案网络，以得到感兴趣区域特征图。将目标光学图像输入至基于特征金字塔网络(Feature Pyramid Network，即FPN)的骨干网络，骨干网络提取输入图像的卷积特征得到多尺度特征图，然后将多尺度特征图输入到区域提案网络(Region Proposal Network，即RPN)，以得到感兴趣区域(Region of Interest，ROI)特征图，即ROIs，需要注意的是，感兴趣区域特征图中包含感兴趣区域和对应的感兴趣区域对齐。In this embodiment, obtaining the region of interest (ROI) feature map using the multi-scale feature map of the target optical image includes: extracting the convolutional features of the target optical image using a backbone network based on a feature pyramid network to obtain a multi-scale feature map; and inputting the multi-scale feature map into a region proposal network to obtain a region of interest (ROI) feature map. The target optical image is input into a backbone network based on a feature pyramid network (FPN), which extracts the convolutional features of the input image to obtain a multi-scale feature map. This multi-scale feature map is then input into a region proposal network (RPN) to obtain the region of interest (ROI) feature map, i.e., ROIs. It is important to note that the ROI feature map includes the ROI and its corresponding alignment.

步骤S12：将所述感兴趣区域特征图输入至目标多级挖掘检测模型；其中，所述目标多级挖掘检测模型包含多个检测头。Step S12: Input the feature map of the region of interest into the target multi-level mining detection model; wherein the target multi-level mining detection model contains multiple detection heads.

本实施例中，所述将所述感兴趣区域特征图输入至目标多级挖掘检测模型之前，还包括：利用训练数据集对初始多级挖掘检测模型中的各个检测头进行训练，以得到当前多级挖掘检测模型的各个检测头；基于最小损失函数对所述当前多级挖掘检测模型的各个检测头进行反向优化，以得到目标多级挖掘检测模型。In this embodiment, before inputting the region of interest feature map into the target multi-level mining detection model, the method further includes: training each detection head in the initial multi-level mining detection model using a training dataset to obtain each detection head of the current multi-level mining detection model; and performing reverse optimization on each detection head of the current multi-level mining detection model based on the minimum loss function to obtain the target multi-level mining detection model.

构建包含多个检测头的初始多级挖掘检测模型，其中，初始检测头(即第一级别检测头)中包含自注意力机制模块，剩余检测头中不包含自注意力机制模块，所有检测头中还包含采样器分类器h^t和一个定位参数回归器f_t，利用训练数据集对初始多级挖掘检测模型进行迭代训练的过程中，基于最小化损失函数来优化每个检测头的参数，将各个检测头的所有损失汇总在一起来训练模型，具体公式如下所示：
L(x^t,g)＝L_cls(h^t(x^t),y^t)+λ[y^t≥1]L_loc(f_t(x^t,b^t),g)；An initial multi-level mining detection model with multiple detection heads is constructed. The initial detection head (i.e., the first-level detection head) includes a self-attention mechanism module, while the remaining detection heads do not. All detection heads also include a sampler classifier h<sub>^t</sub> and a localization parameter regressor f<sub>_t</sub> . During the iterative training of the initial multi-level mining detection model using the training dataset, the parameters of each detection head are optimized by minimizing the loss function. The losses of all detection heads are then aggregated to train the model. The specific formula is shown below:
L(x ^t ,g)=L _cls (h ^t (x ^t ),y ^t )+λ[y ^t ≥1]L _loc (f _t (x ^t ,b ^t ),g);

式中，g表示赋给x^t的真值，λ＝1为权衡系数，[·]为指标函数，b^t为上一检测头的参数回归器f_t-1(x^t-1,b^t-1)的预测框坐标或RPN网络生成的建议框坐标，可以根据t确定采用的坐标种类。In the formula, g represents the true value assigned to x ^t , λ＝1 is the weighting coefficient, [·] is the index function, and b ^t is the predicted box coordinates of the parametric regressor f _t-1 (x ^t-1 , b ^t-1 ) of the previous detector or the suggestion box coordinates generated by the RPN network. The type of coordinates used can be determined based on t.

步骤S13：通过所述目标多级挖掘检测模型中自注意力机制模块将所述感兴趣区域特征图的各个自注意力感兴趣区域划分为初始正样本和初始负样本，利用各个所述检测头从所述初始正样本和所述初始负样本中筛选出含有循环肿瘤细胞的目标正样本，并将所述目标正样本确定为所述待检测对象的循环肿瘤细胞检测结果。Step S13: The self-attention mechanism module in the target multi-level mining detection model divides each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples. The detection heads are used to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples, and the target positive samples are determined as the circulating tumor cell detection results of the object to be detected.

本实施例中，所述通过所述目标多级挖掘检测模型中自注意力机制模块将所述感兴趣区域特征图的各个自注意力感兴趣区域划分为初始正样本和初始负样本，包括：通过所述目标多级挖掘检测模型中初始检测头的自注意力机制模块生成所述感兴趣区域特征图的各个自注意力感兴趣区域，根据各个所述自注意力感兴趣区域的交并比将各个所述自注意力感兴趣区域划分为初始正样本和初始负样本。初始检测头的自注意力机制模块生成感兴趣区域特征图的各个自注意力感兴趣区域，并计算各个自注意力感兴趣区域的交并比(Intersection over Union，即IoU)，根据交并比，将各个自注意力感兴趣区域划分为初始正样本和初始负样本，具体划分如下所示：
In this embodiment, the step of dividing the self-attention regions of interest (ROIs) of the region of interest feature map into initial positive samples and initial negative samples through the self-attention mechanism module in the multi-level target mining detection model includes: generating each ROI of the region of interest feature map through the self-attention mechanism module of the initial detection head in the multi-level target mining detection model, and dividing each ROI into initial positive samples and initial negative samples according to the intersection-over-union (IoU) ratio of each ROI. The self-attention mechanism module of the initial detection head generates each ROI of the region of interest feature map and calculates the intersection-over-union (IoU) ratio of each ROI. Based on the IoU ratio, each ROI is divided into initial positive samples and initial negative samples, as detailed below:

式中，y为样本标签，g_y是真值g的类别标签，x是一个提案(即自注意力感兴趣区域)，u为IoU阈值，y^t为正负样本划分公式所给出的标号x^t。In the formula, y is the sample label, g _y is the class label of the ground value g, x is a proposal (i.e., the region of interest for self-attention), u is the IoU threshold, and y ^t is the label x ^t given by the formula for separating positive and negative samples.

交并比小于阈值u则为初始负样本，交并比不小于阈值u则为初始正样本，也就是说，将不小于IoU阈值的预测框分为正样本，对应的样本标签就是正样本，小于IoU阈值的为负样本，其对应的样本标签为负样本，可以理解的是，初始正样本中含有难分正样本(即错误标签正样本)、易分正样本(即正确标签正样本)，初始负样本中含有有难分负样本(即错误标签负样本)、易分负样本(即正确标签负样本)。If the Intersection over Union (IoU) ratio is less than the threshold u, it is considered an initial negative sample; if the IoU ratio is not less than the threshold u, it is considered an initial positive sample. In other words, predicted boxes with an IoU ratio not less than the threshold are classified as positive samples, and their corresponding labels are positive samples. Boxes with an IoU ratio less than the threshold are classified as negative samples, and their corresponding labels are negative samples. It can be understood that the initial positive samples contain difficult-to-classify positive samples (i.e., incorrectly labeled positive samples) and easy-to-classify positive samples (i.e., correctly labeled positive samples), and the initial negative samples contain difficult-to-classify negative samples (i.e., incorrectly labeled negative samples) and easy-to-classify negative samples (i.e., correctly labeled negative samples).

本实施例中，所述通过所述目标多级挖掘检测模型中初始检测头的自注意力机制模块生成所述感兴趣区域特征图的各个自注意力感兴趣区域，包括：通过所述目标多级挖掘检测模型中初始检测头的自注意力机制模块将所述感兴趣区域特征图进行矩阵转换，以得到目标矩阵，并将所述目标矩阵转化为查询向量、键向量和值向量，利用SoftMax函数将所述查询向量与所述键向量的乘积转换为自注意力得分，获取所述自注意力得分与所述值向量之间的矩阵乘积，对所述矩阵乘积进行卷积，以得到维度恢复后矩阵，将所述维度恢复后矩阵和所述感兴趣区域特征图进行逐点相加，以得到所述感兴趣区域特征图的各个自注意力感兴趣区域。例如图2所示的一种具体的自注意力感兴趣区域获取示意图，自注意力感兴趣区域获取具体过程如下：In this embodiment, the step of generating each self-attention region of interest (ROI) of the ROI feature map through the self-attention mechanism module of the initial detection head in the target multi-level mining detection model includes: performing matrix transformation on the ROI feature map through the self-attention mechanism module of the initial detection head in the target multi-level mining detection model to obtain a target matrix, and converting the target matrix into a query vector, a key vector, and a value vector; using the SoftMax function to convert the product of the query vector and the key vector into a self-attention score; obtaining the matrix product between the self-attention score and the value vector; performing convolution on the matrix product to obtain a dimension-restored matrix; and adding the dimension-restored matrix and the ROI feature map point by point to obtain each self-attention ROI of the ROI feature map. For example, Figure 2 shows a specific schematic diagram of self-attention ROI acquisition. The specific process of self-attention ROI acquisition is as follows:

1)自注意力机制模块将感兴趣区域特征图进行矩阵转换，得到目标矩阵，也就是将感兴趣区域特征图的个数和大小被表示为F_N和BN×C×h×w，其中，B是一次输入神经网络的图像数量，C是通道的数量，h特征的高度，w特征的宽度；1) The self-attention mechanism module performs matrix transformation on the feature map of the region of interest to obtain the target matrix. That is, the number and size of the feature map of the region of interest are represented as F _N and BN×C×h×w, where B is the number of images input to the neural network at one time, C is the number of channels, h is the height of the feature, and w is the width of the feature.

2)将目标矩阵转化为查询向量(query)、键向量(key)和值向量(value)，具体公式如下所示：
q＝Reshape(f_{Q：conv 1×1}(F_N))；
k＝Reshape(f_{K：conv 1×1}(F_N))；
v＝Reshape(f_{V：conv 1×1}(F_N));2) Transform the target matrix into a query vector, a key vector, and a value vector, as shown in the following formula:
q＝Reshape(f _{Q：conv 1×1} (F _N ));
k=Reshape(f _{K：conv 1×1} (F _N ));
v＝Reshape(f _{V：conv 1×1} (F _N ));

式中，q、k、v分别表示查询向量、键向量和值向量，f(·)表示卷积运算，Reshape(·)函数的操作是将特征矩阵展平为向量；In the formula, q, k, and v represent the query vector, key vector, and value vector, respectively; f(·) represents the convolution operation; and the Reshape(·) function flattens the feature matrix into a vector.

3)利用SoftMax函数(归一化函数)将查询向量与键向量的乘积转换为自注意力得分，具体公式如下所示：
score＝δ(q*k^T)；3) Use the SoftMax function (normalization function) to convert the product of the query vector and the key vector into a self-attention score. The specific formula is shown below:
score = δ(q* ^kT );

式中，δ(·)表示SoftMax函数，该函数为将得分映射到[0，1]范围内的概率分布；In the formula, δ(·) represents the SoftMax function, which is a probability distribution that maps the scores to the range [0, 1].

4)获取自注意力得分与值向量之间的矩阵乘积，对矩阵乘积进行卷积，以得到维度恢复后矩阵，将维度恢复后矩阵和感兴趣区域特征图进行逐点相加，以得到感兴趣区域特征图的各个自注意力感兴趣区域，具体公式如下所示：
F′_N＝f_{Re:conv 1×1}(Reshape^-1(score*v))+F_N；4) Obtain the matrix product between the self-attention score and the value vector. Perform convolution on the matrix product to obtain the dimension-restored matrix. Add the dimension-restored matrix and the region of interest feature map point by point to obtain the self-attention regions of interest in the region of interest feature map. The specific formula is as follows:
F′ _N ＝f _{Re:conv 1×1} (Reshape ^-1 (score*v))+F _N ;

式中，f_{Re：conv 1×1}是用于恢复特征空间原始通道维度的卷积运算，F′_N为自注意力感兴趣区域(Attention-based ROIs，即AttROIs)。In the formula, _{fRe:conv 1×1} is the convolution operation used to recover the original channel dimension of the feature space, and _F′N is the attention-based region of interest (i.e., AttROIs).

本实施例中，所述利用各个所述检测头从所述初始正样本和所述初始负样本中筛选出含有循环肿瘤细胞的目标正样本，包括：基于预设正负样本比例，并利用采样器从所述初始正样本和所述初始负样本中筛选出当前初始样本，剔除所述当前初始样本中的难分负样本，以得到当前目标样本，然后将所述当前目标样本输入至当前检测头，以获取所述当前检测头从当前目标样本中筛选出的下一正样本；若所述当前检测头为最后一个检测头，则将所述下一正样本确定为含有循环肿瘤细胞的目标正样本；若所述当前检测头不为最后一个检测头，则基于所述预设正负样本比例，并利用所述采样器从所述下一正样本和所述初始负样本中筛选出下一初始样本，并将所述下一初始样本更新为当前初始样本，然后重新跳转至所述剔除所述当前初始样本中的难分负样本的步骤。In this embodiment, the step of using each of the detection heads to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples includes: based on a preset positive-negative sample ratio, using a sampler to screen out the current initial sample from the initial positive samples and the initial negative samples, removing difficult-to-distinguish negative samples from the current initial sample to obtain the current target sample, and then inputting the current target sample into the current detection head to obtain the next positive sample screened out by the current detection head from the current target sample; if the current detection head is the last detection head, then the next positive sample is determined as the target positive sample containing circulating tumor cells; if the current detection head is not the last detection head, based on the preset positive-negative sample ratio, using the sampler to screen out the next initial sample from the next positive sample and the initial negative samples, updating the next initial sample to the current initial sample, and then jumping back to the step of removing difficult-to-distinguish negative samples from the current initial sample.

例如图3所示的一种具体的目标多级挖掘检测模型处理示意图，初始正样本的数量远小于初始负样本的数量，使用平衡正负采样器从所有提案中以指定的数量比例随机生成正负样本，即基于预设正负样本比例，并利用采样器从初始正样本和初始负样本中筛选出当前初始样本，其中，初始正样本和初始负样本中的难分正样本和难分负样本会被优先选择为当前初始样本，剔除当前初始样本中的难分负样本，以得到当前目标样本，这意味着后续阶段的检测头是对先前阶段预测的分类结果进行修正的，对难分样本进行多次的识别和修正以优化检测性能；进一步的，将当前目标样本输入至当前检测头，当前检测头从当前目标样本中筛选出的下一正样本，若当前检测头不为最后一个检测头，则再基于预设正负样本比例，并利用采样器从下一正样本和初始负样本中筛选出下一初始样本，并将下一初始样本更新为当前初始样本，然后重新跳转至所述剔除所述当前初始样本中的难分负样本的步骤，也就是说，前一阶段检测头的难分样本预测框优先被用作下一阶段检测头的提案输入，并通过正负样本采样器按比例提取正负样本，多阶段检测头对应不同的样本分布，并逐步改善提案的质量，公式具体为如下所示：
f(x,b)＝f_T*f_T-1*…*f_f(x,b)；For example, Figure 3 shows a schematic diagram of a specific multi-level target mining detection model. The number of initial positive samples is much smaller than the number of initial negative samples. A balanced positive-negative sampler is used to randomly generate positive and negative samples from all proposals at a specified ratio. Based on a preset positive-negative sample ratio, the sampler filters the current initial samples from the initial positive and negative samples. Difficult-to-distinguish positive and negative samples from the initial positive and negative samples are preferentially selected as the current initial samples. Difficult-to-distinguish negative samples are removed from the current initial samples to obtain the current target sample. This means that the detection head in subsequent stages corrects the classification results predicted in previous stages, performing multiple identifications and corrections on difficult-to-distinguish samples to optimize detection performance. In one step, the current target sample is input into the current detection head. The current detection head selects the next positive sample from the current target sample. If the current detection head is not the last detection head, then based on the preset positive and negative sample ratio, the next initial sample is selected from the next positive sample and the initial negative sample using a sampler. The next initial sample is then updated to the current initial sample. Then, the process jumps back to the step of removing the difficult negative samples from the current initial sample. In other words, the difficult sample prediction boxes of the previous stage detection head are preferentially used as the proposal input for the next stage detection head, and positive and negative samples are extracted proportionally by the positive and negative sample sampler. The multi-stage detection heads correspond to different sample distributions and gradually improve the quality of the proposals. The specific formula is as follows:
f(x,b)＝f _T *f _T-1 *…*f _f (x,b);

式中，T为梯级总数，T-1为梯级总数减1，x是一个提案，b为提案坐标。In the formula, T is the total number of ladder levels, T-1 is the total number of ladder levels minus 1, x is a proposal, and b is the coordinate of the proposal.

本实施例中，目标多级挖掘检测模型包含用于增强ROI的自注意力模块和难分样本挖掘采样器两个模块。在训练的第一阶段，通过自注意力机制模块来提高目标检测的性能和精确度，在非初始检测头中使用难分样本采样器对难分样本进行多次的识别和修正以优化检测性能。In this embodiment, the multi-level target mining detection model includes two modules: a self-attention module for enhancing the ROI and a hard sample mining sampler. In the first stage of training, the self-attention mechanism module is used to improve the performance and accuracy of target detection, while the hard sample sampler is used in non-initial detection heads to identify and correct hard samples multiple times to optimize detection performance.

例如图4所示的一种具体的多级挖掘检测模型评估示意图，为了定量评估检测性能，对多级挖掘检测模型准确度进行评估，包括平均精度、自由接受者操作特性曲线(Receiver Operating Characteristic Curve，即ROC)分析和消融实验，具体如下：For example, Figure 4 shows a specific evaluation diagram of a multi-level mining detection model. In order to quantitatively evaluate the detection performance, the accuracy of the multi-level mining detection model is evaluated, including average accuracy, receiver operating characteristic curve (ROC) analysis, and ablation experiments, as detailed below:

1)利用平均精度评估多级挖掘检测模型的性能：在考虑到细胞检测任务需要在降低假阳性并平衡细胞的召回率时，评估模型性能时，可以保留预测概率大于阈值τ＝0.6的边界框作为最终预测结果。平均精度(AP)指标可以同时反映每个类别的召回率和检测精度，其定义为：
1) Evaluating the performance of multi-level cell detection models using average precision: Considering the need to reduce false positives and balance cell recall in cell detection tasks, when evaluating model performance, bounding boxes with a predicted probability greater than the threshold τ = 0.6 can be retained as the final prediction result. Average precision (AP) can simultaneously reflect the recall and detection precision for each class, and is defined as follows:

式中，k表示已检测到的CTC细胞或CTC样细胞的数量，P(k)表示列表中截止到第k个元素的精确度(即Precision)，Δr(k)表示从第k-1到第k项的召回率(即Recall)变化，TP、FP、FN分别表示真阳性、假阳性和假阴性的预测框数量，对于多类别检测任务，还使用了mAP值作为指标，它是所有类别的平均AP值。In the formula, k represents the number of CTC cells or CTC-like cells detected, P(k) represents the precision up to the kth element in the list, Δr(k) represents the recall from the (k-1)th to the kth item, TP, FP, and FN represent the number of predicted boxes for true positives, false positives, and false negatives, respectively. For multi-class detection tasks, the mAP value is also used as an indicator, which is the average AP value for all classes.

2)利用自由接受者操作特性曲线分析评估多级挖掘检测模型的性能：在ROC曲线中，纵轴绘制了敏感性，横轴绘制了1-特异性，而FROC(Free-response ROC)曲线将1-特异性替换为每幅图像的假阳性数，这可以反映模型在允许的假阳性数量下进行检测的能力。2) Utilize the free-response operating characteristic curve (FROC) to evaluate the performance of the multi-level mining detection model: In the ROC curve, the vertical axis plots sensitivity and the horizontal axis plots 1-specificity. The FROC curve replaces 1-specificity with the number of false positives per image, which can reflect the model's ability to detect within the allowed number of false positives.

3)利用消融实验评估多级挖掘检测模型的性能：基于Swin-Transformer骨干网络，消融实验用来评估多级挖掘检测模型中超参数的影响。首先将Cascade R-CNN网络设定为基准方法，分别向基准模型添加了所提出的自注意力模块和难分样本挖掘采样器，并同时评估结合这两个模块的模型；进一步地，使用超参数设置进行消融实验，即设置多段IoU阈值。在原始的Cascade R-CNN，通过逐步提高训练样本各阶段的IoU阈值来选择质量更高的阳性样本，例如以下下三种阈值{0.5，0.5，0.5}、{0.5，0.6，0.7}和{0.6，0.7，0.8}。3) Evaluation of the performance of the multi-level mining detection model using ablation experiments: Based on the Swin-Transformer backbone network, ablation experiments were used to evaluate the impact of hyperparameters in the multi-level mining detection model. First, the Cascade R-CNN network was set as the baseline method. The proposed self-attention module and the hard sample mining sampler were added to the baseline model, and the model combining these two modules was evaluated simultaneously. Further, ablation experiments were conducted using hyperparameter settings, i.e., setting multiple IoU thresholds. In the original Cascade R-CNN, higher-quality positive samples were selected by progressively increasing the IoU thresholds at each stage of the training samples, for example, using the following three thresholds: {0.5, 0.5, 0.5}, {0.5, 0.6, 0.7}, and {0.6, 0.7, 0.8}.

通过平均精度、自由接受者操作特性曲线分析和消融实验评估目标多级挖掘检测模型，可以验证目标多级挖掘检测模型的准确性和可靠性。The accuracy and reliability of the multi-level target mining detection model can be verified by evaluating the average accuracy, free acceptor operation characteristic curve analysis, and ablation experiments.

参见图5所示，本申请实施例公开了一种循环肿瘤细胞检测装置，包括：Referring to Figure 5, this application discloses a circulating tumor cell detection device, comprising:

特征图获取模块11，用于采集待检测对象的目标光学图像，利用所述目标光学图像的多尺度特征图获取感兴趣区域特征图；Feature map acquisition module 11 is used to acquire the target optical image of the object to be detected, and to acquire the region of interest feature map using the multi-scale feature map of the target optical image;

特征图输入模块12，用于将所述感兴趣区域特征图输入至目标多级挖掘检测模型；其中，所述目标多级挖掘检测模型包含多个检测头；The feature map input module 12 is used to input the feature map of the region of interest into the target multi-level mining detection model; wherein, the target multi-level mining detection model includes multiple detection heads;

检测结果确定模块13，用于通过所述目标多级挖掘检测模型中自注意力机制模块将所述感兴趣区域特征图的各个自注意力感兴趣区域划分为初始正样本和初始负样本，利用各个所述检测头从所述初始正样本和所述初始负样本中筛选出含有循环肿瘤细胞的目标正样本，并将所述目标正样本确定为所述待检测对象的循环肿瘤细胞检测结果。The detection result determination module 13 is used to divide each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples through the self-attention mechanism module in the multi-level target mining detection model, and to use each of the detection heads to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples, and to determine the target positive samples as the circulating tumor cell detection result of the object to be detected.

进一步的，本申请实施例还提供了一种电子设备。图6是根据一示例性实施例示出的电子设备20结构图，图中的内容不能认为是对本申请的使用范围的任何限制。Furthermore, embodiments of this application also provide an electronic device. Figure 6 is a structural diagram of an electronic device 20 according to an exemplary embodiment; the content in the figure should not be construed as limiting the scope of this application.

图6为本申请实施例提供的一种电子设备的结构示意图。具体可以包括：至少一个处理器21、至少一个存储器22、电源23、通信接口24、输入输出接口25和通信总线26。其中，所述存储器22用于存储计算机程序，所述计算机程序由所述处理器21加载并执行，以实现前述任一实施例公开的由电子设备执行的循环肿瘤细胞检测方法中的相关步骤。Figure 6 is a schematic diagram of the structure of an electronic device provided in an embodiment of this application. Specifically, it may include: at least one processor 21, at least one memory 22, a power supply 23, a communication interface 24, an input/output interface 25, and a communication bus 26. The memory 22 stores a computer program, which is loaded and executed by the processor 21 to implement the relevant steps in the circulating tumor cell detection method performed by the electronic device disclosed in any of the foregoing embodiments.

本实施例中，电源23用于为电子设备上的各硬件设备提供工作电压；通信接口24能够为电子设备创建与外界设备之间的数据传输通道，其所遵循的通信协议是能够适用于本申请技术方案的任意通信协议，在此不对其进行具体限定；输入输出接口25，用于获取外界输入数据或向外界输出数据，其具体的接口类型可以根据具体应用需要进行选取，在此不进行具体限定。In this embodiment, the power supply 23 is used to provide operating voltage for various hardware devices on the electronic device; the communication interface 24 can create a data transmission channel between the electronic device and external devices, and the communication protocol it follows can be any communication protocol applicable to the technical solution of this application, and is not specifically limited here; the input/output interface 25 is used to acquire external input data or output data to the outside world, and its specific interface type can be selected according to specific application needs, and is not specifically limited here.

其中，处理器21可以包括一个或多个处理核心，比如4核心处理器、8核心处理器等。处理器21可以采用DSP(Digital Signal Processing，数字信号处理)、FPGA(Field－Programmable Gate Array，现场可编程门阵列)、PLA(Programmable Logic Array，可编程逻辑阵列)中的至少一种硬件形式来实现。处理器21也可以包括主处理器和协处理器，主处理器是用于对在唤醒状态下的数据进行处理的处理器，也称CPU(Central Processing Unit，中央处理器)；协处理器是用于对在待机状态下的数据进行处理的低功耗处理器。在一些实施例中，处理器21可以在集成有GPU(Graphics Processing Unit，图像处理器)，GPU用于负责显示屏所需要显示的内容的渲染和绘制。一些实施例中，处理器21还可以包括AI(Artificial Intelligence，人工智能)处理器，该AI处理器用于处理有关机器学习的计算操作。The processor 21 may include one or more processing cores, such as a quad-core processor or an octa-core processor. The processor 21 may be implemented using at least one hardware form selected from DSP (Digital Signal Processing), FPGA (Field-Programmable Gate Array), and PLA (Programmable Logic Array). The processor 21 may also include a main processor and a coprocessor. The main processor, also known as a CPU (Central Processing Unit), is used to process data in the wake-up state; the coprocessor is a low-power processor used to process data in the standby state. In some embodiments, the processor 21 may integrate a GPU (Graphics Processing Unit), which is responsible for rendering and drawing the content to be displayed on the screen. In some embodiments, the processor 21 may also include an AI (Artificial Intelligence) processor, which is used to handle computational operations related to machine learning.

另外，存储器22作为资源存储的载体，可以是只读存储器、随机存储器、磁盘或者光盘等，其上所存储的资源包括操作系统221、计算机程序222及数据223等，存储方式可以是短暂存储或者永久存储。In addition, the memory 22, as a carrier for resource storage, can be a read-only memory, random access memory, disk or optical disk, etc. The resources stored on it include operating system 221, computer program 222 and data 223, etc., and the storage method can be temporary storage or permanent storage.

其中，操作系统221用于管理与控制电子设备上的各硬件设备以及计算机程序222，以实现处理器21对存储器22中海量数据223的运算与处理，其可以是Windows、Unix、Linux等。计算机程序222除了包括能够用于完成前述任一实施例公开的由电子设备执行的循环肿瘤细胞检测方法的计算机程序之外，还可以进一步包括能够用于完成其他特定工作的计算机程序。数据223除了可以包括电子设备接收到的由外部设备传输进来的数据，也可以包括由自身输入输出接口25采集到的数据等。The operating system 221 manages and controls the various hardware devices and computer programs 222 on the electronic device to enable the processor 21 to perform calculations and processing on the massive amounts of data 223 in the memory 22. The operating system can be Windows, Unix, Linux, etc. The computer program 222, in addition to including a computer program capable of performing the circulating tumor cell detection method executed by the electronic device as disclosed in any of the foregoing embodiments, may further include computer programs capable of performing other specific tasks. The data 223 may include data received by the electronic device from external devices, as well as data collected by its own input/output interface 25.

进一步的，本申请还公开了一种计算机可读存储介质，用于存储计算机程序；其中，所述计算机程序被处理器执行时实现前述公开的循环肿瘤细胞检测方法。关于该方法的具体步骤可以参考前述实施例中公开的相应内容，在此不再进行赘述。Furthermore, this application also discloses a computer-readable storage medium for storing a computer program; wherein, when the computer program is executed by a processor, it implements the aforementioned method for detecting circulating tumor cells. Specific steps of this method can be found in the corresponding content disclosed in the foregoing embodiments, and will not be repeated here.

本说明书中各个实施例采用递进的方式描述，每个实施例重点说明的都是与其它实施例的不同之处，各个实施例之间相同或相似部分互相参见即可。对于实施例公开的装置而言，由于其与实施例公开的方法相对应，所以描述的比较简单，相关之处参见方法部分说明即可。The various embodiments in this specification are described in a progressive manner, with each embodiment focusing on its differences from other embodiments. Similar or identical parts between embodiments can be referred to interchangeably. For the apparatus disclosed in the embodiments, since it corresponds to the method disclosed in the embodiments, the description is relatively simple; relevant parts can be referred to in the method section.

专业人员还可以进一步意识到，结合本文中所公开的实施例描述的各示例的单元及算法步骤，能够以电子硬件、计算机软件或者二者的结合来实现，为了清楚地说明硬件和软件的可互换性，在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行，取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能，但是这种实现不应认为超出本申请的范围。结合本文中所公开的实施例描述的方法或算法的步骤可以直接用硬件、处理器执行的软件模块，或者二者的结合来实施。软件模块可以置于随机存储器(Random Access Memory，即RAM)、内存、只读存储器(Read-Only Memory，即ROM)、电可编程EPROM(Erasable Programmable Read Only Memory)、电可擦除可编程EEPROM(Electrically Erasable Programmable read only memory)、寄存器、硬盘、可移动磁盘、CD-ROM(Compact Disc Read-Only Memory，即紧凑型光盘只读储存器)、或技术领域内所公知的任意其它形式的存储介质中。Those skilled in the art will further recognize that the units and algorithm steps of the various examples described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, computer software, or a combination of both. To clearly illustrate the interchangeability of hardware and software, the components and steps of the various examples have been generally described in terms of functionality in the foregoing description. Whether these functions are implemented in hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered beyond the scope of this application. The steps of the methods or algorithms described in conjunction with the embodiments disclosed herein can be implemented directly in hardware, software modules executed by a processor, or a combination of both. The software module may be located in random access memory (RAM), memory, read-only memory (ROM), electrically programmable EPROM (EPROM), electrically erasable programmable read-only memory (EEPROM), register, hard disk, removable disk, CD-ROM (Compact Disc Read-Only Memory), or any other form of storage medium known in the art.

最后，还需要说明的是，在本文中，诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来，而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。Finally, it should be noted that in this document, relational terms such as "first" and "second" are used only to distinguish one entity or operation from another, and do not necessarily require or imply any such actual relationship or order between these entities or operations. Furthermore, the terms "comprising," "including," or any other variations thereof are intended to cover non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements includes not only those elements but also other elements not expressly listed, or elements inherent to such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising one..." does not exclude the presence of other identical elements in the process, method, article, or apparatus that includes said element.

以上对本发明所提供的一种循环肿瘤细胞检测方法、装置、设备及介质进行了详细介绍，本文中应用了具体个例对本发明的原理及实施方式进行了阐述，以上实施例的说明只是用于帮助理解本发明的方法及其核心思想；同时，对于本领域的一般技术人员，依据本发明的思想，在具体实施方式及应用范围上均会有改变之处，综上所述，本说明书内容不应理解为对本发明的限制。The foregoing has provided a detailed description of the circulating tumor cell detection method, apparatus, equipment, and medium provided by the present invention. Specific examples have been used to illustrate the principles and implementation methods of the present invention. The descriptions of the above embodiments are only for the purpose of helping to understand the method and core ideas of the present invention. At the same time, for those skilled in the art, there will be changes in the specific implementation methods and application scope based on the ideas of the present invention. Therefore, the content of this specification should not be construed as a limitation of the present invention.

Claims

A method for detecting circulating tumor cells, characterized in that it includes:

Acquire a target optical image of the object to be detected, and use the multi-scale feature map of the target optical image to obtain the feature map of the region of interest.

The feature map of the region of interest is input into the multi-level target mining and detection model; wherein, the multi-level target mining and detection model contains multiple detection heads;

The self-attention mechanism module in the target multi-level mining detection model divides each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples. The detection heads are used to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples, and the target positive samples are determined as the circulating tumor cell detection results of the target object.

The method for detecting circulating tumor cells according to claim 1, characterized in that acquiring the target optical image of the object to be detected includes:

A stained digital slide of the object to be tested is acquired, and initial optical images of the digital slide under different shooting conditions are acquired.

Adjust the pixel size of each of the initial optical images to obtain the target optical image.

The method for detecting circulating tumor cells according to claim 1, characterized in that, the step of obtaining the region of interest feature map using the multi-scale feature map of the target optical image includes:

Convolutional features of the target optical image are extracted using a backbone network based on a feature pyramid network to obtain a multi-scale feature map;

The multi-scale feature map is input into the region proposal network to obtain the region of interest feature map.

The method for detecting circulating tumor cells according to claim 1, characterized in that, before inputting the feature map of the region of interest into the target multi-level mining detection model, it further includes:

The detection heads in the initial multi-level mining detection model are trained using the training dataset to obtain the detection heads in the current multi-level mining detection model.

The target multi-level mining detection model is obtained by back-optimizing each detection head of the current multi-level mining detection model based on the minimum loss function.

The method for detecting circulating tumor cells according to claim 1, characterized in that, the step of dividing each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples by the self-attention mechanism module in the multi-level target mining detection model includes:

The self-attention mechanism module of the initial detection head in the target multi-level mining detection model generates each self-attention region of interest in the feature map of the region of interest, and divides each self-attention region of interest into initial positive samples and initial negative samples according to the intersection-union ratio of each self-attention region of interest.

The circulating tumor cell detection method according to claim 5, characterized in that, the step of generating each self-attention region of interest of the region of interest feature map through the self-attention mechanism module of the initial detection head in the target multi-level mining detection model includes:

The self-attention mechanism module of the initial detection head in the target multi-level mining detection model performs matrix transformation on the feature map of the region of interest to obtain the target matrix. The target matrix is then transformed into a query vector, a key vector, and a value vector. The SoftMax function is used to convert the product of the query vector and the key vector into a self-attention score. The matrix product between the self-attention score and the value vector is obtained. The matrix product is then convolved to obtain the dimension-restored matrix. The dimension-restored matrix and the feature map of the region of interest are then added point by point to obtain each self-attention region of interest in the feature map of the region of interest.

The method for detecting circulating tumor cells according to claim 6, characterized in that, the step of using each of the detection heads to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples includes:

Based on a preset positive and negative sample ratio, and using a sampler to select the current initial sample from the initial positive sample and the initial negative sample, the difficult negative samples in the current initial sample are removed to obtain the current target sample. Then, the current target sample is input to the current detection head to obtain the next positive sample selected by the current detection head from the current target sample.

If the current detection head is the last detection head, then the next positive sample is determined as the target positive sample containing circulating tumor cells;

If the current detection head is not the last detection head, then based on the preset positive and negative sample ratio, the sampler is used to select the next initial sample from the next positive sample and the initial negative sample, and the next initial sample is updated to the current initial sample. Then, the process jumps back to the step of removing the difficult-to-distinguish negative samples from the current initial sample.

A circulating tumor cell detection device, characterized in that it comprises:

The feature map acquisition module is used to acquire the target optical image of the object to be detected, and to acquire the region of interest feature map using the multi-scale feature map of the target optical image.

The feature map input module is used to input the feature map of the region of interest into the target multi-level mining detection model; wherein, the target multi-level mining detection model includes multiple detection heads;

The detection result determination module is used to divide each self-attention region of interest in the feature map of the region of interest into initial positive samples and initial negative samples through the self-attention mechanism module in the multi-level target mining detection model, and to use each of the detection heads to screen out target positive samples containing circulating tumor cells from the initial positive samples and the initial negative samples, and to determine the target positive samples as the circulating tumor cell detection result of the object to be detected.

An electronic device, characterized in that it comprises:

Memory, used to store computer programs;

A processor for executing the computer program to implement the steps of the circulating tumor cell detection method as described in any one of claims 1 to 7.

A computer-readable storage medium is characterized in that it is used to store a computer program; wherein, when the computer program is executed by a processor, it implements the steps of the circulating tumor cell detection method as described in any one of claims 1 to 7.