CN116403643A - Liver cancer prognostic marker screening system and method, liver cancer prognosis risk assessment system - Google Patents
Liver cancer prognostic marker screening system and method, liver cancer prognosis risk assessment system Download PDFInfo
- Publication number
- CN116403643A CN116403643A CN202310169105.9A CN202310169105A CN116403643A CN 116403643 A CN116403643 A CN 116403643A CN 202310169105 A CN202310169105 A CN 202310169105A CN 116403643 A CN116403643 A CN 116403643A
- Authority
- CN
- China
- Prior art keywords
- liver cancer
- copper
- genes
- death
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/30—Detection of binding sites or motifs
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Medical Informatics (AREA)
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Public Health (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Theoretical Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Bioinformatics & Computational Biology (AREA)
- Genetics & Genomics (AREA)
- Evolutionary Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Pathology (AREA)
- Chemical & Material Sciences (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
技术领域technical field
本发明涉及生物医学技术领域,尤其涉及肝癌预后标志物筛选系统及方法、肝癌预后风险评估系统。The invention relates to the field of biomedical technology, in particular to a liver cancer prognosis marker screening system and method, and a liver cancer prognosis risk assessment system.
背景技术Background technique
肝癌是全球癌症相关死亡率最高的肿瘤之一,肝脏是第六大常见的原发性癌症。目前诊断和治疗方法日益增多,但超过50%的HCC患者确诊时已处于晚期,并且70%的患者在治疗后5年内出现复发。由于肿瘤化疗耐药和复发,患者的5年生存率仍然较低。早期HCC通常是可切除的,但晚期HCC除了消融、经动脉化疗栓塞或外照射的局部治疗外,往往还需要索拉非尼进行系统治疗。早期筛查并及早诊断治疗是肝癌患者避免错过手术时机和后期化疗耐药、肿瘤复发的第一道防线。早期诊断同时可以识别并区分低危和高危患者人群,预测患者预后情况,实现精准个性化治疗。因此,及早诊断、治疗前评估患者的预后风险,准确分层是选择适合的治疗方法、提高临床治疗效益、改善患者预后的关键。但是目前并无有效及准确的肝癌预后风险评估工具。Liver cancer is one of the tumors with the highest cancer-related mortality worldwide, and the liver is the sixth most common primary cancer. Currently, diagnosis and treatment methods are increasing, but more than 50% of HCC patients are diagnosed at an advanced stage, and 70% of patients relapse within 5 years after treatment. Due to tumor chemotherapy resistance and recurrence, the 5-year survival rate of patients is still low. Early HCC is usually resectable, but advanced HCC often requires systemic treatment with sorafenib in addition to local treatment with ablation, transarterial chemoembolization, or external beam radiation. Early screening and early diagnosis and treatment are the first line of defense for patients with liver cancer to avoid missing the opportunity of surgery, chemotherapy resistance and tumor recurrence in the later stage. Early diagnosis can also identify and distinguish low-risk and high-risk patient populations, predict patient prognosis, and achieve precise and personalized treatment. Therefore, early diagnosis, assessment of patients' prognostic risk before treatment, and accurate stratification are the keys to selecting appropriate treatment methods, improving clinical treatment benefits, and improving patient prognosis. However, there is currently no effective and accurate tool for assessing the prognosis and risk of liver cancer.
铜死亡是细胞程序性死亡(Programmed Cell Death,PCD)方式之一,是最近发现一种的铜依赖的细胞死亡方式,是指细胞中出现过高或过低铜离子浓度后,为了维持内环境稳定而发生的一种主动性死亡过程。临床肿瘤治疗药物常是通过诱导癌细胞发生PCD达到暂时抑制肿瘤生长的治疗效果。由于铜死亡的发现,肿瘤细胞中发生铜死亡抑制可能是肿瘤进展的原因之一。近期诸多证据表明,铜死亡与肿瘤患者的预后密切相关,铜死亡相关基因具有重要的预后标志物潜能。Copper death is one of the ways of Programmed Cell Death (PCD). It is a recently discovered copper-dependent way of cell death. An active death process that occurs steadily. Clinical tumor therapy drugs usually achieve the therapeutic effect of temporarily inhibiting tumor growth by inducing PCD in cancer cells. Due to the discovery of copper death, suppression of copper death in tumor cells may be one of the reasons for tumor progression. Recent evidence has shown that copper death is closely related to the prognosis of cancer patients, and copper death-related genes have the potential to be important prognostic markers.
发明内容Contents of the invention
为了克服上述技术缺陷,本发明的第一个方面提供一种肝癌预后标志物筛选系统,其包括:In order to overcome the above-mentioned technical defects, the first aspect of the present invention provides a screening system for liver cancer prognostic markers, which includes:
数据采集分析模块,所述数据采集分析模块用于在TCGA和GEO数据库中下载肝癌与癌旁配对组织基因表达信息以及样本的总生存期数据,还用于从若干肿瘤铜死亡基因中进行差异表达分析,从而分析出在TCGA肝癌配对样本中具有差异性表达的若干肝癌铜死亡基因;Data acquisition and analysis module, the data acquisition and analysis module is used to download the gene expression information of liver cancer and adjacent tumor paired tissues and the overall survival data of samples in TCGA and GEO databases, and is also used to perform differential expression from several tumor copper death genes Analysis, so as to analyze several liver cancer copper death genes that are differentially expressed in TCGA liver cancer paired samples;
筛选模块,所述筛选模块用于在TCGA中肝癌与癌旁组织样本中做基因差异性表达分析,根据第一筛选标准构建差异性表达基因集合;另外在TCGA肝癌组织样本中将所有基因与肝癌铜死亡基因做相关性分析,根据第二筛选标准构建肝癌铜死亡基因相关基因集合;然后将差异性表达基因集合和肝癌铜死亡基因相关基因集合中的基因取交集,结合对应样本的预后数据,依次利用单因素分析、LASSO回归法联合逐步回归的多因素分析筛选得到肝癌铜死亡预后相关基因,即肝癌预后标志物。A screening module, the screening module is used to analyze the differential expression of genes in liver cancer and paracancerous tissue samples in TCGA, and construct a set of differentially expressed genes according to the first screening criteria; in addition, in TCGA liver cancer tissue samples, all genes and liver cancer Correlation analysis was performed on copper death genes, and a gene set related to copper death genes in liver cancer was constructed according to the second screening criteria; then, the differentially expressed gene set and the genes in the gene set related to copper death genes in liver cancer were intersected, combined with the prognosis data of the corresponding samples, Using single factor analysis, LASSO regression combined with multivariate analysis of stepwise regression to screen out the genes related to the prognosis of copper death in liver cancer, that is, the prognostic markers of liver cancer.
进一步地,肝癌预后标志物筛选系统进一步包括验证模块,所述验证模块用于在真实世界肝癌样本队列中,利用PCR技术检测肝癌预后标志物的mRNA表达量,以表达量的中位数为界值进行风险等级分组并进行风险因素相关分析,从而检测各基因表达和患者生存状态预后的相关性。Further, the screening system for liver cancer prognostic markers further includes a verification module, which is used to detect the mRNA expression levels of liver cancer prognostic markers using PCR technology in the real-world liver cancer sample cohort, with the median of the expression levels as the boundary Values were grouped by risk level and risk factor correlation analysis was performed to detect the correlation between the expression of each gene and the prognosis of the patient's survival status.
进一步地,所述第一筛选标准为|log2差异倍数|>1且P<0.05,所述第二筛选标准为|相关系数|>0.4且P<0.05。Further, the first screening criterion is |log 2 multiple of difference|>1 and P<0.05, and the second screening criterion is |correlation coefficient|>0.4 and P<0.05.
进一步地,所述肿瘤铜死亡基因和所述肝癌铜死亡基因均为FDX1、LIAS、LIPT1、DLD、DLAT、PDHA1、PDHB、MTF1、GLS和CDKN2A。Further, the tumor copper death gene and the liver cancer copper death gene are FDX1, LIAS, LIPT1, DLD, DLAT, PDHA1, PDHB, MTF1, GLS and CDKN2A.
进一步地,所述肝癌预后标志物为SPC24,GOT2,SRXN1,SAC3D1,CDCA8,NCAPD2,ATAD5,SYDE2,MCM10,N4BP3,NR2C2AO,BESP1,ACAT2和UNC119B。Further, the liver cancer prognostic markers are SPC24, GOT2, SRXN1, SAC3D1, CDCA8, NCAPD2, ATAD5, SYDE2, MCM10, N4BP3, NR2C2AO, BESP1, ACAT2 and UNC119B.
本申请的第二个方面提供一种肝癌预后风险评估系统,其包括:The second aspect of the present application provides a liver cancer prognosis risk assessment system, which includes:
训练模块,所述训练模块用于选用TCGA中肝癌组织样本的基因表达数据作为训练集,利用上述肝癌预后标志物筛选系统筛选得到的肝癌预后标志物的PCR检测的mRNA表达水平来构建用于计算风险值的预后风险评分模型,选取全部训练集样本的风险值的中位数作为归类风险等级的界值,所述风险评分模型为:A training module, the training module is used to select gene expression data of liver cancer tissue samples in TCGA as a training set, and use the mRNA expression levels of the liver cancer prognosis markers screened by the above-mentioned liver cancer prognosis marker screening system to construct and calculate The prognostic risk scoring model of the risk value, the median of the risk values of all training set samples is selected as the boundary value of the classification risk level, and the risk scoring model is:
其中,n为变量总数,Coef为系数,χ为变量,即肝癌预后标志物的PCR检测的mRNA表达量,Coefi为第i个变量的系数;不同变量对应的系数的调整方法为:通过交叉验证法,对给定的值,进行交叉验证,选取交叉验证误差最小的值,然后按照得到的值,用全部数据重新拟合模型;Among them, n is the total number of variables, Coef is the coefficient, χ is the variable, that is, the mRNA expression level detected by PCR of liver cancer prognostic markers, and Coef i is the coefficient of the ith variable; the adjustment method of the coefficients corresponding to different variables is: through crossover Validation method, for the given value, perform cross-validation, select the value with the smallest cross-validation error, and then re-fit the model with all the data according to the obtained value;
测试模块,所述测试模块用于在真实世界的肝癌样本中检测肝癌预后标志物的PCR检测的mRNA表达水平并计算风险值,利用训练集中所得模型的界值对所有真实样本进行分组,通过生存分析验证该界值用于评价肝癌患者预后不良的价值。The test module is used to detect the mRNA expression level of the PCR detection of liver cancer prognostic markers in real-world liver cancer samples and calculate the risk value, and use the cut-off value of the model obtained in the training set to group all real samples, and pass the survival The value of this cut-off value for evaluating the poor prognosis of liver cancer patients was verified by analysis.
本申请的第三个方面提供一种肝癌预后标志物筛选方法,其包括:The third aspect of the present application provides a method for screening liver cancer prognostic markers, which includes:
步骤S1:在TCGA和GEO数据库中下载肝癌与癌旁配对组织基因表达信息以及样本的总生存期数据,还用于从若干肿瘤铜死亡基因中进行差异表达分析,从而分析出在TCGA肝癌配对样本中具有差异性表达的若干肝癌铜死亡基因;Step S1: Download the gene expression information of liver cancer and paracancerous paired tissues and the overall survival data of samples in TCGA and GEO databases, and also use it for differential expression analysis from several tumor copper death genes, so as to analyze the paired samples of liver cancer in TCGA. Several liver cancer copper death genes differentially expressed in
步骤S2:在TCGA中肝癌与癌旁组织样本中做基因差异性表达分析,根据第一筛选标准构建差异性表达基因集合;另外在TCGA肝癌组织样本中将所有基因与肝癌铜死亡基因做相关性分析,根据第二筛选标准构建肝癌铜死亡基因相关基因集合;Step S2: Perform gene differential expression analysis in TCGA liver cancer and paracancerous tissue samples, and construct a differentially expressed gene set according to the first screening criteria; in addition, in TCGA liver cancer tissue samples, correlate all genes with liver cancer copper death genes Analysis, according to the second screening criteria to construct liver cancer copper death gene-related gene set;
步骤S3:将差异性表达基因集合和肝癌铜死亡基因相关基因集合中的基因取交集;Step S3: Intersect the genes in the differentially expressed gene set and the liver cancer copper death gene-related gene set;
步骤S4:结合对应样本的预后数据,依次利用单因素分析、LASSO回归法联合逐步回归的多因素分析筛选得到肝癌铜死亡预后相关基因,即肝癌预后标志物。Step S4: Combined with the prognosis data of the corresponding samples, the genes related to the prognosis of liver cancer copper death, namely the liver cancer prognostic markers, were screened by single factor analysis, LASSO regression method combined with stepwise regression multivariate analysis.
进一步地,肝癌预后标志物筛选方法进一步包括步骤S5:在真实世界肝癌样本队列中,利用PCR技术检测肝癌预后标志物的mRNA表达量,以表达量的中位数为界值进行风险等级分组并进行风险因素相关分析,从而检测各基因表达和患者生存状态预后的相关性。Further, the method for screening liver cancer prognostic markers further includes step S5: in the real-world liver cancer sample cohort, use PCR technology to detect the mRNA expression levels of liver cancer prognostic markers, and use the median of expression levels as the cut-off value to carry out risk level grouping and Risk factor correlation analysis was carried out to detect the correlation between the expression of each gene and the prognosis of the patient's survival status.
进一步地,所述第一筛选标准为|log2差异倍数|>1且P<0.05,所述第二筛选标准为|相关系数|>0.4且P<0.05。Further, the first screening criterion is |log 2 multiple of difference|>1 and P<0.05, and the second screening criterion is |correlation coefficient|>0.4 and P<0.05.
进一步地,所述肿瘤铜死亡基因和所述肝癌铜死亡基因均为FDX1、LIAS、LIPT1、DLD、DLAT、PDHA1、PDHB、MTF1、GLS和CDKN2A;所述肝癌预后标志物为SPC24,GOT2,SRXN1,SAC3D1,CDCA8,NCAPD2,ATAD5,SYDE2,MCM10,N4BP3,NR2C2AO,BESP1,ACAT2和UNC119B。Further, the tumor copper death gene and the liver cancer copper death gene are FDX1, LIAS, LIPT1, DLD, DLAT, PDHA1, PDHB, MTF1, GLS and CDKN2A; the liver cancer prognosis markers are SPC24, GOT2, SRXN1 , SAC3D1, CDCA8, NCAPD2, ATAD5, SYDE2, MCM10, N4BP3, NR2C2AO, BESP1, ACAT2 and UNC119B.
采用了上述技术方案后,与现有技术相比,具有以下有益效果:After adopting the above technical solution, compared with the prior art, it has the following beneficial effects:
本发明提供了一种铜死亡相关肝癌预后标志物的筛选方法,所述筛选和分类方法基于生物信息学和统计学,可提高预后预测和治疗选择的准确性。利用本发明所述筛选和分类方法筛选得到的肝癌预后风险评估工具riskScore,其判定结果与本单位肝癌队列中真实预后数据结果具有高度的一致性。The invention provides a screening method for copper death-related liver cancer prognosis markers. The screening and classification methods are based on bioinformatics and statistics, and can improve the accuracy of prognosis prediction and treatment selection. The riskScore, a liver cancer prognosis risk assessment tool screened by the screening and classification method of the present invention, has a high degree of consistency with the real prognosis data in the liver cancer cohort of our unit.
附图说明Description of drawings
图1为本发明肝癌预后标志物的筛选流程图。Fig. 1 is a flowchart of the screening of liver cancer prognostic markers of the present invention.
图2为本发明预后风险值riskScore在训练集中与肝癌与生存状态预后分析。Fig. 2 is the prognostic analysis of the present invention's prognostic risk value riskScore in the training set and liver cancer and survival status.
图3为本发明预后风险值riskScore在测试集中与肝癌生存分期分析。Fig. 3 is an analysis of the present invention's prognostic risk value riskScore in the test set and liver cancer survival staging.
具体实施方式Detailed ways
以下结合附图与具体实施例进一步阐述本发明的优点。本领域技术人员应当理解,下面所具体描述的内容是说明性的而非限制性的,不应以此限制本发明的保护范围。The advantages of the present invention will be further elaborated below in conjunction with the accompanying drawings and specific embodiments. Those skilled in the art should understand that the content specifically described below is illustrative rather than restrictive, and should not limit the protection scope of the present invention.
实施例1肝癌预后标志物筛选Example 1 Screening of liver cancer prognostic markers
本实施例提供一种肝癌预后标志物筛选系统,其包括数据采集分析模块、筛选模块和验证模块。This embodiment provides a screening system for liver cancer prognostic markers, which includes a data collection and analysis module, a screening module and a verification module.
数据采集分析模块用于在TCGA和GEO数据库中下载肝癌与癌旁配对组织基因表达信息以及样本的总生存期数据,还用于从若干肿瘤铜死亡基因中进行差异表达分析,从而分析出在TCGA肝癌配对样本中具有差异性表达的若干肝癌铜死亡基因。The data acquisition and analysis module is used to download the gene expression information of liver cancer and paracancerous paired tissues and the overall survival data of the samples in the TCGA and GEO databases. It is also used to perform differential expression analysis from several tumor copper death genes, thereby analyzing Several HCC copper death genes differentially expressed in HCC paired samples.
筛选模块用于在TCGA中肝癌与癌旁组织样本中做基因差异性表达分析,根据第一筛选标准构建差异性表达基因集合,第一筛选标准为|log2差异倍数|>1且P<0.05;另外在TCGA肝癌组织样本中将所有基因与肝癌铜死亡基因做相关性分析,根据第二筛选标准构建肝癌铜死亡基因相关基因集合,第二筛选标准为|相关系数|>0.4且P<0.05;然后将差异性表达基因集合和肝癌铜死亡基因相关基因集合中的基因取交集,结合对应样本的预后数据,依次利用单因素分析、LASSO回归法联合逐步回归的多因素分析筛选得到肝癌铜死亡预后相关基因,即肝癌预后标志物。The screening module is used for gene differential expression analysis in liver cancer and paracancerous tissue samples in TCGA, and constructs a set of differentially expressed genes according to the first screening standard, the first screening standard is |log2 difference multiple|>1 and P<0.05; In addition, in TCGA liver cancer tissue samples, the correlation analysis was performed between all genes and liver cancer copper death genes, and a gene set related to liver cancer copper death genes was constructed according to the second screening criteria. The second screening criteria was |correlation coefficient|>0.4 and P<0.05; Then, the genes in the differentially expressed gene set and the gene set related to copper death in liver cancer were intersected, and combined with the prognosis data of the corresponding samples, the prognosis of copper death in liver cancer was obtained by univariate analysis, LASSO regression combined with stepwise regression multivariate analysis and screening. Related genes, namely liver cancer prognostic markers.
验证模块用于在真实世界肝癌样本队列中,利用PCR技术检测肝癌预后标志物的mRNA表达量,以表达量的中位数为界值进行风险等级分组并进行风险因素相关分析,从而检测各基因表达和患者生存状态预后的相关性。The verification module is used to detect the mRNA expression levels of liver cancer prognostic markers using PCR technology in the real-world liver cancer sample cohort, and use the median expression level as the cut-off value to carry out risk level grouping and risk factor correlation analysis, so as to detect each gene Correlation between expression and prognosis of patient survival status.
如图1所示,采用肝癌预后标志物筛选系统进行肝癌预后标志物筛选的过程包括如下步骤:As shown in Figure 1, the process of screening liver cancer prognostic markers using the liver cancer prognostic marker screening system includes the following steps:
步骤S1:在TCGA和GEO数据库中下载肝癌与癌旁配对组织基因表达信息以及样本的总生存期数据,还用于从若干肿瘤铜死亡基因中进行差异表达分析,从而分析出在TCGA肝癌配对样本中具有差异性表达的若干肝癌铜死亡基因。Step S1: Download the gene expression information of liver cancer and paracancerous paired tissues and the overall survival data of samples in TCGA and GEO databases, and also use it for differential expression analysis from several tumor copper death genes, so as to analyze the paired samples of liver cancer in TCGA. Several hepatocellular carcinoma copper death genes differentially expressed in
数据采集分析模块用于下载癌症基因组图谱计划(The Cancer Genome Atlas,TCGA)及GEO(Gene Expression Omnibus,GEO)在线肝癌与癌旁配对组织基因表达信息以及样本的总生存期数据(共50对)。根据国际权威期刊Science(doi:10.1126/science.abf0529)的研究结果,定义10个铜死亡相关基因:FDX1、LIAS、LIPT1、DLD、DLAT、PDHA1、PDHB、MTF1、GLS和CDKN2A。将上述基因在TCGA肝癌配对样本中行差异表达分析,发现所有基因均具有差异表达。所述肿瘤铜死亡基因和所述肝癌铜死亡基因均为FDX1、LIAS、LIPT1、DLD、DLAT、PDHA1、PDHB、MTF1、GLS和CDKN2A。The data acquisition and analysis module is used to download the gene expression information of liver cancer and para-cancer paired tissues and the overall survival data of samples (50 pairs in total) from the Cancer Genome Atlas (TCGA) and GEO (Gene Expression Omnibus, GEO) online . According to the research results of the international authoritative journal Science (doi:10.1126/science.abf0529), 10 copper death-related genes were defined: FDX1, LIAS, LIPT1, DLD, DLAT, PDHA1, PDHB, MTF1, GLS and CDKN2A. The differential expression analysis of the above genes in TCGA liver cancer paired samples showed that all genes were differentially expressed. The tumor copper death gene and the liver cancer copper death gene are FDX1, LIAS, LIPT1, DLD, DLAT, PDHA1, PDHB, MTF1, GLS and CDKN2A.
步骤S2:在TCGA中肝癌与癌旁组织样本中做基因差异性表达分析,根据第一筛选标准构建差异性表达基因集合,所述第一筛选标准为|log2差异倍数|>1且P<0.05;另外在TCGA肝癌组织样本中将所有基因与肝癌铜死亡基因做相关性分析,根据第二筛选标准构建肝癌铜死亡基因相关基因集合,所述第二筛选标准为|相关系数|>0.4且P<0.05。Step S2: Perform differential expression analysis of genes in liver cancer and paracancerous tissue samples in TCGA, and construct a set of differentially expressed genes according to the first screening standard, which is |log2 difference multiple|>1 and P<0.05 ; In addition, in the TCGA liver cancer tissue samples, the correlation analysis was performed between all genes and liver cancer copper death genes, and a set of genes related to liver cancer copper death genes was constructed according to the second screening criteria. The second screening criteria was |correlation coefficient|>0.4 and P <0.05.
步骤S3:将差异性表达基因集合和肝癌铜死亡基因相关基因集合中的基因取交集。Step S3: Intersect the genes in the differentially expressed gene set and the liver cancer copper death gene-related gene set.
步骤S4:结合对应样本的预后数据,依次利用单因素分析、LASSO回归法联合逐步回归的多因素分析筛选得到肝癌铜死亡预后相关基因,即肝癌预后标志物。Step S4: Combined with the prognosis data of the corresponding samples, the genes related to the prognosis of liver cancer copper death, namely the liver cancer prognostic markers, were screened by single factor analysis, LASSO regression method combined with stepwise regression multivariate analysis.
筛选得到14个预后相关基因,分别是SPC24,GOT2,SRXN1,SAC3D1,CDCA8,NCAPD2,ATAD5,SYDE2,MCM10,N4BP3,NR2C2AO,BESP1,ACAT2和UNC119B。Fourteen prognosis-related genes were screened, namely SPC24, GOT2, SRXN1, SAC3D1, CDCA8, NCAPD2, ATAD5, SYDE2, MCM10, N4BP3, NR2C2AO, BESP1, ACAT2 and UNC119B.
步骤S5:在真实世界肝癌样本队列中,利用PCR技术检测肝癌预后标志物的mRNA表达量,以表达量的中位数为界值进行风险等级分组并进行风险因素相关分析,从而检测各基因表达和患者生存状态预后的相关性。Step S5: In the real-world liver cancer sample cohort, use PCR technology to detect the mRNA expression levels of liver cancer prognostic markers, use the median expression level as the cut-off value to carry out risk level grouping and risk factor correlation analysis, so as to detect the expression of each gene Correlation with patient survival status.
在50例肝癌组织样本队列中,利用PCR技术检测的SPC24,GOT2,SRXN1,SAC3D1,CDCA8,NCAPD2,ATAD5,SYDE2,MCM10,N4BP3,NR2C2AO,BESP1,ACAT2和UNC119B的mRNA表达水平,以表达值的中位数为界值进行分组(低风险组、高风险组)并行风险因素相关分析,检测各基因表达和患者生存状态预后的相关性,P<0.05认为差异有统计学意义。PCR所用引物如表1所示。In a cohort of 50 liver cancer tissue samples, the mRNA expression levels of SPC24, GOT2, SRXN1, SAC3D1, CDCA8, NCAPD2, ATAD5, SYDE2, MCM10, N4BP3, NR2C2AO, BESP1, ACAT2 and UNC119B detected by PCR technology were expressed in terms of expression values The median was used as the cut-off value for grouping (low-risk group, high-risk group) and risk factor correlation analysis to detect the correlation between the expression of each gene and the prognosis of the patient's survival status. P<0.05 considered the difference to be statistically significant. The primers used in PCR are listed in Table 1.
表1肝癌预测预后基因集引物序列Table 1 Primer sequences of liver cancer prediction prognostic gene set
实施例2肝癌预后风险评估Example 2 Liver cancer prognosis risk assessment
本实施例提供一种肝癌预后风险评估系统,其包括训练模块和测试模块。其是可供归类风险等级的评估工具和界值。This embodiment provides a liver cancer prognosis risk assessment system, which includes a training module and a testing module. It is an assessment tool and threshold for assigning risk levels.
训练模块用于选用TCGA中肝癌组织样本的基因表达数据作为训练集,利用上述肝癌预后标志物筛选系统筛选得到的14个肝癌预后标志物的PCR检测的mRNA表达水平来构建用于计算风险值的预后风险评分模型,选取全部训练集样本的风险值的中位数作为归类风险等级的界值。所述风险评分模型为:The training module is used to select the gene expression data of liver cancer tissue samples in TCGA as the training set, and use the mRNA expression levels of the 14 liver cancer prognostic markers screened by the above-mentioned liver cancer prognostic marker screening system to construct a risk value calculation model. For the prognostic risk scoring model, the median of the risk values of all samples in the training set is selected as the cutoff value of the classified risk level. The risk scoring model is:
其中,n为变量总数,Coef为系数,χ为变量,即肝癌预后标志物的PCR检测的mRNA表达量,Coefi为第i个变量的系数;不同变量对应的系数的调整方法为:通过交叉验证法,对给定的值,进行交叉验证,选取交叉验证误差最小的值,然后按照得到的值,用全部数据重新拟合模型;Among them, n is the total number of variables, Coef is the coefficient, χ is the variable, that is, the mRNA expression level detected by PCR of liver cancer prognostic markers, and Coef i is the coefficient of the ith variable; the adjustment method of the coefficients corresponding to different variables is: through crossover Validation method, for the given value, perform cross-validation, select the value with the smallest cross-validation error, and then re-fit the model with all the data according to the obtained value;
训练模块利用所述预后不良生物标志物对肝癌患者预后进行风险评分,得到风险值riskScore,风险值=TNM分期+(SPC24表达量×(-0.56))+(GOT2表达量×(-0.36))+(SRXN1表达量×0.25)+(SAC3D1表达量×0.66)+(CDCA8表达量×0.98)+(NCAPD2表达量×(-0.49))+(ATAD5表达量×(-0.03))+(SYDE2表达量×0.53)+(MCM10表达量×0.50)+(N4BP3表达量×0.34)+(NR2C2AP表达量×(-0.56))+(BFSP1表达量×0.45)+(ACAT2表达量×(-0.30))+(UNC119B表达量×(-0.24))。选取全部训练集样本的风险值的中位数作为归类风险等级的界值。示例地,本实施例中的肝癌TNM分期为2017AJCC第八版。风险值小于界值定义为预后不良低风险,风险值大于界值定义为预后不良高风险)。风险分数预后分析,评估风险值和预后的相关性,P<0.05认为差异有统计学意义。训练模块中预后不良高风险和低风险的临床分期差异如图2所示。The training module uses the poor prognosis biomarkers to perform a risk score on the prognosis of liver cancer patients to obtain the risk value riskScore, risk value = TNM stage + (SPC24 expression level × (-0.56)) + (GOT2 expression level × (-0.36)) +(SRXN1 expression×0.25)+(SAC3D1 expression×0.66)+(CDCA8 expression×0.98)+(NCAPD2 expression×(-0.49))+(ATAD5 expression×(-0.03))+(SYDE2 expression Amount×0.53)+(MCM10 expression amount×0.50)+(N4BP3 expression amount×0.34)+(NR2C2AP expression amount×(-0.56))+(BFSP1 expression amount×0.45)+(ACAT2 expression amount×(-0.30)) +(UNC119B expression level×(-0.24)). The median of the risk values of all training set samples is selected as the cut-off value of the classification risk level. Exemplarily, the TNM staging of liver cancer in this embodiment is the eighth edition of the 2017 AJCC. A risk value smaller than the cutoff value was defined as low risk of poor prognosis, and a risk value greater than the cutoff value was defined as high risk of poor prognosis). Risk score prognostic analysis, assessing the correlation between risk value and prognosis, P<0.05 considered the difference to be statistically significant. The difference in clinical stage between high risk and low risk of poor prognosis in the training module is shown in Figure 2.
测试模块用于在真实世界的肝癌样本中检测14种肝癌预后标志物的PCR检测的mRNA表达水平并计算风险值,利用训练集中所得模型的界值对所有真实样本进行分组,通过生存分析验证该界值后,用于评价肝癌患者预后不良的价值。The test module is used to detect the mRNA expression levels of 14 liver cancer prognostic markers detected by PCR in real-world liver cancer samples and calculate the risk value. All real samples are grouped by using the cut-off value of the model obtained in the training set, and the survival analysis is used to verify the After the cut-off value, it is used to evaluate the value of poor prognosis in patients with liver cancer.
所述测试模块用于在肝癌在线数据队列样本中验证风险评估系统界值在评价预后风险等级中的价值。该数据队列共包含50例肝癌样本基因表达数据。利用PCR检测14种预后相关基因的表达水平并计算风险值。利用界值进行风险分组后行生存状态及预后临床分期分析,测试模块中预后不良高风险和低风险的临床分期差异如图3所示,P<0.05认为有统计学差异。The test module is used to verify the value of the cutoff value of the risk assessment system in evaluating the prognostic risk level in the liver cancer online data cohort sample. The data cohort contains gene expression data of 50 liver cancer samples. The expression levels of 14 prognostic genes were detected by PCR and the risk value was calculated. Survival status and prognosis clinical staging were analyzed after risk grouping by cut-off value. The differences in clinical staging between high-risk and low-risk patients with poor prognosis in the test module are shown in Figure 3, and P<0.05 was considered statistically significant.
利用本发明所述筛选和分类方法筛选得到的肝癌预后风险评估工具riskScore,其判定结果与肝癌队列中真实预后数据结果具有高度的一致性,可提高预后预测和治疗选择的准确性。The liver cancer prognosis risk assessment tool riskScore screened by the screening and classification method of the present invention has a high degree of consistency with the real prognosis data results in the liver cancer cohort, which can improve the accuracy of prognosis prediction and treatment selection.
应当注意的是,本发明的实施例有较佳的实施性,且并非对本发明作任何形式的限制,任何熟悉该领域的技术人员可能利用上述揭示的技术内容变更或修饰为等同的有效实施例,但凡未脱离本发明技术方案的内容,依据本发明的技术实质对以上实施例所作的任何修改或等同变化及修饰,均仍属于本发明技术方案的范围内。It should be noted that the embodiments of the present invention have better implementability and are not intended to limit the present invention in any form. Any person skilled in the art may use the technical content disclosed above to change or modify equivalent effective embodiments However, any modifications or equivalent changes and modifications made to the above embodiments according to the technical essence of the present invention without departing from the content of the technical solution of the present invention still belong to the scope of the technical solution of the present invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310169105.9A CN116403643A (en) | 2023-02-27 | 2023-02-27 | Liver cancer prognostic marker screening system and method, liver cancer prognosis risk assessment system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310169105.9A CN116403643A (en) | 2023-02-27 | 2023-02-27 | Liver cancer prognostic marker screening system and method, liver cancer prognosis risk assessment system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN116403643A true CN116403643A (en) | 2023-07-07 |
Family
ID=87011404
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202310169105.9A Pending CN116403643A (en) | 2023-02-27 | 2023-02-27 | Liver cancer prognostic marker screening system and method, liver cancer prognosis risk assessment system |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN116403643A (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117757936A (en) * | 2023-12-26 | 2024-03-26 | 郑州大学 | Application of copper death related gene in diagnosis, treatment and prognosis prediction of adrenocortical carcinoma |
| CN118380153A (en) * | 2024-05-06 | 2024-07-23 | 重庆市人口和计划生育科学技术研究院 | Liver cancer prognosis model based on copper death related gene, construction method and application thereof |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008107134A2 (en) * | 2007-03-02 | 2008-09-12 | Roche Diagnostics Gmbh | A method for detection of liver cancer, risk of liver cancer, risk of recurrence of liver cancer, malignancy of liver cancer and progression of liver cancer with time by using the methylated cytosine in basp1 gene and/or srd5a2 gene |
| CN101613748A (en) * | 2009-06-09 | 2009-12-30 | 中国人民解放军第二军医大学 | A method for detecting serum markers of pancreatic cancer |
| WO2013162502A1 (en) * | 2012-04-23 | 2013-10-31 | Empire Technology Development Llc | Tissue visualization for resection |
| CN114854863A (en) * | 2022-06-02 | 2022-08-05 | 中国人民解放军空军军医大学 | Application of model constructed based on biomarkers in predicting cancer immunotherapy effect |
| CN115019965A (en) * | 2022-05-20 | 2022-09-06 | 中山大学附属第一医院 | Method for constructing liver cancer patient survival prediction model based on cell death related gene |
-
2023
- 2023-02-27 CN CN202310169105.9A patent/CN116403643A/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008107134A2 (en) * | 2007-03-02 | 2008-09-12 | Roche Diagnostics Gmbh | A method for detection of liver cancer, risk of liver cancer, risk of recurrence of liver cancer, malignancy of liver cancer and progression of liver cancer with time by using the methylated cytosine in basp1 gene and/or srd5a2 gene |
| CN101613748A (en) * | 2009-06-09 | 2009-12-30 | 中国人民解放军第二军医大学 | A method for detecting serum markers of pancreatic cancer |
| WO2013162502A1 (en) * | 2012-04-23 | 2013-10-31 | Empire Technology Development Llc | Tissue visualization for resection |
| CN115019965A (en) * | 2022-05-20 | 2022-09-06 | 中山大学附属第一医院 | Method for constructing liver cancer patient survival prediction model based on cell death related gene |
| CN114854863A (en) * | 2022-06-02 | 2022-08-05 | 中国人民解放军空军军医大学 | Application of model constructed based on biomarkers in predicting cancer immunotherapy effect |
Non-Patent Citations (2)
| Title |
|---|
| 张诚胜;王朝英;何二霞;徐军女;卓曼云;林海锋;: "LncRNA与肝细胞癌关联TCGA数据库信息分析评价", 中华肿瘤防治杂志, no. 09, 14 May 2020 (2020-05-14) * |
| 赵小乐等: ""肝癌中铜死亡相关基因的表达及其预后价值"", 《肿瘤防治研究》, vol. 50, no. 2, 25 February 2023 (2023-02-25), pages 140 - 145 * |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117757936A (en) * | 2023-12-26 | 2024-03-26 | 郑州大学 | Application of copper death related gene in diagnosis, treatment and prognosis prediction of adrenocortical carcinoma |
| CN118380153A (en) * | 2024-05-06 | 2024-07-23 | 重庆市人口和计划生育科学技术研究院 | Liver cancer prognosis model based on copper death related gene, construction method and application thereof |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113539376B (en) | Gene model for judging prognosis of liver cell liver cancer patient, construction method and application | |
| CN110580956B (en) | A group of liver cancer prognostic markers and their applications | |
| CN109897899B (en) | Marker for prognosis judgment of locally advanced esophageal squamous carcinoma and application thereof | |
| CN115019965A (en) | Method for constructing liver cancer patient survival prediction model based on cell death related gene | |
| CN113450873A (en) | Marker for predicting gastric cancer prognosis and immunotherapy applicability and application thereof | |
| CN111863159B (en) | A method for establishing a nomogram model for predicting the efficacy of tumor immunotherapy | |
| CN114203256B (en) | MIBC typing and prognosis prediction model construction method based on microbial abundance | |
| CN113066585A (en) | A method for efficient and rapid evaluation of prognosis of patients with stage II colorectal cancer based on immune gene expression profile | |
| CN116403643A (en) | Liver cancer prognostic marker screening system and method, liver cancer prognosis risk assessment system | |
| CN109929934B (en) | Application of immune-related genes in colorectal cancer prognosis kits and systems | |
| CN115505644A (en) | A kit for predicting the efficacy of chemotherapy for head and neck squamous cell carcinoma and its application | |
| CN113436741B (en) | Lung cancer recurrence prediction method based on tissue specific enhancer region DNA methylation | |
| CN116364179A (en) | Colorectal cancer prognostic marker screening system and method, colorectal cancer prognosis risk assessment system | |
| Zhan et al. | Development and validation of a prognostic gene signature in clear cell renal cell carcinoma | |
| CN117038092A (en) | Pancreatic cancer prognosis model construction method based on Cox regression analysis | |
| CN116386714A (en) | Processing method of non-small cell lung cancer data | |
| CN113416786A (en) | Biomarker combination for hepatocellular carcinoma prognosis evaluation and screening method and application thereof | |
| CN117809838A (en) | Non-small cell lung cancer patient prognosis survival rate prediction method based on multiple groups | |
| Zhu et al. | Comprehensive analysis of N6‐methyladenosine‐related long non‐coding RNAs for prognosis prediction in liver hepatocellular carcinoma | |
| Li et al. | Robust nonparametric integrative analysis to decipher heterogeneity and commonality across subgroups using sparse boosting | |
| CN111733252A (en) | A combination of characteristic miRNA expression profiles and a method for early prediction of gastric cancer | |
| CN116312814B (en) | Construction method, equipment, device and kit of lung adenocarcinoma molecular typing model | |
| Zhu et al. | Prognostic iron-metabolism signature robustly stratifies single-cell characteristics of hepatocellular carcinoma | |
| CN116246707A (en) | Evaluation of cross-resistance gene sets, scoring models and their construction methods and applications in HER2-positive gastric cancer | |
| CN115851952A (en) | Prognostic biomarkers for squamous cell carcinoma of the head and neck and their screening methods and applications |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |