[go: up one dir, main page]

WO2024192928A1 - Gene combination for liver cancer detection, and related reagent and application - Google Patents

Gene combination for liver cancer detection, and related reagent and application Download PDF

Info

Publication number
WO2024192928A1
WO2024192928A1 PCT/CN2023/105356 CN2023105356W WO2024192928A1 WO 2024192928 A1 WO2024192928 A1 WO 2024192928A1 CN 2023105356 W CN2023105356 W CN 2023105356W WO 2024192928 A1 WO2024192928 A1 WO 2024192928A1
Authority
WO
WIPO (PCT)
Prior art keywords
gene
reagent
liver cancer
seq
probe
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2023/105356
Other languages
French (fr)
Chinese (zh)
Inventor
李慧
郗丽英
张嫒媛
杨浩
郑璐
王寅
白健
吴�琳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Berryoncology Jingzhan Medical Laboratory Co Ltd
Original Assignee
Berryoncology Jingzhan Medical Laboratory Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Berryoncology Jingzhan Medical Laboratory Co Ltd filed Critical Berryoncology Jingzhan Medical Laboratory Co Ltd
Publication of WO2024192928A1 publication Critical patent/WO2024192928A1/en
Anticipated expiration legal-status Critical
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6858Allele-specific amplification
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/50ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for simulation or modelling of medical disorders
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/118Prognosis of disease development
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/154Methylation markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/16Primer sets for multiplex assays
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/166Oligonucleotides used as internal standards, controls or normalisation probes
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Definitions

  • the present disclosure relates to the field of biotechnology and disease detection, and in particular, to a gene combination and related reagents and applications for liver cancer detection.
  • Liver cancer mainly includes hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (ICC), of which 70%-90% are hepatocellular carcinoma.
  • HCC hepatocellular carcinoma
  • ICC intrahepatic cholangiocarcinoma
  • Liver cancer is one of the most common malignant tumors in the world, and its incidence is increasing year by year. It is estimated that by 2025, the incidence rate will exceed one million cases.
  • High-risk groups for HCC include those with HBV infection, HCV infection, alcoholic liver disease (ALD) caused by excessive drinking, nonalcoholic fatty liver disease (NAFLD) caused by diabetes or obesity-related metabolic causes, and cirrhosis caused by other causes, as well as people with a family history of liver cancer, especially men over 40 years old.
  • HBV infection is the main risk factor for HCC, accounting for about 85% of the number of liver cancer patients. Since 2002, the proportion of HCC caused by HBV and HCV infection has been declining due to the popularization of HBV vaccination for newborns and the improvement of HCV cure rate.
  • HCC cases caused by alcoholic liver disease and nonalcoholic fatty liver disease have been on the rise, accounting for 15% of the general population, becoming the second largest risk factor after HBV infection. More than 90% of HCC patients have chronic liver disease, and cirrhosis caused by any cause is an extremely high risk factor for HCC.
  • HCC human Crohn's disease .
  • the prognosis of HCC depends on the tumor stage.
  • the 5-year survival rate of early HCC is over 70%, while only 70% of patients with mid-stage HCC can survive 1 to 1.5 years after treatment.
  • Regular screening of high-risk groups such as those with cirrhosis and chronic HBV infection can reduce the mortality rate by 37%.
  • patients with early liver cancer only account for 10% to 20% of the number of liver cancer patients.
  • Most HCC patients are already in the middle and late stages when liver cancer is detected, and the survival rate is greatly reduced. Therefore, early detection of liver cancer and timely surgical treatment can enable patients to obtain the best survival rate.
  • Serum AFP is currently a common and important indicator for diagnosing liver cancer and monitoring therapeutic effects. Serum AFP ⁇ 400ng/ml is highly suggestive of liver cancer after excluding pregnancy, chronic or active liver disease, gonadal embryonal tumors, and digestive tract tumors. When the AFP threshold is 20ng/mL, the sensitivity and specificity of AFP for detecting HCC are 41% to 65% and 80% to 90%, respectively; however, nearly 50% of HCC patients have AFP levels below 20ng/mL, so AFP cannot be used alone for HCC screening. Total AFP is not specific for liver cancer, and 10% to 42% of elevated AFP levels also occur in other liver diseases besides HCC, such as viral hepatitis.
  • AFP and ultrasound screening are not high, and abdominal ultrasound depends on the operator's level, most liver cancers are discovered in the late stage clinically, which greatly reduces the patient's five-year survival rate. Therefore, it is urgent to develop a product with high sensitivity and specificity for early screening and early diagnosis of people at high risk of liver cancer.
  • the ideal detection product should be highly repeatable, independent of the operator's experience (such as abdominal ultrasound), highly accurate, and easy to implement in different clinical scenarios, and liquid biopsy meets these standards.
  • the purpose of the present disclosure is to provide a gene combination and related reagents and applications for liver cancer detection.
  • the present disclosure provides the use of a reagent or a reagent combination in preparing a product for any one or more of the following uses, including (1) early screening or early diagnosis of liver cancer, (2) diagnosis or auxiliary diagnosis of liver cancer, (3) assessment or auxiliary assessment of the prognostic risk of liver cancer and (4) dynamic monitoring of liver cancer; the reagent or reagent combination comprises: a first reagent for detecting the methylation content of the OSR2 gene and a second reagent for detecting the methylation content of the TSPYL5 gene.
  • the present disclosure provides a reagent or a reagent combination, which includes the reagent or the reagent combination described in the aforementioned embodiments.
  • the present disclosure provides a kit, which includes the reagent or reagent combination described in any of the aforementioned embodiments.
  • the present disclosure provides a method for detecting the methylation content of a target gene, which comprises: using the reagent or reagent combination described in the aforementioned embodiment to detect the methylation content of the target gene in a sample; the target gene comprises OSR2 and TSPYL5 genes.
  • the present disclosure provides a method for training a liver cancer prediction model, which comprises: obtaining the reagent or reagent combination described in the aforementioned embodiment to detect the methylation content of the target gene in the training sample and annotating the result; wherein the target gene is the target gene as described in the aforementioned embodiment, and the annotated result comprises: a label representing any one of the liver cancer risk, prognosis risk and disease progression of the sample; the methylation content of the target gene of the sample to be tested is input into a pre-constructed prediction model to obtain a prediction result; the prediction model is a machine learning model that can judge any one of the liver cancer risk, prognosis risk and disease progression of the sample according to the methylation content of the target gene.
  • an embodiment of the present disclosure provides an electronic device comprising: a processor and a memory; the memory is used to store a program, and when the program is executed by the processor, the processor implements the training method of the liver cancer prediction model or the liver cancer prediction method described in the aforementioned embodiment; the prediction method comprises: obtaining the methylation content of the target gene in the sample to be tested; the target gene is the target gene described in the aforementioned embodiment; the methylation content of the target gene of the sample to be tested is input into the model trained by the training method described in the aforementioned embodiment to obtain the prediction result of the sample to be tested.
  • the present disclosure provides a gene combination of OSR2 and TSPYL5 genes that can be used for liver cancer detection. By detecting the methylation content of this gene combination, it can be determined whether the subject has liver cancer, the risk of liver cancer and/or the disease progression of liver cancer. It has the advantages of high detection sensitivity and good specificity.
  • Figure 1 is a comparison of the principles of MethyLight and HeavyMethyl detection
  • Figure 2 shows the ROC curve of 21 candidate genes for distinguishing liver cancer from liver cirrhosis
  • Figure 3 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from all negative samples
  • Figure 4 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from cirrhosis samples
  • Figure 5 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from healthy samples
  • Figure 6 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from liver cirrhosis
  • Figure 7 shows the ROC curves of OSR2, TSPYL5 and AFP for distinguishing liver cancer from cirrhosis.
  • Methylation is a common epigenetic modification of human genomic DNA. It refers to the transfer of a methyl group (-CH 3 ) to the 5th carbon atom of the DNA cytosine base C under the action of DNA methyltransferase.
  • DNA methylation modification is involved in a variety of cell biological processes such as cell differentiation, genome stability, X chromosome inactivation and gene imprinting.
  • Common methyltransferases in mammals are DNMT3a, DNMT3b and DNMT1.
  • DNMT3a and DNMT3b realize de novo methylation of DNA, while DNMT1 is responsible for replicating the methylation pattern of the DNA template sequence.
  • DNA methylation mainly occurs on the base C of CpG dinucleotides. About 70% of human genes have multiple densely packed CpG dinucleotides in the promoter region upstream, forming a CpG island. Methylation of the CpG island leads to silencing of gene expression. In addition to the methylation of the CpG island in the promoter region of the gene, the first exon of the gene is similar to the promoter region and is also methylated, leading to silencing of gene expression. Gene DNA methylation also plays an important role in the occurrence and development of tumors. High methylation in the promoter region of tumor suppressor genes inhibits gene expression, causing the loss of tumor suppressor gene function, leading to uncontrolled proliferation and invasion and metastasis of tumor cells, and ultimately leading to the occurrence and development of cancer.
  • methylation modification occurs in the early stages of tumors, earlier than gene mutations. Methylation of CpG dinucleotides often appears in clusters, that is, in a linked state, which is suitable for early screening and diagnosis of tumors.
  • CpG islands are related to the expression of controlled genes, the methylation of CpG islands can present a tissue-specific pattern. Therefore, DNA methylation can be used to trace the tissue origin of cancer.
  • the detection methods of DNA methylation can be divided into chemical conversion method, enzyme conversion method, enzyme digestion method and methylated DNA immunoprecipitation method.
  • the chemical conversion method is to convert methylated cytosine (5-mC) into uracil U by bisulfite, and then perform PCR detection or sequencing.
  • Such methods include WGBS (Whole-Genome Bisulfite Sequencing), BSAS (Bisulfite Amplicon Sequencing), MSP (Methylation Specific PCR), BSP (Bisulfite Sequencing PCR), QMSP (Quantitative Methylation Specific PCR).
  • the enzymatic conversion method is to convert 5-mC into 5-caC by TET enzyme, and then convert 5-caC into uracil U by APOBEC enzyme or convert 5-caC into DHU by pyridine borane, such as NEB's NEBNext Enzymatic Methyl-seq Kit and Base Genomics' TAPS method.
  • Enzyme digestion is to cut methylated or non-methylated sites by methylation-sensitive or insensitive endonucleases, and selectively analyze DNA templates by PCR or sequencing, such as simplified genomic bisulfite sequencing RRBS.
  • Methylated DNA Immunoprecipitation is to separate methylated DNA fragments by anti-5-mC antibodies, and then perform relevant detection and analysis.
  • Enzyme conversion method has little damage to DNA, high recovery rate of converted DNA, and relatively complete fragments, but the conversion efficiency is unstable.
  • Enzyme digestion method has little damage to DNA and high enzyme digestion efficiency, but it needs to find specific endonucleases for one or a class of DNA sequences, which is not universal.
  • the capture efficiency of methylated DNA immunoprecipitation method is greatly affected by the system.
  • bisulfite conversion has great damage to DNA, the converted DNA is fragmented, and the conversion recovery rate is low, it has high conversion efficiency and stable conversion rate. It can identify single-base methylation and is the gold standard for DNA methylation detection. It is widely used in scientific research and clinical practice.
  • Fluorescence quantitative PCR based on bisulfite conversion can be divided into MethyLight and HeavyMethyl according to the principle of detecting methylation (see Figure 1).
  • the primer design of HeavyMethyl does not distinguish between methylated and non-methylated sequences, but blocks the primer extension of the non-methylated template by designing one or two methylation-specific oligonucleotide blockers, thereby selectively amplifying the methylated template, and at the same time designs a methylation-specific probe, and realizes the selective amplification and detection of the methylated template through the oligonucleotide blocker and the methylation-specific probe.
  • HeavyMethyl can detect methylated DNA as low as 30-60pg, while non-methylated DNA is not amplified.
  • MethyLight designs methylation-specific primers and probes for DNA sequences after bisulfite conversion, so that only methylated templates are amplified and detected, while non-methylated templates are not amplified and detected.
  • MethyLight is considered to be highly sensitive and is widely used for DNA methylation detection.
  • primers and probes with high sensitivity and specificity can be designed.
  • it is difficult to design suitable primers and probes due to insufficient density of CpG dinucleotides or uneven distribution of methylated C and non-methylated C, resulting in insufficient sensitivity or specificity of the MethyLight detection system, and false positive or false negative results.
  • the first step is to screen methylation markers (target genes), screen out methylation markers with high sensitivity and specificity, and then perform fluorescent quantitative PCR detection. It can be seen that it is particularly critical to screen out methylation markers with good performance.
  • the present disclosure provides a gene combination for liver cancer detection, and a set of reagent combinations and kits for liver cancer detection based on the gene combination. By detecting the methylation content of the target gene combination, it can be determined whether the subject has liver cancer or the risk of liver cancer.
  • the present disclosure provides the use of a reagent or a reagent combination in preparing a product for any one or more of the following uses, including (1) early screening or early diagnosis of liver cancer, (2) diagnosis or auxiliary diagnosis of liver cancer, (3) assessment or auxiliary assessment of prognostic risk of liver cancer, and (4) dynamic monitoring of liver cancer; the reagent or reagent combination comprises: a first reagent for detecting the methylation content of the OSR2 gene and a second reagent for detecting the methylation content of the TSPYL5 gene.
  • OSR2 is a protein-coding gene that encodes odd-skipped-related transcription factor 2.
  • the OSR2 gene has GeneID: 116039.
  • the detection region of the OSR2 gene is selected from: the full-length or partial sequence of the OSR2 gene (exon region and intron region) and the 5' promoter region.
  • the detection region of the OSR2 gene includes: region 1 whose nucleotide sequence is shown in SEQ ID NO.1. Region 1 is an optional detection region, and the C in the CpG dinucleotide in its sequence is highly methylated in liver cancer, while the methylation level is low in cirrhosis, benign liver lesions and healthy people, and there is a significant difference between the two.
  • the chromosome coordinates are chr8:99,955,670-99,961,095 (genome version number hg19), and the DNA sequence is as follows (Note: C in bold CG or all CG is methylated C):
  • the first reagent includes: any one or more of a primer pair, a probe, and a chip.
  • the first reagent includes: a primer pair with a nucleotide sequence as shown in SEQ ID NO. 4 to 5 and/or a probe with a nucleotide sequence as shown in SEQ ID NO. 6.
  • TSPYL5 is a protein encoding gene predicted to activate chromatin binding activity and histone binding activity, and is involved in multiple physiological processes including cellular response to gamma radiation, positive regulation of protein kinase B signaling, and positive regulation of protein ubiquitination.
  • the TSPYL5 gene has GeneID: 85453.
  • the detection region of the TSPYL5 gene is selected from: the full-length or partial sequence of the TSPYL5 gene (exon region and intron region) and the 5' promoter region.
  • the detection region of the TSPYL5 gene includes: region 2 whose nucleotide sequence is shown as SEQ ID NO.2. Region 2 is an optional detection region, and the C in the CpG dinucleotide in its sequence is highly methylated in liver cancer, while the methylation level is low in cirrhosis, benign liver lesions and healthy people, and there is a significant difference between the two.
  • the chromosome coordinates are chr8:98,289,281-98,290,380 (genome version number hg19), and the DNA sequence is as follows (Note: C in bold CG or all CG is methylated C):
  • the second reagent includes any one or more of a primer pair, a probe, and a chip.
  • the second reagent includes: a primer pair with a nucleotide sequence as shown in SEQ ID NO.16-17 and/or a probe with a nucleotide sequence as shown in SEQ ID NO.18.
  • the reagent or reagent combination also includes: a reagent or reagent combination for detecting the methylation content of any one or more genes in C1QL4 gene, CR1L gene, CYP26C1 gene, FOXG1 gene, GHSR gene, HIST1H1D gene, IRX5 gene, KCNG3 gene, LHX2 gene, MEX3A gene, NEFM gene, OTX1 gene, OXTR gene, PCDHGB6 gene, PCDHGB7 gene, PITX1 gene, PRLHR gene, PRRX1 gene, and ZIC4 gene.
  • the detection region of the methylation content of the 19 genes is selected from the full length or partial region of the corresponding gene and its promoter region.
  • the 19 genes are used as a gene combination with the OSR2 gene and the TSPYL5 gene for liver cancer detection, which has good detection sensitivity and specificity, and the AUC can reach 0.92.
  • the reagent or reagent combination further includes: a third reagent for detecting the content of an internal reference gene.
  • the internal reference gene includes the SDF4 gene.
  • SDF4 is a protein-coding gene that encodes a stromal cell-derived factor, which is a member of the CREC (reticulum calcium-binding protein) protein family, and the encoded protein contains six EF-hand motifs and a calcium-binding motif. This protein is located in the Golgi cavity and may be involved in regulating calcium-dependent cell activities.
  • the SDF4 gene has GeneID: 51150.
  • the detection region of the SDF4 gene includes: the full-length or partial sequence of the SDF4 gene.
  • the detection region of the SDF4 gene includes: region 3 whose nucleotide sequence is shown as SEQ ID NO.3. Region 3 is an optional detection region, and the GC content of the segment sequence is 64.7%, the CpG dinucleotide density is 5.88%, and the GC content after bisulfite conversion is 32.5%.
  • the chromosome coordinates are chr1: 1,159,192-1,159,483 (genome version number hg19), and the DNA sequence is as follows (Note: C in bold CG or all CG is methylated C):
  • the third reagent includes: any one or more of a primer pair, a probe, and a chip.
  • the third reagent includes: a primer pair with a nucleotide sequence as shown in SEQ ID NO. 28 to 29 and/or a probe with a nucleotide sequence as shown in SEQ ID NO. 30.
  • OSR2 and TSPYL5 are tumor markers highly correlated with liver cancer, and the detection target is their methylated DNA sequence
  • SDF4 is an internal reference gene used to control the quality of DNA samples and as a reference for the methylation content of the target gene.
  • the difference ( ⁇ Ct) of the Ct value amplified by OSR2 and TSPYL5 minus the Ct value amplified by the internal reference gene SDF4 is used as the relative methylation content of the target gene, and after modeling with the ⁇ Ct value, it is determined whether the subject has liver cancer or the risk of liver cancer.
  • the product includes any one or more of a reagent and a kit.
  • embodiments of the present disclosure provide a reagent or a reagent combination, which comprises the reagent or the reagent combination described in any of the preceding embodiments.
  • an embodiment of the present disclosure provides a kit, which includes the reagent or reagent combination described in any of the preceding embodiments.
  • the reagent or reagent combination and the kit have any one or more of the following uses: (1) early screening or early diagnosis of liver cancer; (2) diagnosis or auxiliary diagnosis of liver cancer, (3) preliminary assessment or auxiliary assessment of the prognostic risk of liver cancer and (4) dynamic monitoring of liver cancer.
  • an embodiment of the present disclosure provides a method for detecting the methylation content of a target gene, comprising: using the reagent or reagent combination described in any of the preceding embodiments to detect the methylation content of the target gene in a sample; the target genes include the OSR2 gene and the TSPYL5 gene.
  • the use is not directly aimed at diagnosis or treatment of disease.
  • the direct purpose is not to diagnose or treat a disease.
  • the sample to be tested is an environmental sample containing a biological sample or an artificially produced standard
  • the direct purpose of the test is not to diagnose or treat a disease.
  • the method further comprises detecting an internal reference gene.
  • the target gene also includes: for detecting any one or more of C1QL4 gene, CR1L gene, CYP26C1 gene, FOXG1 gene, GHSR gene, HIST1H1D gene, IRX5 gene, KCNG3 gene, LHX2 gene, MEX3A gene, NEFM gene, OTX1 gene, OXTR gene, PCDHGB6 gene, PCDHGB7 gene, PITX1 gene, PRLHR gene, PRRX1 gene, and ZIC4 gene.
  • the internal reference gene includes: SDF4 gene.
  • the sample of the detection method includes: a DNA sample or an environmental sample containing DNA.
  • the type of the DNA sample can be any one of plasma cfDNA, serum cfDNA, tissue DNA, cell DNA and exosome DNA.
  • the detection of the methylation content of the target gene comprises: using the reagent or reagent combination to perform qPCR amplification on the sample after bisulfite conversion;
  • the qPCR reaction system comprises: the final concentration of MgCl2 is 4-8mM, the final concentration of dNTP Mix is 200-800 ⁇ M, the final concentration of Taq enzyme is 1.0-10U/reaction, the final concentration of the primer pair for OSR2 gene detection is 0.05-1.0 ⁇ M, and the final concentration of the probe is 50-500nM; the final concentration of the primer pair for TSPYL5 gene detection is 0.1-1.0 ⁇ M, and the final concentration of the probe is 50-500nM; the final concentration of the primer pair for SDF4 gene detection is 0.1-1.0 ⁇ M, and the final concentration of the probe is 50-500nM.
  • the qPCR reaction system includes: a final concentration of MgCl2 of 6 mM, a final concentration of dNTP Mix of 500 ⁇ M, a final concentration of Taq enzyme of 3 U/reaction, a final concentration of the primer pair for OSR2 gene detection of 0.1 ⁇ M, a final concentration of the probe of 200 nM, a final concentration of the primer pair for TSPYL5 gene detection of 0.2 ⁇ M, a final concentration of the probe of 200 nM, a final concentration of the primer pair for SDF4 gene detection of 0.2 ⁇ M, a final concentration of the probe of 100 nM, and a total reaction volume of 25 ⁇ L.
  • the qPCR reaction procedure includes the following:
  • the qPCR reaction procedure includes the following: pre-denaturation at 95°C for 5 minutes; denaturation at 95°C for 15 seconds, annealing and extension at 62°C for 30 seconds, 15 cycles, without collecting fluorescence; denaturation at 95°C for 15 seconds, annealing and extension at 62°C for 32 seconds, 30 cycles, collecting fluorescence.
  • the present disclosure also provides a method for training a liver cancer prediction model, which includes:
  • the methylation content of the target gene of the sample to be tested is input into a pre-constructed prediction model to obtain a prediction result;
  • the prediction model is a machine learning model that can judge any of the liver cancer risk, prognosis risk and disease progression of the sample based on the methylation content of the target gene.
  • the sample size of the training samples is greater than or equal to 10.
  • the sample size can specifically be any one of 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900 and 1000, or a range between any two of them.
  • the label may be a character, where a character refers to a glyph-like unit or symbol, including letters, numbers, operation symbols, punctuation marks and other symbols, as well as some functional symbols.
  • the machine learning model can be selected from any one or a combination of support vector machine, decision tree, random forest, logistic regression, Bayesian, K nearest neighbor, K means, Markov and regression ridge algorithms.
  • an embodiment of the present disclosure further provides an electronic device, comprising: a processor and a memory; the memory is used to store a program, and when the program is executed by the processor, the processor implements the training method of the liver cancer prediction model or the liver cancer prediction method described in any of the above embodiments;
  • the prediction method includes: obtaining the methylation content of the target gene in the sample to be tested; the target gene is the target gene described in any of the aforementioned embodiments; the methylation content of the target gene of the sample to be tested is input into the model trained by the training method described in any of the aforementioned embodiments to obtain the prediction result of the sample to be tested.
  • the electronic device may include a memory, a processor, a bus, and a communication interface, wherein the memory, the processor, and the communication interface are electrically connected to each other directly or indirectly to achieve data transmission or interaction.
  • these components may be electrically connected to each other via one or more buses or signal lines.
  • the processor may process information and/or data related to target identification to perform one or more functions described in the present application.
  • the memory can be, but is not limited to, random access memory (RAM), read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable read-only memory (EEPROM), etc.
  • RAM random access memory
  • ROM read-only memory
  • PROM programmable read-only memory
  • EPROM erasable programmable read-only memory
  • EEPROM electrically erasable read-only memory
  • the processor can be an integrated circuit chip with signal processing capabilities.
  • the processor can be a general-purpose processor, including a central processing unit (CPU), a network processor (NP), etc.; it can also be a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • CPU central processing unit
  • NP network processor
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA field programmable gate array
  • the electronic device can be a server, a cloud platform, a mobile phone, a tablet computer, a laptop computer, an ultra-mobile personal computer (UMPC), a handheld computer, a netbook, a personal digital assistant (PDA), a wearable electronic device, a virtual reality device, etc. Therefore, the embodiments of the present application do not limit the type of electronic device.
  • UMPC ultra-mobile personal computer
  • PDA personal digital assistant
  • DNA was collected from 43 liver cancer tissues and 32 paired adjacent paracancerous tissues, and plasma was collected from 55 liver cancer tissues, 55 liver cirrhosis tissues, and 50 healthy subjects, with 2 mL of plasma from each tissue.
  • the cfDNA concentration was measured using Qubit dsDNA HS Assay Kit (Thermofisher Scientific, Cat. No. Q32854).
  • the converted DNA was used to construct the library using Hieff NGS Methyl-seq DNA Library Prep Kit for Illumina (YEASEN, catalog number: 12211ES24). The operating steps are shown in the kit manual.
  • the library was concentrated using Qubit dsDNA HS Assay Kit and the library fragments were quality checked using 4200 bioanalyzer (Agilent DNA 1000 Kit, Cat. No.: 5067-1504).
  • the library that passed the quality check was captured by liquid phase hybridization.
  • the captured library was concentrated using Qubit dsDNA HS Assay Kit and the library fragments were quality checked using 4200 bioanalyzer. After the captured library passed the quality check, it was sequenced on the Illumina sequencer NovaSeq with a read length of PE150.
  • bamUtil software to soft-clip the overlapping regions of shorter inserts to prevent duplicate statistics when detecting methylation levels
  • bamtools software to remove DNA fragments with low alignment quality, unaligned, or incompletely paired double-end reads
  • the sequencing data of tissue DNA and plasma cfDNA were analyzed, and 21 candidate genes were finally screened out.
  • the difference levels of the median methylation percentages of the 21 candidate genes in liver cancer and cirrhosis, liver cancer and cirrhosis, and healthy people are shown in Table 1.
  • the performance of the 21 candidate genes in distinguishing liver cancer from cirrhosis is shown in Figure 2.
  • the AUC for distinguishing liver cancer from cirrhosis reached 0.92, among which HIST1H1D, IRX5, NEFM, OSR2, OTX1, PITX1, and TSPYL5 could reach an AUC of more than 0.71 for distinguishing liver cancer from cirrhosis.
  • OSR2 and TSPYL5 Based on the difference in methylation percentage of candidate genes in liver cancer, liver cirrhosis, and healthy people and the AUC area of the ROC curve, combined with the characteristics of the gene sequence, two target genes, OSR2 and TSPYL5, were screened out and qPCR primer probes were designed. According to the optional sequences of OSR2 and TSPYL5 genes, the entire optional sequence was traversed to screen sequence fragments suitable for primer probe design, and several sets of primer probe combinations were carefully designed in combination with the design elements of MethyLight primer probes.
  • primer-probe combinations were designed in the optional DNA segments of OSR2 and TSPYL5, and the primer-probe combinations were screened.
  • the primer-probe combinations for OSR2 amplification are:
  • SEQ ID NO.4 5 ⁇ -GCGTCGGTTTCGTTTTTGTACGTC-3 ⁇ ;
  • SEQ ID NO.5 5 ⁇ -CCACGCGCTACGCTACCG-3 ⁇ ;
  • SEQ ID NO.7 5 ⁇ -GTTTTTCGGAGGTAAGATTATTTGCG-3 ⁇ ;
  • SEQ ID NO.8 5 ⁇ -ACTCGATATATCCAAAACAAATTCGC-3 ⁇ ;
  • SEQ ID NO.10 5 ⁇ -TTTTGCGTCGTCGTTATTTTCG-3 ⁇ ;
  • SEQ ID NO.11 5 ⁇ -AAAACCTATACGATCTCAACGCG-3 ⁇ ;
  • SEQ ID NO.12 5 ⁇ 6-FAM-ATGGTGGAGCGGGTGATTTCGTGTAT-3 ⁇ BHQ1;
  • SEQ ID NO.14 5 ⁇ -AAAACCTATACGATCTCAACGCGA-3 ⁇ ;
  • the primer-probe combinations for TSPYL5 amplification are:
  • SEQ ID NO.16 5 ⁇ -CGCTCATAATAACGACGAAAACAACT-3 ⁇ ;
  • SEQ ID NO.17 5 ⁇ -GGAGAGATTCGTTCGGTTTCGATC-3 ⁇ ;
  • SEQ ID NO.20 5 ⁇ -AAAACCGCAAAATCACAACGTATCTT-3 ⁇ ;
  • SEQ ID NO.22 5 ⁇ -CCGCGATACTACAAATTTCTAAAACCTT-3 ⁇ ;
  • SEQ ID NO.25 5 ⁇ -AATACTACAATCGCAACTACCCAAA-3 ⁇ ;
  • SEQ ID NO.26 5 ⁇ -GGAAGGTAGTATGGATACGTTGGA-3 ⁇ ;
  • the primer-probe combinations for SDF4 amplification are:
  • SEQ ID NO.30 5 ⁇ ROX-ACCGACGACATCCGTCTAACTATTCGA-3 ⁇ BHQ2;
  • SEQ ID NO.31 5 ⁇ -TGTCGTTGTTGGGTGGTAGGTTTA-3 ⁇ ;
  • SEQ ID NO.32 5 ⁇ -CCCCAAAACCCCGAACAAC-3 ⁇ ;
  • SEQ ID NO.33 5 ⁇ ROX-AACGAATACCGCCGACCCAAAACTCCA-3 ⁇ BHQ2;
  • SEQ ID NO.34 5 ⁇ -GGTCGGATTAACGGGATGAAGTTTA-3 ⁇ ;
  • SEQ ID NO.35 5 ⁇ -CCCACACACGATTCGTACCTC-3 ⁇ ;
  • SEQ ID NO.36 5 ⁇ ROX-CCCGATCCCGCGTTCCCTACTATCCT-3 ⁇ BHQ2.
  • the fluorescent group modification at the 5' end of the probe can be the fluorescent group used in various types of probes such as TaqMan probe, MGB probe, molecular beacon, BQ probe, such as FAM, TET, JOE, VIC, HEX, Cy3, NED, TAMRA, ROX, Cy5, AMCA, Pacific Blue, Atto 425, BODIPY FL, Oregon Green 488, R6G, Yakima Yellow, Quasar 570, AquaPhluor 593, Texas Red, Atto 590, Quasar 670, Cy5.5, Cy7 and IR Dye 750;
  • the fluorescence quenching group modification at the 3' end of the probe can be the quenching group used in various types of probes such as TaqMan probe, MGB probe, molecular beacon, BQ probe, such as BHQ1, BHQ2, BHQ3, BBQ650, MGB, Dabcyl and DBQ1.
  • the qPCR amplification product of the positive quality control with a starting amount of 100 copies was quality checked using a 4200 bioanalyzer and quality check reagents (Agilent DNA 1000 Kit, catalog number: 5067-1504).
  • the validation results of the primer-probe combinations of OSR2, TSPYL5 and SDF4 are shown in Table 2-3. Select a primer-probe combination with high sensitivity, high specificity, and minimal primer dimer and nonspecific amplification, and determine the optimal primer-probe combination in combination with the fluorescence amplification curve. As shown in Table 2-3, the optimal primer-probe combination for OSR2 gene amplification is combination 1, the optimal primer-probe combination for TSPYL5 gene amplification is combination 5, and the optimal primer-probe combination for SDF4 gene amplification is combination 9.
  • test result is Undetermined, the value is set to 30.
  • the Ct value is the test value + 15.
  • HCC hepatocellular carcinoma
  • 31 cases were plasma collected in 2018 and 22 cases were plasma collected in 2020
  • 52 cases of liver cirrhosis of which 4 cases were plasma collected in 2019, 42 cases were plasma collected in 2020, and 6 cases were plasma collected in 2021
  • benign liver lesions 6 cases of focal nodular hyperplasia, 2 cases of hepatic adenoma, and 12 cases of hemangioma.
  • Each sample was 2mL of plasma.
  • Primer probe combination 9 for SDF4 amplification
  • the qPCR reaction was performed as follows
  • TSPYL5 When cirrhosis is used as a negative population, TSPYL5 is better than OSR2 in distinguishing liver cancer from cirrhosis, with AUCs of 0.933 and 0.906, respectively. After integrating TSPYL5 and OSR2, the AUC for distinguishing liver cancer from cirrhosis can reach 0.95. ( Figure 4)
  • OSR2 performed better than TSPYL5 in distinguishing liver cancer from healthy people, with AUCs of 0.935 and 0.923, respectively.
  • the AUC for distinguishing liver cancer from healthy people could reach 0.961 ( Figure 5).
  • the plasma volume was reduced to 1 mL and a second batch of plasma samples was used for verification, some of which were also tested for AFP.
  • liver cancer liver cancer
  • liver cirrhosis liver cirrhosis
  • the primer probe for OSR2 amplification was combination 1;
  • the primer probe for TSPYL5 amplification was combination 5;
  • the primer probe for SDF4 amplification was combination 9;
  • AFP is a self-developed reagent, and 100 ⁇ L of plasma was taken for AFP testing.
  • NA means AFP was not detected.
  • the AUC of OSR2 for distinguishing liver cancer from cirrhosis reached 0.914
  • the AUC of TSPYL5 for distinguishing liver cancer from cirrhosis reached 0.915
  • the AUC for distinguishing liver cancer from cirrhosis reached 0.951
  • the AUCs of OSR2 and TSPYL5 for distinguishing liver cancer from cirrhosis were 0.906 and 0.933, respectively.
  • the AUC for distinguishing liver cancer from cirrhosis can reach 0.95.
  • the AUC of OSR2 is higher than that of 2 mL of plasma at the starting amount of 1 mL of plasma, and the AUC of TSPYL5 is lower than that of 2 mL of plasma, after the integration of the two genes, the AUC of 1 mL of plasma starting amount is equivalent to that of 2 mL of plasma starting amount.
  • liver cancer and cirrhosis samples with AFP data were selected for ROC curve analysis.
  • the AUC of AFP for distinguishing liver cancer from cirrhosis is 0.741
  • the AUC of OSR2 and TSPYL5 for distinguishing liver cancer from cirrhosis is above 0.9, which is significantly higher than AFP.
  • the AUC of integrating OSR2 and TSPYL5 to distinguish liver cancer from cirrhosis is 0.96. It can be seen that the performance of this target gene combination in detecting liver cancer is significantly better than AFP.
  • the present disclosure provides a tumor marker for liver cancer detection - OSR2 and TSPYL5 genes, the methylation content of their DNA sequences can be used to distinguish liver cancer from cirrhosis, liver cancer from healthy people, liver cancer from benign liver lesions, and the AUC is all above 0.9; the present disclosure also provides a gene combination and a kit for liver cancer detection, namely, the OSR2, TSPYL5 and SDF4 gene combination, wherein OSR2 and TSPYL5 are target genes for detecting liver cancer, and SDF4 is an internal reference gene for determining the relative methylation content.
  • the AUC for distinguishing liver cancer from cirrhosis, liver cancer from healthy people, and liver cancer from benign liver lesions can reach above 0.95, which is significantly better than the performance of the existing tumor marker protein AFP used for liver cancer screening.

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Analytical Chemistry (AREA)
  • Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Public Health (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Biotechnology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Microbiology (AREA)
  • Primary Health Care (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Oncology (AREA)
  • Hospice & Palliative Care (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

A gene combination for liver cancer detection, and a related reagent and application, relating to the fields of biotechnology and disease detection. Provided are OSR2, TSPYL5 and SDF4 genes that can be used for liver cancer detection. By measuring the methylation content in the gene combination, whether a subject suffers from liver cancer and/or the risk of the subject suffering from liver cancer can be determined. The present invention has the advantages of high detection sensitivity and good specificity.

Description

一种用于肝癌检测的基因组合与相关试剂和应用A gene combination for liver cancer detection and related reagents and applications

相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS

本公开要求于2023年03月23日提交中国专利局的申请号为CN2023102952484、名称为“一种用于肝癌检测的基因组合与相关试剂和应用”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。The present disclosure claims the priority of the Chinese patent application with application number CN2023102952484 filed with the Chinese Patent Office on March 23, 2023 and entitled “A gene combination and related reagents and applications for liver cancer detection”, the entire contents of which are incorporated by reference in the present disclosure.

技术领域Technical Field

本公开涉及生物技术与疾病检测领域,具体而言,涉及一种用于肝癌检测的基因组合与相关试剂和应用。The present disclosure relates to the field of biotechnology and disease detection, and in particular, to a gene combination and related reagents and applications for liver cancer detection.

背景技术Background Art

肝癌主要包括肝细胞癌(Hepatocellular Carcinoma,HCC)与肝内胆管癌(Intrahepatic Cholangiocarcinoma,ICC),其中70%-90%的肝癌是肝细胞癌。肝癌是全球常见的恶性肿瘤之一,其发生率正在逐年增加,预计到2025年,发病率将超过一百万例。Liver cancer mainly includes hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (ICC), of which 70%-90% are hepatocellular carcinoma. Liver cancer is one of the most common malignant tumors in the world, and its incidence is increasing year by year. It is estimated that by 2025, the incidence rate will exceed one million cases.

HCC的高危人群包括HBV感染、HCV感染、过度饮酒引起的酒精肝(Alcoholic Liver Disease,ALD)、糖尿病或肥胖相关代谢原因导致的非酒精性脂肪肝(Nonalcoholic Fatty Liver Disease,NAFLD)及其他原因引起的肝硬化以及有肝癌家族史的人群,尤其是年龄大于40岁的男性。HBV感染是HCC最主要的风险因素,约占肝癌患者人数的85%。由于自2002年起,新生儿HBV疫苗接种的普及与HCV治愈率的提高,HBV、HCV感染导致的HCC比例呈下降趋势。但由于生活水平和饮食结构变化,酒精肝和非酒精性脂肪肝导致的HCC病例呈上升趋势,占到普通人群的15%,成为继HBV感染之后的第二大风险因素。超过90%的HCC患者都有慢性肝病,任何病因导致的肝硬化是HCC的极高危因素。High-risk groups for HCC include those with HBV infection, HCV infection, alcoholic liver disease (ALD) caused by excessive drinking, nonalcoholic fatty liver disease (NAFLD) caused by diabetes or obesity-related metabolic causes, and cirrhosis caused by other causes, as well as people with a family history of liver cancer, especially men over 40 years old. HBV infection is the main risk factor for HCC, accounting for about 85% of the number of liver cancer patients. Since 2002, the proportion of HCC caused by HBV and HCV infection has been declining due to the popularization of HBV vaccination for newborns and the improvement of HCV cure rate. However, due to changes in living standards and dietary structure, HCC cases caused by alcoholic liver disease and nonalcoholic fatty liver disease have been on the rise, accounting for 15% of the general population, becoming the second largest risk factor after HBV infection. More than 90% of HCC patients have chronic liver disease, and cirrhosis caused by any cause is an extremely high risk factor for HCC.

HCC的预后取决于肿瘤分期。早期HCC的5年生存率超过70%,而中期HCC病人治疗后,只有70%的患者能存活1~1.5年。对肝硬化与HBV慢性感染者等高危人群进行定期筛查能够将死亡率降低37%。目前,早期肝癌患者只占到肝癌人数的10%~20%,大部分HCC病人检测出肝癌时已为中晚期,生存率大大降低。因此,在早期发现肝癌并及时进行手术治疗,能够使患者获得最佳的生存率。The prognosis of HCC depends on the tumor stage. The 5-year survival rate of early HCC is over 70%, while only 70% of patients with mid-stage HCC can survive 1 to 1.5 years after treatment. Regular screening of high-risk groups such as those with cirrhosis and chronic HBV infection can reduce the mortality rate by 37%. At present, patients with early liver cancer only account for 10% to 20% of the number of liver cancer patients. Most HCC patients are already in the middle and late stages when liver cancer is detected, and the survival rate is greatly reduced. Therefore, early detection of liver cancer and timely surgical treatment can enable patients to obtain the best survival rate.

对HBV或HCV感染,或任何原因引起的肝硬化高风险人群进行筛查,能够提高早期肝癌的检出率和肝癌治愈率。每半年一次腹部超声或结合血清甲胎蛋白(α-fetoprotein,AFP)是推荐的HCC的筛查手段。超声筛查具有简单、费用低、非侵入性、能实时观察、无放射辐射等优点。但不断增加的数据强调腹部超声依赖于操作者的水平,而且用于肥胖和非酒精性脂肪肝患者筛查的性能不佳。一项meta分析发现腹部超声检测早期HCC的整体灵敏度只有45%,虽然结合AFP检测早期HCC的灵敏度能提高到63%,但特异性降低。且大多数直径<1cm的小肝癌不容易被腹部超声检测出,非常难以诊断,因此针对直径<1cm的病灶要进行定期随访,如3个月随访一次。Screening of high-risk groups for HBV or HCV infection, or cirrhosis caused by any reason, can improve the detection rate of early liver cancer and the cure rate of liver cancer. Abdominal ultrasound once every six months or combined with serum alpha-fetoprotein (AFP) is the recommended screening method for HCC. Ultrasound screening has the advantages of simplicity, low cost, non-invasiveness, real-time observation, and no radiation. However, increasing data emphasize that abdominal ultrasound depends on the level of the operator and has poor performance in screening patients with obesity and non-alcoholic fatty liver disease. A meta-analysis found that the overall sensitivity of abdominal ultrasound for detecting early HCC was only 45%. Although the sensitivity of combining AFP to detect early HCC can be increased to 63%, the specificity is reduced. In addition, most small liver cancers with a diameter of less than 1 cm are not easily detected by abdominal ultrasound and are very difficult to diagnose. Therefore, regular follow-up should be conducted for lesions with a diameter of less than 1 cm, such as once every 3 months.

血清AFP是当前诊断肝癌和疗效监测常用且重要的指标。血清AFP≥400ng/ml,在排除妊娠、慢性或活动性肝病、生殖腺胚胎原性肿瘤以及消化道肿瘤后,高度提示肝癌。当AFP阈值在20ng/mL时,AFP检测HCC的灵敏度和特异性分别在41%~65%和80%~90%;然而,将近50%的HCC患者的AFP水平低于20ng/mL,因此AFP不能单独用于HCC筛查。总AFP并不是肝癌特异性的,有10%~42%的AFP水平升高也在除HCC之外的其他肝脏疾病中出现,如病毒性肝炎。Serum AFP is currently a common and important indicator for diagnosing liver cancer and monitoring therapeutic effects. Serum AFP ≥ 400ng/ml is highly suggestive of liver cancer after excluding pregnancy, chronic or active liver disease, gonadal embryonal tumors, and digestive tract tumors. When the AFP threshold is 20ng/mL, the sensitivity and specificity of AFP for detecting HCC are 41% to 65% and 80% to 90%, respectively; however, nearly 50% of HCC patients have AFP levels below 20ng/mL, so AFP cannot be used alone for HCC screening. Total AFP is not specific for liver cancer, and 10% to 42% of elevated AFP levels also occur in other liver diseases besides HCC, such as viral hepatitis.

综上,由于AFP和超声筛查的灵敏度和特异性均不高,且腹部超声依赖于操作者的水平,临床上大部分肝癌发现时均处于晚期,大大降低了患者的五年生存率。因此,亟待开发一款灵敏度高、特异性高的针对肝癌高风险人群的早筛、早诊产品。理想的检测产品应当高度可重复,不依赖于操作者的经验(如腹部超声),高准确度,在不同的临床场景容易实施,而液体活检正好符合这些标准。In summary, since the sensitivity and specificity of AFP and ultrasound screening are not high, and abdominal ultrasound depends on the operator's level, most liver cancers are discovered in the late stage clinically, which greatly reduces the patient's five-year survival rate. Therefore, it is urgent to develop a product with high sensitivity and specificity for early screening and early diagnosis of people at high risk of liver cancer. The ideal detection product should be highly repeatable, independent of the operator's experience (such as abdominal ultrasound), highly accurate, and easy to implement in different clinical scenarios, and liquid biopsy meets these standards.

发明内容Summary of the invention

本公开的目的在于提供一种用于肝癌检测的基因组合与相关试剂和应用。The purpose of the present disclosure is to provide a gene combination and related reagents and applications for liver cancer detection.

本公开是这样实现的:The present disclosure is implemented as follows:

本公开提供了试剂或试剂组合在制备用于具有以下任意一种或多种用途的产品中的应用,所述用途包括(1)肝癌的早筛或早诊、(2)肝癌的诊断或辅助诊断、(3)肝癌预后风险的评估或辅助评估和(4)肝癌的动态监测;所述试剂或试剂组合包括:检测OSR2基因的甲基化含量的第一试剂和检测TSPYL5基因的甲基化含量的第二试剂。The present disclosure provides the use of a reagent or a reagent combination in preparing a product for any one or more of the following uses, including (1) early screening or early diagnosis of liver cancer, (2) diagnosis or auxiliary diagnosis of liver cancer, (3) assessment or auxiliary assessment of the prognostic risk of liver cancer and (4) dynamic monitoring of liver cancer; the reagent or reagent combination comprises: a first reagent for detecting the methylation content of the OSR2 gene and a second reagent for detecting the methylation content of the TSPYL5 gene.

本公开提供了一种试剂或试剂组合,其包括前述实施例所述的试剂或试剂组合。The present disclosure provides a reagent or a reagent combination, which includes the reagent or the reagent combination described in the aforementioned embodiments.

本公开提供了一种试剂盒,其包括前述任意实施例所述的试剂或试剂组合。The present disclosure provides a kit, which includes the reagent or reagent combination described in any of the aforementioned embodiments.

本公开提供了一种靶基因甲基化含量的检测方法,其包括:采用前述实施例所述的试剂或试剂组合对样本进行靶基因的甲基化含量的检测;所述靶基因包括OSR2和TSPYL5基因。The present disclosure provides a method for detecting the methylation content of a target gene, which comprises: using the reagent or reagent combination described in the aforementioned embodiment to detect the methylation content of the target gene in a sample; the target gene comprises OSR2 and TSPYL5 genes.

本公开提供了一种肝癌预测模型的训练方法,其包括:获取前述实施例所述试剂或试剂组合对训练样本中靶基因的甲基化含量进行检测并为其标注结果;其中,所述靶基因为如前述实施例所述的靶基因,所述标注结果包括:代表样本的肝癌患病风险、预后风险和疾病进程中任意一种情况的标签;将待测样本的靶基因的甲基化含量输入预先构建的预测模型中,获得预测结果;所述预测模型为能够根据所述靶基因的甲基化含量判断样本的肝癌患病风险、预后风险和疾病进程中任意一种情况的机器学习模型。The present disclosure provides a method for training a liver cancer prediction model, which comprises: obtaining the reagent or reagent combination described in the aforementioned embodiment to detect the methylation content of the target gene in the training sample and annotating the result; wherein the target gene is the target gene as described in the aforementioned embodiment, and the annotated result comprises: a label representing any one of the liver cancer risk, prognosis risk and disease progression of the sample; the methylation content of the target gene of the sample to be tested is input into a pre-constructed prediction model to obtain a prediction result; the prediction model is a machine learning model that can judge any one of the liver cancer risk, prognosis risk and disease progression of the sample according to the methylation content of the target gene.

第六方面,本公开实施例提供了一种电子设备,其包括:处理器和存储器;所述存储器用于存储程序,当所述程序被所述处理器执行时,使得所述处理器实现前述实施例所述的肝癌预测模型的训练方法或肝癌的预测方法;所述预测方法包括:获取待测样本中靶基因的甲基化含量;所述靶基因为前述实施例所述的靶基因;将待测样本靶基因的甲基化含量输入前述实施例所述的训练方法训练好的模型中,获得待测样本的预测结果。In a sixth aspect, an embodiment of the present disclosure provides an electronic device comprising: a processor and a memory; the memory is used to store a program, and when the program is executed by the processor, the processor implements the training method of the liver cancer prediction model or the liver cancer prediction method described in the aforementioned embodiment; the prediction method comprises: obtaining the methylation content of the target gene in the sample to be tested; the target gene is the target gene described in the aforementioned embodiment; the methylation content of the target gene of the sample to be tested is input into the model trained by the training method described in the aforementioned embodiment to obtain the prediction result of the sample to be tested.

本公开具有以下有益效果:The present disclosure has the following beneficial effects:

针对目前超声与AFP检测对早期肝癌检测灵敏度、特异性低的问题,本公开提供能用于肝癌检测的基因组合OSR2和TSPYL5基因,通过检测此基因组合的甲基化含量,可判断受检者是否患有肝癌、患有肝癌的风险高低和/或肝癌的疾病进程,具有检测灵敏度高,特异性好的优势。In response to the problem of low sensitivity and specificity of current ultrasound and AFP tests for early liver cancer detection, the present disclosure provides a gene combination of OSR2 and TSPYL5 genes that can be used for liver cancer detection. By detecting the methylation content of this gene combination, it can be determined whether the subject has liver cancer, the risk of liver cancer and/or the disease progression of liver cancer. It has the advantages of high detection sensitivity and good specificity.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本公开实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本公开的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present disclosure, the drawings required for use in the embodiments will be briefly introduced below. It should be understood that the following drawings only show certain embodiments of the present disclosure and therefore should not be regarded as limiting the scope. For ordinary technicians in this field, other related drawings can be obtained based on these drawings without paying creative work.

图1为MethyLight与HeavyMethyl检测的原理对比;Figure 1 is a comparison of the principles of MethyLight and HeavyMethyl detection;

图2为21个候选基因区分肝癌与肝硬化的ROC曲线;Figure 2 shows the ROC curve of 21 candidate genes for distinguishing liver cancer from liver cirrhosis;

图3为OSR2与TSPYL5区分肝癌与所有阴性样本的ROC曲线;Figure 3 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from all negative samples;

图4为OSR2与TSPYL5区分肝癌与肝硬化样本的ROC曲线;Figure 4 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from cirrhosis samples;

图5为OSR2与TSPYL5区分肝癌与健康人样本的ROC曲线;Figure 5 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from healthy samples;

图6为OSR2与TSPYL5区分肝癌与肝硬化的ROC曲线;Figure 6 shows the ROC curves of OSR2 and TSPYL5 for distinguishing liver cancer from liver cirrhosis;

图7为OSR2、TSPYL5与AFP区分肝癌与肝硬化的ROC曲线。Figure 7 shows the ROC curves of OSR2, TSPYL5 and AFP for distinguishing liver cancer from cirrhosis.

具体实施方式DETAILED DESCRIPTION

为使本公开实施例的目的、技术方案和优点更加清楚,下面将对本公开实施例中的技术方案进行清楚、完整地描述。实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,均为可以通过市售购买获得的常规产品。In order to make the purpose, technical scheme and advantages of the embodiments of the present disclosure clearer, the technical scheme in the embodiments of the present disclosure will be described clearly and completely below. If the specific conditions are not specified in the embodiments, they are carried out according to conventional conditions or conditions recommended by the manufacturer. If the manufacturer of the reagents or instruments used is not specified, they are all conventional products that can be purchased commercially.

甲基化(Methylation)是人基因组DNA常见的一种表观遗传修饰,是指在DNA甲基化转移酶的作用下,将甲基(-CH3)转移到DNA胞嘧啶碱基C的第5号碳原子上。DNA甲基化修饰参与细胞分化、基因组稳定性、X染色体失活和基因印记等多种细胞生物学过程。在哺乳动物体内常见的甲基转移酶有DNMT3a、DNMT3b和DNMT1,其中DNMT3a和DNMT3b实现DNA的从头甲基化(De novo Methylation),而DNMT1负责复制DNA模板序列的甲基化模式。Methylation is a common epigenetic modification of human genomic DNA. It refers to the transfer of a methyl group (-CH 3 ) to the 5th carbon atom of the DNA cytosine base C under the action of DNA methyltransferase. DNA methylation modification is involved in a variety of cell biological processes such as cell differentiation, genome stability, X chromosome inactivation and gene imprinting. Common methyltransferases in mammals are DNMT3a, DNMT3b and DNMT1. DNMT3a and DNMT3b realize de novo methylation of DNA, while DNMT1 is responsible for replicating the methylation pattern of the DNA template sequence.

DNA的甲基化主要发生CpG二核苷酸的碱基C上。约70%的人类基因上游的启动子区上有多个密集的CpG二核苷酸,形成CpG岛(CpG Island)。CpG岛的甲基化导致基因表达沉默。除了基因启动子区CpG岛的甲基化,基因的第一号外显子与启动子区类似,也存在甲基化,而导致基因表达的沉默。基因的DNA甲基化在肿瘤发生发展中亦发挥重要作用。抑癌基因启动子区的高甲基化抑制基因的表达,使抑癌基因功能丧失,导致肿瘤细胞的增殖失控和侵袭转移,最终导致癌症的发生发展。 DNA methylation mainly occurs on the base C of CpG dinucleotides. About 70% of human genes have multiple densely packed CpG dinucleotides in the promoter region upstream, forming a CpG island. Methylation of the CpG island leads to silencing of gene expression. In addition to the methylation of the CpG island in the promoter region of the gene, the first exon of the gene is similar to the promoter region and is also methylated, leading to silencing of gene expression. Gene DNA methylation also plays an important role in the occurrence and development of tumors. High methylation in the promoter region of tumor suppressor genes inhibits gene expression, causing the loss of tumor suppressor gene function, leading to uncontrolled proliferation and invasion and metastasis of tumor cells, and ultimately leading to the occurrence and development of cancer.

有研究表明,甲基化修饰发生于肿瘤早期,早于基因突变的发生。CpG二核苷酸的甲基化常常成簇出现,即呈连锁状态,适合肿瘤的早筛、早诊。另外,由于CpG岛与控制基因的表达相关,CpG岛的甲基化可呈现出组织特异性模式,因此,可以用DNA甲基化进行癌症的组织溯源。Studies have shown that methylation modification occurs in the early stages of tumors, earlier than gene mutations. Methylation of CpG dinucleotides often appears in clusters, that is, in a linked state, which is suitable for early screening and diagnosis of tumors. In addition, since CpG islands are related to the expression of controlled genes, the methylation of CpG islands can present a tissue-specific pattern. Therefore, DNA methylation can be used to trace the tissue origin of cancer.

目前,DNA甲基化的检测方法可分为化学转化法、酶转化法、酶切法和甲基化DNA免疫沉淀法。化学转化法即通过重亚硫酸氢盐将甲基化胞嘧啶(5-mC)转化成尿嘧啶U,然后进行PCR检测或测序,此类方法有WGBS(Whole-Genome Bisulfite Sequencing)、BSAS(Bisulfite Amplicon Sequencing)、MSP(Methylation Specific PCR)、BSP(Bisulfite Sequencing PCR)、QMSP(Quantitative Methylation Specific PCR)。酶转化法即通过TET酶将5-mC转化为5-caC,再用APOBEC酶将5-caC转化为尿嘧啶U或者通过吡啶硼烷将5-caC转化为DHU,如NEB公司的NEBNext Enzymatic Methyl-seq Kit和Base Genomics公司的TAPS方法。酶切法即通过甲基化敏感或不敏感的内切酶切割甲基化或非甲基化位点,通过PCR或测序进行DNA模板的选择性分析,如简化基因组重亚硫酸盐测序RRBS。甲基化DNA免疫沉淀法(Methylated DNA Immunoprecipitation,MeDIP),通过抗5-mC的抗体分离出甲基化的DNA片段,然后进行相关检测分析。酶转化法对DNA的损伤小,转化后的DNA回收率高,片段较为完整,但转化效率不稳定。酶切法对DNA的损伤小,酶切效率高,但需要对一种或一类DNA序列寻找特异性的内切酶,不具有普适性。甲基化DNA免疫沉淀法的捕获效率受体系影响较大。重亚硫酸氢盐转化虽然对DNA的损伤大,转化后的DNA呈片段化,转化回收率低,但转化效率高,转化率稳定,可以识别单碱基甲基化,是DNA甲基化检测的金标准,在科研与临床上广泛应用。At present, the detection methods of DNA methylation can be divided into chemical conversion method, enzyme conversion method, enzyme digestion method and methylated DNA immunoprecipitation method. The chemical conversion method is to convert methylated cytosine (5-mC) into uracil U by bisulfite, and then perform PCR detection or sequencing. Such methods include WGBS (Whole-Genome Bisulfite Sequencing), BSAS (Bisulfite Amplicon Sequencing), MSP (Methylation Specific PCR), BSP (Bisulfite Sequencing PCR), QMSP (Quantitative Methylation Specific PCR). The enzymatic conversion method is to convert 5-mC into 5-caC by TET enzyme, and then convert 5-caC into uracil U by APOBEC enzyme or convert 5-caC into DHU by pyridine borane, such as NEB's NEBNext Enzymatic Methyl-seq Kit and Base Genomics' TAPS method. Enzyme digestion is to cut methylated or non-methylated sites by methylation-sensitive or insensitive endonucleases, and selectively analyze DNA templates by PCR or sequencing, such as simplified genomic bisulfite sequencing RRBS. Methylated DNA Immunoprecipitation (MeDIP) is to separate methylated DNA fragments by anti-5-mC antibodies, and then perform relevant detection and analysis. Enzyme conversion method has little damage to DNA, high recovery rate of converted DNA, and relatively complete fragments, but the conversion efficiency is unstable. Enzyme digestion method has little damage to DNA and high enzyme digestion efficiency, but it needs to find specific endonucleases for one or a class of DNA sequences, which is not universal. The capture efficiency of methylated DNA immunoprecipitation method is greatly affected by the system. Although bisulfite conversion has great damage to DNA, the converted DNA is fragmented, and the conversion recovery rate is low, it has high conversion efficiency and stable conversion rate. It can identify single-base methylation and is the gold standard for DNA methylation detection. It is widely used in scientific research and clinical practice.

基于重亚硫酸氢盐转化的荧光定量PCR按照检测甲基化的原理可分为MethyLight和HeavyMethyl两种(见附图1)。HeavyMethyl的引物设计不区分甲基化与非甲基化序列,而是通过设计一条或两条甲基化特异性的寡核苷酸Blocker,阻断引物延伸非甲基化模板,从而选择性扩增甲基化模板,同时设计一条甲基化特异性的探针,通过寡核苷酸Blocker和甲基化特异性探针,实现甲基化模板的选择性扩增与检测。经过对Blocker、探针的精心设计、检测体系的优化,HeavyMethyl能够检测到低至30~60pg的甲基化DNA,而非甲基化DNA不被扩增。Fluorescence quantitative PCR based on bisulfite conversion can be divided into MethyLight and HeavyMethyl according to the principle of detecting methylation (see Figure 1). The primer design of HeavyMethyl does not distinguish between methylated and non-methylated sequences, but blocks the primer extension of the non-methylated template by designing one or two methylation-specific oligonucleotide blockers, thereby selectively amplifying the methylated template, and at the same time designs a methylation-specific probe, and realizes the selective amplification and detection of the methylated template through the oligonucleotide blocker and the methylation-specific probe. After careful design of blockers and probes and optimization of the detection system, HeavyMethyl can detect methylated DNA as low as 30-60pg, while non-methylated DNA is not amplified.

MethyLight针对重亚硫酸氢盐转化后的DNA序列,设计甲基化特异性的引物与探针,使得只有甲基化的模板进行扩增与检测,而非甲基化的模板不会被进行扩增与检测。MethyLight被认为灵敏度高,广泛用于DNA的甲基化检测。对于大部分基因的甲基化序列,能够设计出灵敏度和特异性均高的引物和探针,然而对于个别基因的甲基化序列,由于CpG二核苷酸不够密集或者甲基化的C与非甲基化C分布不均匀等原因,难以设计出合适的引物探针,从而导致MethyLight检测体系的灵敏度或特异性不足,出现假阳性或假阴性结果。因此,在用MethyLight方法检测基因甲基化含量时,需要对基因序列进行充分分析,选择适合设计引物探针的序列,根据适合设计引物探针的序列特性,结合MethyLight引物探针设计要素,精心设计、筛选与充分验证可行的引物探针组合。MethyLight designs methylation-specific primers and probes for DNA sequences after bisulfite conversion, so that only methylated templates are amplified and detected, while non-methylated templates are not amplified and detected. MethyLight is considered to be highly sensitive and is widely used for DNA methylation detection. For the methylation sequences of most genes, primers and probes with high sensitivity and specificity can be designed. However, for the methylation sequences of individual genes, it is difficult to design suitable primers and probes due to insufficient density of CpG dinucleotides or uneven distribution of methylated C and non-methylated C, resulting in insufficient sensitivity or specificity of the MethyLight detection system, and false positive or false negative results. Therefore, when using the MethyLight method to detect the methylation content of genes, it is necessary to fully analyze the gene sequence, select sequences suitable for designing primer probes, and carefully design, screen and fully verify feasible primer probe combinations based on the sequence characteristics suitable for designing primer probes and combined with the design elements of MethyLight primer probes.

尽管基于重亚硫酸氢盐转化的荧光定量PCR有上述优势,但目前荧光定量PCR最多只有5个荧光通道,因此限制了基因甲基化靶点的检测个数。因此,针对特定癌种的检测,首选要进行甲基化Marker(靶基因)的筛选,筛选出灵敏度与特异性高的甲基化Marker,然后再进行荧光定量PCR检测。由此可见,筛选出性能好的甲基化Marker尤为关键。Although fluorescent quantitative PCR based on bisulfite conversion has the above advantages, it currently has only 5 fluorescent channels at most, thus limiting the number of gene methylation targets that can be detected. Therefore, for the detection of specific cancer types, the first step is to screen methylation markers (target genes), screen out methylation markers with high sensitivity and specificity, and then perform fluorescent quantitative PCR detection. It can be seen that it is particularly critical to screen out methylation markers with good performance.

本公开提供一种用于肝癌检测的基因组合,以及基于此基因组合的一组用于肝癌检测的试剂组合及试剂盒,通过检测靶基因组合的甲基化含量,可判断受检者是否患有肝癌或患有肝癌的风险高低。The present disclosure provides a gene combination for liver cancer detection, and a set of reagent combinations and kits for liver cancer detection based on the gene combination. By detecting the methylation content of the target gene combination, it can be determined whether the subject has liver cancer or the risk of liver cancer.

具体的技术方案Specific technical solutions

一方面,本公开提供了试剂或试剂组合在制备用于具有以下任意一种或多种用途的产品中的应用,所述用途包括(1)肝癌的早筛或早诊、(2)肝癌的诊断或辅助诊断、(3)肝癌预后风险的评估或辅助评估和(4)肝癌的动态监测;所述试剂或试剂组合包括:检测OSR2基因的甲基化含量的第一试剂和检测TSPYL5基因的甲基化含量的第二试剂。In one aspect, the present disclosure provides the use of a reagent or a reagent combination in preparing a product for any one or more of the following uses, including (1) early screening or early diagnosis of liver cancer, (2) diagnosis or auxiliary diagnosis of liver cancer, (3) assessment or auxiliary assessment of prognostic risk of liver cancer, and (4) dynamic monitoring of liver cancer; the reagent or reagent combination comprises: a first reagent for detecting the methylation content of the OSR2 gene and a second reagent for detecting the methylation content of the TSPYL5 gene.

OSR2是一种蛋白质编码基因,编码奇数跳跃(odd-skipped)相关转录因子2。OSR2 is a protein-coding gene that encodes odd-skipped-related transcription factor 2.

在一些实施例中,所述OSR2基因的GeneID:116039。In some embodiments, the OSR2 gene has GeneID: 116039.

在一些实施例中,所述OSR2基因的检测区域选自:OSR2基因(外显子区和内含子区)与5`启动子区的全长或部分序列。In some embodiments, the detection region of the OSR2 gene is selected from: the full-length or partial sequence of the OSR2 gene (exon region and intron region) and the 5' promoter region.

在一些实施例中,所述OSR2基因的检测区域包括:核苷酸序列如SEQIDNO.1所示的区域1。区域1为可选的检测区域,其序列中CpG二核苷酸中的C在肝癌中呈高度甲基化连锁状态,而在肝硬化、肝脏良性病变与健康人中甲基化水平低,二者有显著差异。染色体坐标为chr8:99,955,670-99,961,095(基因组版本号hg19),DNA序列如下(备注:加粗CG或所有CG中的C为甲基化的C):

In some embodiments, the detection region of the OSR2 gene includes: region 1 whose nucleotide sequence is shown in SEQ ID NO.1. Region 1 is an optional detection region, and the C in the CpG dinucleotide in its sequence is highly methylated in liver cancer, while the methylation level is low in cirrhosis, benign liver lesions and healthy people, and there is a significant difference between the two. The chromosome coordinates are chr8:99,955,670-99,961,095 (genome version number hg19), and the DNA sequence is as follows (Note: C in bold CG or all CG is methylated C):

在一些实施例中,所述第一试剂包括:引物对、探针和芯片中的任意一种或多种。In some embodiments, the first reagent includes: any one or more of a primer pair, a probe, and a chip.

在一些实施例中,所述第一试剂包括:核苷酸序列如SEQIDNO.4~5所示的引物对和/或核苷酸序列如SEQIDNO.6所示的探针。In some embodiments, the first reagent includes: a primer pair with a nucleotide sequence as shown in SEQ ID NO. 4 to 5 and/or a probe with a nucleotide sequence as shown in SEQ ID NO. 6.

在一些实施例中,TSPYL5是一种蛋白编码基因,预测能激活染色质结合活性和组蛋白结合活性,涉及多个生理过程,包括细胞对伽马辐射的反应,蛋白激酶B信号的正调控和蛋白质泛素化的正调控。In some embodiments, TSPYL5 is a protein encoding gene predicted to activate chromatin binding activity and histone binding activity, and is involved in multiple physiological processes including cellular response to gamma radiation, positive regulation of protein kinase B signaling, and positive regulation of protein ubiquitination.

在一些实施例中,所述TSPYL5基因的GeneID:85453。In some embodiments, the TSPYL5 gene has GeneID: 85453.

在一些实施例中,所述TSPYL5基因的检测区域选自:TSPYL5基因(外显子区和内含子区)与5`启动子区的全长或部分序列。In some embodiments, the detection region of the TSPYL5 gene is selected from: the full-length or partial sequence of the TSPYL5 gene (exon region and intron region) and the 5' promoter region.

在一些实施例中,所述TSPYL5基因的检测区域包括:核苷酸序列如SEQIDNO.2所示的区域2。区域2为可选的检测区域,其序列中CpG二核苷酸中的C在肝癌中呈高度甲基化连锁状态,而在肝硬化、肝脏良性病变与健康人中甲基化水平低,二者有显著差异。染色体坐标为chr8:98,289,281-98,290,380(基因组版本号hg19),DNA序列如下(备注:加粗CG或所有CG中的C为甲基化的C):
In some embodiments, the detection region of the TSPYL5 gene includes: region 2 whose nucleotide sequence is shown as SEQ ID NO.2. Region 2 is an optional detection region, and the C in the CpG dinucleotide in its sequence is highly methylated in liver cancer, while the methylation level is low in cirrhosis, benign liver lesions and healthy people, and there is a significant difference between the two. The chromosome coordinates are chr8:98,289,281-98,290,380 (genome version number hg19), and the DNA sequence is as follows (Note: C in bold CG or all CG is methylated C):

在一些实施例中,所述第二试剂包括:引物对、探针和芯片中的任意一种或多种。In some embodiments, the second reagent includes any one or more of a primer pair, a probe, and a chip.

在一些实施例中,所述第二试剂包括:核苷酸序列如SEQIDNO.16~17所示的引物对和/或核苷酸序列如SEQIDNO.18所示的探针。In some embodiments, the second reagent includes: a primer pair with a nucleotide sequence as shown in SEQ ID NO.16-17 and/or a probe with a nucleotide sequence as shown in SEQ ID NO.18.

在一些实施例中,所述试剂或试剂组合还包括:用于检测C1QL4基因、CR1L基因、CYP26C1基因、FOXG1基因、GHSR基因、HIST1H1D基因、IRX5基因、KCNG3基因、LHX2基因、MEX3A基因、NEFM基因、OTX1基因、OXTR基因、PCDHGB6基因、PCDHGB7基因、PITX1基因、PRLHR基因、PRRX1基因、ZIC4基因中的任意一种或多种基因的甲基化含量的试剂或试剂组合。该19个基因的甲基化含量的检测区域选自对应基因与其启动子区域的全长或部分区域,在限定了检测基因的情况下,本领域技术人员可结合常规技术手段获得对应的检测试剂。将19个基因与OSR2基因、TSPYL5基因作为基因组合,用于肝癌检测,具有很好的检测灵敏度和特异性,AUC可达到0.92。In some embodiments, the reagent or reagent combination also includes: a reagent or reagent combination for detecting the methylation content of any one or more genes in C1QL4 gene, CR1L gene, CYP26C1 gene, FOXG1 gene, GHSR gene, HIST1H1D gene, IRX5 gene, KCNG3 gene, LHX2 gene, MEX3A gene, NEFM gene, OTX1 gene, OXTR gene, PCDHGB6 gene, PCDHGB7 gene, PITX1 gene, PRLHR gene, PRRX1 gene, and ZIC4 gene. The detection region of the methylation content of the 19 genes is selected from the full length or partial region of the corresponding gene and its promoter region. When the detection gene is limited, those skilled in the art can obtain the corresponding detection reagent by combining conventional technical means. The 19 genes are used as a gene combination with the OSR2 gene and the TSPYL5 gene for liver cancer detection, which has good detection sensitivity and specificity, and the AUC can reach 0.92.

在一些实施例中,所述试剂或试剂组合还包括:检测内参基因的含量的第三试剂。In some embodiments, the reagent or reagent combination further includes: a third reagent for detecting the content of an internal reference gene.

在一些实施例中,所述内参基因包括SDF4基因。SDF4是一种蛋白质编码基因,编码一种基质细胞衍生因子,其为CREC(网腔钙结合蛋白)蛋白家族的成员之一,该编码蛋白质包含六个EF-hand基序和钙结合基序。这种蛋白质定位在高尔基体腔,可能参与调节钙依赖的细胞活动。In some embodiments, the internal reference gene includes the SDF4 gene. SDF4 is a protein-coding gene that encodes a stromal cell-derived factor, which is a member of the CREC (reticulum calcium-binding protein) protein family, and the encoded protein contains six EF-hand motifs and a calcium-binding motif. This protein is located in the Golgi cavity and may be involved in regulating calcium-dependent cell activities.

在一些实施例中,所述SDF4基因的GeneID:51150。In some embodiments, the SDF4 gene has GeneID: 51150.

在一些实施例中,所述SDF4基因的检测区域包括:SDF4基因的全长或部分序列。In some embodiments, the detection region of the SDF4 gene includes: the full-length or partial sequence of the SDF4 gene.

在一些实施例中,所述SDF4基因的检测区域包括:核苷酸序列如SEQIDNO.3所示的区域3。区域3为可选的检测区域,该区段序列的GC含量=64.7%,CpG二核苷酸密度=5.88%,重亚硫酸氢盐转化后的GC含量=32.5%。染色体坐标为chr1:1,159,192-1,159,483(基因组版本号hg19),DNA序列如下(备注:加粗CG或所有CG中的C为甲基化的C):
In some embodiments, the detection region of the SDF4 gene includes: region 3 whose nucleotide sequence is shown as SEQ ID NO.3. Region 3 is an optional detection region, and the GC content of the segment sequence is 64.7%, the CpG dinucleotide density is 5.88%, and the GC content after bisulfite conversion is 32.5%. The chromosome coordinates are chr1: 1,159,192-1,159,483 (genome version number hg19), and the DNA sequence is as follows (Note: C in bold CG or all CG is methylated C):

在一些实施例中,所述第三试剂包括:引物对、探针和芯片中的任意一种或多种。In some embodiments, the third reagent includes: any one or more of a primer pair, a probe, and a chip.

在一些实施例中,所述第三试剂包括:核苷酸序列如SEQIDNO.28~29所示的引物对和/或核苷酸序列如SEQIDNO.30所示的探针。In some embodiments, the third reagent includes: a primer pair with a nucleotide sequence as shown in SEQ ID NO. 28 to 29 and/or a probe with a nucleotide sequence as shown in SEQ ID NO. 30.

在本公开实施例中,OSR2与TSPYL5为肝癌高度相关的肿瘤标志物,检测靶点为其甲基化的DNA序列,SDF4为内参基因,用于质控DNA样本合格并作为靶基因甲基化含量的参照。具体为:通过检测样本中靶基因OSR2、TSPYL5的甲基化含量,分别以OSR2、TSPYL5扩增的Ct值减去内参基因SDF4扩增的Ct值的差值(△Ct)作为靶基因甲基化相对含量的高低,以△Ct值进行建模后,判断受检者是否患有肝癌或患有肝癌的风险高低。In the disclosed embodiment, OSR2 and TSPYL5 are tumor markers highly correlated with liver cancer, and the detection target is their methylated DNA sequence, and SDF4 is an internal reference gene used to control the quality of DNA samples and as a reference for the methylation content of the target gene. Specifically, by detecting the methylation content of the target genes OSR2 and TSPYL5 in the sample, the difference (△Ct) of the Ct value amplified by OSR2 and TSPYL5 minus the Ct value amplified by the internal reference gene SDF4 is used as the relative methylation content of the target gene, and after modeling with the △Ct value, it is determined whether the subject has liver cancer or the risk of liver cancer.

在一些实施例中,所述产品包括:试剂和试剂盒中的任意一种或多种。In some embodiments, the product includes any one or more of a reagent and a kit.

另一方面,本公开实施例提供了一种试剂或试剂组合,其包含前述任意实施例所述的试剂或试剂组合。On the other hand, embodiments of the present disclosure provide a reagent or a reagent combination, which comprises the reagent or the reagent combination described in any of the preceding embodiments.

另一方面,本公开实施例提供了一种试剂盒,其包括前述任意实施例所述的试剂或试剂组合。On the other hand, an embodiment of the present disclosure provides a kit, which includes the reagent or reagent combination described in any of the preceding embodiments.

所述试剂或试剂组合以及所述试剂盒具有以下任意一种或多种用途:(1)肝癌的早筛或早诊;(2)肝癌的诊断或辅助诊断、(3)肝癌预后风险的预评估或辅助评估和(4)肝癌的动态监测。The reagent or reagent combination and the kit have any one or more of the following uses: (1) early screening or early diagnosis of liver cancer; (2) diagnosis or auxiliary diagnosis of liver cancer, (3) preliminary assessment or auxiliary assessment of the prognostic risk of liver cancer and (4) dynamic monitoring of liver cancer.

另一方面,本公开实施例提供了一种靶基因甲基化含量的检测方法,其包括:采用前述任意实施例所述的试剂或试剂组合对样本进行靶基因的甲基化含量的检测;所述靶基因包括OSR2基因和TSPYL5基因。On the other hand, an embodiment of the present disclosure provides a method for detecting the methylation content of a target gene, comprising: using the reagent or reagent combination described in any of the preceding embodiments to detect the methylation content of the target gene in a sample; the target genes include the OSR2 gene and the TSPYL5 gene.

在一些实施例中,所述应用不以疾病的诊断或治疗为直接目的。In some embodiments, the use is not directly aimed at diagnosis or treatment of disease.

不以疾病的诊断或治疗为直接目的的情况有很多,比如待测样本为含生物样本的环境样本或人工制作的标准品时,检测的直接目的并非是以疾病的诊断或治疗为直接目的。There are many situations where the direct purpose is not to diagnose or treat a disease. For example, when the sample to be tested is an environmental sample containing a biological sample or an artificially produced standard, the direct purpose of the test is not to diagnose or treat a disease.

在一些实施例中,所述方法还包括对内参基因进行检测。In some embodiments, the method further comprises detecting an internal reference gene.

在一些实施例中,所述靶基因还包括:用于检测C1QL4基因、CR1L基因、CYP26C1基因、FOXG1基因、GHSR基因、HIST1H1D基因、IRX5基因、KCNG3基因、LHX2基因、MEX3A基因、NEFM基因、OTX1基因、OXTR基因、PCDHGB6基因、PCDHGB7基因、PITX1基因、PRLHR基因、PRRX1基因、ZIC4基因中的任意一种或多种。 In some embodiments, the target gene also includes: for detecting any one or more of C1QL4 gene, CR1L gene, CYP26C1 gene, FOXG1 gene, GHSR gene, HIST1H1D gene, IRX5 gene, KCNG3 gene, LHX2 gene, MEX3A gene, NEFM gene, OTX1 gene, OXTR gene, PCDHGB6 gene, PCDHGB7 gene, PITX1 gene, PRLHR gene, PRRX1 gene, and ZIC4 gene.

在一些实施例中,所述内参基因包括:SDF4基因。In some embodiments, the internal reference gene includes: SDF4 gene.

在一些实施例中,所述检测方法的样本包括:DNA样本或含DNA的环境样本。DNA样本的类型可以为血浆cfDNA、血清cfDNA、组织DNA、细胞DNA和外泌体DNA中的任意一种。In some embodiments, the sample of the detection method includes: a DNA sample or an environmental sample containing DNA. The type of the DNA sample can be any one of plasma cfDNA, serum cfDNA, tissue DNA, cell DNA and exosome DNA.

在一些实施例中,所述靶基因的甲基化含量的检测包括:采用所述试剂或试剂组合对经重亚硫酸氢盐转化后的样本进行qPCR扩增;In some embodiments, the detection of the methylation content of the target gene comprises: using the reagent or reagent combination to perform qPCR amplification on the sample after bisulfite conversion;

在一些实施例中,当所述试剂或试剂组合包含第一试剂~第三试剂时,所述qPCR的反应体系包括:MgCl2的终浓度为4~8mM,dNTP Mix的终浓度为200~800μM,Taq酶的终浓度为1.0~10U/反应,用于OSR2基因检测的引物对的终浓度为0.05~1.0μM、探针终浓度为50~500nM;用于TSPYL5基因检测的引物对的终浓度为0.1~1.0μM、探针的终浓度为50~500nM;用于SDF4基因检测的引物对的终浓度为0.1~1.0μM、探针的终浓度为50~500nM。In some embodiments, when the reagent or reagent combination comprises the first reagent to the third reagent, the qPCR reaction system comprises: the final concentration of MgCl2 is 4-8mM, the final concentration of dNTP Mix is 200-800μM, the final concentration of Taq enzyme is 1.0-10U/reaction, the final concentration of the primer pair for OSR2 gene detection is 0.05-1.0μM, and the final concentration of the probe is 50-500nM; the final concentration of the primer pair for TSPYL5 gene detection is 0.1-1.0μM, and the final concentration of the probe is 50-500nM; the final concentration of the primer pair for SDF4 gene detection is 0.1-1.0μM, and the final concentration of the probe is 50-500nM.

在一些实施例中,所述qPCR的反应体系包括:MgCl2的终浓度为6mM,dNTP Mix的终浓度为500μM,Taq酶的终浓度为3U/反应,用于OSR2基因检测的引物对的终浓度为0.1μM、探针的终浓度为200nM,用于TSPYL5基因检测的引物对终浓度为0.2μM、探针的终浓度为200nM,用于SDF4基因检测的引物对的终浓度为0.2μM、探针的终浓度为100nM,反应总体积25μL。In some embodiments, the qPCR reaction system includes: a final concentration of MgCl2 of 6 mM, a final concentration of dNTP Mix of 500 μM, a final concentration of Taq enzyme of 3 U/reaction, a final concentration of the primer pair for OSR2 gene detection of 0.1 μM, a final concentration of the probe of 200 nM, a final concentration of the primer pair for TSPYL5 gene detection of 0.2 μM, a final concentration of the probe of 200 nM, a final concentration of the primer pair for SDF4 gene detection of 0.2 μM, a final concentration of the probe of 100 nM, and a total reaction volume of 25 μL.

在一些实施例中,所述qPCR的反应程序包括如下:In some embodiments, the qPCR reaction procedure includes the following:

95~98℃预变性0.5~5分钟;95~98℃变性5~15秒,55~65℃退火与延伸30~60秒,15个循环,不收集荧光;95~98℃变性5~15秒,55~65℃退火与延伸30~60秒,30个循环,收集荧光。Pre-denaturation at 95-98°C for 0.5-5 min; denaturation at 95-98°C for 5-15 sec, annealing and extension at 55-65°C for 30-60 sec, 15 cycles without collecting fluorescence; denaturation at 95-98°C for 5-15 sec, annealing and extension at 55-65°C for 30-60 sec, 30 cycles, collecting fluorescence.

在一些实施例中,所述qPCR的反应程序包括如下:95℃预变性5分钟;95℃变性15秒,62℃退火与延伸30秒,15个循环,不收集荧光;95℃变性15秒,62℃退火与延伸32秒,30个循环,收集荧光。In some embodiments, the qPCR reaction procedure includes the following: pre-denaturation at 95°C for 5 minutes; denaturation at 95°C for 15 seconds, annealing and extension at 62°C for 30 seconds, 15 cycles, without collecting fluorescence; denaturation at 95°C for 15 seconds, annealing and extension at 62°C for 32 seconds, 30 cycles, collecting fluorescence.

另一方面,本公开实施例还提供了一种肝癌预测模型的训练方法,其包括:On the other hand, the present disclosure also provides a method for training a liver cancer prediction model, which includes:

获取前述任意实施例所述试剂或试剂组合对训练样本中靶基因的甲基化含量进行检测并为其标注结果;其中,所述靶基因为如前述任意实施例所述的靶基因,所述标注结果包括:代表样本的肝癌患病风险、预后风险和疾病进程中任意一种情况的标签;Obtaining the reagent or reagent combination described in any of the foregoing embodiments to detect the methylation content of the target gene in the training sample and annotate the result; wherein the target gene is the target gene described in any of the foregoing embodiments, and the annotated result includes: a label representing any one of the liver cancer risk, prognosis risk and disease progression of the sample;

将待测样本的靶基因的甲基化含量输入预先构建的预测模型中,获得预测结果;所述预测模型为能够根据所述靶基因的甲基化含量判断样本的肝癌患病风险、预后风险和疾病进程中任意一种情况的机器学习模型。The methylation content of the target gene of the sample to be tested is input into a pre-constructed prediction model to obtain a prediction result; the prediction model is a machine learning model that can judge any of the liver cancer risk, prognosis risk and disease progression of the sample based on the methylation content of the target gene.

在一些实施例中,训练样本的样本量为大于等于10。该样本量具体可以为10、20、30、40、50、60、70、80、90、100、200、300、400、500、600、700、800、900和1000中的任意一种或任意两种之间的范围。In some embodiments, the sample size of the training samples is greater than or equal to 10. The sample size can specifically be any one of 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900 and 1000, or a range between any two of them.

在一些实施例中,所述标签可以为字符,字符指类字形单位或符号,包括字母、数字、运算符号、标点符号和其他符号,以及一些功能性符号。In some embodiments, the label may be a character, where a character refers to a glyph-like unit or symbol, including letters, numbers, operation symbols, punctuation marks and other symbols, as well as some functional symbols.

在一些实施例中,所述机器学习模型可以选自:支持向量机、决策树、随机森林、逻辑回归、贝叶斯、K近邻、K均值、马尔可夫和回归岭算法中的任意一种或几种的组合。In some embodiments, the machine learning model can be selected from any one or a combination of support vector machine, decision tree, random forest, logistic regression, Bayesian, K nearest neighbor, K means, Markov and regression ridge algorithms.

此外,本公开实施例还提供了一种电子设备,其包括:处理器和存储器;所述存储器用于存储程序,当所述程序被所述处理器执行时,使得所述处理器实现前述任意实施例所述的肝癌预测模型的训练方法或肝癌的预测方法;In addition, an embodiment of the present disclosure further provides an electronic device, comprising: a processor and a memory; the memory is used to store a program, and when the program is executed by the processor, the processor implements the training method of the liver cancer prediction model or the liver cancer prediction method described in any of the above embodiments;

所述预测方法包括:获取待测样本中靶基因的甲基化含量;所述靶基因为前述任意实施例所述的靶基因;将待测样本靶基因的甲基化含量输入前述任意实施例所述的训练方法训练好的模型中,获得待测样本的预测结果。The prediction method includes: obtaining the methylation content of the target gene in the sample to be tested; the target gene is the target gene described in any of the aforementioned embodiments; the methylation content of the target gene of the sample to be tested is input into the model trained by the training method described in any of the aforementioned embodiments to obtain the prediction result of the sample to be tested.

所述电子设备可以包括存储器、处理器、总线和通信接口,该存储器、处理器和通信接口相互之间直接或间接地电性连接,以实现数据的传输或交互。例如,这些元件相互之间可通过一条或多条总线或信号线实现电性连接。处理器可以处理与目标识别有关的信息和/或数据,以执行本申请中描述的一个或多个功能。The electronic device may include a memory, a processor, a bus, and a communication interface, wherein the memory, the processor, and the communication interface are electrically connected to each other directly or indirectly to achieve data transmission or interaction. For example, these components may be electrically connected to each other via one or more buses or signal lines. The processor may process information and/or data related to target identification to perform one or more functions described in the present application.

存储器可以是但不限于,随机存取存储器(Random Access Memory,RAM),只读存储器(Read Only Memory,ROM),可编程只读存储器(Programmable Read-Only Memory,PROM),可擦除只读存储器(Erasable Programmable Read-Only Memory,EPROM),电可擦除只读存储器(Electric Erasable Programmable Read-Only Memory,EEPROM)等。The memory can be, but is not limited to, random access memory (RAM), read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable read-only memory (EEPROM), etc.

处理器可以是一种集成电路芯片,具有信号处理能力。该处理器可以是通用处理器,包括中央处理器(Central Processing Unit,CPU)、网络处理器(Network Processor,NP)等;还可以是数字信号处理器(Digital Signal Processing,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。The processor can be an integrated circuit chip with signal processing capabilities. The processor can be a general-purpose processor, including a central processing unit (CPU), a network processor (NP), etc.; it can also be a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.

在实际应用中,该电子设备可以是服务器、云平台、手机、平板电脑、笔记本电脑、超级移动个人计算机(Ultra-mobile Personal Computer,UMPC)、手持计算机、上网本、个人数字助理(Personal Digital Assistant,PDA)、可穿戴电子设备、虚拟现实设备等设备,因此本申请实施例对电子设备的种类不做限制。In actual applications, the electronic device can be a server, a cloud platform, a mobile phone, a tablet computer, a laptop computer, an ultra-mobile personal computer (UMPC), a handheld computer, a netbook, a personal digital assistant (PDA), a wearable electronic device, a virtual reality device, etc. Therefore, the embodiments of the present application do not limit the type of electronic device.

以下结合实施例对本公开的特征和性能作进一步的详细描述。The features and performance of the present invention are further described in detail below in conjunction with the embodiments.

实施例1靶基因的筛选Example 1 Screening of target genes

1、甲基化捕获测序1. Methylation capture sequencing

(1)样本(1) Sample

收集43例肝癌组织与配对的32例癌旁组织DNA,55例肝癌、55例肝硬化、50例健康人血浆,每例血浆2mL。DNA was collected from 43 liver cancer tissues and 32 paired adjacent paracancerous tissues, and plasma was collected from 55 liver cancer tissues, 55 liver cirrhosis tissues, and 50 healthy subjects, with 2 mL of plasma from each tissue.

(2)血浆cfDNA提取(2) Plasma cfDNA extraction

a、将血浆于室温或37℃解冻后,于16,000g、4℃离心10分钟;a. Thaw the plasma at room temperature or 37°C and centrifuge at 16,000g and 4°C for 10 minutes;

b、向15mL离心管中加入80μL的Proteinase K(Apostle MiniMax游离DNA分离试剂盒,货号A17622CN-384),随后加入离心后的血浆,转移血浆时不要吸到离心沉淀或者絮状物,然后旋涡混匀;b. Add 80 μL of Proteinase K (Apostle MiniMax Free DNA Isolation Kit, Catalog No. A17622CN-384) to a 15 mL centrifuge tube, then add the centrifuged plasma. When transferring the plasma, do not aspirate the centrifugal precipitate or flocs, and then vortex to mix.

c、向上述15mL离心管中加入200μL的Sample Lysis Buffer,旋涡混匀,然后60℃水浴20分钟,每隔5分钟颠倒混匀数次;c. Add 200 μL of Sample Lysis Buffer to the above 15 mL centrifuge tube, vortex to mix, then place in a 60°C water bath for 20 minutes, inverting several times every 5 minutes to mix;

d、水浴结束后,将15mL离心管取出,室温静止10分钟,然后短暂离心,使得管壁上的血浆裂解液收集到管底;d. After the water bath is over, take out the 15 mL centrifuge tube, let it stand at room temperature for 10 minutes, and then centrifuge briefly to collect the plasma lysate on the tube wall to the bottom of the tube;

e、取6个深孔板,准备核酸的自动化提取。板1:加入2.5mL的cfDNA Lysis/Binding Solution、30μL的Magnetic Nanoparticles和上述全部的血浆裂解液,板2:加入1mL的cfDNA Wash Solution,板3:加入1mL的cfDNA Wash Sloution,板4:加入2mL稀释的2nd Wash Solution,板5:加入500μL稀释的2nd Wash Solution,板6:加入50μL的cfDNA Elution Solution;e. Take 6 deep-well plates and prepare for automated nucleic acid extraction. Plate 1: add 2.5 mL of cfDNA Lysis/Binding Solution, 30 μL of Magnetic Nanoparticles and all the above plasma lysate, Plate 2: add 1 mL of cfDNA Wash Solution, Plate 3: add 1 mL of cfDNA Wash Sloution, Plate 4: add 2 mL of diluted 2nd Wash Solution, Plate 5: add 500 μL of diluted 2nd Wash Solution, Plate 6: add 50 μL of cfDNA Elution Solution;

f、在第7号板位,放置新的Deep-Well Tip Comb板,开始仪器自检,自检合格后,选择核酸自动化提取程序,开始cfDNA的自动化提取,按照顺序依次放入板1-板6;f. Place a new Deep-Well Tip Comb plate in position 7 and start the instrument self-check. After the self-check is qualified, select the nucleic acid automatic extraction program and start the automatic extraction of cfDNA. Place plates 1 to 6 in order.

g、仪器提取完成后,取下板6吸取cfDNA。g. After the instrument extraction is completed, remove plate 6 and absorb cfDNA.

(3)甲基化捕获文库制备与上机测序(3) Methylation capture library preparation and sequencing

用Qubit dsDNA HS Assay Kit(Thermofisher Scientific,货号Q32854)测定cfDNA浓度。The cfDNA concentration was measured using Qubit dsDNA HS Assay Kit (Thermofisher Scientific, Cat. No. Q32854).

取40μL的组织DNA或cfDNA加入到160μL的重亚硫酸氢盐转化试剂(Zymo Research,EZ DNA Methylation Lightning Kits)中进行转化,98℃变性8分钟,54℃转化1小时,4℃保持。将转化后的产物进行纯化。纯化DNA用Qubit ssDNA Assay Kit(Thermo Fisher,货号:Q10212)进行定量。Take 40 μL of tissue DNA or cfDNA and add it to 160 μL of bisulfite conversion reagent (Zymo Research, EZ DNA Methylation Lightning Kits) for conversion, denature at 98°C for 8 minutes, convert at 54°C for 1 hour, and maintain at 4°C. The converted product is purified. The purified DNA is quantified using Qubit ssDNA Assay Kit (Thermo Fisher, Cat. No.: Q10212).

转化后的DNA采用Hieff NGS Methyl-seq DNA Library Prep Kit for Illumina试剂盒(YEASEN,货号:12211ES24)进行建库,操作步骤见试剂盒说明书。The converted DNA was used to construct the library using Hieff NGS Methyl-seq DNA Library Prep Kit for Illumina (YEASEN, catalog number: 12211ES24). The operating steps are shown in the kit manual.

文库用Qubit dsDNA HS Assay Kit进行浓度测定并用4200生物分析仪进行文库片段质检(Agilent DNA 1000Kit,货号:5067-1504)。质检合格后的文库进行液相杂交捕获。捕获后的文库用Qubit dsDNA HS Assay Kit进行浓度测定并用4200生物分析仪进行文库片段质检。捕获文库质检合格后在Illumina测序仪NovaSeq上进行测序,读长PE150。The library was concentrated using Qubit dsDNA HS Assay Kit and the library fragments were quality checked using 4200 bioanalyzer (Agilent DNA 1000 Kit, Cat. No.: 5067-1504). The library that passed the quality check was captured by liquid phase hybridization. The captured library was concentrated using Qubit dsDNA HS Assay Kit and the library fragments were quality checked using 4200 bioanalyzer. After the captured library passed the quality check, it was sequenced on the Illumina sequencer NovaSeq with a read length of PE150.

2、靶基因的筛选2. Screening of target genes

对下机后的捕获数据进行分析,按照如下条件进行靶基因的筛选:Analyze the captured data after leaving the machine and screen the target genes according to the following conditions:

(1)使用fastp软件对下机数据进行数据过滤,包括过滤测序接头序列,去除测序读长小于50bp的DNA片段,去除平均测序质量较低的DNA片段;(1) Use fastp software to filter the data from the machine, including filtering the sequencing adapter sequences, removing DNA fragments with sequencing read lengths less than 50 bp, and removing DNA fragments with low average sequencing quality;

(2)使用Bismark将过滤后的数据与hg19参考基因组(携带decoy诱饵序列)进行比对,得到每个DNA片段对应的具体基因组位置信息,并且得到每个CpG位点的甲基化状态信息;(2) Use Bismark to compare the filtered data with the hg19 reference genome (carrying the decoy bait sequence) to obtain the specific genomic location information corresponding to each DNA fragment and the methylation status information of each CpG site;

(3)使用picard软件对top链和bottom链分别去除PCR扩增过程中引入的冗余数据,然后合并数据;(3) Use Picard software to remove redundant data introduced during PCR amplification from the top and bottom chains, respectively, and then merge the data;

(4)使用bamUtil软件对较短的插入片段重叠区域进行soft-clipped标记,防止甲基化水平检测时发生重复统计; (4) Use bamUtil software to soft-clip the overlapping regions of shorter inserts to prevent duplicate statistics when detecting methylation levels;

(5)使用bamtools软件去除比对质量较低、未比对上、双端reads未能完全配对的DNA片段;(5) Use bamtools software to remove DNA fragments with low alignment quality, unaligned, or incompletely paired double-end reads;

(6)筛选在肝癌组织DNA与血浆cfDNA中高甲基化且连锁,而在癌旁组织DNA与肝硬化、健康人血浆cfDNA中低甲基化的基因及染色体区段。(6) Screening for genes and chromosome segments that are highly methylated and linked in HCC tissue DNA and plasma cfDNA, but lowly methylated in paracancerous tissue DNA and plasma cfDNA of cirrhotic and healthy subjects.

3、筛选结果3. Filter results

对组织DNA与血浆cfDNA的测序数据进行分析,最终筛选出21个候选基因。21个候选基因在肝癌与肝硬化、肝癌与肝硬化及健康人中的中位甲基化百分比的差异水平见表1。21个候选基因区分肝癌与肝硬化性能见图2。从图2可见,整合21个基因后,区分肝癌与肝硬化的AUC达到0.92,其中HIST1H1D、IRX5、NEFM、OSR2、OTX1、PITX1、TSPYL5区分肝癌与肝硬化的AUC能达到0.71以上。The sequencing data of tissue DNA and plasma cfDNA were analyzed, and 21 candidate genes were finally screened out. The difference levels of the median methylation percentages of the 21 candidate genes in liver cancer and cirrhosis, liver cancer and cirrhosis, and healthy people are shown in Table 1. The performance of the 21 candidate genes in distinguishing liver cancer from cirrhosis is shown in Figure 2. As can be seen from Figure 2, after integrating the 21 genes, the AUC for distinguishing liver cancer from cirrhosis reached 0.92, among which HIST1H1D, IRX5, NEFM, OSR2, OTX1, PITX1, and TSPYL5 could reach an AUC of more than 0.71 for distinguishing liver cancer from cirrhosis.

表1候选基因在肝癌、肝硬化与健康人中的中位甲基化百分比的差异
Table 1 Differences in median methylation percentages of candidate genes in liver cancer, liver cirrhosis and healthy subjects

实施例2OSR2、TSPYL5与SDF4扩增引物探针的筛选Example 2 Screening of primer probes for OSR2, TSPYL5 and SDF4 amplification

根据候选基因在肝癌、肝硬化、健康人中的甲基化百分比的差异水平与ROC曲线的AUC面积,并结合基因序列的特征,筛选出OSR2与TSPYL5两个靶基因,设计qPCR引物探针。根据OSR2与TSPYL5基因的可选序列,遍历整个可选序列,筛选适合引物探针设计的序列片段,结合MethyLight引物探针设计要素,精心设计了几套引物探针组合。Based on the difference in methylation percentage of candidate genes in liver cancer, liver cirrhosis, and healthy people and the AUC area of the ROC curve, combined with the characteristics of the gene sequence, two target genes, OSR2 and TSPYL5, were screened out and qPCR primer probes were designed. According to the optional sequences of OSR2 and TSPYL5 genes, the entire optional sequence was traversed to screen sequence fragments suitable for primer probe design, and several sets of primer probe combinations were carefully designed in combination with the design elements of MethyLight primer probes.

(1)引物探针设计(1) Primer and probe design

在OSR2与TSPYL5可选的DNA区段各设计几套引物探针组合,进行引物探针组合的筛选。Several sets of primer-probe combinations were designed in the optional DNA segments of OSR2 and TSPYL5, and the primer-probe combinations were screened.

OSR2扩增的引物探针组合有:The primer-probe combinations for OSR2 amplification are:

引物探针组合1:Primer probe combination 1:

SEQ ID NO.4:5`-GCGTCGGTTTCGTTTTTGTACGTC-3`;SEQ ID NO.4: 5`-GCGTCGGTTTCGTTTTTGTACGTC-3`;

SEQ ID NO.5:5`-CCACGCGCTACGCTACCG-3`;SEQ ID NO.5: 5`-CCACGCGCTACGCTACCG-3`;

SEQ ID NO.6:SEQ ID NO.6:

5`6-FAM-ACCGCCTTTCCCGCGCTTACCCGA-3`BHQ1;5`6-FAM-ACCGCCTTTCCCGCGCTTACCCGA-3`BHQ1;

引物探针组合2:Primer probe combination 2:

SEQ ID NO.7:5`-GTTTTTCGGAGGTAAGATTATTTGCG-3`;SEQ ID NO.7: 5`-GTTTTTCGGAGGTAAGATTATTTGCG-3`;

SEQ ID NO.8:5`-ACTCGATATATCCAAAACAAATTCGC-3`;SEQ ID NO.8: 5`-ACTCGATATATCCAAAACAAATTCGC-3`;

SEQ ID NO.9:SEQ ID NO.9:

5`6-FAM-CAACCATCCTCCTTACCCGCCTCACCTATAATC-3`BHQ1;5`6-FAM-CAACCATCCTCCTTACCCGCCTCACCTATAATC-3`BHQ1;

引物探针组合3:Primer probe combination 3:

SEQ ID NO.10:5`-TTTTGCGTCGTCGTTATTTTCG-3`;SEQ ID NO.10: 5`-TTTTGCGTCGTCGTTATTTTCG-3`;

SEQ ID NO.11:5`-AAAACCTATACGATCTCAACGCG-3`;SEQ ID NO.11:5`-AAAACCTATACGATCTCAACGCG-3`;

SEQ ID NO.12:5`6-FAM-ATGGTGGAGCGGGTGATTTCGTGTAT-3`BHQ1;SEQ ID NO.12: 5`6-FAM-ATGGTGGAGCGGGTGATTTCGTGTAT-3`BHQ1;

引物探针组合4:Primer probe combination 4:

SEQ ID NO.13:5`-GGTTTTGCGTCGTCGTTATTTTC-3`;SEQ ID NO.13:5`-GGTTTTGCGTCGTCGTTATTTTC-3`;

SEQ ID NO.14:5`-AAAACCTATACGATCTCAACGCGA-3`;SEQ ID NO.14: 5`-AAAACCTATACGATCTCAACGCGA-3`;

SEQ ID NO.15:5`6-FAM-ATGGTGGAGCGGGTGATTTCGTGTAT-3`BHQ1;SEQ ID NO.15:5`6-FAM-ATGGTGGAGCGGGTGATTTCGTGTAT-3`BHQ1;

TSPYL5扩增的引物探针组合有:The primer-probe combinations for TSPYL5 amplification are:

引物探针组合5:Primer probe combination 5:

SEQ ID NO.16:5`-CGCTCATAATAACGACGAAAACAACT-3`; SEQ ID NO.16: 5`-CGCTCATAATAACGACGAAAACAACT-3`;

SEQ ID NO.17:5`-GGAGAGATTCGTTCGGTTTCGATC-3`;SEQ ID NO.17:5`-GGAGAGATTCGTTCGGTTTCGATC-3`;

SEQ ID NO.18:SEQ ID NO.18:

5`Cy5-CACGCTATAACCCTACGACTCCTAACGCCA-3`BHQ-3;5`Cy5-CACGCTATAACCCTACGACTCCTAACGCCA-3`BHQ-3;

引物探针组合6:Primer probe combination 6:

SEQ ID NO.19:5`-GCGGGAGGATTTTCGATTTCGA-3`;SEQ ID NO.19:5`-GCGGGAGGATTTTCGATTTCGA-3`;

SEQ ID NO.20:5`-AAAACCGCAAAATCACAACGTATCTT-3`;SEQ ID NO.20: 5`-AAAACCGCAAAATCACAACGTATCTT-3`;

SEQ ID NO.21:SEQ ID NO.21:

5`Cy5-AAACTACCTCCGCCGCCACCATAAACGAC-3`BHQ-3;5`Cy5-AAACTACCTCCGCCGCCACCATAAACGAC-3`BHQ-3;

引物探针组合7:Primer probe combination 7:

SEQ ID NO.22:5`-CCGCGATACTACAAATTTCTAAAACCTT-3`;SEQ ID NO.22: 5`-CCGCGATACTACAAATTTCTAAAACCTT-3`;

SEQ ID NO.23:5`-GTAGATATTGTTTTCGTGGGAATAGCG-3`;SEQ ID NO.23:5`-GTAGATATTGTTTTCGTGGGAATAGCG-3`;

SEQ ID NO.24:SEQ ID NO.24:

5`Cy5-CCCGACAAAACCACGCCGATTTCCAACGC-3`BHQ-3;5`Cy5-CCCGACAAAACCACGCCGATTTCCAACGC-3`BHQ-3;

引物探针组合8:Primer probe combination 8:

SEQ ID NO.25:5`-AATACTACAATCGCAACTACCCAAA-3`;SEQ ID NO.25: 5`-AATACTACAATCGCAACTACCCAAA-3`;

SEQ ID NO.26:5`-GGAAGGTAGTATGGATACGTTGGA-3`;SEQ ID NO.26:5`-GGAAGGTAGTATGGATACGTTGGA-3`;

SEQ ID NO.27:SEQ ID NO.27:

5`Cy5-AAAACCGAAAATAAACCCTATCCGCCTAAACG-3`BHQ-3;5`Cy5-AAAACCGAAAATAAACCCTATCCGCCTAAACG-3`BHQ-3;

SDF4扩增的引物探针组合有:The primer-probe combinations for SDF4 amplification are:

引物探针组合9:Primer probe combination 9:

SEQ ID NO.28:5`-GTAGGTGGTCGTTAGATTGGG-3`;SEQ ID NO.28:5`-GTAGGTGGTCGTTAGATTGGG-3`;

SEQ ID NO.29:5`-TCGCTATTAATAACAAATTCACGCA-3`;SEQ ID NO.29:5`-TCGCTATTAATAACAAATTCACGCA-3`;

SEQ ID NO.30:5`ROX-ACCGACGACATCCGTCTAACTATTCGA-3`BHQ2;SEQ ID NO.30: 5`ROX-ACCGACGACATCCGTCTAACTATTCGA-3`BHQ2;

引物探针组合10:Primer probe combination 10:

SEQ ID NO.31:5`-TGTCGTTGTTGGTGGTAGGTTTA-3`;SEQ ID NO.31:5`-TGTCGTTGTTGGGTGGTAGGTTTA-3`;

SEQ ID NO.32:5`-CCCCAAAACCCCGAACAAC-3`;SEQ ID NO.32: 5`-CCCCAAAACCCCGAACAAC-3`;

SEQ ID NO.33:5`ROX-AACGAATACCGCCGACCCAAAACTCCA-3`BHQ2;SEQ ID NO.33:5`ROX-AACGAATACCGCCGACCCAAAACTCCA-3`BHQ2;

引物探针组合11:Primer probe combination 11:

SEQ ID NO.34:5`-GGTCGGATTAACGGGATGAAGTTTA-3`;SEQ ID NO.34:5`-GGTCGGATTAACGGGATGAAGTTTA-3`;

SEQ ID NO.35:5`-CCCACACACGATTCGTACCTC-3`;SEQ ID NO.35: 5`-CCCACACACGATTCGTACCTC-3`;

SEQ ID NO.36:5`ROX-CCCGATCCCGCGTTCCCTACTATCCT-3`BHQ2。SEQ ID NO.36: 5`ROX-CCCGATCCCGCGTTCCCTACTATCCT-3`BHQ2.

需要说明的是,探针5`端的荧光基团修饰可以是TaqMan探针、MGB探针、分子信标、BQ探针等各种类型探针使用的荧光基团,如FAM、TET、JOE、VIC、HEX、Cy3、NED、TAMRA、ROX、Cy5、AMCA、Pacific Blue、Atto 425、BODIPY FL、Oregon Green 488、R6G、Yakima Yellow、Quasar 570、AquaPhluor 593、Texas Red、Atto 590、Quasar 670、Cy5.5、Cy7和IR Dye 750等;探针3`端的荧淬灭基团修饰可以是TaqMan探针、MGB探针、分子信标、BQ探针等各种类型探针使用的淬灭基团,如BHQ1、BHQ2、BHQ3、BBQ650、MGB、Dabcyl和DBQ1等。It should be noted that the fluorescent group modification at the 5' end of the probe can be the fluorescent group used in various types of probes such as TaqMan probe, MGB probe, molecular beacon, BQ probe, such as FAM, TET, JOE, VIC, HEX, Cy3, NED, TAMRA, ROX, Cy5, AMCA, Pacific Blue, Atto 425, BODIPY FL, Oregon Green 488, R6G, Yakima Yellow, Quasar 570, AquaPhluor 593, Texas Red, Atto 590, Quasar 670, Cy5.5, Cy7 and IR Dye 750; the fluorescence quenching group modification at the 3' end of the probe can be the quenching group used in various types of probes such as TaqMan probe, MGB probe, molecular beacon, BQ probe, such as BHQ1, BHQ2, BHQ3, BBQ650, MGB, Dabcyl and DBQ1.

(2)样本(2) Sample

甲基化含量为95%~100%的阳性质控品DNA与甲基化含量为0%的阴性质控品DNA(自制)。Positive quality control DNA with a methylation content of 95% to 100% and negative quality control DNA with a methylation content of 0% (self-made).

(3)重亚硫酸氢盐转化(3) Bisulfite conversion

试剂与操作步骤见实施例1。See Example 1 for reagents and operating procedures.

(4)甲基化特异性qPCR验证(4) Methylation-specific qPCR verification

取转化后的30ng阴性质控品进行引物探针的特异性验证,设置3个技术重复;将转化后的阳性质控品DNA用无核酸酶水进行梯度稀释,分别取100copies、10copies进行灵敏度验证,分别设置1个与3个技术重复。30 ng of negative quality control product after conversion was used to verify the specificity of primers and probes, and three technical replicates were set; the converted positive quality control product DNA was gradient diluted with nuclease-free water, and 100 copies and 10 copies were taken for sensitivity verification, and 1 and 3 technical replicates were set respectively.

按照表2-1配制qPCR反应体系。Prepare the qPCR reaction system according to Table 2-1.

表2-1 qPCR反应体系
Table 2-1 qPCR reaction system

按照表2-2程序进行qPCR扩增。Perform qPCR amplification according to the procedure in Table 2-2.

表2-2 qPCR反应程序

Table 2-2 qPCR reaction procedure

(5)4200质检(5) 4200 Quality Inspection

将起始量为100copies的阳性质控品的qPCR扩增产物用4200生物分析仪进行质检,质检试剂(Agilent DNA 1000Kit,货号:5067-1504)。The qPCR amplification product of the positive quality control with a starting amount of 100 copies was quality checked using a 4200 bioanalyzer and quality check reagents (Agilent DNA 1000 Kit, catalog number: 5067-1504).

(6)qPCR验证结果(6) qPCR verification results

OSR2、TSPYL5与SDF4的引物探针组合的验证结果如表2-3。选择灵敏度高、特异性高、尽量没有引物二聚体与非特异性扩增的引物探针组合,并结合荧光扩增曲线,确定最优的引物探针组合。由表2-3可见,OSR2基因扩增的最优引物探针组合是组合1,TSPYL5基因扩增的最优引物探针组合是组合5,SDF4基因扩增的最优引物探针组合是组合9。The validation results of the primer-probe combinations of OSR2, TSPYL5 and SDF4 are shown in Table 2-3. Select a primer-probe combination with high sensitivity, high specificity, and minimal primer dimer and nonspecific amplification, and determine the optimal primer-probe combination in combination with the fluorescence amplification curve. As shown in Table 2-3, the optimal primer-probe combination for OSR2 gene amplification is combination 1, the optimal primer-probe combination for TSPYL5 gene amplification is combination 5, and the optimal primer-probe combination for SDF4 gene amplification is combination 9.

表2-3靶基因与内参基因的引物探针筛选结果
Table 2-3 Primer probe screening results of target genes and internal reference genes

备注:检测结果为Undetermined时,设值为30。Ct值为检测的数值+15。Note: When the test result is Undetermined, the value is set to 30. The Ct value is the test value + 15.

实施例3基因组合的血浆样本验证Example 3 Plasma sample validation of gene combination

(1)样本(1) Sample

肝细胞癌(HCC)53例,其中31例为2018年采集的血浆,22例为2020年采集的血浆;肝硬化(Liver Cirrhosis)52例,其中4例为2019年采集的血浆,42例为2020年采集的血浆,6例为2021年采集的血浆;健康人(Health)50例,均为2021年采集的血浆;肝脏良性病变:6例局灶性结节性增生(Focal Nodular Hyperplasia),2例肝腺瘤(Hepatic Adenoma),12例肝血管瘤(Hemangioma)。每例样本2mL血浆。53 cases of hepatocellular carcinoma (HCC), of which 31 cases were plasma collected in 2018 and 22 cases were plasma collected in 2020; 52 cases of liver cirrhosis, of which 4 cases were plasma collected in 2019, 42 cases were plasma collected in 2020, and 6 cases were plasma collected in 2021; 50 healthy people (Health), all of which were plasma collected in 2021; benign liver lesions: 6 cases of focal nodular hyperplasia, 2 cases of hepatic adenoma, and 12 cases of hemangioma. Each sample was 2mL of plasma.

(2)血浆cfDNA提取(2) Plasma cfDNA extraction

试剂与操作步骤见实施例1。See Example 1 for reagents and operating procedures.

(3)重亚硫酸氢盐转化(3) Bisulfite conversion

试剂与操作步骤见实施例1。See Example 1 for reagents and operating procedures.

(4)qPCR(4) qPCR

用于OSR2扩增的引物探针组合1;Primer probe combination 1 for OSR2 amplification;

用于TSPYL5扩增的引物探针组合5;Primer probe combination 5 for TSPYL5 amplification;

用于SDF4扩增的引物探针组合9。Primer probe combination 9 for SDF4 amplification.

按照表3-1配制qPCR反应体系。Prepare the qPCR reaction system according to Table 3-1.

表3-1 qPCR反应体系

Table 3-1 qPCR reaction system

按照如下程序进行qPCR反应The qPCR reaction was performed as follows

表3-2 qPCR反应程序
Table 3-2 qPCR reaction procedure

(5)qPCR结果(5) qPCR results

表3-3 qPCR结果



Table 3-3 qPCR results



由图3可见,将肝硬化、肝局灶性结节增生、肝腺瘤、肝血管瘤作为阴性人群,OSR2与TSPYL5区分肝癌与阴性人群的性能相当,ROC曲线的AUC面积均为0.927,将OSR2与TSPYL5整合后,区分肝癌与阴性人群的AUC能达到0.955。As can be seen from Figure 3, taking cirrhosis, focal nodular hyperplasia, hepatic adenoma, and hepatic hemangioma as the negative population, the performance of OSR2 and TSPYL5 in distinguishing liver cancer from the negative population is equivalent, and the AUC area of the ROC curve is 0.927. After integrating OSR2 and TSPYL5, the AUC for distinguishing liver cancer from the negative population can reach 0.955.

当将肝硬化作为阴性人群,TSPYL5区分肝癌与肝硬化的性能比OSR2更优,AUC分别是0.933与0.906。将TSPYL5与OSR2整合后,区分肝癌与肝硬化的AUC能达到0.95。(图4)。When cirrhosis is used as a negative population, TSPYL5 is better than OSR2 in distinguishing liver cancer from cirrhosis, with AUCs of 0.933 and 0.906, respectively. After integrating TSPYL5 and OSR2, the AUC for distinguishing liver cancer from cirrhosis can reach 0.95. (Figure 4)

当将健康人作为阴性人群时,OSR2区分肝癌与健康人的性能比TSPYL5更优,AUC分别为0.935与0.923。将OSR2与TSPYL5整合后,区分肝癌与健康人的AUC能达到0.961(图5)。When healthy people were used as negative subjects, OSR2 performed better than TSPYL5 in distinguishing liver cancer from healthy people, with AUCs of 0.935 and 0.923, respectively. After integrating OSR2 and TSPYL5, the AUC for distinguishing liver cancer from healthy people could reach 0.961 (Figure 5).

由此可见,OSR2与TSPYL5检测肝癌的性能均佳,且二者有较好的互补性。 It can be seen that both OSR2 and TSPYL5 have good performance in detecting liver cancer, and the two have good complementarity.

实施例4试剂盒在肝癌检测中的应用Example 4 Application of the kit in liver cancer detection

为了进一步验证OSR2与TSPYL5基因组合检测肝癌的性能,将血浆体积降至1mL,进行第二批血浆样本验证,其中部分样本同时进行AFP的检测。In order to further verify the performance of OSR2 and TSPYL5 gene combination in detecting liver cancer, the plasma volume was reduced to 1 mL and a second batch of plasma samples was used for verification, some of which were also tested for AFP.

(1)样本(1) Sample

肝癌54例,肝硬化61例,每例血浆体积1mL。There were 54 cases of liver cancer and 61 cases of liver cirrhosis, and the plasma volume of each case was 1 mL.

(2)血浆cfDNA提取(2) Plasma cfDNA extraction

血浆cfDNA提取试剂与方法见实施例1。See Example 1 for the plasma cfDNA extraction reagent and method.

(3)重亚硫酸氢盐转化(3) Bisulfite conversion

重亚硫酸氢盐转化试剂与方法见实施例1。See Example 1 for bisulfite conversion reagents and methods.

(4)qPCR扩增(4) qPCR amplification

OSR2扩增的引物探针为组合1;The primer probe for OSR2 amplification was combination 1;

TSPYL5扩增的引物探针为组合5;The primer probe for TSPYL5 amplification was combination 5;

SDF4扩增的引物探针为组合9;The primer probe for SDF4 amplification was combination 9;

对qPCR反应体系的各组分浓度进行了优化,最优的反应体系如表4-1。The concentration of each component of the qPCR reaction system was optimized, and the optimal reaction system is shown in Table 4-1.

按照表4-1配制qPCR反应体系。Prepare the qPCR reaction system according to Table 4-1.

表4-1qPCR反应体系
Table 4-1qPCR reaction system

对qPCR反应的温度、时间进行了优化,最优的反应程序如表4-2。The temperature and time of the qPCR reaction were optimized, and the optimal reaction program is shown in Table 4-2.

按照表4-2设置qPCR反应程序。Set up the qPCR reaction program according to Table 4-2.

表4-2 qPCR反应程序
Table 4-2 qPCR reaction procedure

(5)AFP检测(5) AFP detection

为了与AFP的性能进行比较,部分血浆样本同时进行AFP检测。AFP为自研试剂,取100μL血浆进行AFP检测。In order to compare the performance with AFP, some plasma samples were tested for AFP at the same time. AFP is a self-developed reagent, and 100 μL of plasma was taken for AFP testing.

(6)检测结果(6) Test results

表4-3 qPCR结果



Table 4-3 qPCR results



备注:NA表示没有检测AFP。Note: NA means AFP was not detected.

由图6可见,对于1mL血浆,OSR2区分肝癌与肝硬化的AUC达到0.914,TSPYL5区分肝癌与肝硬化的AUC能达到0.915,两个基因整合后,区分肝癌与肝硬化的AUC能达到0.951。对比2mL血浆,OSR2、TSPYL5区分肝癌与肝硬化的AUC分别是0.906与0.933,将OSR2与TSPYL5整合,区分肝癌与肝硬化的AUC能达到0.95。由此可见,虽然1mL血浆起始量下,OSR2的AUC比2mL血浆起始量高,而TSPYL5的AUC比2mL血浆起始量低,但两个基因整合后,1mL血浆起始量与2mL血浆起始量的AUC相当。As shown in Figure 6, for 1 mL of plasma, the AUC of OSR2 for distinguishing liver cancer from cirrhosis reached 0.914, and the AUC of TSPYL5 for distinguishing liver cancer from cirrhosis reached 0.915. After the integration of the two genes, the AUC for distinguishing liver cancer from cirrhosis reached 0.951. Compared with 2 mL of plasma, the AUCs of OSR2 and TSPYL5 for distinguishing liver cancer from cirrhosis were 0.906 and 0.933, respectively. After integrating OSR2 and TSPYL5, the AUC for distinguishing liver cancer from cirrhosis can reach 0.95. It can be seen that although the AUC of OSR2 is higher than that of 2 mL of plasma at the starting amount of 1 mL of plasma, and the AUC of TSPYL5 is lower than that of 2 mL of plasma, after the integration of the two genes, the AUC of 1 mL of plasma starting amount is equivalent to that of 2 mL of plasma starting amount.

为了比较基因甲基化与AFP的性能,选取有AFP数据的肝癌与肝硬化样本,进行ROC曲线分析。由图7可见,AFP区分肝癌与肝硬化的AUC为0.741,OSR2与TSPYL5区分肝癌与肝硬化的AUC均在0.9以上,明显比高于AFP,将OSR2与TSPYL5整合区分肝癌与肝硬化的AUC为0.96,可见,此靶基因组合检测肝癌的性能明显优于AFP。In order to compare the performance of gene methylation and AFP, liver cancer and cirrhosis samples with AFP data were selected for ROC curve analysis. As shown in Figure 7, the AUC of AFP for distinguishing liver cancer from cirrhosis is 0.741, and the AUC of OSR2 and TSPYL5 for distinguishing liver cancer from cirrhosis is above 0.9, which is significantly higher than AFP. The AUC of integrating OSR2 and TSPYL5 to distinguish liver cancer from cirrhosis is 0.96. It can be seen that the performance of this target gene combination in detecting liver cancer is significantly better than AFP.

综上,本公开提供一种用于肝癌检测的肿瘤标志物—OSR2与TSPYL5基因,其DNA序列的甲基化含量可以用于区分肝癌与肝硬化,肝癌与健康人、肝癌与肝脏良性病变,AUC均在0.9以上;本公开还提供一种用于肝癌检测的基因组合及试剂盒,即OSR2、TSPYL5与SDF4基因组合,其中OSR2、TSPYL5为检测肝癌的靶基因,SDF4为用于甲基化相对含量确定的内参基因,检测OSR2、TSPYL5二者的甲基化含量,区分肝癌与肝硬化、肝癌与健康人、肝癌与肝脏良性病变的AUC均能达到0.95以上,明显优于现有用于肝癌筛查的肿瘤标志蛋白AFP的性能。In summary, the present disclosure provides a tumor marker for liver cancer detection - OSR2 and TSPYL5 genes, the methylation content of their DNA sequences can be used to distinguish liver cancer from cirrhosis, liver cancer from healthy people, liver cancer from benign liver lesions, and the AUC is all above 0.9; the present disclosure also provides a gene combination and a kit for liver cancer detection, namely, the OSR2, TSPYL5 and SDF4 gene combination, wherein OSR2 and TSPYL5 are target genes for detecting liver cancer, and SDF4 is an internal reference gene for determining the relative methylation content. By detecting the methylation content of OSR2 and TSPYL5, the AUC for distinguishing liver cancer from cirrhosis, liver cancer from healthy people, and liver cancer from benign liver lesions can reach above 0.95, which is significantly better than the performance of the existing tumor marker protein AFP used for liver cancer screening.

以上所述仅为本公开可选的实施例而已,并不用于限制本公开,对于本领域的技术人员来说,本公开可以有各种更改和变化。凡在本公开的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。 The above description is only an optional embodiment of the present disclosure and is not intended to limit the present disclosure. For those skilled in the art, the present disclosure may have various modifications and variations. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present disclosure shall be included in the protection scope of the present disclosure.

Claims (10)

试剂或试剂组合在制备用于具有以下任意一种或多种用途的产品中的应用,所述用途包括(1)肝癌的早筛或早诊、(2)肝癌的诊断或辅助诊断、(3)肝癌预后风险的评估或辅助评估和(4)肝癌的动态监测;所述试剂或试剂组合包括:检测OSR2基因的甲基化含量的第一试剂和检测TSPYL5基因的甲基化含量的第二试剂。Use of a reagent or a reagent combination in the preparation of a product for any one or more of the following uses, the uses including (1) early screening or early diagnosis of liver cancer, (2) diagnosis or auxiliary diagnosis of liver cancer, (3) assessment or auxiliary assessment of prognostic risk of liver cancer and (4) dynamic monitoring of liver cancer; the reagent or reagent combination comprises: a first reagent for detecting the methylation content of the OSR2 gene and a second reagent for detecting the methylation content of the TSPYL5 gene. 根据权利要求1所述的应用,其特征在于,所述OSR2基因的GeneID:116039;The use according to claim 1, characterized in that the GeneID of the OSR2 gene is: 116039; 优选地,所述OSR2基因的检测区域选自:OSR2基因与其5`启动子区序列的全长或部分序列;Preferably, the detection region of the OSR2 gene is selected from: the full-length or partial sequence of the OSR2 gene and its 5' promoter region sequence; 优选地,所述OSR2基因的检测区域包括:核苷酸序列如SEQIDNO.1所示的区域1;Preferably, the detection region of the OSR2 gene includes: region 1 having a nucleotide sequence as shown in SEQ ID NO.1; 优选地,所述第一试剂包括:引物对、探针和芯片中的任意一种或多种;Preferably, the first reagent comprises: any one or more of a primer pair, a probe and a chip; 优选地,所述第一试剂包括:核苷酸序列如SEQIDNO.4~5所示的引物对和/或核苷酸序列如SEQIDNO.6所示的探针。Preferably, the first reagent comprises: a primer pair having a nucleotide sequence as shown in SEQ ID NO. 4 to 5 and/or a probe having a nucleotide sequence as shown in SEQ ID NO. 6. 根据权利要求1或2所述的应用,其特征在于,所述TSPYL5基因的GeneID:85453;The use according to claim 1 or 2, characterized in that the GeneID of the TSPYL5 gene is: 85453; 优选地,所述TSPYL5基因的检测区域选自:TSPYL5基因与其5`启动子区的全长或部分序列;Preferably, the detection region of the TSPYL5 gene is selected from: the full-length or partial sequence of the TSPYL5 gene and its 5' promoter region; 优选地,所述TSPYL5基因的检测区域包括:核苷酸序列如SEQIDNO.2所示的区域2;Preferably, the detection region of the TSPYL5 gene includes: region 2 whose nucleotide sequence is shown as SEQ ID NO.2; 优选地,所述第二试剂包括:引物对、探针和芯片中的任意一种或多种;Preferably, the second reagent comprises: any one or more of a primer pair, a probe and a chip; 优选地,所述第二试剂包括:核苷酸序列如SEQIDNO.16~17所示的引物对和/或核苷酸序列如SEQIDNO.18所示的探针。Preferably, the second reagent comprises: a primer pair having a nucleotide sequence as shown in SEQ ID NO.16-17 and/or a probe having a nucleotide sequence as shown in SEQ ID NO.18. 根据权利要求3所述的应用,其特征在于,所述试剂或试剂组合还包括:用于检测C1QL4基因、CR1L基因、CYP26C1基因、FOXG1基因、GHSR基因、HIST1H1D基因、IRX5基因、KCNG3基因、LHX2基因、MEX3A基因、NEFM基因、OTX1基因、OXTR基因、PCDHGB6基因、PCDHGB7基因、PITX1基因、PRLHR基因、PRRX1基因、ZIC4基因中的任意一种或多种基因的甲基化含量的试剂或试剂组合;The use according to claim 3, characterized in that the reagent or reagent combination further comprises: a reagent or reagent combination for detecting the methylation content of any one or more genes among C1QL4 gene, CR1L gene, CYP26C1 gene, FOXG1 gene, GHSR gene, HIST1H1D gene, IRX5 gene, KCNG3 gene, LHX2 gene, MEX3A gene, NEFM gene, OTX1 gene, OXTR gene, PCDHGB6 gene, PCDHGB7 gene, PITX1 gene, PRLHR gene, PRRX1 gene, and ZIC4 gene; 优选地,所述试剂或试剂组合还包括:检测内参基因的含量的第三试剂;Preferably, the reagent or reagent combination further comprises: a third reagent for detecting the content of an internal reference gene; 优选地,所述内参基因包括SDF4基因;Preferably, the internal reference gene includes the SDF4 gene; 优选地,所述SDF4基因的GeneID:51150;Preferably, the GeneID of the SDF4 gene is: 51150; 优选地,所述SDF4基因的检测区域包括:SDF4基因的全长或部分序列;Preferably, the detection region of the SDF4 gene includes: the full-length or partial sequence of the SDF4 gene; 优选地,所述SDF4基因的检测区域包括:核苷酸序列如SEQIDNO.3所示的区域3;Preferably, the detection region of the SDF4 gene includes: region 3 whose nucleotide sequence is shown as SEQ ID NO.3; 优选地,所述第三试剂包括:引物对、探针和芯片中的任意一种或多种;Preferably, the third reagent includes: any one or more of a primer pair, a probe and a chip; 优选地,所述第三试剂包括:核苷酸序列如SEQIDNO.28~29所示的引物对和/或核苷酸序列如SEQIDNO.30所示的探针。Preferably, the third reagent comprises: a primer pair having a nucleotide sequence as shown in SEQ ID NO. 28 to 29 and/or a probe having a nucleotide sequence as shown in SEQ ID NO. 30. 根据权利要求1或2所述的应用,其特征在于,所述试剂或试剂组合还包括:MgCl2、dNTP、Taq酶、缓冲液Buffer中的至少一种。The use according to claim 1 or 2, characterized in that the reagent or reagent combination further comprises: at least one of MgCl 2 , dNTP, Taq enzyme, and buffer Buffer. 一种试剂或试剂组合,其特征在于,其包括权利要求1~5任一项所述的试剂或试剂组合。A reagent or a reagent combination, characterized in that it comprises the reagent or the reagent combination according to any one of claims 1 to 5. 一种试剂盒,其特征在于,其包括权利要求1~5任一项所述的试剂或试剂组合。A kit, characterized in that it comprises the reagent or reagent combination according to any one of claims 1 to 5. 一种靶基因甲基化含量的检测方法,其特征在于,其包括:采用权利要求1~5任一项所述的试剂或试剂组合对样本进行靶基因的甲基化含量的检测;所述靶基因包括OSR2基因和TSPYL5基因;A method for detecting the methylation content of a target gene, characterized in that it comprises: using the reagent or reagent combination described in any one of claims 1 to 5 to detect the methylation content of the target gene in a sample; the target gene comprises the OSR2 gene and the TSPYL5 gene; 优选地,所述应用不以疾病的诊断或治疗为直接目的;Preferably, the application is not directly intended for diagnosis or treatment of a disease; 优选地,所述靶基因还包括:用于检测C1QL4基因、CR1L基因、CYP26C1基因、FOXG1基因、GHSR基因、HIST1H1D基因、IRX5基因、KCNG3基因、LHX2基因、MEX3A基因、NEFM基因、OTX1基因、OXTR基因、PCDHGB6基因、PCDHGB7基因、PITX1基因、PRLHR基因、PRRX1基因、ZIC4基因中的任意一种或多种;Preferably, the target gene also includes: for detecting any one or more of C1QL4 gene, CR1L gene, CYP26C1 gene, FOXG1 gene, GHSR gene, HIST1H1D gene, IRX5 gene, KCNG3 gene, LHX2 gene, MEX3A gene, NEFM gene, OTX1 gene, OXTR gene, PCDHGB6 gene, PCDHGB7 gene, PITX1 gene, PRLHR gene, PRRX1 gene, and ZIC4 gene; 优选地,所述方法还包括对内参基因进行检测;Preferably, the method further comprises detecting an internal reference gene; 优选地,所述内参基因包括:SDF4基因;Preferably, the internal reference gene includes: SDF4 gene; 优选地,所述检测方法的样本包括:DNA样本或含DNA的环境样本;Preferably, the samples of the detection method include: DNA samples or environmental samples containing DNA; 优选地,所述靶基因的甲基化含量的检测包括:采用所述试剂或试剂组合对经重亚硫酸氢盐转化处理后的样本进行qPCR扩增;Preferably, the detection of the methylation content of the target gene comprises: using the reagent or the reagent combination to perform qPCR amplification on the sample after bisulfite conversion treatment; 优选地,当所述试剂或试剂组合包含第一试剂~第三试剂时,所述qPCR的反应体系包括:MgCl2的终浓度为4~8mM,dNTP Mix的终浓度为200~800μM,Taq酶的终浓度为1.0~10U/反应,用于OSR2基因检测的引物对的终浓度为0.05~1.0μM、探针终浓度为50~500nM;用于TSPYL5基因检测的引物对的终浓度为0.1~1.0μM、探针的终浓度为50~500nM;用于SDF4基因检测的引物对的终浓度为0.1~1.0μM、探针的终浓度为50~500nM;Preferably, when the reagent or reagent combination comprises the first reagent to the third reagent, the qPCR reaction system comprises: the final concentration of MgCl2 is 4-8mM, the final concentration of dNTP Mix is 200-800μM, the final concentration of Taq enzyme is 1.0-10U/reaction, the final concentration of the primer pair for OSR2 gene detection is 0.05-1.0μM, and the final concentration of the probe is 50-500nM; the final concentration of the primer pair for TSPYL5 gene detection is 0.1-1.0μM, and the final concentration of the probe is 50-500nM; the final concentration of the primer pair for SDF4 gene detection is 0.1-1.0μM, and the final concentration of the probe is 50-500nM; 优选地,所述qPCR的反应程序包括:95~98℃预变性0.5~5分钟;95~98℃变性5~15秒,55~65℃退火与延伸30~60秒,15个循环,不收集荧光;95~98℃变性5~15秒,55~65℃退火与延伸30~60秒,30个循环,收集荧光。Preferably, the reaction procedure of the qPCR includes: pre-denaturation at 95-98°C for 0.5-5 minutes; denaturation at 95-98°C for 5-15 seconds, annealing and extension at 55-65°C for 30-60 seconds, 15 cycles, without collecting fluorescence; denaturation at 95-98°C for 5-15 seconds, annealing and extension at 55-65°C for 30-60 seconds, 30 cycles, collecting fluorescence. 一种肝癌预测模型的训练方法,其特征在于,其包括:A method for training a liver cancer prediction model, characterized in that it comprises: 获取权利要求1~5任一项所述试剂或试剂组合对训练样本中靶基因的甲基化含量进行检测并为其标注结果;其中,所述靶基因为权利要求8所述的靶基因,所述标注结果包括:代表样本的肝癌患病风险、预后风险和疾病进程中任意一种情况的标签;Obtain the reagent or reagent combination described in any one of claims 1 to 5 to detect the methylation content of the target gene in the training sample and annotate the result; wherein the target gene is the target gene described in claim 8, and the annotation result includes: a label representing any one of the liver cancer risk, prognosis risk and disease progression of the sample; 将待测样本的靶基因的甲基化含量输入预先构建的预测模型中,获得预测结果;所述预测模型为能够根据所述靶基因的甲基化含量判断样本的肝癌患病风险、预后风险和疾病进程中任意一种情况的机器学习模型。The methylation content of the target gene of the sample to be tested is input into a pre-constructed prediction model to obtain a prediction result; the prediction model is a machine learning model that can judge any of the liver cancer risk, prognosis risk and disease progression of the sample based on the methylation content of the target gene. 一种电子设备,其特征在于,其包括:处理器和存储器;所述存储器用于存储程序,当所述程序被所述处理器执行时,使得所述处理器实现权利要求9所述的肝癌预测模型的训练方法或肝癌的预测方法;An electronic device, characterized in that it comprises: a processor and a memory; the memory is used to store a program, and when the program is executed by the processor, the processor implements the training method of the liver cancer prediction model or the liver cancer prediction method according to claim 9; 所述预测方法包括:获取待测样本中靶基因的甲基化含量;所述靶基因为权利要求8中所述的靶基因;将待测样本靶基因的甲基化含量输入权利要求9所述的训练方法训练好的模型中,获得待测样本的预测结果。 The prediction method comprises: obtaining the methylation content of the target gene in the sample to be tested; the target gene is the target gene described in claim 8; inputting the methylation content of the target gene of the sample to be tested into the model trained by the training method described in claim 9 to obtain the prediction result of the sample to be tested.
PCT/CN2023/105356 2023-03-23 2023-06-30 Gene combination for liver cancer detection, and related reagent and application Pending WO2024192928A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202310295248.4A CN116200499B (en) 2023-03-23 2023-03-23 Gene combination for liver cancer detection, related reagent and application
CN202310295248.4 2023-03-23

Publications (1)

Publication Number Publication Date
WO2024192928A1 true WO2024192928A1 (en) 2024-09-26

Family

ID=86511265

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/105356 Pending WO2024192928A1 (en) 2023-03-23 2023-06-30 Gene combination for liver cancer detection, and related reagent and application

Country Status (2)

Country Link
CN (1) CN116200499B (en)
WO (1) WO2024192928A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116200499B (en) * 2023-03-23 2024-01-19 北京和瑞精湛医学检验实验室有限公司 Gene combination for liver cancer detection, related reagent and application
CN118147309B (en) * 2024-04-15 2024-10-22 广州市基准医疗有限责任公司 Methylation biomarkers or combinations for diagnosing bladder cancer lymph node metastasis and uses thereof
CN118425502B (en) * 2024-07-05 2024-11-05 深圳市博卡生物技术有限公司 An immunochromatographic detection card for detecting a new gastric cancer marker SDF4 and a preparation method thereof

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100167940A1 (en) * 2008-11-26 2010-07-01 The Johns Hopkins University Methods for identifying cancer risk
CN104762301A (en) * 2015-04-24 2015-07-08 武汉大学 Kit and method for detecting methylation level of liver cancer risk gene TSPYL5
KR20170071724A (en) * 2015-12-16 2017-06-26 연세대학교 산학협력단 Method for diagnosing or predicting hepatocellular carcinoma using dna methylation changes of intragenic cpg island involved in hepatocellular carcinoma specific gene expression
CN109790198A (en) * 2016-09-02 2019-05-21 梅约医学教育与研究基金会 Detection of hepatocellular carcinoma
CN110257511A (en) * 2018-12-29 2019-09-20 武汉生命科技股份有限公司 A method of for detecting the quantitative fluorescent PCR of TSPYL5 gene methylation
WO2022032429A1 (en) * 2020-08-10 2022-02-17 华大数极生物科技(深圳)有限公司 Methylation markers for liver cancer detection and diagnosis
US20220243278A1 (en) * 2021-01-29 2022-08-04 Mayo Foundation For Medical Education And Research Detecting the presence or absence of multiple types of cancer
CN116200499A (en) * 2023-03-23 2023-06-02 北京和瑞精湛医学检验实验室有限公司 Gene combination for liver cancer detection, related reagent and application
CN116219020A (en) * 2023-03-02 2023-06-06 北京和瑞精湛医学检验实验室有限公司 Methylation reference gene and application thereof

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3481954A4 (en) * 2016-07-06 2020-04-15 Youhealth Biotech, Limited METHYLIZATION MARKERS FOR LUNG CANCER AND USES THEREOF
CN114164275B (en) * 2021-12-22 2022-12-09 武汉艾米森生命科技有限公司 Application of liver cancer marker in preparation of liver cancer detection product and detection kit
CN113897434A (en) * 2021-11-02 2022-01-07 北京艾克伦医疗科技有限公司 Method and kit for identifying liver cancer status

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100167940A1 (en) * 2008-11-26 2010-07-01 The Johns Hopkins University Methods for identifying cancer risk
CN104762301A (en) * 2015-04-24 2015-07-08 武汉大学 Kit and method for detecting methylation level of liver cancer risk gene TSPYL5
KR20170071724A (en) * 2015-12-16 2017-06-26 연세대학교 산학협력단 Method for diagnosing or predicting hepatocellular carcinoma using dna methylation changes of intragenic cpg island involved in hepatocellular carcinoma specific gene expression
CN109790198A (en) * 2016-09-02 2019-05-21 梅约医学教育与研究基金会 Detection of hepatocellular carcinoma
CN110257511A (en) * 2018-12-29 2019-09-20 武汉生命科技股份有限公司 A method of for detecting the quantitative fluorescent PCR of TSPYL5 gene methylation
WO2022032429A1 (en) * 2020-08-10 2022-02-17 华大数极生物科技(深圳)有限公司 Methylation markers for liver cancer detection and diagnosis
US20220243278A1 (en) * 2021-01-29 2022-08-04 Mayo Foundation For Medical Education And Research Detecting the presence or absence of multiple types of cancer
CN116219020A (en) * 2023-03-02 2023-06-06 北京和瑞精湛医学检验实验室有限公司 Methylation reference gene and application thereof
CN116200499A (en) * 2023-03-23 2023-06-02 北京和瑞精湛医学检验实验室有限公司 Gene combination for liver cancer detection, related reagent and application

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
FU XIAO, YANG YIXUAN, ZHANG DAZHI: "Molecular mechanism of albumin in suppressing invasion and metastasis of hepatocellular carcinoma", LIVER INTERNATIONAL, BLACKWELL MUNKSGAARD, OXFORD, GB, vol. 42, no. 3, 1 March 2022 (2022-03-01), GB , pages 696 - 709, XP093212766, ISSN: 1478-3223, DOI: 10.1111/liv.15115 *
HERNANDEZ-MEZA, G. ET AL.: "DNA Methylation Profiling of Human Hepatocarcinogenesis", HEPATOLOGY, vol. 74, no. 1, 31 July 2021 (2021-07-31), pages 183 - 199, XP071565797, DOI: 10.1002/hep.31659 *

Also Published As

Publication number Publication date
CN116200499A (en) 2023-06-02
CN116200499B (en) 2024-01-19

Similar Documents

Publication Publication Date Title
TWI874916B (en) Methods and systems for tumor detection
CN107771221B (en) Mutation Detection for Cancer Screening and Fetal Analysis
EP2891720B1 (en) Method for screening cancer
WO2021128519A1 (en) Combination of dna methylation biomarkers, and detection method therefor and kit thereof
WO2024192928A1 (en) Gene combination for liver cancer detection, and related reagent and application
WO2020020071A1 (en) Methylation modification-based tumor marker stamp-ep1
CN114072527A (en) Determination of Linear and circular forms of circulating nucleic acids
CN105555965B (en) Method for determining nucleic acid composition in nucleic acid mixtures
WO2010118559A1 (en) A method for screening cancer
WO2020020072A1 (en) Methylation modification-based tumor marker stamp-ep2
WO2024178886A1 (en) Methylation reference gene and use thereof
CN113355415B (en) Detection reagents and kits for esophageal cancer diagnosis or auxiliary diagnosis
WO2023082142A1 (en) Otx1 methylation marker for detecting liver cancer
CN109837344B (en) Methylated EphA7 nucleotide fragment, detection method and application thereof
CN116144782A (en) A combination marker for lung cancer detection and its application
CN113355414B (en) Esophageal cancer detection kit and application thereof
CN120945048A (en) Urothelial carcinoma methylation marker kit
CN116463417A (en) Application of the reagent for detecting the methylation level of the target region in the preparation of liver cancer diagnostic products
WO2020221314A1 (en) Tumor marker stamp-ep7 based on methylated modification and use thereof
Cheruba Development of a Novel Non-Invasive Cancer Detection Method Through Enrichment of CpG-Rich Regions in Cell-Free DNA
CN116814790A (en) Application of PITX2 gene as a marker in detecting lung cancer
TW202540440A (en) Enrichment of clinically-relevant nucleic acids
CN120624647A (en) Methylation marker combination and application, kit and computer-readable storage medium for diagnosing or predicting nasopharyngeal carcinoma
CN116516005A (en) A nucleic acid product, kit and application for detecting head and neck squamous cell carcinoma
CN116042839A (en) Detection markers and their applications

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23928233

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE