[go: up one dir, main page]

WO2020064006A2 - Gene group used for breast cancer molecular typing and distant metastasis risk, diagnostic product, and application - Google Patents

Gene group used for breast cancer molecular typing and distant metastasis risk, diagnostic product, and application Download PDF

Info

Publication number
WO2020064006A2
WO2020064006A2 PCT/CN2019/109459 CN2019109459W WO2020064006A2 WO 2020064006 A2 WO2020064006 A2 WO 2020064006A2 CN 2019109459 W CN2019109459 W CN 2019109459W WO 2020064006 A2 WO2020064006 A2 WO 2020064006A2
Authority
WO
WIPO (PCT)
Prior art keywords
risk
breast cancer
gene
reagent
distant metastasis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2019/109459
Other languages
French (fr)
Chinese (zh)
Other versions
WO2020064006A3 (en
Inventor
周彤
胡志元
周伟庆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Precision Diagnostics Co Ltd
Original Assignee
Shanghai Precision Diagnostics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Precision Diagnostics Co Ltd filed Critical Shanghai Precision Diagnostics Co Ltd
Priority to CN201980006647.6A priority Critical patent/CN111566229B/en
Publication of WO2020064006A2 publication Critical patent/WO2020064006A2/en
Publication of WO2020064006A3 publication Critical patent/WO2020064006A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6851Quantitative amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/112Disease subtyping, staging or classification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Definitions

  • the invention belongs to the field of biotechnology, and particularly relates to breast cancer subtypes and distant metastasis risk gene groups, and in vitro diagnostic products and applications thereof.
  • Breast cancer accounts for the first incidence of malignant tumors in women in China, and it is increasing year by year at a rate of about 4%.
  • Breast cancer is a highly heterogeneous tumor, and its pathological molecular mechanism is very complicated, often involving mutations of multiple genes in many types of somatic cells.
  • Breast cancer classification according to the pathological characteristics of the tumor, improving the accuracy of clinical diagnosis and prognosis prediction of breast cancer, is the basis of accurate treatment after breast cancer surgery, and also the key to improving the survival rate of breast cancer patients.
  • the accurate treatment of breast cancer after surgery at home and abroad still faces great challenges.
  • the internationally recognized breast cancer molecular typing product based on tumor gene expression is the PAM50 (Prosigna) jointly developed by Professor Perou of the University of North Carolina and NanoString Corporation, which has been approved by the US FDA.
  • PAM50 divides breast cancer into four subtypes, lumenal A (Luminal A), lumenal B (Luminal B), and HER2 enriched type ( HER2-enriched (HER2 for short) and Basal-like (Basal), and the risk of distant metastasis in breast cancer within 10 years was evaluated based on the subtype and tumor proliferation index.
  • the present invention relates to a group of gene groups, including 66 molecular typing and genes related to distant metastasis risk assessment, and the use of the gene group in molecular diagnosis of breast cancer.
  • the gene population can be used for molecular typing of breast cancer and / or assessing its risk of distant metastasis.
  • the 66 molecular typing and distant metastasis risk assessment related genes include:
  • the gene population further includes a housekeeping gene.
  • the housekeeping genes include at least one of the following (eg, 1, 2, 3, 4, 5, or 6), preferably at least 3, and most preferably all 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.
  • the present invention relates to a gene group (also referred to herein as Precitype) for molecular typing of breast cancer and / or assessing its risk of distant metastasis, which includes 72 Genes, that is, 66 molecular typing and distant metastasis risk assessment related genes and 6 housekeeping genes, of which
  • the 66 molecular typing and distant metastasis risk assessment related genes include:
  • the six housekeeping genes include: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.
  • the present invention relates to the use of the genetic population in performing breast cancer molecular typing and / or assessing the risk of distant metastases.
  • the invention also relates to the use of the gene population in the preparation of a diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastases.
  • the invention also relates to a method of molecularly typing breast cancer and / or assessing the risk of distant metastases, said method being performed by using a diagnostic product of the invention.
  • the present invention also relates to a diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastases, which comprises a reagent for detecting the expression level of a gene in the gene group of the present invention.
  • the diagnostic product is in the form of an in vitro diagnostic product, preferably in the form of a diagnostic kit.
  • the diagnostic product further comprises a total RNA extraction reagent, a reverse transcription reagent, and / or a second-generation sequencing reagent.
  • the reagent is a probe or a primer.
  • the breast cancer includes luminal type A, luminal type B, HER2-enriched type, basal cell type, and immune-enhanced type (Immune-enhanced, which may also be referred to herein as Immuno ).
  • Figure 1 shows that according to the expression level of 66 genes in breast cancer tissue, breast cancer is divided into luminal A (Lum A), luminal B (Lum B), basal cell type (Basal), and HER2 enriched type (HER2 ) And immune-enhanced (Imm).
  • Lum A luminal A
  • Lum B luminal B
  • Basal basal cell type
  • HER2 HER2
  • Imm immune-enhanced
  • A 1951 European and American cases
  • B 824 Chinese cases.
  • Invasion-related genes ie, CTSL2 and MMP11
  • CTSL2 and MMP11 have fewer genes and have fewer gene expression regions in the heat map, so they are not shown in Figure 1.
  • Figure 2 shows that the risk of distant metastasis is different for each subtype of breast cancer.
  • the risk of distant metastasis of luminal type A was significantly lower than the other four subtypes, and the risk of distant metastasis of immune-enhanced type was significantly lower than that of luminal type B, basal cell type and HER2 enriched type.
  • Lumen B is less than basal cell type (Basal), while basal cell type (Basal) is less than HER2 enriched type (HER2); 10 years after surgery There is no significant difference between the three.
  • the lumen B type was smaller than the basal cell type and the HER2 enriched type within 3 years of surgery. After 10 years of surgery, the HER2 enriched type had the worst prognosis.
  • A 1951 European and American cases
  • B 824 Chinese cases.
  • Figure 3 shows that the immune index has an important effect on the prognosis of breast cancer.
  • strong immune function can significantly reduce the risk of distant metastasis of breast cancer in patients ;
  • low-risk breast cancer subtype lumen A the strength of immune function has no significant effect on the prognosis.
  • A 1951 European and American cases; B, 824 Chinese cases.
  • Figure 4 shows that according to the calculated recurrence risk index, the risk of distant tumor metastasis is divided into three groups, low risk (0-32), medium risk (33-49), and high risk (50-100).
  • A 1951 European and American cases; B, 824 Chinese cases.
  • FIG. 5 shows the second-generation sequencing (NGS) -based gene expression detection of the present invention, and its sensitivity and detection throughput are better than the existing gene chip, quantitative PCR, and Nanostring technologies.
  • NGS second-generation sequencing
  • Figure 6 shows the results of reproducible experiments, which show that the RNA (left) extracted from paraffin sections and the RNA (right) extracted from fresh tissue have excellent correlation.
  • the expression “at least one (species)” or similar expression “one (species) or multiple (species)” means 1 (species), 2 (species), 3 (species), 4 ( Species), 5 (species), 6 (species), 7 (species), 8 (species), 9 (species) or more (species).
  • breast cancer originates from ducts and acinar epithelium at all levels of the breast, and gradually develops from glandular epithelium to atypical hyperplasia. It is divided into carcinoma in situ (non-invasive cancer), early invasive cancer to invasive cancer. In a preferred embodiment, the breast cancer is invasive breast cancer.
  • risk of recurrence refers to the likelihood of a breast cancer patient's tumor recurrence within a specified number of years, including, but not limited to, localized breast cancer recurrence, regional breast cancer recurrence, and distant metastasis. As used herein, "risk of relapse” preferably indicates the possibility of distant metastasis. Therefore, in this article, the assessment of "Risk of Recurrence Score” (RRS) can be expressed as “distant recurrence risk score” or "distant metastasis risk score”.
  • breast cancer recurrence risk assessment can be used to assess the possibility of local recurrence or distant metastases in breast cancer patients.
  • breast cancer can be classified into low risk, medium risk, and high risk according to the recurrence risk score.
  • Chemotherapy is generally considered to be low risk, while chemotherapy at high risk reduces the risk of recurrence or distant metastases.
  • the medium risk requires the clinician to make judgments based on the actual clinical situation. Such a "medium risk" has less guiding significance in clinical treatment. Therefore, the fewer cases of medium risk, the more efficient the assessment will be.
  • the risk of breast cancer recurrence is preferably assessed for the possibility of distant metastases.
  • RNA transcript refers to total RNA, that is, coding or non-coding RNA, including RNA directly from tissue or peripheral blood samples, and also indirectly from tissue or blood samples after cell lysis.
  • Total RNA includes tRNA, mRNA, and rRNA, where mRNA includes mRNA transcribed from the target gene, and mRNA from other non-target genes.
  • polypeptide refers to a compound consisting of amino acids connected by peptide bonds, including the full length or amino acid fragments of a polypeptide.
  • the amount of the polypeptide encoded by the gene can be standardized with respect to the amount of total protein in the sample or the amount of the polypeptide encoded by the housekeeping gene.
  • RNA transcripts can be detected and quantified, for example, by methods of hybridization, amplification or sequencing. For example, hybridize RNA transcripts with probes or primers.
  • hybridization refers to the process under which the two nucleic acid fragments are combined by stable and specific hydrogen bonds to form a double helix complex under appropriate conditions.
  • amplification primer or “primer” refers to a nucleic acid fragment containing 5 to 100 nucleotides, preferably, 15 to 30 cores capable of initiating an enzymatic reaction (eg, an enzymatic amplification reaction). Glycylic acid.
  • hybridization probe refers to a nucleic acid sequence including at least 5 nucleotides, for example, containing 5 to 100 nucleotides, which can interact with the expression product of the target gene or the expansion of the expression product under specified conditions. The gain products hybridize to form a complex.
  • the hybridization probe may further include a label for detection.
  • the label includes, but is not limited to, a label for fluorescent quantitative PCR or fluorescent in situ hybridization.
  • the label may be FAM, HEX, VIC, Cy5, or the like.
  • the label may be biotin, digoxin and the like.
  • the detection of the expression level of a gene described herein may employ assay methods known in the art, including, but not limited to, a method of detecting the amount of an RNA transcript of the gene or the amount of a polypeptide encoded by the gene.
  • RNA transcript of a gene can be converted into cDNA complementary thereto by methods known in the art, and the amount of RNA transcript can be obtained by measuring the amount of complementary cDNA.
  • the amount of a gene's RNA transcript or its complementary cDNA can be normalized to the amount of total RNA or total cDNA in a sample or to a set of housekeeping genes' RNA transcripts or its complementary cDNA.
  • RNA transcripts can be detected and quantified by methods such as hybridization, amplification or sequencing, including but not limited to methods for hybridizing RNA transcripts with probes or primers, by polymerase chain reaction (PCR) -based A method of detecting the amount of RNA transcript or its corresponding cDNA product by various quantitative PCR techniques or sequencing techniques.
  • the quantitative PCR technology includes, but is not limited to, fluorescent quantitative PCR, real-time PCR, or semi-quantitative PCR technology.
  • the sequencing technology includes, but is not limited to, Sanger sequencing, second-generation sequencing, third-generation sequencing, and single-cell sequencing.
  • the sequencing technology is second-generation sequencing, more preferably targeted RNA-seq technology.
  • the amount of polypeptide can be detected by, for example, proteomics or reagents.
  • the reagent is an antibody, an antibody fragment or an affinity protein.
  • the invention relates to a group of gene groups, which includes 66 genes for molecular typing and distant metastasis risk assessment.
  • the 66 molecular typing and distant metastasis risk assessment related genes include:
  • the gene population of the present invention further includes a housekeeping gene.
  • the housekeeping gene includes at least one of the following (eg, 1, 2, 3, 4, 5, or 6), preferably at least 3, and most preferably 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.
  • a set of gene groups is provided, which includes 72 genes, that is, 66 molecular typing and distant metastasis risk assessment related genes, and 6 housekeeping genes.
  • This gene group is also referred to herein as Precitype.
  • the 66 molecular typing and distant metastasis risk assessment related genes and 6 housekeeping genes are as described above.
  • the gene group of the present invention is shown in Table 1.
  • Housekeeping genes are used to normalize and correct target gene expression levels. Inclusion criteria for housekeeping genes that can be considered are: 1. Stable expression in tissues, whose expression level is not affected or less affected by pathological conditions or drug treatment; 2. Expression levels should not be too high to avoid expression data ( (For example, obtained through second-generation sequencing), the proportion of data obtained is too high, which affects the accuracy of data detection and interpretation of other genes. Therefore, housekeeping genes may be included in the gene group of the present invention.
  • the housekeeping genes considered include at least one of GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC, such as 1, 2, 3, 4, 5, or 6, preferably at least 3, and most preferably all 6 .
  • the housekeeping genes considered include GAPDH, GUSB, MRPL19, PSMC4, SF3A1, TFRC, or any combination thereof.
  • the present invention also relates to a group of immune-related genes comprising 17 genes: APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ, and PTPRC.
  • the genes in the immune-related gene group are also shown in the related information in Table 1.
  • the gene population of the present invention can be used for molecular typing (subtyping) of breast cancer and / or assessing its risk of distant metastasis.
  • the subtypes of breast cancer include luminal A, luminal B, HER2-enriched, basal cell-like, and immune-enhanced.
  • the invention relates to a diagnostic product for molecularly typing breast cancer and / or assessing its risk of distant metastasis, which comprises an agent for detecting the expression level of genes in the gene group of the invention.
  • the gene population is as described above.
  • the gene group of the present invention may include 66 genes related to molecular typing and distant metastasis risk assessment.
  • the 66 molecular typing and distant metastasis risk assessment related genes include:
  • the gene population of the present invention further includes a housekeeping gene.
  • the housekeeping gene includes at least one of the following (eg, 1, 2, 3, 4, 5, or 6), preferably at least 3, and most preferably 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1 and TFRC.
  • the gene group is shown in Table 1.
  • the diagnostic product is an in vitro diagnostic product.
  • the diagnostic product is a diagnostic kit.
  • the diagnostic product is used for breast cancer subtype and / or distant metastasis risk assessment.
  • the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene. In yet another embodiment, the reagent is a reagent that detects the amount of cDNA complementary to the mRNA.
  • the diagnostic product further includes a total RNA extraction reagent, a reverse transcription reagent, and / or a second-generation sequencing reagent.
  • the total RNA extraction reagent may be a conventional total RNA extraction reagent in the art. Examples include, but are not limited to, Qiagen 73504, RNA Storm CD201, Invitrogen and ABI AM1975.
  • the reverse transcription reagent may be a conventional reverse transcription reagent in the art, and preferably includes a dNTP solution and / or an RNA reverse transcriptase.
  • Examples of reverse transcription reagents include, but are not limited to, NEB M0368L, Thermo K1622, ABI 4366596.
  • the second-generation sequencing reagent may be a reagent commonly used in the art, as long as it can meet the requirements for performing second-generation sequencing on the obtained sequence.
  • Second-generation sequencing reagents can be commercially available products, examples of which include, but are not limited to, Illumina Reagent Kit v3 (150cycle) (MS-102-3001), Targeted RNA Index Kit A-96 Indicators (384 Samples) (RT-402-1001).
  • Next-generation sequencing is conventional second-generation sequencing in the art, such as targeted RNA-seq technology. Therefore, next-generation sequencing reagents can also include Illumina custom-made reagents that can be used to build libraries that target RNA-seq, such as Targeted RNA Custom Panel Kit (96 Samples) (RT-102-1001).
  • the reagent is a probe or a primer.
  • the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein.
  • the reagent is a primer.
  • the primer is a synthetic oligonucleotide fragment (preferably having high specificity) as long as it is complementary to a partial sequence of a gene in the gene group of the present invention, and the gene therein is amplified.
  • Primers can be synthetic. More preferably, the sequences of the primers are as shown in SEQ ID No. 1-SEQ ID No. 132 and SEQ ID NO. 133-SEQ ID NO. 144. The specific protocol of the primers can be seen in Table 2.
  • an embodiment of the present invention also relates to a diagnostic product (preferably an in vitro diagnostic product, especially a kit), which comprises a primer whose sequence is as described in SEQ ID NO.1-SEQ ID NO.132.
  • the diagnostic product further comprises a primer, and the sequence of the primer is as described in SEQ ID NO. 133-SEQ ID NO. 144.
  • the diagnostic product of the present invention (preferably an in vitro diagnostic product, especially a kit) comprises a primer whose sequence is shown in SEQ ID NO.1-SEQ ID NO.144 (see also Table 2 ).
  • the diagnostic product (preferably in the form of a kit) of the present invention also preferably includes a device for extracting a test sample from a subject; for example, a device for extracting tissue or blood from a subject, preferably any blood that can be used for blood collection Needles, syringes, etc.
  • the subject is a mammal, preferably a human, especially a female with breast cancer.
  • the primers can be used to detect the expression level of the gene population.
  • the present invention also relates to a method for determining a breast cancer molecular typing and / or distant metastatic risk in a subject, the method comprising
  • the method of the invention can be used for diagnostic or non-diagnostic purposes.
  • the subject for use in the method of the invention is a mammal, preferably a human, especially a female with breast cancer.
  • the method of providing a sample in step (1) and the sample used are not particularly limited, as long as the expression level of genes in a gene group can be obtained therefrom, for example, total RNA of a subject can be extracted from the sample.
  • the sample is preferably a sample of tissue, blood, plasma, body fluid or a combination thereof, preferably a tissue sample, especially a paraffin tissue sample.
  • the sample is a tumor tissue sample or a tissue sample containing tumor cells, particularly a breast tumor tissue sample or a tissue sample containing breast tumor cells.
  • step (2) the expression level of a gene in the gene group of the present invention is determined.
  • This gene group is as described above.
  • the gene group can also be described in Table 1.
  • the method of the present invention can be performed by a reagent for detecting a gene expression level in the gene group.
  • the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene.
  • the reagent is a reagent that detects the amount of cDNA complementary to the mRNA.
  • the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein.
  • the reagent is a probe or a primer, especially a primer. More preferably, the sequence of the primer is as described in SEQ ID NO.1-SEQ ID NO.132. In one embodiment, a primer is also used, and the sequence of the primer is as described in SEQ ID NO. 133-SEQ ID NO. 144. The specific protocol of the primers can also be shown in Table 2.
  • the method of the present invention can be performed by a diagnostic product (especially a diagnostic kit) of the present invention.
  • step (2) may include
  • the extraction in step (2-1) can be performed by a conventional method in the art, and preferably, a total RNA of fresh frozen tissue or paraffin-embedded tissue of a subject is detected by using an RNA extraction kit.
  • extraction can be performed using Roche's RNA extraction kit (product number: RocheCatalogNumber # 3270289001) or Qiagen's RNA extraction kit (Qiagen, Rnease, FFPE, Kitalog ##, 504). .
  • a method for constructing a library may include the following steps:
  • the extracted total RNA was reverse transcribed to generate cDNAs of the genes described in Table 1.
  • the ends were filled and phosphorylated at the 5 ′ end.
  • 30 ⁇ l of DNA, 45 ⁇ l of purified water, 10 ⁇ l of T4 DNA ligase buffer with 10 mM ATP, 4 ⁇ l of 10 mM dNTP Mix, 5 ⁇ l of T4 DNA polymerase, 1 ⁇ l of Klenow enzyme, and 5 ⁇ l of T4 ligase were mixed. Incubate at 20 ° C for 30 minutes (reagent is Illumina sample preparation kit PE-102-1001). After incubation, use QIAGEN QIAquick PCR purification kit (part # 28104) to purify DNA.
  • End suspension A Dissolve the product from the previous step in 32 ⁇ l of buffer, add 5 ⁇ l of Klenow buffer, 10 ⁇ l of 1 mM dATP, and 3 ⁇ l of Klenow ⁇ , and keep at 37 ° C for 30 minutes (reagent is Illumina sample preparation kit).
  • the products were ligated by QIAGEN MinElute PCR purification kit (part # 28004): DNA was dissolved in 10 ⁇ l buffer, DNA ligase buffer 2 ⁇ 25 ⁇ l, PE adapter Oligo Mix 10 ⁇ l, DNA ligase 5 ⁇ l, and kept at 20 ° C for 15 minutes ( The reagent was Illumina sample preparation kit PE-102-1001). After warming, the DNA was purified using QIAGEN QIAquick PCR purification kit (part # 28104) to obtain a library.
  • Step (2-3) can be completed by RNA sequencing.
  • the sequencing method may be a conventional RNA-seq sequencing method for determining a gene expression level in the art.
  • Illumina NextSeq / MiSeq / MiniSeq / iSeq series sequencers are preferably used for next-generation sequencing.
  • the primers in the kit were used to amplify the genes shown in Table 1.
  • the obtained gene sequence could be sequenced twice.
  • the secondary sequencing is a targeted RNA-seq technology, and double-ended sequencing is performed with an Illumina NextSeq / MiSeq / MiniSeq / iSeq sequencer. Such a process can be done automatically by the instrument itself.
  • step (3) may be completed by performing statistical analysis on the obtained sequencing results.
  • Breast cancer typing and risk prediction can optionally be performed according to the single sample prediction method (SSP (Single Sample Predictor) and Parker) pioneered by Hu et al.
  • SSP Single Sample Predictor
  • the gene expression data of the obtained sequencing results are analyzed to obtain a subtype of a single sample, and the risk of distant metastasis can be calculated.
  • the self-developed analysis software EPIG was used to calculate the close relationship with breast cancer metastasis through unsupervised clustering and paired correlation analysis. Relevant and statistically significant gene expression profiles, and based on this to obtain meaningful genes in the expression profile. Among them, the two most relevant genes are the cell cycle and immune response genes, respectively.
  • the specific method is to randomly extract 1017 chip data (50%) from 2034 gene chip expression data, calculate expression profiles and related genes, and repeat the extraction and calculation 1000 times.
  • the detection method of the present invention can be used for diagnostic or non-diagnostic purposes.
  • the present invention also provides the application of the gene population of the present invention to molecular typing of breast cancer and / or assessing the risk of distant metastasis.
  • the invention also provides the application of the gene group of the invention in the preparation of a product for molecularly typing breast cancer and / or assessing its risk of distant metastasis.
  • the product is in the form of a detection kit.
  • the invention further provides the application of the reagent for detecting the gene expression level in the gene group of the invention in preparing an in vitro diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastasis.
  • the product is in the form of a detection kit.
  • the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene.
  • the reagent is a reagent that detects the amount of cDNA complementary to the mRNA.
  • the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein.
  • the reagent is a probe or a primer, especially a primer. More preferably, the sequences of the primers are as shown in SEQ ID No. 1-SEQ ID No. 132 and SEQ ID NO. 133-SEQ ID NO. 144. The specific protocol of the primers can also be shown in Table 2.
  • the subtypes of breast cancer include lumen A, lumen B, HER2 enriched, basal cell, and immune-enhanced.
  • the detection sample used in the present invention is preferably a tissue from a test object (subject object), as long as the total RNA of the test object can be extracted from the test sample.
  • the test sample is preferably one or more of a tissue sample, blood, plasma, and body fluid, and more preferably a tissue sample, such as a paraffin tissue sample.
  • the test sample is a tissue with a high content of tumor cells.
  • the present invention also relates to a group of proliferation-related genes including ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C, and ZWINT ( (See also the relevant information in Table 1).
  • the present invention also relates to a group of immune-related genes, including APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC ( (See also the relevant information in Table 1).
  • the present invention also relates to a combination of a proliferation-related gene and an immune-related gene, wherein the proliferation-related gene and the immune-related gene are as described above.
  • the present invention also relates to the immune-related genes (APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC) or all The proliferation-related genes (ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C, and ZWINT) or a combination of the two, Or the use of related reagents for detecting the immune-related genes, the proliferation-related genes, or a combination thereof in the preparation of in vitro diagnostic products for molecular typing of breast cancer and / or assessment of the risk of distant metastasis of breast cancer.
  • the proliferation-related genes (ASPM, AURKA, BI
  • the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene.
  • the reagent is a reagent that detects the amount of cDNA complementary to the mRNA.
  • the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein.
  • the reagent is a probe or a primer, especially a primer. More preferably, the sequences of the primers can be shown in Table 2.
  • the present invention also relates to a method for determining a breast cancer molecular typing and / or distant metastatic risk in a subject.
  • the steps of the method are described above, wherein the corresponding gene group is the immune-related gene, the proliferation-related gene, or a combination thereof.
  • a new breast cancer subtype (immunoenhancement type) can be determined, and different effects and effects on distant metastasis of breast cancer can be shown in different subtypes.
  • a gene group for molecular typing of breast cancer and / or assessing the risk of distant metastasis characterized in that the gene group comprises 66 genes related to molecular typing and risk assessment for distant metastasis, wherein:
  • the 66 molecular typing and distant metastasis risk assessment related genes include:
  • the gene group also includes a housekeeping gene
  • the housekeeping gene includes at least one of the following (such as 1, 2, 3, 4, 5, 6), preferably at least 3, and most preferably 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.
  • a gene group for molecular typing of breast cancer and / or assessing its risk of distant metastasis characterized by:
  • the gene group includes 66 genes related to molecular typing and distant metastasis risk assessment, and 6 housekeeping genes, of which,
  • the 66 molecular typing and distant metastasis risk assessment related genes include:
  • the six housekeeping genes include: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.
  • a diagnostic product for molecularly typing breast cancer and / or assessing its risk of distant metastasis comprising a reagent for detecting the expression level of a gene in the gene group according to any one of items 1-3.
  • reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene.
  • reagent is a reagent that detects an amount of cDNA complementary to the mRNA.
  • the diagnostic product further comprises a total RNA extraction reagent, a reverse transcription reagent, and / or a second-generation sequencing reagent.
  • the reagent is a reagent for detecting the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein.
  • a set of primers for molecular typing of breast cancer and / or assessing the risk of distant metastasis wherein the sequence of said primers is shown in SEQ ID NO.1-SEQ ID NO.132.
  • the primer set of item 15 further comprising a primer having a sequence shown in SEQ ID NO. 133-SEQ ID NO. 144.
  • a set of primers for molecular typing of breast cancer and / or assessing the risk of distant metastasis wherein the sequence of said primers is shown in SEQ ID NO.1-SEQ ID NO.144.
  • the scheme of the present invention introduces immune regulation genes into the molecular typing of breast cancer for the first time, and can further strengthen the rationality of molecular typing of breast cancer by considering the expression levels of other genes And improve the ability to guide the clinical treatment of breast cancer, and molecularly type breast cancer into lumen A, lumen B, HER2 enriched, basal cell, and immune-enhanced.
  • a more accurate distinction can also be made between these types of transfer risks.
  • the risk of distant metastasis of luminal type A is significantly lower than the other four subtypes, while the risk of distant metastasis of immune-enhanced type is significantly lower than that of luminal type B, basal cell type and HER2 enriched type.
  • the prognosis of breast cancer with different types can be judged more accurately.
  • strong immune function can significantly reduce the risk of distant breast cancer metastasis.
  • the immune genome can be further subdivided.
  • the second-generation sequencing is used to detect gene expression and is used for molecular typing.
  • the gene group and corresponding products of the present invention can increase detection sensitivity, improve detection capability and efficiency, and greatly reduce detection costs.
  • the present invention is more accurate for molecular typing of breast cancer, can calculate the risk of distant metastasis, predict the risk of distant metastasis, and can guide clinical treatment more accurately.
  • the products and methods of the present invention have high reliability, high sensitivity, and high repeatability.
  • the products and methods of the present invention can significantly reduce the proportion of types that cannot be typed, thereby providing more accurate and efficient typing methods and products.
  • the products and methods of the present invention can also significantly reduce the so-called "medium risk” ratio. And this type of "medium risk” is difficult to judge in clinical treatment, and the guidance is poor.
  • Example 1 Screening of genes associated with breast cancer subtypes and distant metastasis risk
  • Results A total of 66 genes and 6 housekeeping genes related to breast cancer subtype and distant metastasis risk were screened out, that is, 72 gene test combinations. The gene list is shown in Table 1.
  • Example 2 Using the screened breast cancer subtypes and distant metastasis risk-related gene groups to perform molecular typing and distant metastasis risk assessment of breast cancer
  • the 66 breast cancer molecular genotyping groups shown in Table 1 were used to molecularly type the above breast cancer cases to divide breast cancer tumors into 5 subtypes (Figure 1), and their treatment can be given Different suggestions:
  • Patients with luminal A tumors have a low mutation rate of p53 gene and a good prognosis. They are not sensitive to chemotherapy and are suitable for endocrine therapy. Therefore, it has guiding significance for clinical endocrine therapy.
  • Lumen B is a tumor that is sensitive to endocrine therapy, but for HER2-positive patients, tamoxifen treatment is less effective than lumen A, and aromatase inhibitors are better. For patients with HER2-positive lumen type B tumors, molecular targeted therapy can be performed.
  • HER2-enriched tumors have a high mutation rate of p53 gene and relatively poor tumor differentiation. This type is relatively sensitive to targeted molecular therapy but has a poor prognosis.
  • HER2-enriched tumors are widely treated with Herceptin combined with systemic chemotherapy.
  • Basal cell tumors are the most invasive tumors, namely triple negative tumors (ER-, HER2-, PR-). Basal cell tumor patients are not sensitive to current breast cancer treatment options and generally have a poor clinical prognosis; however, if the tumor tissues have strong immune-related gene expression, the prognosis is relatively good.
  • the Kaplan-Meier survival curve can be drawn to obtain the corresponding metastatic risk.
  • the distant transfer risks of the five subtypes are different.
  • Luminous type A has the lowest risk of distant metastasis, which is a low-risk subtype.
  • luminal type B is lower than the HER2 enriched type and basal cell type. It is said that the lumen B type, HER2 enriched type and basal cell type are not significantly different, and they are all high-risk subtypes, while the immune-enhancing type is between the two groups and belongs to intermediate-risk subtypes.
  • Each subtype can be further divided into two groups based on the immune index, that is, the strong immune group and the weak immune group, and the difference in metastatic risk between the two groups is observed.
  • the strong immune group and the weak immune group the difference in metastatic risk between the two groups is observed.
  • Lumenal B Basal-like and HER2-enriched subtypes, the risk of distant metastasis was significantly lower in patients with strong immunity (P ⁇ 0.05). Immunely weak cases; Lumenal A, immunologically weak cases and distantly metastatic cases showed no significant difference in distant metastatic risk (Figure 3).
  • the immune index can be obtained by averaging the expression levels of the 17 immune genes:
  • the Cox model is used to calculate the risk of distant metastasis of the tumor.
  • the occurrence and time of distant metastasis are used as the observation endpoint.
  • Corresponding coefficients are determined based on the relative risk of the tumor subtype, immune index, and proliferation index for distant metastasis. Methods as below:
  • RRS 0.2x Basal + 0.4x HER2-0.2x Immune-0.1x LumA + 0.2x LumB + 0.3x proliferation index-0.1x immune index
  • Basal represents the pearson correlation coefficient between the tumor and the basal cell tumor
  • HER2 represents the pearson correlation coefficient between the tumor and the HER2-enriched tumor
  • “LumA” represents the pearson correlation coefficient between this tumor and lumen type A tumor
  • “LumB” represents the pearson correlation coefficient between the tumor and luminal type B tumor
  • Immuno represents the pearson correlation coefficient between the tumor and the immune-enhanced tumor.
  • the risk of distant metastasis of the tumor can be divided into three groups, low risk (0-32), medium risk (33-49), and high risk ( 50-100).
  • RNA-seq technology based on Illumina mid-throughput second-generation sequencing platform MiSeq was used to detect fresh tumor tissue or paraffin-embedded tumor tissue of 300 Chinese breast cancer patients.
  • the expression levels of 72 genes including genes were used as the basis for further breast cancer typing.
  • Second-generation sequencing database The raw data of all next-generation sequencing of fresh tumor tissue or paraffin-embedded tumor tissue of 300 Chinese breast cancer patients described in Example 3 will be uploaded to a web-based data storage and analysis database.
  • This method uses Java software to develop and apply many J2EE (Java Enterprise Edition) components and modes. It can 1) directly input data from Illumina NextSeq / MiSeq / MiniSeq / iSeq instruments; 2) it can display the entered data in a flexible manner.
  • J2EE Java Enterprise Edition
  • Data can be indexed according to different needs, such as gene correlation, samples or experimental groups; 3) calculation of gene expression profiles after normalization of housekeeping genes; 4) analysis of specific element details; 5) , Output data in different formats, such as XML, excel and text formats; 6), can be safely managed to ensure data privacy protection.
  • RNA sequencing Obtain postoperative paraffin samples from Chinese breast cancer patients through partner hospitals, extract RNA in accordance with the previous steps, perform quality inspection, build a library by reverse transcription, use the Illumina NextSeq / MiSeq / MiniSeq / iSeq platform for RNA sequencing, and detect 72 gene expression levels
  • PRS Precitype Risk Score
  • Example 4 Analysis method of second-generation sequencing test kit for assessing breast cancer molecular typing and distant metastasis risk gene groups
  • Step 1 Take the tumor or paraffin-embedded tissue of the test object, and use the method in the test kit to obtain the region of the test object that contains high tumor cells as the original material.
  • Step 2 Extract total RNA from the tissue.
  • Step 3 The obtained RNA is made into a library for sequencing.
  • the RNA of the obtained tissue is made into a library that can be used for targeted RNA-seq second-generation sequencing.
  • the method for preparing the library includes the following steps:
  • RNA of the extracted tissue was generated under the guidance of a specific primer using reverse transcriptase to generate cDNAs of a variety of genes of interest (72 genes described in Table 1). The ends were filled and phosphorylated at the 5 ′ end.
  • 30 ⁇ l of DNA 45 ⁇ l of purified water, 10 ⁇ l of T4 DNA ligase buffer with 10 mM ATP, 4 ⁇ l of 10 mM dNTP Mix, 5 ⁇ l of T4 DNA polymerase, 1 ⁇ l of Klenow enzyme, 5 ⁇ l of ⁇ 4 After mixing, incubate at 20 ° C for 30 minutes (reagents from Illumina sample preparation kit PE-102-1001).
  • End suspension A Dissolve the product from the previous step in 32 ⁇ l of buffer, add 5 ⁇ l of Klenow buffer, 10 ⁇ l of 1 mM dATP, and 3 ⁇ l of Klenow ⁇ -3 ⁇ l, and keep at 37 ° C for 30 minutes (reagent from Illumina sample preparation kit).
  • MinElute PCR purification kit (part # 28004) ligation: DNA was dissolved in 10 ⁇ l buffer, DNA ligase buffer 2 ⁇ 25 ⁇ l, PE adapter Adapter Oligo Mix 10 ⁇ l, DNA ligase 5 ⁇ l, and kept at 20 ° C for 15 minutes (reagent is Illumina sample Prepare the kit PE-102-1001). After warming, use the QIAGEN QIAquick PCR purification kit (part # 28104) to purify the DNA to obtain the library.
  • Step 4 Use the primer sequences in Table 2 of Example 3 to perform the next-generation sequencing on the resulting DNA library using NextSeq / MiSeq / MiniSeq / iSeq. Double-ended sequencing was performed with an Illumina NextSeq / MiSeq / MiniSeq / iSeq sequencer. This process is done automatically by the instrument itself (Illumina).
  • Step 5 Statistical analysis of results.
  • the obtained sequencing results were analyzed statistically, and breast cancer typing and risk prediction were performed according to the single sample prediction method (SSP (Single Sample Predictor) or Parker) proposed by Hu et al.
  • SSP Single Sample Predictor
  • the gene expression data of the obtained sequencing results were analyzed.
  • the detection method of the present invention is the best in both sensitivity and copy number detection capabilities.
  • the experimental research of the present invention shows that the sensitivity of the second-generation sequencing to determine the gene expression profile is much higher than that of the gene chip method, and at the same time, it is better than the quantitative PCR and Nanostring method in the detection throughput (Figure 5).
  • the detection method of the present invention has high repeatability. Correlation coefficients were higher than 0.97 in 7 repeated experiments with paraffin tissue RNA. With 15 replicates of fresh frozen tissue, the correlation coefficient was higher than 0.99 ( Figure 6).
  • CN 104293910A discloses a group of gene groups, which are composed of 60 genes (Comparative I). It can be seen from the comparison between the scheme of the gene group of the present invention and the scheme of the present invention that a new immune-enhancing type is introduced, so that the subtype of breast cancer can be classified more accurately. Moreover, these types of transfer risks can be more accurately distinguished. As mentioned above, the risk of distant metastasis of luminal type A is significantly lower than the other four subtypes, while the risk of distant metastasis of immune-enhanced type is significantly lower than that of luminal type B, basal cell type and HER2 enriched type. Furthermore, the prognosis of breast cancer with different types can be judged more accurately. For lumen B, basal cell, and HER2-enriched types in high-risk breast cancer subtypes, strong immune function can significantly reduce the risk of distant metastasis of breast cancer.
  • Example 2 of the present invention 1951 breast cancer tumors with complete clinical information in 2034 cases were subjected to typing analysis and distant metastasis risk assessment.
  • the scheme of the present invention (such as 66 in Table 1) Molecular typing and risk of distant metastasis and 6 housekeeping genes) were compared with comparison I. The results are shown in Tables 4 and 5. For a detailed description of the results, please refer to Comparative Example 2.
  • the scheme of the present invention can significantly reduce the proportion of Mixed cases.
  • the solution of the present invention provides a more complete analysis system, for example, the types of typing are increased. At the same time, more accurate typing can be achieved. It also significantly reduced the number of Mixed cases.
  • the effect of the scheme of the present invention is obviously better than that of the prior art, and the proportion of medium risk is significantly reduced.
  • Comparison II (59 genes): The 20 proliferation-related genes in the above comparison I were replaced with 19 proliferation-related genes used in the scheme of the present invention.
  • Comparison III (77 genes): The above comparison I was added to the 17 immune-related genes used in the present invention.
  • Contrast I 60 genes
  • Contrast II 59 genes
  • 12.70% Contrast III 77 genes
  • the scheme of the present invention can significantly reduce the proportion of Mixed cases. From the differences between the protocol of the present invention and Comparative II or the difference between Comparative I and Comparative III shown in the table above, it can be known that the protocol of the present invention introduces immune regulatory genes into the molecular typing of breast cancer for the first time, making the mixed case The proportion is significantly reduced.
  • the protocol of the present invention (66 molecular typing and distant metastatic risk and 6 housekeeping genes described in Table 1) is compared with Comparative I, Comparative II, and Comparative III below.
  • the distant metastasis risk score was calculated based on the gene expression level in the 1951 breast cancers described above, and the risk of distant metastasis was divided into three groups: low risk, medium risk, and high risk. The results are shown in Table 7, where Shows the proportion of "medium risk" types that are less instructive in clinical treatment in the total sample.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Wood Science & Technology (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Pathology (AREA)
  • Biomedical Technology (AREA)
  • Hospice & Palliative Care (AREA)
  • Oncology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Disclosed are a gene group used for breast cancer subtyping and distant metastasis risk evaluation, an in vitro diagnostic product, and an application.

Description

乳腺癌分子分型及远处转移风险基因群及诊断产品和应用Breast cancer molecular typing and distant metastasis risk gene group and diagnostic products and applications

本申请要求2018年9月30日提交的,题为“乳腺癌分子分型及远处转移风险基因群及诊断产品和应用”的第201811166148.7号中国专利申请的优先权,该申请的内容整体援引加入本文。This application claims the priority of Chinese Patent Application No. 201811166148.7, entitled "Molecular Typing and Breast Cancer Risk Genomes and Diagnostic Products and Applications", filed on September 30, 2018, the content of which is incorporated by reference in its entirety Join this article.

技术领域Technical field

本发明属于生物技术领域,具体涉及乳腺癌亚型分型及评估远处转移风险基因群及其体外诊断产品和应用。The invention belongs to the field of biotechnology, and particularly relates to breast cancer subtypes and distant metastasis risk gene groups, and in vitro diagnostic products and applications thereof.

背景技术Background technique

乳腺癌占我国妇女恶性肿瘤发病的第一位,并以每年4%左右的速度逐年增加。乳腺癌是高度异质性的肿瘤,其病理分子机制非常复杂,往往涉及多种类型的体细胞中多个基因的突变。根据肿瘤的病理特征进行乳腺癌分型,提高乳腺癌的临床诊断和预后预测的准确度,是乳腺癌手术后精准治疗的基础,也是提高乳腺癌患者生存率的关键。目前国内外对于手术后的乳腺癌的精准治疗仍面临很大挑战。对于术后后继治疗的患者,如何选择具有针对性的治疗方案,提高治疗效率,需要新的技术手段来帮助医生和病人进行准确的判断。如何对乳腺癌患者进行精确分类,进而指导临床进行有效治疗,并同时避免无效或有害治疗是本发明的主要目标。Breast cancer accounts for the first incidence of malignant tumors in women in China, and it is increasing year by year at a rate of about 4%. Breast cancer is a highly heterogeneous tumor, and its pathological molecular mechanism is very complicated, often involving mutations of multiple genes in many types of somatic cells. Breast cancer classification according to the pathological characteristics of the tumor, improving the accuracy of clinical diagnosis and prognosis prediction of breast cancer, is the basis of accurate treatment after breast cancer surgery, and also the key to improving the survival rate of breast cancer patients. At present, the accurate treatment of breast cancer after surgery at home and abroad still faces great challenges. For patients undergoing subsequent treatment, how to choose a targeted treatment plan and improve treatment efficiency requires new technical means to help doctors and patients make accurate judgments. How to accurately classify breast cancer patients, and then guide clinical treatment for effective treatment, while avoiding ineffective or harmful treatment is the main objective of the present invention.

当前国际上公认的基于肿瘤基因表达的乳腺癌分子分型产品为北卡罗来纳大学Perou教授与NanoString公司合作共同开发的PAM50(Prosigna),其已得到美国FDA批准。PAM50通过检测55个基因表达水平,将乳腺癌分为4种亚型,管腔A型(Luminal A,简称Lum A)、管腔B型(Luminal B,简称Lum B)、HER2富集型(HER2-enriched,简称HER2)及基底细胞型(Basal-like,简称Basal),并根据亚型及肿瘤增殖指数评估乳腺癌10年内远处转移的危险性。The internationally recognized breast cancer molecular typing product based on tumor gene expression is the PAM50 (Prosigna) jointly developed by Professor Perou of the University of North Carolina and NanoString Corporation, which has been approved by the US FDA. By detecting the expression levels of 55 genes, PAM50 divides breast cancer into four subtypes, lumenal A (Luminal A), lumenal B (Luminal B), and HER2 enriched type ( HER2-enriched (HER2 for short) and Basal-like (Basal), and the risk of distant metastasis in breast cancer within 10 years was evaluated based on the subtype and tumor proliferation index.

发明内容Summary of the Invention

在一方面,本发明涉及一组基因群,其包括66个分子分型及远处转移风险评估相关基因,以及该基因群在乳腺癌分子诊断中的应用。所述基因群可以用于乳腺癌分子分型和/或评估其远处转移风险。In one aspect, the present invention relates to a group of gene groups, including 66 molecular typing and genes related to distant metastasis risk assessment, and the use of the gene group in molecular diagnosis of breast cancer. The gene population can be used for molecular typing of breast cancer and / or assessing its risk of distant metastasis.

所述66个分子分型及远处转移风险评估相关基因包括:The 66 molecular typing and distant metastasis risk assessment related genes include:

(1)增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT,(1) Proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C and ZWINT,

(2)免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,(2) Immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC,

(3)基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,(3) basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1,

(4)雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,(4) estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MAPT, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6,

(5)HER2相关基因ERBB2、FGFR4和GRB7,(5) HER2 related genes ERBB2, FGFR4 and GRB7,

(6)侵袭相关基因CTSL2和MMP11。(6) Invasion related genes CTSL2 and MMP11.

在一实施方案中,所述基因群还包括看家基因。优选地,看家基因包括以下中的至少1个(例如1、2、3、4、5或6个)、优选至少3个、最优选全部6个:GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC。In one embodiment, the gene population further includes a housekeeping gene. Preferably, the housekeeping genes include at least one of the following (eg, 1, 2, 3, 4, 5, or 6), preferably at least 3, and most preferably all 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.

在一具体的方面,本发明涉及一种用于乳腺癌分子分型和/或评估其远处转移风险的基因群(在本文中又称为为普瑞泰(Precitype)),其包括72个基因,即66个分子分型及远处转移风险评估相关基因以及6个看家基因,其中In a specific aspect, the present invention relates to a gene group (also referred to herein as Precitype) for molecular typing of breast cancer and / or assessing its risk of distant metastasis, which includes 72 Genes, that is, 66 molecular typing and distant metastasis risk assessment related genes and 6 housekeeping genes, of which

所述66个分子分型及远处转移风险评估相关基因包括:The 66 molecular typing and distant metastasis risk assessment related genes include:

(1)增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT,(1) Proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C and ZWINT,

(2)免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,(2) Immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC,

(3)基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,(3) basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1,

(4)雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,(4) estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MAPT, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6,

(5)HER2相关基因ERBB2、FGFR4和GRB7,(5) HER2 related genes ERBB2, FGFR4 and GRB7,

(6)侵袭相关基因CTSL2和MMP11;(6) Invasion related genes CTSL2 and MMP11;

所述6个看家基因包括:GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC。The six housekeeping genes include: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.

在一实施方案中,本发明的基因群中的基因的信息还参见表1。For information on genes in the gene group of the present invention, see Table 1 in one embodiment.

在另一方面,本发明涉及所述基因群在进行乳腺癌分子分型和/或评估远处转移风险中的应用。In another aspect, the present invention relates to the use of the genetic population in performing breast cancer molecular typing and / or assessing the risk of distant metastases.

在又一方面,本发明还涉及所述基因群在制备对乳腺癌进行分子分型和/或评估远处转移风险的诊断产品中的应用。在另外的方面,本发明还涉及一种对乳腺癌进行分子分型和/或评估远处转移风险的方法,所述方法通过使用本发明的诊断产 品完成。In yet another aspect, the invention also relates to the use of the gene population in the preparation of a diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastases. In a further aspect, the invention also relates to a method of molecularly typing breast cancer and / or assessing the risk of distant metastases, said method being performed by using a diagnostic product of the invention.

在进一步的方面,本发明还涉及一种对乳腺癌进行分子分型和/或评估远处转移风险的诊断产品,其包含检测本发明的基因群中基因的表达水平的相关试剂。在优选的实施方案中,所述诊断产品为体外诊断产品的形式,优选诊断试剂盒的形式。在进一步优选的实施方案中,所述诊断产品还包括总RNA抽提试剂、逆转录试剂和/或二代测序试剂。In a further aspect, the present invention also relates to a diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastases, which comprises a reagent for detecting the expression level of a gene in the gene group of the present invention. In a preferred embodiment, the diagnostic product is in the form of an in vitro diagnostic product, preferably in the form of a diagnostic kit. In a further preferred embodiment, the diagnostic product further comprises a total RNA extraction reagent, a reverse transcription reagent, and / or a second-generation sequencing reagent.

在进一步的实施方案中,所述试剂为探针或引物。In a further embodiment, the reagent is a probe or a primer.

在本发明的一优选实施方案中,所述乳腺癌包括管腔A型、管腔B型、HER2富集型、基底细胞型及免疫增强型(Immune-enhanced,在本文中又可称为Immuno)。In a preferred embodiment of the present invention, the breast cancer includes luminal type A, luminal type B, HER2-enriched type, basal cell type, and immune-enhanced type (Immune-enhanced, which may also be referred to herein as Immuno ).

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1表示根据乳腺癌组织中66个基因表达水平,将乳腺癌分为管腔A型(Lum A)、管腔B型(Lum B)、基底细胞型(Basal)、HER2富集型(HER2)和免疫增强型(Imm)。A、1951欧美病例;B、824中国病例。侵袭相关基因(即CTSL2和MMP11)因基因数少,在热图中基因表现区域较少,故在图1中未标出。Figure 1 shows that according to the expression level of 66 genes in breast cancer tissue, breast cancer is divided into luminal A (Lum A), luminal B (Lum B), basal cell type (Basal), and HER2 enriched type (HER2 ) And immune-enhanced (Imm). A, 1951 European and American cases; B, 824 Chinese cases. Invasion-related genes (ie, CTSL2 and MMP11) have fewer genes and have fewer gene expression regions in the heat map, so they are not shown in Figure 1.

图2表示乳腺癌每种亚型远处转移风险不同。其中,管腔A型远处转移风险显著低于其他四种亚型,免疫增强型远处转移风险显著低于管腔B型、基底细胞型和HER2富集型。欧美病例手术后5年之内远处转移风险,管腔B型(Lum B)小于基底细胞型(Basal),而基底细胞型(Basal)小于HER2富集型(HER2);手术后10年后,三者没有显著差别。中国病例手术3年内,管腔B型小于基底细胞型和HER2富集型,手术10年后,HER2富集型预后最差。A、1951欧美病例;B、824中国病例。Figure 2 shows that the risk of distant metastasis is different for each subtype of breast cancer. Among them, the risk of distant metastasis of luminal type A was significantly lower than the other four subtypes, and the risk of distant metastasis of immune-enhanced type was significantly lower than that of luminal type B, basal cell type and HER2 enriched type. The risk of distant metastasis within 5 years after surgery in European and American cases. Lumen B is less than basal cell type (Basal), while basal cell type (Basal) is less than HER2 enriched type (HER2); 10 years after surgery There is no significant difference between the three. In Chinese patients, the lumen B type was smaller than the basal cell type and the HER2 enriched type within 3 years of surgery. After 10 years of surgery, the HER2 enriched type had the worst prognosis. A, 1951 European and American cases; B, 824 Chinese cases.

图3表示免疫指数对乳腺癌的预后有重要影响。其中,对于高风险型的乳腺癌亚型中的管腔B型(Lum B)、基底细胞型(Basal)和HER2富集型(HER2),免疫功能强可显著降低病人乳腺癌远处转移风险;对于低风险乳腺癌亚型管腔A型(Lum A),免疫功能的强弱对于预后无明显影响。A、1951欧美病例;B、824中国病例。Figure 3 shows that the immune index has an important effect on the prognosis of breast cancer. Among them, for the high-risk breast cancer subtypes of lumen B (Lum B), basal cell type (Basal) and HER2 enriched type (HER2), strong immune function can significantly reduce the risk of distant metastasis of breast cancer in patients ; For low-risk breast cancer subtype lumen A, the strength of immune function has no significant effect on the prognosis. A, 1951 European and American cases; B, 824 Chinese cases.

图4表示根据所计算出的复发风险指数,肿瘤远处转移的风险分为三组,低风险(0-32)、中风险(33-49)和高风险(50-100)。A、1951欧美病例;B、824中国病例。Figure 4 shows that according to the calculated recurrence risk index, the risk of distant tumor metastasis is divided into three groups, low risk (0-32), medium risk (33-49), and high risk (50-100). A, 1951 European and American cases; B, 824 Chinese cases.

图5表示基于本发明的二代测序(NGS)的基因表达检测,其敏感度和检测通量要好于现有的基因芯片、定量PCR及Nanostring技术。FIG. 5 shows the second-generation sequencing (NGS) -based gene expression detection of the present invention, and its sensitivity and detection throughput are better than the existing gene chip, quantitative PCR, and Nanostring technologies.

图6为重复性实验结果,其显示石蜡切片提取的RNA(左)及新鲜组织提取的RNA(右)检测结果均有极好的相关性。Figure 6 shows the results of reproducible experiments, which show that the RNA (left) extracted from paraffin sections and the RNA (right) extracted from fresh tissue have excellent correlation.

具体实施方式detailed description

一般定义和术语General definitions and terms

以下将对本发明进一步详细说明,应理解,所述用语旨在描述目的,而非限制本发明。The present invention will be described in further detail below, it being understood that the terms are intended to describe the purpose, not to limit the present invention.

除非另有说明,本文使用的所述技术和科学术语具有与本发明所属领域技术人员通常所理解的相同的含义。若存在矛盾,则以本申请提供的定义为准。当以范围、优选范围、或者优选的数值上限以及优选的数值下限的形式表述某个量、浓度或其他值或参数的时候,应当理解相当于具体揭示了通过将任意一对范围上限或优选数值与任意范围下限或优选数值结合起来的任何范围,而不考虑该范围是否具体揭示。除非另有说明,本文所列出的数值范围旨在包括范围的端点和该范围内的所有整数和分数(小数)。Unless stated otherwise, the technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. In case of conflict, the definition provided in this application shall prevail. When a certain amount, concentration, or other value or parameter is expressed in the form of a range, a preferred range, or a preferred upper numerical value and a preferred lower numerical value, it should be understood that it is equivalent to specifically revealing that the Any range combined with any lower range limit or preferred value, regardless of whether the range is specifically disclosed. Unless otherwise stated, the numerical ranges listed herein are intended to include the endpoints of the range and all integers and fractions (decimals) within the range.

术语“约”、“大约”当与数值变量并用时,通常指该变量的数值和该变量的所有数值在实验误差内(例如对于平均值95%的置信区间内)或在指定数值的±10%内,或更宽范围内。The terms "about" and "approximately" when used in conjunction with a numerical variable generally mean that the value of the variable and all values of the variable are within experimental error (for example, within a 95% confidence interval for the mean) or within ± 10 of a specified value %, Or a wider range.

术语“任选”或“任选存在”是指随后描述的事件或情况可能发生或可能不发生,该描述包括发生所述事件或情况和不发生所述事件或情况。例如,当某个基团描述为任选地被取代时,其可以是未被取代的,也可以是被取代的,例如被一个或多个独立地选自本文所述的那些取代基取代。本领域技术人员应当理解,任选还涵盖取代基的种类和数目可以任意选择和组合的含义,只要形成的化合物是稳定的。The term "optional" or "optionally present" means that the event or situation described later may or may not occur, and the description includes the occurrence or non-occurrence of the event or situation. For example, when a group is described as being optionally substituted, it may be unsubstituted or substituted, such as with one or more substituents independently selected from those described herein. It should be understood by those skilled in the art that the meaning of the kind and number of substituents may be optionally selected and combined, so long as the formed compound is stable.

表述“包含”或与其同义的类似表述“包括”、“含有”和“具有”等是开放性的,不排除额外的未列举的元素、步骤或成分。表述“由…组成”排除未指明的任何元素、步骤或成分。表述“基本上由…组成”指范围限制在指定的元素、步骤或成分,加上任选存在的不会实质上影响所要求保护的主题的基本和新的特征的元素、步骤或成分。应当理解,表述“包含”涵盖,表述“基本上由…组成”和“由…组成”。The expression "comprising" or similar expressions "including," "containing," and "having" are open-ended, and do not exclude additional unrecited elements, steps, or ingredients. The expression "consisting of" excludes any element, step, or ingredient not specified. The expression "consisting essentially of" means that the scope is limited to the specified elements, steps or ingredients, plus elements, steps or ingredients that are optionally present and do not substantially affect the basic and new characteristics of the claimed subject matter. It should be understood that the expression "comprising" encompasses the expressions "consisting essentially of" and "consisting of".

在本文中,表述“至少一个(种)”或者类似的表述“一个(种)或多个(种)”表示1个(种)、2个(种)、3个(种)、4个(种)、5个(种)、6个(种)、7个(种)、8个(种)、9个(种)或者更多个(种)。In this text, the expression "at least one (species)" or similar expression "one (species) or multiple (species)" means 1 (species), 2 (species), 3 (species), 4 ( Species), 5 (species), 6 (species), 7 (species), 8 (species), 9 (species) or more (species).

术语“乳腺癌”起源于乳腺各级导管和腺泡上皮,由腺上皮增生到不典型增生而逐步发展形成,分为原位癌(非浸润性癌)、早期浸润癌至浸润性癌。在一优选的实施方案中,乳腺癌为浸润性乳腺癌。The term "breast cancer" originates from ducts and acinar epithelium at all levels of the breast, and gradually develops from glandular epithelium to atypical hyperplasia. It is divided into carcinoma in situ (non-invasive cancer), early invasive cancer to invasive cancer. In a preferred embodiment, the breast cancer is invasive breast cancer.

术语“复发风险”是指乳腺癌患者在指定年限内肿瘤复发的可能性,包含但不限于发生乳腺癌局部复发、乳腺癌区域复发和远处转移。在本文中,“复发风险”优选表示发生远处转移的可能性。因此,在本文中,“复发风险”(Risk of Recurrence Score,RRS)的评估可以体现为“远处复发风险评分”或“远处转移风险评分”。The term "risk of recurrence" refers to the likelihood of a breast cancer patient's tumor recurrence within a specified number of years, including, but not limited to, localized breast cancer recurrence, regional breast cancer recurrence, and distant metastasis. As used herein, "risk of relapse" preferably indicates the possibility of distant metastasis. Therefore, in this article, the assessment of "Risk of Recurrence Score" (RRS) can be expressed as "distant recurrence risk score" or "distant metastasis risk score".

乳腺癌复发风险评估可以用于评估乳腺癌患者发生局部复发或远处转移的可 能性。本文中,根据复发风险评分,可以将乳腺癌分为低风险、中风险、高风险。通常认为低风险不需使用化疗,而高风险使用化疗可以降低发生复发或远处转移的风险。相比之下,中风险需要临床医生自行结合临床实际情况进行判断,这样的“中风险”在临床治疗上指导意义较差。因此,中风险的情况越少,评估的效率就越高。在本文中实施例中,乳腺癌复发风险评估优选发生远处转移的可能性。Breast cancer recurrence risk assessment can be used to assess the possibility of local recurrence or distant metastases in breast cancer patients. In this article, breast cancer can be classified into low risk, medium risk, and high risk according to the recurrence risk score. Chemotherapy is generally considered to be low risk, while chemotherapy at high risk reduces the risk of recurrence or distant metastases. In contrast, the medium risk requires the clinician to make judgments based on the actual clinical situation. Such a "medium risk" has less guiding significance in clinical treatment. Therefore, the fewer cases of medium risk, the more efficient the assessment will be. In the examples herein, the risk of breast cancer recurrence is preferably assessed for the possibility of distant metastases.

在对乳腺癌进行分子分型时,还可能遇到这样的情况,其特征与正常乳腺组织相似,因而称为正常细胞(样)型(Normal-like)。不为任何理论所束缚,认为其可能是由于在肿瘤组织样本中混入的正常乳腺组织的“污染”引起,因此在通常的实践中,例如在进行分子分型或评估风险时,会将其排除。When molecularly typing breast cancer, you may also encounter situations that have characteristics similar to normal breast tissue and are therefore called Normal-like. Without being bound by any theory, it is thought that it may be caused by "contamination" of normal breast tissue mixed in tumor tissue samples, so it is usually excluded in common practice, such as when performing molecular typing or assessing risk .

Mixed为目前使用的分子分型方法尚不能区分的乳腺癌病例,并非特定肿瘤类型。因其特征复杂,这样的病例无法根据其分型特征指导治疗方案的选择,因此在对乳腺癌进行分型时,这样的病例越少则分型效率越高。Mixed is a breast cancer case that cannot be distinguished by the current molecular typing method, and it is not a specific tumor type. Due to its complex characteristics, such cases cannot guide the choice of treatment options based on their typing characteristics. Therefore, when typing breast cancer, the fewer such cases, the higher the typing efficiency.

本文所述的基因表达水平的检测可以例如通过检测RNA转录物来实现,也可以例如通过蛋白组学方法检测蛋白表达水平来实现。术语“RNA转录物”是指总RNA,即编码或者非编码RNA,包括直接来自于组织或外周血样本中,也包括间接来自于细胞裂解后的组织或血液样本中的RNA。总RNA包含tRNA、mRNA和rRNA,其中,mRNA包括目标基因转录的mRNA,也包括来自于其他非目标基因的mRNA。Detection of gene expression levels described herein can be achieved, for example, by detecting RNA transcripts, or by proteomics methods, for example. The term "RNA transcript" refers to total RNA, that is, coding or non-coding RNA, including RNA directly from tissue or peripheral blood samples, and also indirectly from tissue or blood samples after cell lysis. Total RNA includes tRNA, mRNA, and rRNA, where mRNA includes mRNA transcribed from the target gene, and mRNA from other non-target genes.

术语“多肽”是指由氨基酸以肽键连接组成的化合物,包括多肽的全长或氨基酸片段。所述基因编码的多肽的量,可以针对样本中总蛋白的量或管家基因所编码的多肽的量来标准化。The term "polypeptide" refers to a compound consisting of amino acids connected by peptide bonds, including the full length or amino acid fragments of a polypeptide. The amount of the polypeptide encoded by the gene can be standardized with respect to the amount of total protein in the sample or the amount of the polypeptide encoded by the housekeeping gene.

在本文中,RNA转录物可以例如通过杂交、扩增或者测序的方法来检测和量化。比如,将RNA转录物与探针或者引物杂交。Herein, RNA transcripts can be detected and quantified, for example, by methods of hybridization, amplification or sequencing. For example, hybridize RNA transcripts with probes or primers.

术语“杂交”是指在适当条件下,两个核酸片段通过稳定且特异的氢键结合,形成双螺旋复合物的过程。The term "hybridization" refers to the process under which the two nucleic acid fragments are combined by stable and specific hydrogen bonds to form a double helix complex under appropriate conditions.

术语“扩增引物”或“引物”,是指包含5~100个核苷酸的核酸片段,优选地,包含能起始酶促反应(如,酶促扩增反应)的15~30个核苷酸。The term "amplification primer" or "primer" refers to a nucleic acid fragment containing 5 to 100 nucleotides, preferably, 15 to 30 cores capable of initiating an enzymatic reaction (eg, an enzymatic amplification reaction). Glycylic acid.

术语“(杂交)探针”是指包括至少5个核苷酸的核酸序列,比如,包含5~100个核苷酸,其能在指定条件下与目标基因的表达产物或者该表达产物的扩增产物杂交形成复合物。杂交探针上还可以包括用于检测的标记物。所述标记物包括但不限于用于荧光定量PCR或荧光原位杂交的标记物。在一优选方案中,所述标记物可以为FAM、HEX、VIC、Cy5等。在另一优选方案中,所述标记物可以为生物素、地高辛等。The term "(hybridization) probe" refers to a nucleic acid sequence including at least 5 nucleotides, for example, containing 5 to 100 nucleotides, which can interact with the expression product of the target gene or the expansion of the expression product under specified conditions. The gain products hybridize to form a complex. The hybridization probe may further include a label for detection. The label includes, but is not limited to, a label for fluorescent quantitative PCR or fluorescent in situ hybridization. In a preferred embodiment, the label may be FAM, HEX, VIC, Cy5, or the like. In another preferred embodiment, the label may be biotin, digoxin and the like.

本文所述的基因表达水平的检测可以采用本领域已知的测定方法,包括但不限于检测所述基因的RNA转录物的量或所述基因编码的多肽的量的方法。The detection of the expression level of a gene described herein may employ assay methods known in the art, including, but not limited to, a method of detecting the amount of an RNA transcript of the gene or the amount of a polypeptide encoded by the gene.

基因的RNA转录物可通过本领域已知的方法转化为与其互补的cDNA,通过测定互补cDNA的量可以获得RNA转录物的量。基因的RNA转录物或与其互补的cDNA的量,可以针对样本中总RNA或总cDNA的量或者针对一组看家基因的RNA转录物或与其互补的cDNA的量来标准化。The RNA transcript of a gene can be converted into cDNA complementary thereto by methods known in the art, and the amount of RNA transcript can be obtained by measuring the amount of complementary cDNA. The amount of a gene's RNA transcript or its complementary cDNA can be normalized to the amount of total RNA or total cDNA in a sample or to a set of housekeeping genes' RNA transcripts or its complementary cDNA.

在本文中,RNA转录物可以通过例如杂交、扩增或者测序的方法来检测和量化,包括但不限于将RNA转录物与探针或者引物杂交的方法,通过基于聚合酶链式反应(PCR)的各种定量PCR技术或测序技术检测RNA转录物或其对应cDNA产物的量的方法。所述定量PCR技术包括但不限于荧光定量PCR、实时PCR或半定量PCR技术。所述测序技术包括但不限于Sanger测序、二代测序、三代测序和单细胞测序等。优选地,所述测序技术为二代测序,更优选地为靶向RNA-seq技术。Herein, RNA transcripts can be detected and quantified by methods such as hybridization, amplification or sequencing, including but not limited to methods for hybridizing RNA transcripts with probes or primers, by polymerase chain reaction (PCR) -based A method of detecting the amount of RNA transcript or its corresponding cDNA product by various quantitative PCR techniques or sequencing techniques. The quantitative PCR technology includes, but is not limited to, fluorescent quantitative PCR, real-time PCR, or semi-quantitative PCR technology. The sequencing technology includes, but is not limited to, Sanger sequencing, second-generation sequencing, third-generation sequencing, and single-cell sequencing. Preferably, the sequencing technology is second-generation sequencing, more preferably targeted RNA-seq technology.

在本文中,多肽的量可以通过例如蛋白组学或试剂来检测。优选地,所述试剂为抗体、抗体片段或者亲和性蛋白。Herein, the amount of polypeptide can be detected by, for example, proteomics or reagents. Preferably, the reagent is an antibody, an antibody fragment or an affinity protein.

本发明的基因群Gene group of the present invention

本发明涉及一组基因群,其包括66个用于分子分型及远处转移风险评估相关基因。所述66个分子分型及远处转移风险评估相关基因包括:The invention relates to a group of gene groups, which includes 66 genes for molecular typing and distant metastasis risk assessment. The 66 molecular typing and distant metastasis risk assessment related genes include:

(1)增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT,(1) Proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C and ZWINT,

(2)免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,(2) Immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC,

(3)基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,(3) basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1,

(4)雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,(4) estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MAPT, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6,

(5)HER2相关基因ERBB2、FGFR4和GRB7,(5) HER2 related genes ERBB2, FGFR4 and GRB7,

(6)侵袭相关基因CTSL2和MMP11。(6) Invasion related genes CTSL2 and MMP11.

在一实施方案中,本发明的基因群还包括看家基因。优选地,所述看家基因包括以下中的至少一个(例如1、2、3、4、5或6个)、优选至少3个、最优选6个:GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC。In one embodiment, the gene population of the present invention further includes a housekeeping gene. Preferably, the housekeeping gene includes at least one of the following (eg, 1, 2, 3, 4, 5, or 6), preferably at least 3, and most preferably 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.

在本发明的一具体实施方案中,提供了一组基因群,其包括72个基因,即66个分子分型及远处转移风险评估相关基因以及6个看家基因。该基因群在本文中又称为普瑞泰(Precitype)。所述66个分子分型及远处转移风险评估相关基因以及6个看家基因如上文所述。In a specific embodiment of the present invention, a set of gene groups is provided, which includes 72 genes, that is, 66 molecular typing and distant metastasis risk assessment related genes, and 6 housekeeping genes. This gene group is also referred to herein as Precitype. The 66 molecular typing and distant metastasis risk assessment related genes and 6 housekeeping genes are as described above.

在一实施方案中,本发明的基因群如表1所示。In one embodiment, the gene group of the present invention is shown in Table 1.

表1乳腺癌分子分型和/或远处转移风险的基因群Table 1 Genetic groups of breast cancer molecular typing and / or distant metastasis risk

Figure PCTCN2019109459-appb-000001
Figure PCTCN2019109459-appb-000001

Figure PCTCN2019109459-appb-000002
Figure PCTCN2019109459-appb-000002

Figure PCTCN2019109459-appb-000003
Figure PCTCN2019109459-appb-000003

看家基因用于对靶基因表达水平进行标准化和校正。可以考虑的看家基因的纳入标准有:1、在组织中稳定表达,其表达水平不受病理状况或药物治疗影响或者影响较小;2、表达水平也不宜过高,以避免在表达数据(如通过二代测序获得)获取的数据中占比过高,影响其他基因的数据检测和解读的准确性。因此,在本发明的基因群中可以包括看家基因。优选地,所考虑的看家基因包括GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC中的至少一个,例如1、2、3、4、5或6个,优选至少3个,最优选全部6个。在一实施方案中,所考虑的看家基因包括GAPDH、GUSB、MRPL19、PSMC4、SF3A1、TFRC或其任意组合。Housekeeping genes are used to normalize and correct target gene expression levels. Inclusion criteria for housekeeping genes that can be considered are: 1. Stable expression in tissues, whose expression level is not affected or less affected by pathological conditions or drug treatment; 2. Expression levels should not be too high to avoid expression data ( (For example, obtained through second-generation sequencing), the proportion of data obtained is too high, which affects the accuracy of data detection and interpretation of other genes. Therefore, housekeeping genes may be included in the gene group of the present invention. Preferably, the housekeeping genes considered include at least one of GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC, such as 1, 2, 3, 4, 5, or 6, preferably at least 3, and most preferably all 6 . In one embodiment, the housekeeping genes considered include GAPDH, GUSB, MRPL19, PSMC4, SF3A1, TFRC, or any combination thereof.

在另一实施方案中,本发明还涉及一组免疫相关基因群,其包括17个基因:APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC。In another embodiment, the present invention also relates to a group of immune-related genes comprising 17 genes: APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ, and PTPRC.

该免疫相关基因群中的基因还参见表1中的相关信息所示。The genes in the immune-related gene group are also shown in the related information in Table 1.

在一具体的实施方案中,本发明的基因群可用于乳腺癌分子分型(亚型分型)和/或评估其远处转移风险。In a specific embodiment, the gene population of the present invention can be used for molecular typing (subtyping) of breast cancer and / or assessing its risk of distant metastasis.

在一特定的实施方案中,乳腺癌的亚型包括管腔A型、管腔B型、HER2富集型、基底细胞样型及免疫增强型。In a specific embodiment, the subtypes of breast cancer include luminal A, luminal B, HER2-enriched, basal cell-like, and immune-enhanced.

本发明的诊断产品Diagnostic product of the invention

本发明涉及一种对乳腺癌进行分子分型和/或评估其远处转移风险的诊断产品,其包含检测本发明的基因群中基因的表达水平的试剂。所述基因群如上文所述。The invention relates to a diagnostic product for molecularly typing breast cancer and / or assessing its risk of distant metastasis, which comprises an agent for detecting the expression level of genes in the gene group of the invention. The gene population is as described above.

具体地,本发明的基因群可以包括66个分子分型及远处转移风险评估相关基因。所述66个分子分型及远处转移风险评估相关基因包括:Specifically, the gene group of the present invention may include 66 genes related to molecular typing and distant metastasis risk assessment. The 66 molecular typing and distant metastasis risk assessment related genes include:

(1)增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT,(1) Proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C and ZWINT,

(2)免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,(2) Immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC,

(3)基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,(3) basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1,

(4)雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,(4) estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MAPT, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6,

(5)HER2相关基因ERBB2、FGFR4和GRB7,(5) HER2 related genes ERBB2, FGFR4 and GRB7,

(6)侵袭相关基因CTSL2和MMP11。(6) Invasion related genes CTSL2 and MMP11.

在一实施方案中,本发明的基因群还包括看家基因。优选地,所述看家基因包括以下中的至少一个(例如1、2、3、4、5或6个)、优选至少3个,最优选6个:GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC。In one embodiment, the gene population of the present invention further includes a housekeeping gene. Preferably, the housekeeping gene includes at least one of the following (eg, 1, 2, 3, 4, 5, or 6), preferably at least 3, and most preferably 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1 and TFRC.

在一实施方案中,所述基因群如表1所示。In one embodiment, the gene group is shown in Table 1.

在一实施方案中,所述诊断产品为体外诊断产品。在一具体的实施方案中,所述诊断产品为诊断试剂盒。In one embodiment, the diagnostic product is an in vitro diagnostic product. In a specific embodiment, the diagnostic product is a diagnostic kit.

在一实施方案中,所述诊断产品用于乳腺癌亚型分型和/或其远处转移风险评估。In one embodiment, the diagnostic product is used for breast cancer subtype and / or distant metastasis risk assessment.

在一实施方案中,所述试剂为检测所述基因转录的RNA,特别是mRNA的量的试剂。在又一实施方案中,所述试剂为检测与所述mRNA互补的cDNA的量的试剂。In one embodiment, the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene. In yet another embodiment, the reagent is a reagent that detects the amount of cDNA complementary to the mRNA.

在一优选的实施方案中,所述诊断产品还包括总RNA抽提试剂、逆转录试剂和/或二代测序试剂。In a preferred embodiment, the diagnostic product further includes a total RNA extraction reagent, a reverse transcription reagent, and / or a second-generation sequencing reagent.

所述总RNA抽提试剂可以为本领域常规的总RNA抽提试剂。其实例包括但不限于Qiagen 73504、RNA storm CD201、Invitrogen and ABI AM1975。The total RNA extraction reagent may be a conventional total RNA extraction reagent in the art. Examples include, but are not limited to, Qiagen 73504, RNA Storm CD201, Invitrogen and ABI AM1975.

所述逆转录试剂可以为本领域常规的逆转录试剂,并且优选地包括dNTP溶液和/或RNA逆转录酶。逆转录试剂的实例包括但不限于NEB M0368L、Thermo K1622、ABI 4366596。The reverse transcription reagent may be a conventional reverse transcription reagent in the art, and preferably includes a dNTP solution and / or an RNA reverse transcriptase. Examples of reverse transcription reagents include, but are not limited to, NEB M0368L, Thermo K1622, ABI 4366596.

所述二代测序试剂可以为本领域常规使用的试剂,只要能够满足对所得序列进行二代测序的要求即可。二代测序试剂可以为市售产品,其实例包括但不限于Illumina公司

Figure PCTCN2019109459-appb-000004
Reagent Kit v3(150cycle)(MS-102-3001)、
Figure PCTCN2019109459-appb-000005
Targeted RNA Index Kit A-96Indices(384Samples)(RT-402-1001)。二代测序为本领域常规的二代测序,例如为靶向RNA-seq技术。因此,二代测序试剂还可以包括可供构建靶向RNA-seq的文库Illumina定制的试剂,例如
Figure PCTCN2019109459-appb-000006
Targeted RNA Custom Panel Kit(96 Samples)(RT-102-1001)。 The second-generation sequencing reagent may be a reagent commonly used in the art, as long as it can meet the requirements for performing second-generation sequencing on the obtained sequence. Second-generation sequencing reagents can be commercially available products, examples of which include, but are not limited to, Illumina
Figure PCTCN2019109459-appb-000004
Reagent Kit v3 (150cycle) (MS-102-3001),
Figure PCTCN2019109459-appb-000005
Targeted RNA Index Kit A-96 Indicators (384 Samples) (RT-402-1001). Next-generation sequencing is conventional second-generation sequencing in the art, such as targeted RNA-seq technology. Therefore, next-generation sequencing reagents can also include Illumina custom-made reagents that can be used to build libraries that target RNA-seq, such as
Figure PCTCN2019109459-appb-000006
Targeted RNA Custom Panel Kit (96 Samples) (RT-102-1001).

在优选的实施方案中,所述试剂为探针或引物。In a preferred embodiment, the reagent is a probe or a primer.

在可选的实施方案中,所述试剂为检测所述基因编码的多肽的量的试剂,优选地,所述试剂为抗体、抗体片段或者亲和性蛋白。In an alternative embodiment, the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein.

在优选的实施方案中,所述试剂为引物。在一实施方案中,引物为合成寡聚核苷酸片段(优选具有高特异性),只要能与本发明的基因群中的基因部分序列互补,并扩增出其中的基因即可。引物可以是人工合成的。更优选地,所述引物的序列如SEQ ID NO.1-SEQ ID NO.132和SEQ ID NO.133-SEQ ID NO.144所示。引物的具体方案可以参见表2所示。In a preferred embodiment, the reagent is a primer. In one embodiment, the primer is a synthetic oligonucleotide fragment (preferably having high specificity) as long as it is complementary to a partial sequence of a gene in the gene group of the present invention, and the gene therein is amplified. Primers can be synthetic. More preferably, the sequences of the primers are as shown in SEQ ID No. 1-SEQ ID No. 132 and SEQ ID NO. 133-SEQ ID NO. 144. The specific protocol of the primers can be seen in Table 2.

因此,本发明的一实施方案还涉及一种诊断产品(优选体外诊断产品,特别是试剂盒),其包含引物,所述引物的序列如SEQ ID NO.1-SEQ ID NO.132所述。在一实施方案中,所述诊断产品还包含引物,所述引物的序列如SEQ ID NO.133-SEQ ID NO.144所述。Therefore, an embodiment of the present invention also relates to a diagnostic product (preferably an in vitro diagnostic product, especially a kit), which comprises a primer whose sequence is as described in SEQ ID NO.1-SEQ ID NO.132. In one embodiment, the diagnostic product further comprises a primer, and the sequence of the primer is as described in SEQ ID NO. 133-SEQ ID NO. 144.

在优选的实施方案中,本发明的诊断产品(优选体外诊断产品,特别是试剂盒)包含引物,所述引物的序列如SEQ ID NO.1-SEQ ID NO.144所示(还参见表2)。In a preferred embodiment, the diagnostic product of the present invention (preferably an in vitro diagnostic product, especially a kit) comprises a primer whose sequence is shown in SEQ ID NO.1-SEQ ID NO.144 (see also Table 2 ).

本发明的诊断产品(优选试剂盒的形式)还优选地包括从受试者对象提取检测样本的器械;例如从受试者体内提取组织或血液的器械,优选任何能用于取血的釆血针、注射器等。所述受试者对象为哺乳动物,优选为人,特别是患有乳腺癌的女性。The diagnostic product (preferably in the form of a kit) of the present invention also preferably includes a device for extracting a test sample from a subject; for example, a device for extracting tissue or blood from a subject, preferably any blood that can be used for blood collection Needles, syringes, etc. The subject is a mammal, preferably a human, especially a female with breast cancer.

在一实施方案中,所述引物可以用来检测所述基因群的表达水平。In one embodiment, the primers can be used to detect the expression level of the gene population.

本发明的方法和应用Method and application of the invention

在又一方面,本发明还涉及一种用于确定受试者对象的乳腺癌分子分型和/或远处转移风险的方法,所述方法包括In yet another aspect, the present invention also relates to a method for determining a breast cancer molecular typing and / or distant metastatic risk in a subject, the method comprising

(1)提供受试者对象的样本,(1) provide a sample of the subject,

(2)测定所述样本中本发明的基因群中基因的表达水平,(2) determining the expression level of a gene in the gene group of the present invention in said sample,

(3)确定所述受试者的乳腺癌分子分型和/或远处转移的风险。(3) determining the risk of breast cancer molecular typing and / or distant metastasis in said subject.

本发明的方法可以用于诊断或非诊断目的。The method of the invention can be used for diagnostic or non-diagnostic purposes.

用于本发明的方法的受试者对象为哺乳动物,优选为人,特别是患有乳腺癌的女性。The subject for use in the method of the invention is a mammal, preferably a human, especially a female with breast cancer.

在步骤(1)中提供样本的方法和使用的样本没有特别的限制,只要能从其中获得基因群中的基因的表达水平即可,例如可以从所述样本提取受试者对象的总RNA。所述样本优选地为组织、血液、血浆、体液或其组合的样本,优选为组织样本,特别是石蜡组织样本。在优选的实施方案中,样本为肿瘤组织样本或包含肿瘤细胞的组织样本,特别是乳腺肿瘤组织样本或包含乳腺肿瘤细胞的组织样本。The method of providing a sample in step (1) and the sample used are not particularly limited, as long as the expression level of genes in a gene group can be obtained therefrom, for example, total RNA of a subject can be extracted from the sample. The sample is preferably a sample of tissue, blood, plasma, body fluid or a combination thereof, preferably a tissue sample, especially a paraffin tissue sample. In a preferred embodiment, the sample is a tumor tissue sample or a tissue sample containing tumor cells, particularly a breast tumor tissue sample or a tissue sample containing breast tumor cells.

在步骤(2)中,测定本发明的基因群中基因的表达水平。该基因群如上文所述。该基因群还可以参见表1的描述。本发明的方法,例如步骤(2),可以通过检测所 述基因群中的基因表达水平的试剂完成。在一实施方案中,所述试剂为检测所述基因转录的RNA,特别是mRNA的量的试剂。在又一实施方案中,所述试剂为检测与所述mRNA互补的cDNA的量的试剂。在可选的实施方案中,所述试剂为检测所述基因编码的多肽的量的试剂,优选地,所述试剂为抗体、抗体片段或者亲和性蛋白。在另一实施方案中,所述试剂为探针或引物,特别是引物。更优选地,引物的序列如SEQ ID NO.1-SEQ ID NO.132所述。在一实施方案中,还使用引物,所述引物的序列如SEQ ID NO.133-SEQ ID NO.144所述。引物的具体方案还可以参见表2所示。In step (2), the expression level of a gene in the gene group of the present invention is determined. This gene group is as described above. The gene group can also be described in Table 1. The method of the present invention, for example, step (2), can be performed by a reagent for detecting a gene expression level in the gene group. In one embodiment, the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene. In yet another embodiment, the reagent is a reagent that detects the amount of cDNA complementary to the mRNA. In an alternative embodiment, the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein. In another embodiment, the reagent is a probe or a primer, especially a primer. More preferably, the sequence of the primer is as described in SEQ ID NO.1-SEQ ID NO.132. In one embodiment, a primer is also used, and the sequence of the primer is as described in SEQ ID NO. 133-SEQ ID NO. 144. The specific protocol of the primers can also be shown in Table 2.

本发明的方法,例如步骤(2),可以通过本发明的诊断产品(特别是诊断试剂盒)完成。The method of the present invention, such as step (2), can be performed by a diagnostic product (especially a diagnostic kit) of the present invention.

在一优选的实施方案中,步骤(2)可以包括In a preferred embodiment, step (2) may include

(2-1)提取样本中的总RNA;(2-1) extracting total RNA from the sample;

(2-2)将任选地进行纯化的总RNA转化为cDNA,然后将其制备成可用于二次测序的文库;(2-2) converting the optionally purified total RNA into cDNA, and then preparing it into a library that can be used for secondary sequencing;

(2-3)对步骤(2-2)获得的文库进行测序。(2-3) Sequencing the library obtained in step (2-2).

步骤(2-1)的提取可以通过本领域常规方法进行,优选地利用RNA提取试剂盒提取检测受试者的新鲜冷冻组织或石蜡包埋组织的总RNA。在更优选的实施方案中,可以使用Roche公司的RNA抽提试剂盒(产品号为Roche Catalog Number#3270289001)或者Qiagen公司的RNA抽提试剂盒(Qiagen RNease FFPE kit,Catalog Number#73504)进行提取。The extraction in step (2-1) can be performed by a conventional method in the art, and preferably, a total RNA of fresh frozen tissue or paraffin-embedded tissue of a subject is detected by using an RNA extraction kit. In a more preferred embodiment, extraction can be performed using Roche's RNA extraction kit (product number: RocheCatalogNumber # 3270289001) or Qiagen's RNA extraction kit (Qiagen, Rnease, FFPE, Kitalog ##, 504). .

在一示例性实施方案中,文库的构建方法可以包括以下步骤:In an exemplary embodiment, a method for constructing a library may include the following steps:

将提取的总RNA反转录生成如表1所述的基因的cDNA。末端补平并进行5’端磷酸化,将30μlDNA、45μl纯水、10μl具有10mM ATP的T4 DNA连接酶缓冲液、4μl包含10mM dNTP Mix、5μlT4 DNA聚合酶、1μl Klenow酶、5μlΤ4连接酶混合后,在20℃下温浴30分钟(试剂为Illumina样本准备试剂盒PE-102-1001),温浴后釆用QIAGEN QIAquick PCR纯化试剂盒(part#28104)纯化DNA。末端悬液A:将上步的产物溶解在32μl缓冲液中,加入Klenow缓冲液5μl,1mM dATP 10μl、Klenow Εχο-3μl,在37℃下保持30分钟(试剂为Illumina样本准备试剂盒)。产物由QIAGEN MinElute PCR纯化试剂盒(part#28004)连接:DNA溶解在10μl缓冲液中,加入DNA连接酶缓冲液2χ25μl、PE Adapter Oligo Mix 10μl,DNA连接酶5μl,在20℃下保持15分钟(试剂为Illumina样本准备试剂盒PE-102-1001),温浴后釆用QIAGEN QIAquick PCR纯化试剂盒(part#28104)纯化DNA,即得文库。The extracted total RNA was reverse transcribed to generate cDNAs of the genes described in Table 1. The ends were filled and phosphorylated at the 5 ′ end. 30 μl of DNA, 45 μl of purified water, 10 μl of T4 DNA ligase buffer with 10 mM ATP, 4 μl of 10 mM dNTP Mix, 5 μl of T4 DNA polymerase, 1 μl of Klenow enzyme, and 5 μl of T4 ligase were mixed. Incubate at 20 ° C for 30 minutes (reagent is Illumina sample preparation kit PE-102-1001). After incubation, use QIAGEN QIAquick PCR purification kit (part # 28104) to purify DNA. End suspension A: Dissolve the product from the previous step in 32 μl of buffer, add 5 μl of Klenow buffer, 10 μl of 1 mM dATP, and 3 μl of Klenow Εχο, and keep at 37 ° C for 30 minutes (reagent is Illumina sample preparation kit). The products were ligated by QIAGEN MinElute PCR purification kit (part # 28004): DNA was dissolved in 10 μl buffer, DNA ligase buffer 2 × 25 μl, PE adapter Oligo Mix 10 μl, DNA ligase 5 μl, and kept at 20 ° C for 15 minutes ( The reagent was Illumina sample preparation kit PE-102-1001). After warming, the DNA was purified using QIAGEN QIAquick PCR purification kit (part # 28104) to obtain a library.

步骤(2-3)可以通过RNA测序完成。所述的测序的方法可以为本领域常规的用于确定基因表达水平的RNA-seq测序方法。优选地利用Illumina  NextSeq/MiSeq/MiniSeq/iSeq系列测序仪进行二代测序。利用试剂盒中的引物对表1所示的基因进行扩增,根据步骤(2-2)所制备的文库的不同,可以对所得基因序列进行二次测序。优选地,二次测序为靶向RNA-seq技术,用Illumina NextSeq/MiSeq/MiniSeq/iSeq测序仪进行双端测序。这样的过程可以由仪器本身自动完成。Step (2-3) can be completed by RNA sequencing. The sequencing method may be a conventional RNA-seq sequencing method for determining a gene expression level in the art. Illumina NextSeq / MiSeq / MiniSeq / iSeq series sequencers are preferably used for next-generation sequencing. The primers in the kit were used to amplify the genes shown in Table 1. Depending on the library prepared in step (2-2), the obtained gene sequence could be sequenced twice. Preferably, the secondary sequencing is a targeted RNA-seq technology, and double-ended sequencing is performed with an Illumina NextSeq / MiSeq / MiniSeq / iSeq sequencer. Such a process can be done automatically by the instrument itself.

在本发明的一实施方案中,步骤(3)可以通过将所得测序结果进行统计分析完成。可以任选地根据Hu等开创的单一样品预测法SSP(Single Sample Predictor)和Parker等优化的方法来进行乳腺癌分型和风险预测。对将所得测序结果基因表达数据进行分析获得单一样品的亚型分型,并可以计算远处转移风险。In an embodiment of the present invention, step (3) may be completed by performing statistical analysis on the obtained sequencing results. Breast cancer typing and risk prediction can optionally be performed according to the single sample prediction method (SSP (Single Sample Predictor) and Parker) pioneered by Hu et al. The gene expression data of the obtained sequencing results are analyzed to obtain a subtype of a single sample, and the risk of distant metastasis can be calculated.

对于本发明的方法、用途和产品,作为示例性实施方案,可以采用如下程序。For the method, use, and product of the present invention, as an exemplary embodiment, the following procedures can be adopted.

首先收集14个乳腺癌队列研究中2034个Affymetrix U133基因芯片数据(GSE3494,GSE6532,GSE1456,GSE9195,GSE2034,GSE5327,GSE7390,GSE11121,GSE2603,GSE7378,GSE8193,GSE12093和E-TABM-158)进行基因筛选及优化用于乳腺癌分子分型及预后评估的基因组合。除基因表达数据外,每个对应病例的临床信息,包括病例分级、淋巴结转移、ER/PR检测、远处转移及发生时间以及治疗方案等均加以收集分析。First collected 2034 Affymetrix U133 gene chip data (GSE3494, GSE6532, GSE1456, GSE9195, GSE2034, GSE5327, GSE7390, GSE11121, GSE2603, GSE7378, GSE8193, GSE12093 and E-TABM-158) in 14 breast cancer cohort studies for gene screening And optimization of gene combinations for breast cancer molecular typing and prognosis assessment. In addition to the gene expression data, the clinical information of each corresponding case, including case classification, lymph node metastasis, ER / PR detection, distant metastasis and time of occurrence, and treatment plan were collected and analyzed.

所有的基因芯片数据在合并进行分析前,都需要经过标化(RMA)、批校正(Combat)及相同基因合并,以消除由于每个研究队列的技术平台、操作流程存在差异而造成的基因表达数据差异,为下一步基因筛选打好基础。All gene chip data need to undergo standardization (RMA), batch correction (Combat), and the same gene combination before analysis for analysis, in order to eliminate gene expression caused by differences in technical platforms and operating procedures of each research cohort. The difference in data lays the foundation for the next genetic screening.

对于2034个经过标化、批校正和同类基因合并的乳腺癌病例的基因表达数据,采用自主开发的分析软件EPIG,通过非监督性的聚类及配对相关性分析,计算出与乳腺癌转移密切相关且有统计学意义的基因表达谱,并以此为基础得到表达谱内的有意义基因。其中,最为相关的两组基因分别与细胞周期和免疫应答基因。For the gene expression data of 2034 breast cancer cases that have been standardized, batch-corrected and merged with similar genes, the self-developed analysis software EPIG was used to calculate the close relationship with breast cancer metastasis through unsupervised clustering and paired correlation analysis. Relevant and statistically significant gene expression profiles, and based on this to obtain meaningful genes in the expression profile. Among them, the two most relevant genes are the cell cycle and immune response genes, respectively.

为了确认所发现的细胞周期和免疫应答相关基因在预测疾病复发的稳定性,进一步进行了稳定性分析。具体做法为,从2034个基因芯片表达数据中随机抽取1017个芯片数据(50%),进行表达谱和相关基因计算,并重复抽取额和计算1000次。To confirm the stability of the discovered cell cycle and immune response-related genes in predicting disease relapse, stability analysis was further performed. The specific method is to randomly extract 1017 chip data (50%) from 2034 gene chip expression data, calculate expression profiles and related genes, and repeat the extraction and calculation 1000 times.

结果显示,63个细胞周期相关基因和121免疫应答基因始终稳定地出现在1000次的计算结果当中。将其中与无复发生存期最为密切相关的19个细胞周期相关基因和17个免疫应答基因,结合ER、PR、HER2及基底细胞特征基因可以确定66基因的组合。在此基础上考虑看家基因,可以确定一个72基因的组合(参见表1)。The results showed that 63 cell cycle-related genes and 121 immune response genes consistently appeared in the calculation results 1000 times. Combining the 19 cell cycle-related genes and 17 immune response genes that are most closely related to relapse-free survival, combined with ER, PR, HER2, and basal cell characteristic genes, a combination of 66 genes can be determined. Considering the housekeeping genes on this basis, a combination of 72 genes can be determined (see Table 1).

本发明的检测方法可用于诊断目的或非诊断目的。The detection method of the present invention can be used for diagnostic or non-diagnostic purposes.

相应地,本发明还提供了本发明的基因群在对乳腺癌进行分子分型和/或评估其远处转移风险中的应用。Accordingly, the present invention also provides the application of the gene population of the present invention to molecular typing of breast cancer and / or assessing the risk of distant metastasis.

本发明还提供了本发明的基因群在制备对乳腺癌进行分子分型和/或评估其远处转移风险的产品中的应用。在优选的实施方案中,所述产品为检测试剂盒的形式。The invention also provides the application of the gene group of the invention in the preparation of a product for molecularly typing breast cancer and / or assessing its risk of distant metastasis. In a preferred embodiment, the product is in the form of a detection kit.

本发明还进一步提供了检测本发明的基因群中的基因表达水平的试剂在制备对乳腺癌进行分子分型和/或评估其远处转移风险的体外诊断产品中的应用。在优选的实施方案中,所述产品为检测试剂盒的形式。在一实施方案中,所述试剂为检测所述基因转录的RNA,特别是mRNA的量的试剂。在又一实施方案中,所述试剂为检测与所述mRNA互补的cDNA的量的试剂。在可选的实施方案中,所述试剂为检测所述基因编码的多肽的量的试剂,优选地,所述试剂为抗体、抗体片段或者亲和性蛋白。在另一实施方案中,所述试剂为探针或引物,特别是引物。更优选地,所述引物的序列如SEQ ID NO.1-SEQ ID NO.132和SEQ ID NO.133-SEQ ID NO.144所示。引物的具体方案还可以参见表2所示。The invention further provides the application of the reagent for detecting the gene expression level in the gene group of the invention in preparing an in vitro diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastasis. In a preferred embodiment, the product is in the form of a detection kit. In one embodiment, the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene. In yet another embodiment, the reagent is a reagent that detects the amount of cDNA complementary to the mRNA. In an alternative embodiment, the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein. In another embodiment, the reagent is a probe or a primer, especially a primer. More preferably, the sequences of the primers are as shown in SEQ ID No. 1-SEQ ID No. 132 and SEQ ID NO. 133-SEQ ID NO. 144. The specific protocol of the primers can also be shown in Table 2.

所述乳腺癌的亚型包括管腔A型、管腔B型、HER2富集型、基底细胞型及免疫增强型。The subtypes of breast cancer include lumen A, lumen B, HER2 enriched, basal cell, and immune-enhanced.

本发明所用的检测样本优选为来自检测对象(受试者对象)的组织,只要能从检测样本中抽提检测对象的总RNA即可。所述检测样本优选地为组织样本、血液、血浆和体液中的一种或几种,更优选地为组织样本,例如石蜡组织样本。在优选的实施方案中,检测样本为肿瘤细胞含量高的组织。The detection sample used in the present invention is preferably a tissue from a test object (subject object), as long as the total RNA of the test object can be extracted from the test sample. The test sample is preferably one or more of a tissue sample, blood, plasma, and body fluid, and more preferably a tissue sample, such as a paraffin tissue sample. In a preferred embodiment, the test sample is a tissue with a high content of tumor cells.

本发明还涉及一组增殖相关基因,其包括ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT(还可参见表1中的相关信息)。The present invention also relates to a group of proliferation-related genes including ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C, and ZWINT ( (See also the relevant information in Table 1).

本发明还涉及一组免疫相关基因,其包括APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC(还可参见表1中的相关信息)。The present invention also relates to a group of immune-related genes, including APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC ( (See also the relevant information in Table 1).

相应地,本发明还涉及增殖相关基因和免疫相关基因的组合,其中所述增殖相关基因和所述免疫相关基因如上文所述。Accordingly, the present invention also relates to a combination of a proliferation-related gene and an immune-related gene, wherein the proliferation-related gene and the immune-related gene are as described above.

本发明还涉及所述免疫相关基因(APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC)或所述增殖相关基因(ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT)或这两者的组合,或用于检测所述免疫相关基因、所述增殖相关基因或其组合的相关试剂在制备用于乳腺癌分子分型和/或评估乳腺癌远处转移风险的体外诊断产品中的应用。The present invention also relates to the immune-related genes (APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC) or all The proliferation-related genes (ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C, and ZWINT) or a combination of the two, Or the use of related reagents for detecting the immune-related genes, the proliferation-related genes, or a combination thereof in the preparation of in vitro diagnostic products for molecular typing of breast cancer and / or assessment of the risk of distant metastasis of breast cancer.

本发明还涉及所述免疫相关基因或所述增殖相关基因或其组合在进行乳腺癌分子分型和/或评估远处转移风险中的应用。在一实施方案中,所述试剂为检测所述基因转录的RNA,特别是mRNA的量的试剂。在又一实施方案中,所述试剂为 检测与所述mRNA互补的cDNA的量的试剂。在可选的实施方案中,所述试剂为检测所述基因编码的多肽的量的试剂,优选地,所述试剂为抗体、抗体片段或者亲和性蛋白。在另一实施方案中,所述试剂为探针或引物,特别是引物。更优选地,所述引物的序列可以参见表2所示。The invention also relates to the use of the immune-related genes or the proliferation-related genes or a combination thereof in performing molecular typing of breast cancer and / or assessing the risk of distant metastases. In one embodiment, the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene. In yet another embodiment, the reagent is a reagent that detects the amount of cDNA complementary to the mRNA. In an alternative embodiment, the reagent is a reagent that detects the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein. In another embodiment, the reagent is a probe or a primer, especially a primer. More preferably, the sequences of the primers can be shown in Table 2.

相应地,本发明还涉及一种用于确定受试者对象的乳腺癌分子分型和/或远处转移风险的方法。所述方法的步骤描述如上文所述,其中对应的基因群为上述所述免疫相关基因、所述增殖相关基因或其组合。Accordingly, the present invention also relates to a method for determining a breast cancer molecular typing and / or distant metastatic risk in a subject. The steps of the method are described above, wherein the corresponding gene group is the immune-related gene, the proliferation-related gene, or a combination thereof.

在本申请的方案中,可以确定一种新的乳腺癌亚型分型(免疫增强型),并且还能够在不同亚型中显示对乳腺癌远处转移的不同作用和影响。In the scheme of the present application, a new breast cancer subtype (immunoenhancement type) can be determined, and different effects and effects on distant metastasis of breast cancer can be shown in different subtypes.

本申请的方案还可以列举如下。The scheme of the present application can also be enumerated as follows.

1.一种用于乳腺癌分子分型和/或评估其远处转移风险的基因群,其特征在于,所述基因群包括66个分子分型及远处转移风险评估相关基因,其中,A gene group for molecular typing of breast cancer and / or assessing the risk of distant metastasis, characterized in that the gene group comprises 66 genes related to molecular typing and risk assessment for distant metastasis, wherein:

所述66个分子分型及远处转移风险评估相关基因包括:The 66 molecular typing and distant metastasis risk assessment related genes include:

(1)增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT,(1) Proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C and ZWINT,

(2)免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,(2) Immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC,

(3)基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,(3) basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1,

(4)雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,(4) estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MAPT, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6,

(5)HER2相关基因ERBB2、FGFR4和GRB7,(5) HER2 related genes ERBB2, FGFR4 and GRB7,

(6)侵袭相关基因CTSL2和MMP11。(6) Invasion related genes CTSL2 and MMP11.

2.如第1项所述的基因群,其特征在于,2. The gene group according to item 1, wherein

所述基因群还包括看家基因;The gene group also includes a housekeeping gene;

优选地,所述看家基因包括以下中的至少一个(如1、2、3、4、5、6个)、优选至少3个、最优选6个:GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC。Preferably, the housekeeping gene includes at least one of the following (such as 1, 2, 3, 4, 5, 6), preferably at least 3, and most preferably 6: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.

3.一种用于乳腺癌分子分型和/或评估其远处转移风险的基因群,其特征在于,3. A gene group for molecular typing of breast cancer and / or assessing its risk of distant metastasis, characterized by:

所述基因群包括66个分子分型及远处转移风险评估相关基因以及6个看家基因,其中,The gene group includes 66 genes related to molecular typing and distant metastasis risk assessment, and 6 housekeeping genes, of which,

所述66个分子分型及远处转移风险评估相关基因包括:The 66 molecular typing and distant metastasis risk assessment related genes include:

(1)增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、 UBE2C和ZWINT,(1) Proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C, and ZWINT,

(2)免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,(2) Immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC,

(3)基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,(3) basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1,

(4)雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,(4) estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MAPT, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6,

(5)HER2相关基因ERBB2、FGFR4和GRB7,(5) HER2 related genes ERBB2, FGFR4 and GRB7,

(6)侵袭相关基因CTSL2和MMP11;(6) Invasion related genes CTSL2 and MMP11;

所述6个看家基因包括:GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC。The six housekeeping genes include: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC.

4.第1-3中任一项的基因群在进行乳腺癌分子分型和/或评估其远处转移风险中的应用。4. Use of the genetic group according to any one of 1-3 in performing molecular typing of breast cancer and / or assessing its risk of distant metastasis.

5.第1-3中任一项的基因群在制备对乳腺癌进行分子分型和/或评估其远处转移风险的诊断产品中的应用。5. Use of the genetic group of any one of items 1-3 in the preparation of a diagnostic product for molecularly typing breast cancer and / or assessing its risk of distant metastasis.

6.检测第1-3中任一项的基因群中的基因的表达水平的试剂在制备对乳腺癌进行分子分型和/或评估其远处转移风险的诊断产品中的应用。6. Use of a reagent for detecting the expression level of a gene in the gene group according to any one of items 1-3 in the preparation of a diagnostic product for molecularly typing breast cancer and / or assessing the risk of its distant metastasis.

7.一种对乳腺癌进行分子分型和/或评估其远处转移风险的诊断产品,其包含检测第1-3中任一项的基因群中基因的表达水平的相关试剂。7. A diagnostic product for molecularly typing breast cancer and / or assessing its risk of distant metastasis, comprising a reagent for detecting the expression level of a gene in the gene group according to any one of items 1-3.

8.第6或7项的应用或诊断产品,其特征在于,所述诊断产品为体外诊断产品的形式,优选诊断试剂盒的形式。8. The application or diagnostic product of item 6 or 7, characterized in that the diagnostic product is in the form of an in vitro diagnostic product, preferably in the form of a diagnostic kit.

9.第6-8项之一的应用或诊断产品,其特征在于,所述试剂为检测所述基因转录的RNA,特别是mRNA的量的试剂。9. The application or diagnostic product according to any one of items 6 to 8, wherein the reagent is a reagent that detects the amount of RNA, particularly mRNA, transcribed by the gene.

10.第6-9项之一的应用或诊断产品,其特征在于,所述试剂为检测与所述mRNA互补的cDNA的量的试剂。10. The application or diagnostic product according to any one of items 6 to 9, wherein the reagent is a reagent that detects an amount of cDNA complementary to the mRNA.

11.第6-10项之一的应用或诊断产品,其特征在于,所述诊断产品还包括总RNA抽提试剂、逆转录试剂和/或二代测序试剂。11. The application or diagnostic product according to any one of items 6 to 10, wherein the diagnostic product further comprises a total RNA extraction reagent, a reverse transcription reagent, and / or a second-generation sequencing reagent.

12.第6-11项之一的应用或诊断产品,其特征在于,所述试剂为检测所述基因编码的多肽的量的试剂,优选地,所述试剂为抗体、抗体片段或者亲和性蛋白。12. The application or diagnostic product according to any one of items 6 to 11, wherein the reagent is a reagent for detecting the amount of the polypeptide encoded by the gene, and preferably, the reagent is an antibody, an antibody fragment, or an affinity protein.

13.第6-12项之一的应用或诊断产品,其特征在于,所述试剂为探针或引物,优选引物。13. The application or diagnostic product according to any one of items 6 to 12, wherein the reagent is a probe or a primer, preferably a primer.

14.第13项的应用或诊断产品,其特征在于,所述引物的序列如SEQ ID NO.1-SEQ ID NO.144所示。14. The application or diagnostic product of item 13, characterized in that the sequence of the primer is as shown in SEQ ID NO.1-SEQ ID NO.144.

15.一组用于进行乳腺癌分子分型和/或评估其远处转移风险的引物,其中所述引物的序列如SEQ ID NO.1-SEQ ID NO.132所示。15. A set of primers for molecular typing of breast cancer and / or assessing the risk of distant metastasis, wherein the sequence of said primers is shown in SEQ ID NO.1-SEQ ID NO.132.

16.第15项的引物组,其还包括序列如SEQ ID NO.133-SEQ ID NO.144所示的引物。16. The primer set of item 15, further comprising a primer having a sequence shown in SEQ ID NO. 133-SEQ ID NO. 144.

17.一组用于进行乳腺癌分子分型和/或评估其远处转移风险的引物,其中所述引物的序列如SEQ ID NO.1-SEQ ID NO.144所示。17. A set of primers for molecular typing of breast cancer and / or assessing the risk of distant metastasis, wherein the sequence of said primers is shown in SEQ ID NO.1-SEQ ID NO.144.

18.第15-17中任一项的引物组在用于制备对乳腺癌进行分子分型和/或评估其远处转移风险的产品中的应用。18. Use of the primer set of any of 15-17 in the production of a product for molecularly typing breast cancer and / or assessing its risk of distant metastasis.

19.第1-18项之一的基因群、应用、诊断产品或引物组,其特征在于,所述乳腺癌包括管腔A型、管腔B型、HER2富集型、基底细胞型及免疫增强型。19. The gene group, application, diagnostic product or primer set according to any one of items 1 to 18, wherein the breast cancer includes lumen A type, lumen B type, HER2 enriched type, basal cell type, and immunity Enhanced.

有益效果Beneficial effect

与现有技术(如PAM50技术)相比,本发明的方案首次将免疫调控基因引入乳腺癌分子分型当中,而且还可以通过考虑其他基因的表达水平,进一步加强了乳腺癌分子分型的合理性和准确性,提升指导乳腺癌的临床治疗的能力,将乳腺癌分子分型为管腔A型、管腔B型、HER2富集型、基底细胞型及免疫增强型。还可以对这些分型的转移风险进行更为准确的区分。例如,管腔A型远处转移风险显著低于其他四种亚型,而免疫增强型远处转移风险显著低于管腔B型、基底细胞型和HER2富集型。进一步地,还可以更准确地判断不同分型的乳腺癌的预后情况。例如,对于高风险型的乳腺癌亚型中的管腔B型、基底细胞型和HER2富集型,免疫功能强可显著降低乳腺癌远处转移风险。Compared with the existing technology (such as PAM50 technology), the scheme of the present invention introduces immune regulation genes into the molecular typing of breast cancer for the first time, and can further strengthen the rationality of molecular typing of breast cancer by considering the expression levels of other genes And improve the ability to guide the clinical treatment of breast cancer, and molecularly type breast cancer into lumen A, lumen B, HER2 enriched, basal cell, and immune-enhanced. A more accurate distinction can also be made between these types of transfer risks. For example, the risk of distant metastasis of luminal type A is significantly lower than the other four subtypes, while the risk of distant metastasis of immune-enhanced type is significantly lower than that of luminal type B, basal cell type and HER2 enriched type. Furthermore, the prognosis of breast cancer with different types can be judged more accurately. For example, for luminal B, basal cell, and HER2-enriched types in high-risk breast cancer subtypes, strong immune function can significantly reduce the risk of distant breast cancer metastasis.

对于除了免疫增强型之外的其他亚型,通过免疫基因组还可以进行进一步细分。而且,采用二代测序的检测基因表达并用于分子分型,通过本发明的基因群及其相应产品,可以增加检测的敏感度,提高了检测能力和效率,同时极大降低了检测成本。与目前常用的IHC技术相比,本发明对乳腺癌的分子分型更加精确,并可计算出远处转移风险,对远处转移风险进行预测,可更加精准地指导临床治疗。而且,本发明的产品和方法具有高可靠性、高灵敏度和高可重复性。例如,在一方面,本发明的产品和方法可以显著降低无法分型的类型的比例,从而提供了更为准确、效率更高的分型方法和产品。在另一方面,本发明的产品和方法也可以显著降低所谓的“中风险”的比率。而这样的“中风险”类型在临床治疗上难以进行判断,指导意义较差。For subtypes other than immune-enhanced, the immune genome can be further subdivided. In addition, the second-generation sequencing is used to detect gene expression and is used for molecular typing. The gene group and corresponding products of the present invention can increase detection sensitivity, improve detection capability and efficiency, and greatly reduce detection costs. Compared with the currently commonly used IHC technology, the present invention is more accurate for molecular typing of breast cancer, can calculate the risk of distant metastasis, predict the risk of distant metastasis, and can guide clinical treatment more accurately. Moreover, the products and methods of the present invention have high reliability, high sensitivity, and high repeatability. For example, in one aspect, the products and methods of the present invention can significantly reduce the proportion of types that cannot be typed, thereby providing more accurate and efficient typing methods and products. In another aspect, the products and methods of the present invention can also significantly reduce the so-called "medium risk" ratio. And this type of "medium risk" is difficult to judge in clinical treatment, and the guidance is poor.

实施例Examples

下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施例范围之中。下列实施例中未注明具体条件的实验方法,按照常规方法和条件,或按照商品说明书选择。本文的实施例中所用的试剂和仪器均是可商购的。The present invention is further described below by way of examples, but the present invention is not limited to the scope of the examples. The experimental methods without specific conditions specified in the following examples were selected according to conventional methods and conditions, or according to product specifications. The reagents and instruments used in the examples herein are all commercially available.

实施例1:评估乳腺癌亚型分型及远处转移风险相关基因群的筛选Example 1: Screening of genes associated with breast cancer subtypes and distant metastasis risk

方法:通过EPIG基因表达谱分析程序(参见Zhou,Chou et al,2006.Environ Health Perspect 114(4),553-559;Chou,Zhou et al,2007.BMC Bioinformatics 8,427)分析14项乳腺癌队列研究2034病例中的具有完整临床信息的1951例乳腺癌肿瘤基因表达量(GSE3494,GSE6532,GSE1456,GSE9195,GSE2034,GSE5327,GSE7390,GSE11121,GSE2603,GSE7378,GSE8193,GSE12093和E-TABM-158),筛选出与乳腺癌远处转移风险密切相关的免疫相关基因与细胞周期基因,结合已报导的ER、PR及亚型分型相关基因,在每组基因中计算并优选对分型及远处转移风险贡献率大的基因。Methods: 14 breast cancer cohort studies were analyzed by the EPIG gene expression profiling program (see Zhou, Zhou et al., 2006. Health, Perspective 114 (4), 553-559; Zhou, Zhou et al., 2007. BMC Bioinformatics 8,427). Gene expression of 1951 breast cancer tumors with complete clinical information in 2034 cases (GSE3494, GSE6532, GSE1456, GSE9195, GSE2034, GSE5327, GSE7390, GSE11121, GSE2603, GSE7378, GSE8193, GSE12093, and E-TABM-158), screened Immune-related genes and cell cycle genes that are closely related to the risk of distant metastasis of breast cancer, combined with the reported ER, PR and subtype typing related genes, calculate and optimize the risk of typing and distant metastasis in each group of genes Genes with a high contribution rate.

结果:共筛选获得了与乳腺癌亚型分型及远处转移风险相关的66个基因及6个看家基因,即72个基因测试组合。基因列表见表1。Results: A total of 66 genes and 6 housekeeping genes related to breast cancer subtype and distant metastasis risk were screened out, that is, 72 gene test combinations. The gene list is shown in Table 1.

实施例2:采用所筛选的乳腺癌亚型分型及远处转移风险相关基因群对乳腺癌进行分子分型及远处转移风险评估Example 2: Using the screened breast cancer subtypes and distant metastasis risk-related gene groups to perform molecular typing and distant metastasis risk assessment of breast cancer

实验方法:采用72基因测试组合。其中66个乳腺癌亚型分型及远处转移风险相关基因群(增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT,免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,HER2相关基因ERBB2、FGFR4和GRB7,侵袭相关基因CTSL2和MMP11)用于分子分型。6个内参基因(包括GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC)可以作为内标。计算远处转移风险指数时采用表1中除6个内参基因外的其他所有66个基因。对欧美乳腺癌公共数据库中1951例欧美乳腺癌肿瘤患者的数据进行分子分型(亚型分型)和远处转移风险评估(图1A),并对824例中国乳腺癌肿瘤样品进行分子分型(亚型分型)和远处转移风险评估(图1B),并将两者进行比较。Experimental method: 72 gene test combinations were used. Among them, 66 breast cancer subtypes and distant metastasis risk-related genes (proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A , TPX2, TYMS, UBE2C and ZWINT, immune related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC, Basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1, estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MPT2, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6, HER2-related genes ERBB2, FGFR4 and GRB7, and invasion-related genes CTSL2 and MMP11) were used for molecular typing. Six internal reference genes (including GAPDH, GUSB, MRPL19, PSMC4, SF3A1 and TFRC) can be used as internal standards. In calculating the distant metastasis risk index, all 66 genes except Table 6 reference genes in Table 1 were used. Molecular typing (subtyping) and distant metastatic risk assessment of data from 1951 European and American breast cancer patients in the European and American breast cancer public databases (Figure 1A), and molecular typing of 824 Chinese breast cancer tumor samples (Subtype typing) and distant metastatic risk assessment (Figure 1B), and compare the two.

实验结果:Experimental results:

1、乳腺癌分子分型1. Molecular typing of breast cancer

如上所述,利用表1所示66个乳腺癌分子分型基因群对以上乳腺癌病例进行分子分型,将乳腺癌肿瘤分为5个亚型(图1),并可以对其治疗给出不同建议:As mentioned above, the 66 breast cancer molecular genotyping groups shown in Table 1 were used to molecularly type the above breast cancer cases to divide breast cancer tumors into 5 subtypes (Figure 1), and their treatment can be given Different suggestions:

1)管腔A型(Luminal A)1) Lumenal A

管腔A型肿瘤患者的p53基因突变率很低,预后较好,其对化疗不敏感,而 适合内分泌治疗,故对临床内分泌治疗有指导意义。Patients with luminal A tumors have a low mutation rate of p53 gene and a good prognosis. They are not sensitive to chemotherapy and are suitable for endocrine therapy. Therefore, it has guiding significance for clinical endocrine therapy.

2)管腔B型(Luminal B)2) Lumenal B

管腔B型属于内分泌治疗敏感的肿瘤,但对于HER2阳性患者来说,采用三苯氧胺治疗的疗效较管腔A型差,而采用芳香化酶抑制剂的效果较好。对于HER2阳性的管腔B型肿瘤患者,可进行分子靶向治疗。Lumen B is a tumor that is sensitive to endocrine therapy, but for HER2-positive patients, tamoxifen treatment is less effective than lumen A, and aromatase inhibitors are better. For patients with HER2-positive lumen type B tumors, molecular targeted therapy can be performed.

3)HER2富集型(HER2-enriched)3) HER2-enriched

HER2富集型肿瘤患者的p53基因的突变率很高,肿瘤分化相对较差,此型对靶向分子治疗相对敏感,但预后较差。HER2富集型的肿瘤广泛采用赫赛汀联合系统化疗的治疗方案。Patients with HER2-enriched tumors have a high mutation rate of p53 gene and relatively poor tumor differentiation. This type is relatively sensitive to targeted molecular therapy but has a poor prognosis. HER2-enriched tumors are widely treated with Herceptin combined with systemic chemotherapy.

4)基底细胞型(Basal-like)4) Basal-like

基底细胞型肿瘤是侵入性最强的肿瘤,即三阴性肿瘤(ER-,HER2-,PR-)。基底细胞型肿瘤患者对目前的乳腺癌治疗方案都不敏感,一般临床预后较差;但如果肿瘤组织免疫相关基因表达强,预后则相对好。Basal cell tumors are the most invasive tumors, namely triple negative tumors (ER-, HER2-, PR-). Basal cell tumor patients are not sensitive to current breast cancer treatment options and generally have a poor clinical prognosis; however, if the tumor tissues have strong immune-related gene expression, the prognosis is relatively good.

5)免疫增强型(Immune-enhanced)5) Immune-enhanced

此型较管腔A型预后稍差,但好于其它三种亚型。免疫基因高表达,因而对各种辅助治疗相对敏感。This type has a slightly worse prognosis than the lumen type A, but is better than the other three subtypes. Immune genes are highly expressed and therefore relatively sensitive to a variety of adjuvant therapies.

通过计算不同亚型发生远处转移的数量和时间,绘制Kaplan-Meier生存曲线可以获得相应的转移风险。如图2所示,上述五个亚型的远处转移风险不同。管腔A型的远处转移风险最低,属低危亚型;对于5年远处转移风险来说,管腔B型低于HER2富集型和基底细胞型;对于10年远处转移风险来说,管腔B型、HER2富集型和基底细胞型三者无显著差异,均属高风险亚型,而免疫增强型则介于两组之间,属中危亚型。By calculating the number and time of distant metastases in different subtypes, the Kaplan-Meier survival curve can be drawn to obtain the corresponding metastatic risk. As shown in Figure 2, the distant transfer risks of the five subtypes are different. Luminous type A has the lowest risk of distant metastasis, which is a low-risk subtype. For 5-year distant metastatic risk, luminal type B is lower than the HER2 enriched type and basal cell type. It is said that the lumen B type, HER2 enriched type and basal cell type are not significantly different, and they are all high-risk subtypes, while the immune-enhancing type is between the two groups and belongs to intermediate-risk subtypes.

2、免疫指数对不同亚型远处转移风险的影响2.Impact of immune index on risk of distant metastasis in different subtypes

根据17个免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC的表达水平计算免疫指数,根据免疫指数可将每个亚型进一步分为两组,即,免疫强的组和免疫弱的组,并观察两组之间的转移风险差异。其中,管腔B型(Luminal B)、基底细胞型(Basal-like)及HER2富集型(HER2-enriched)亚型中,免疫强的病例的远处转移风险显著低于(P<0.05)免疫弱的病例;而管腔A型(Luminal A),免疫强的病例和免疫弱的病例的远处转移风险差异不显著(图3)。Calculate the immune index based on the expression levels of 17 immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC. Each subtype can be further divided into two groups based on the immune index, that is, the strong immune group and the weak immune group, and the difference in metastatic risk between the two groups is observed. Among them, Lumenal B, Basal-like and HER2-enriched subtypes, the risk of distant metastasis was significantly lower in patients with strong immunity (P <0.05). Immunely weak cases; Lumenal A, immunologically weak cases and distantly metastatic cases showed no significant difference in distant metastatic risk (Figure 3).

通过17个免疫基因标化后的表达水平平均值可以获得免疫指数:The immune index can be obtained by averaging the expression levels of the 17 immune genes:

Figure PCTCN2019109459-appb-000007
Figure PCTCN2019109459-appb-000007

3、远处转移风险评估3.Distant risk assessment

肿瘤远处转移风险的计算采用Cox模型,以远处转移是否发生及发生时间作为观察终点,根据肿瘤的亚型、免疫指数和增殖指数对于远处转移发生影响的相对危险度确定相应系数,计算方法如下:The Cox model is used to calculate the risk of distant metastasis of the tumor. The occurrence and time of distant metastasis are used as the observation endpoint. Corresponding coefficients are determined based on the relative risk of the tumor subtype, immune index, and proliferation index for distant metastasis. Methods as below:

远处转移风险评分(Risk of Recurrence Score,RRS)的计算:0-100Calculation of distant metastasis risk score (Risk of Recurrence Score, RRS): 0-100

0-32,低风险;33-49,中风险;50-100,高风险;0-32, low risk; 33-49, medium risk; 50-100, high risk;

Figure PCTCN2019109459-appb-000008
Figure PCTCN2019109459-appb-000008

RRS=0.2x Basal+0.4x HER2–0.2x Immune-0.1x LumA+0.2x LumB+0.3x增殖指数-0.1x免疫指数RRS = 0.2x Basal + 0.4x HER2-0.2x Immune-0.1x LumA + 0.2x LumB + 0.3x proliferation index-0.1x immune index

其中,“Basal”代表该肿瘤与基底细胞型肿瘤的pearson相关系数;Among them, "Basal" represents the pearson correlation coefficient between the tumor and the basal cell tumor;

“HER2”代表该肿瘤与HER2富集型肿瘤的pearson相关系数;"HER2" represents the pearson correlation coefficient between the tumor and the HER2-enriched tumor;

“LumA”代表该肿瘤与管腔A型肿瘤的pearson相关系数;"LumA" represents the pearson correlation coefficient between this tumor and lumen type A tumor;

“LumB”代表该肿瘤与管腔B型肿瘤的pearson相关系数;"LumB" represents the pearson correlation coefficient between the tumor and luminal type B tumor;

“Immune”代表该肿瘤与免疫增强型肿瘤的pearson相关系数。"Immune" represents the pearson correlation coefficient between the tumor and the immune-enhanced tumor.

如图4所示,根据所计算得出的远处转移风险评分,可将肿瘤远处转移的风险分为三组,低风险(0-32)、中风险(33-49)和高风险(50-100)。As shown in Figure 4, according to the calculated distant metastasis risk score, the risk of distant metastasis of the tumor can be divided into three groups, low risk (0-32), medium risk (33-49), and high risk ( 50-100).

对于低风险组病例,如果淋巴结检查阴性,则不建议进行辅助化疗。For patients in the low-risk group, adjuvant chemotherapy is not recommended if the lymph node test is negative.

实施例3:评估高危乳腺癌亚型免疫相关基因的引物和探针设计及实验室验证Example 3: Primer and Probe Design and Laboratory Validation to Evaluate Immune-Related Genes in High-risk Breast Cancer Subtypes

实验方法:采用基于Illumina中通量二代测序平台MiSeq的靶向RNA-seq技术,检测300例中国乳腺癌病人新鲜肿瘤组织或石蜡包埋的肿瘤组织中实施例1所筛选得到的包含免疫相关基因在内的72个基因的表达水平,作为进一步进行乳腺癌分型的依据。Experimental method: Targeted RNA-seq technology based on Illumina mid-throughput second-generation sequencing platform MiSeq was used to detect fresh tumor tissue or paraffin-embedded tumor tissue of 300 Chinese breast cancer patients. The expression levels of 72 genes including genes were used as the basis for further breast cancer typing.

实验结果:Experimental results:

1、检测实施例1所筛选得到的72个基因的72对引物见表2。1. 72 pairs of primers for detecting 72 genes screened in Example 1 are shown in Table 2.

表2:本发明的基因群的基因及其扩增引物Table 2: Genes of the gene group of the present invention and their amplification primers

Figure PCTCN2019109459-appb-000009
Figure PCTCN2019109459-appb-000009

Figure PCTCN2019109459-appb-000010
Figure PCTCN2019109459-appb-000010

Figure PCTCN2019109459-appb-000011
Figure PCTCN2019109459-appb-000011

Figure PCTCN2019109459-appb-000012
Figure PCTCN2019109459-appb-000012

Figure PCTCN2019109459-appb-000013
Figure PCTCN2019109459-appb-000013

Figure PCTCN2019109459-appb-000014
Figure PCTCN2019109459-appb-000014

2、二代测序数据库的建立。所有实施例3中所述的300例中国乳腺癌病人的新鲜肿瘤组织或石蜡包埋的肿瘤组织进行二代测序的原始数据都会上传到一个基于网络的数据储存和分析库。这一方法采用Java软件开发并应用了许多J2EE(Java Enterprise Edition)组分和模式,能够1)、直接从Illumina NextSeq/MiSeq/MiniSeq/iSeq仪器输入数据;2)、可以灵活的方式显示录入的数据,并可按不同的需求,如基因相关性、样本或实验组别,进行索引;3)、计算以看家基因标化后的基因表达谱;4)、分析特定元素的细节;5)、以不同的格式输出数据,如XML、excel和文本格式;6)、可安全管理、确保数据隐私保护。2. Establishment of second-generation sequencing database. The raw data of all next-generation sequencing of fresh tumor tissue or paraffin-embedded tumor tissue of 300 Chinese breast cancer patients described in Example 3 will be uploaded to a web-based data storage and analysis database. This method uses Java software to develop and apply many J2EE (Java Enterprise Edition) components and modes. It can 1) directly input data from Illumina NextSeq / MiSeq / MiniSeq / iSeq instruments; 2) it can display the entered data in a flexible manner. Data, and can be indexed according to different needs, such as gene correlation, samples or experimental groups; 3) calculation of gene expression profiles after normalization of housekeeping genes; 4) analysis of specific element details; 5) , Output data in different formats, such as XML, excel and text formats; 6), can be safely managed to ensure data privacy protection.

3、300例中国乳腺癌的亚型、增殖指数及免疫指数结果(表3)。3. Subtype, proliferation index and immune index results of 300 cases of Chinese breast cancer (Table 3).

通过合作医院获得中国乳腺癌患者手术后石蜡样本,按前述步骤提取RNA,进行质检,反转录建库,采用Illumina NextSeq/MiSeq/MiniSeq/iSeq平台进行RNA测序,检测72基因表达水平,并进行亚型分型及远处转移风险(RRS,在本文中,又可称为远处复发风险或Precitype Risk Score,简称PRS)计算,最终获得300例样本检测结果。如上文所述,Mixed在本实施例的统计结果中并不纳入。Obtain postoperative paraffin samples from Chinese breast cancer patients through partner hospitals, extract RNA in accordance with the previous steps, perform quality inspection, build a library by reverse transcription, use the Illumina NextSeq / MiSeq / MiniSeq / iSeq platform for RNA sequencing, and detect 72 gene expression levels The subtype classification and distant metastasis risk (RRS, also referred to as distant recurrence risk or Precitype Risk Score (PRS) in this paper) were calculated, and 300 samples were finally obtained. As mentioned above, Mixed is not included in the statistical results of this embodiment.

表3. 300例中国乳腺癌病例的72基因分子分型及远处转移风险检测结果Table 3. Molecular genotyping of 72 genes and detection of distant metastatic risk in 300 Chinese breast cancer cases

Figure PCTCN2019109459-appb-000015
Figure PCTCN2019109459-appb-000015

实施例4:评估乳腺癌分子分型及远处转移风险基因群的二代测序检测试剂盒分析方法Example 4: Analysis method of second-generation sequencing test kit for assessing breast cancer molecular typing and distant metastasis risk gene groups

步骤1:取检测对象肿瘤或石蜡包埋组织,利用检测试剂盒中的方法获取检测对象含肿瘤细胞高的区域为原始材料。Step 1: Take the tumor or paraffin-embedded tissue of the test object, and use the method in the test kit to obtain the region of the test object that contains high tumor cells as the original material.

步骤2:提取组织中总RNA。可以使用Roche公司生产的RNA抽提试剂盒来提取组织中的RNA(产品号为Roche Catalog Number#3270289001)或者Qiagen公司的RNA抽提试剂盒来提取(Qiagen RNease FFPE kit,Catalog Number#73504)。Step 2: Extract total RNA from the tissue. You can use RNA extraction kits produced by Roche to extract RNA from tissues (product number is RocheCatalogNumber # 3270289001) or Qiagen's RNA extraction kits to extract (QiagenRNeaseFFFF kit, CatalogNumber # 73504).

步骤3:将所得RNA制成可供测序的文库。将所得组织的RNA制成可供靶向RNA-seq技术二代测序的文库,文库的制备方法包括以下步骤:Step 3: The obtained RNA is made into a library for sequencing. The RNA of the obtained tissue is made into a library that can be used for targeted RNA-seq second-generation sequencing. The method for preparing the library includes the following steps:

将提取组织的RNA在专一引物的指导下用反转录酶生成感兴趣的多种基因(如表1所述的72个基因)cDNA。末端补平并进行5’端磷酸化,将30μlDNA、45μl纯水、10μl具有10mM ATP的T4 DNA连接酶缓冲液、4μl包含10mM dNTP Mix、5μl T4 DNA聚合酶、1μl Klenow酶、5μl Τ4连接酶混合后,在20℃温浴30分钟(试剂来自Illumina样本准备试剂盒PE-102-1001),温浴后釆用QIAGEN QIAquick PCR纯化试剂盒(part#28104)纯化DNA。末端悬A:将上步的产物溶解在32μl缓冲液中,加入Klenow缓冲液5μl,1mM dATP 10μl、Klenow Εχο-3μl,在37℃保持30分钟(试剂来自Illumina样本准备试剂盒),产 物由QIAGEN MinElute PCR纯化试剂盒(part#28004)连接:DNA溶解在10μl缓冲液中,加入DNA连接酶缓冲液2χ25μl、PE Adapter Oligo Mix 10μl,DNA连接酶5μl,在20℃保持15分钟(试剂为Illumina样本准备试剂盒PE-102-1001),温浴后釆用QIAGEN QIAquick PCR纯化试剂盒(part#28104)纯化DNA即得文库。The RNA of the extracted tissue was generated under the guidance of a specific primer using reverse transcriptase to generate cDNAs of a variety of genes of interest (72 genes described in Table 1). The ends were filled and phosphorylated at the 5 ′ end. 30 μl of DNA, 45 μl of purified water, 10 μl of T4 DNA ligase buffer with 10 mM ATP, 4 μl of 10 mM dNTP Mix, 5 μl of T4 DNA polymerase, 1 μl of Klenow enzyme, 5 μl of Τ4 After mixing, incubate at 20 ° C for 30 minutes (reagents from Illumina sample preparation kit PE-102-1001). After incubation, the DNA was purified using QIAGEN QIAquick PCR purification kit (part # 28104). End suspension A: Dissolve the product from the previous step in 32 μl of buffer, add 5 μl of Klenow buffer, 10 μl of 1 mM dATP, and 3 μl of Klenow Εχο-3 μl, and keep at 37 ° C for 30 minutes (reagent from Illumina sample preparation kit). MinElute PCR purification kit (part # 28004) ligation: DNA was dissolved in 10 μl buffer, DNA ligase buffer 2 × 25 μl, PE adapter Adapter Oligo Mix 10 μl, DNA ligase 5 μl, and kept at 20 ° C for 15 minutes (reagent is Illumina sample Prepare the kit PE-102-1001). After warming, use the QIAGEN QIAquick PCR purification kit (part # 28104) to purify the DNA to obtain the library.

步骤4:利用实施例3表2中的引物序列对所得DNA文库进行用NextSeq/MiSeq/MiniSeq/iSeq进行二代测序。用Illumina NextSeq/MiSeq/MiniSeq/iSeq测序仪进行双端测序。此过程均由仪器本身自动完成(Illumina公司)。Step 4: Use the primer sequences in Table 2 of Example 3 to perform the next-generation sequencing on the resulting DNA library using NextSeq / MiSeq / MiniSeq / iSeq. Double-ended sequencing was performed with an Illumina NextSeq / MiSeq / MiniSeq / iSeq sequencer. This process is done automatically by the instrument itself (Illumina).

步骤5:结果统计分析。将所得测序结果进行统计分析,根据Hu等提出的单一样品预测法SSP(Single Sample Predictor)或Parker等优化的方法来进行乳腺癌分型和风险预测。对所得测序结果基因表达数据进行分析。Step 5: Statistical analysis of results. The obtained sequencing results were analyzed statistically, and breast cancer typing and risk prediction were performed according to the single sample prediction method (SSP (Single Sample Predictor) or Parker) proposed by Hu et al. The gene expression data of the obtained sequencing results were analyzed.

实施例5:敏感度Example 5: Sensitivity

本发明的检测方法无论是敏感度还是拷贝数检测能力都是最好。本发明实验研究表明用二代测序来测定基因表达谱,其敏感度远高于基因芯片法,同时在检测通量上优于定量PCR和Nanostring法(图5)。The detection method of the present invention is the best in both sensitivity and copy number detection capabilities. The experimental research of the present invention shows that the sensitivity of the second-generation sequencing to determine the gene expression profile is much higher than that of the gene chip method, and at the same time, it is better than the quantitative PCR and Nanostring method in the detection throughput (Figure 5).

实施例6:重复性Example 6: Repeatability

本发明的检测方法重复性高。用石蜡组织RNA做的7个重复实验,相关系数高于0.97。用新鲜冷冻组织的15个重复,相关系数高于0.99(图6)。The detection method of the present invention has high repeatability. Correlation coefficients were higher than 0.97 in 7 repeated experiments with paraffin tissue RNA. With 15 replicates of fresh frozen tissue, the correlation coefficient was higher than 0.99 (Figure 6).

对比例Comparative example

在以下对比例中,将本发明所用的基因群的方案与部分现有技术进行对比。如上文所述,进行分型比较时,并不考虑正常细胞(样)型。In the following comparative examples, the scheme of the gene group used in the present invention is compared with part of the prior art. As described above, when performing typing comparisons, normal cell (like) types are not considered.

对比例1:本发明的方案相对于现有技术的比较Comparative Example 1: Comparison of the scheme of the present invention with respect to the prior art

CN 104293910 A公开了一组基因群,其由60个基因组成(对比I)。将本发明的基因群的方案与其进行对比可见,本发明的方案引入新的免疫增强型,从而可以对乳腺癌的亚型分类更为准确。而且,还可以对这些分型的转移风险进行更为准确的区分。如上文所述,管腔A型远处转移风险显著低于其他四种亚型,而免疫增强型远处转移风险显著低于管腔B型、基底细胞型和HER2富集型。进一步地,还可以更准确地判断不同分型的乳腺癌的预后情况。对于高风险型的乳腺癌亚型中的管腔B型、基底细胞型和HER2富集型,免疫功能强可显著降低乳腺癌远处转移风险。CN 104293910A discloses a group of gene groups, which are composed of 60 genes (Comparative I). It can be seen from the comparison between the scheme of the gene group of the present invention and the scheme of the present invention that a new immune-enhancing type is introduced, so that the subtype of breast cancer can be classified more accurately. Moreover, these types of transfer risks can be more accurately distinguished. As mentioned above, the risk of distant metastasis of luminal type A is significantly lower than the other four subtypes, while the risk of distant metastasis of immune-enhanced type is significantly lower than that of luminal type B, basal cell type and HER2 enriched type. Furthermore, the prognosis of breast cancer with different types can be judged more accurately. For lumen B, basal cell, and HER2-enriched types in high-risk breast cancer subtypes, strong immune function can significantly reduce the risk of distant metastasis of breast cancer.

另外,采用本发明实施例2所述方法,对2034病例中的具有完整临床信息的 1951例乳腺癌肿瘤进行分型分析和远处转移风险评估,本发明的方案(如表1所述的66个分子分型及远处转移风险和6个看家基因)与对比I进行比较,结果如表4、表5,对结果的具体说明可以参见对比例2。In addition, using the method described in Example 2 of the present invention, 1951 breast cancer tumors with complete clinical information in 2034 cases were subjected to typing analysis and distant metastasis risk assessment. The scheme of the present invention (such as 66 in Table 1) Molecular typing and risk of distant metastasis and 6 housekeeping genes) were compared with comparison I. The results are shown in Tables 4 and 5. For a detailed description of the results, please refer to Comparative Example 2.

表4Table 4

Figure PCTCN2019109459-appb-000016
Figure PCTCN2019109459-appb-000016

表5table 5

Figure PCTCN2019109459-appb-000017
Figure PCTCN2019109459-appb-000017

由上表可见,除了上述优势之外,与现有技术相比,本发明的方案能够显著降低Mixed病例的比例。本发明的方案相对于现有技术的方案,提供了更完备的分析系统,例如增加了分型的种类。同时,可以实现更为准确的分型。此外还大大减少了Mixed病例的量。采用本发明的方案效果明显优于现有技术的方案,中风险的比例显著降低。As can be seen from the above table, in addition to the above advantages, compared with the prior art, the scheme of the present invention can significantly reduce the proportion of Mixed cases. Compared with the solution of the prior art, the solution of the present invention provides a more complete analysis system, for example, the types of typing are increased. At the same time, more accurate typing can be achieved. It also significantly reduced the number of Mixed cases. The effect of the scheme of the present invention is obviously better than that of the prior art, and the proportion of medium risk is significantly reduced.

对比例2:相关基因选择的影响Comparative Example 2: Impact of related gene selection

1.对于分子分型的影响1. Impact on molecular typing

为了更清楚地比较基因群选择的作用,以下将本发明的方案(如表1所述的66个分子分型及远处转移风险和6个看家基因)与对比II和对比III进行比较,以对基因的选择进行更详细的分析。方法参见对比例1。结果参见表6。In order to compare the role of gene group selection more clearly, the scheme of the present invention (66 molecular typing and distant metastatic risk and 6 housekeeping genes described in Table 1) is compared with comparison II and comparison III below. For a more detailed analysis of gene selection. For the method, see Comparative Example 1. The results are shown in Table 6.

对比II(59个基因):将上述对比I中的20个增殖相关基因替换为本发明方案中所用的19个增殖相关基因。Comparison II (59 genes): The 20 proliferation-related genes in the above comparison I were replaced with 19 proliferation-related genes used in the scheme of the present invention.

对比III(77个基因):将上述对比I加上本发明中所用的17个免疫相关基因。Comparison III (77 genes): The above comparison I was added to the 17 immune-related genes used in the present invention.

表6Table 6

方案Program Mixed的比例Mixed ratio 本发明的方案Solution of the invention 3.73%3.73% 对比I(60基因)Contrast I (60 genes) 14.65%14.65% 对比II(59基因)Contrast II (59 genes) 12.70%12.70% 对比III(77基因)Contrast III (77 genes) 9.43%9.43%

由上表可见,与现有技术相比,本发明的方案能够显著降低Mixed病例的比 例。由上表所示的本发明的方案和对比II之间的差异或者对比I和对比III之间的差异可知,本发明的方案首次将免疫调控基因引入乳腺癌分子分型当中,使得Mixed病例的比例显著降低。As can be seen from the above table, compared with the prior art, the scheme of the present invention can significantly reduce the proportion of Mixed cases. From the differences between the protocol of the present invention and Comparative II or the difference between Comparative I and Comparative III shown in the table above, it can be known that the protocol of the present invention introduces immune regulatory genes into the molecular typing of breast cancer for the first time, making the mixed case The proportion is significantly reduced.

由对比I和对比II之间的差异可知,仅改变增殖相关基因(将现有技术方案中的增殖相关基因替换为本发明方案中的增殖相关基因)能够使Mixed病例的比例降低,分型方法的效率提高。由本发明方案和对比III之间的差异可知,在引入免疫相关基因之后,增殖相关基因的改变对于分型方法的效率的影响尤为显著。本发明方案中包含的增殖相关基因与免疫相关基因的组合,其效果优于现有技术。From the difference between comparison I and comparison II, it can be known that only changing the proliferation-related genes (replacement of the proliferation-related genes in the prior art scheme with the proliferation-related genes in the scheme of the present invention) can reduce the proportion of Mixed cases, and the typing method Improved efficiency. It can be known from the difference between the scheme of the present invention and the comparison III that after the introduction of immune-related genes, the change of the proliferation-related genes has a particularly significant effect on the efficiency of the typing method. The combination of the proliferation-related genes and the immune-related genes contained in the scheme of the present invention has better effect than the prior art.

2.对于远处转移风险评估的影响2. Impact on distant transfer risk assessment

按照实施例2的方法,以下将本发明的方案(如表1所述的66个分子分型及远处转移风险和6个看家基因)与对比I、对比II和对比III进行比较。根据基因在上文所述1951例乳腺癌中的表达水平计算远处转移风险评分,并将肿瘤远处转移的风险分为三组:低风险、中风险和高风险,结果参见表7,其中示出在临床治疗上指导意义较差的“中风险”类型在总样本中所占的比例。In accordance with the method of Example 2, the protocol of the present invention (66 molecular typing and distant metastatic risk and 6 housekeeping genes described in Table 1) is compared with Comparative I, Comparative II, and Comparative III below. The distant metastasis risk score was calculated based on the gene expression level in the 1951 breast cancers described above, and the risk of distant metastasis was divided into three groups: low risk, medium risk, and high risk. The results are shown in Table 7, where Shows the proportion of "medium risk" types that are less instructive in clinical treatment in the total sample.

表7Table 7

方案Program 中风险的比例Risk ratio 本发明的方案Solution of the invention 16.50%16.50% 对比I(60基因)Contrast I (60 genes) 25.83%25.83% 对比II(59基因)Contrast II (59 genes) 23.58%23.58% 对比III(77基因)Contrast III (77 genes) 21.22%21.22%

由上表可见,采用本发明的方案效果明显优于现有技术的方案,中风险的比例显著降低。As can be seen from the above table, the effect of the scheme of the present invention is significantly better than that of the prior art, and the proportion of risks is significantly reduced.

虽然本发明已阐述并描述了典型的实施方案,但本发明并不限于所述细节。由于各种可能的修改和替换没有背离本发明的精神,本领域技术人员可使用常规试验能够想到的本发明的变型和等同物,因此所有这些变型和等同物都落入由所附权利要求书所定义的本发明的精神和范围内。Although the invention has been illustrated and described with typical embodiments, the invention is not limited to the details. Since various possible modifications and substitutions do not depart from the spirit of the present invention, those skilled in the art can use the variations and equivalents of the present invention that can be conceived by routine experimentation, so all these variations and equivalents fall within the scope of the appended claims. Within the spirit and scope of the invention as defined.

Claims (16)

一种用于乳腺癌分子分型和/或评估其远处转移风险的基因群,其特征在于,A genetic group for molecular typing of breast cancer and / or assessing its risk of distant metastasis, characterized in that: 所述基因群包括66个分子分型及远处转移风险评估相关基因,其中,The gene group includes 66 genes related to molecular typing and distant metastasis risk assessment, of which, 所述66个分子分型及远处转移风险评估相关基因包括:The 66 molecular typing and distant metastasis risk assessment related genes include: (1)增殖相关基因ASPM、AURKA、BIRC5、CCNB1、CDC20、CDK1、CENPU、CEP55、MELK、MKI67、NEK2、PRC1、PTTG1、RRM2、TOP2A、TPX2、TYMS、UBE2C和ZWINT,(1) Proliferation-related genes ASPM, AURKA, BIRC5, CCNB1, CDC20, CDK1, CENPU, CEP55, MELK, MKI67, NEK2, PRC1, PTTG1, RRM2, TOP2A, TPX2, TYMS, UBE2C and ZWINT, (2)免疫相关基因APOBEC3G、CCL5、CCR2、CD2、CD3D、CD52、CD53、CORO1A、CXCL9、GZMA、GZMK、HLA-DMA、HLA-DQA1、IL2RG、LCK、LYZ和PTPRC,(2) Immune-related genes APOBEC3G, CCL5, CCR2, CD2, CD3D, CD52, CD53, CORO1A, CXCL9, GZMA, GZMK, HLA-DMA, HLA-DQA1, IL2RG, LCK, LYZ and PTPRC, (3)基底细胞相关基因ACTR3B、CDH3、EGFR、FOXC1、KRT14、KRT17、KRT5、MIA、MYC、PHGDH和SFRP1,(3) basal cell related genes ACTR3B, CDH3, EGFR, FOXC1, KRT14, KRT17, KRT5, MIA, MYC, PHGDH and SFRP1, (4)雌激素受体相关基因BAG1、BCL2、BLVRA、CD68、ESR1、FOXA1、GSTM1、MAPT、MDM2、MLPH、NAT1、PGR、SCUBE2和SLC39A6,(4) estrogen receptor related genes BAG1, BCL2, BLVRA, CD68, ESR1, FOXA1, GSTM1, MAPT, MDM2, MLPH, NAT1, PGR, SCUBE2 and SLC39A6, (5)HER2相关基因ERBB2、FGFR4和GRB7,(5) HER2 related genes ERBB2, FGFR4 and GRB7, (6)侵袭相关基因CTSL2和MMP11。(6) Invasion related genes CTSL2 and MMP11. 如权利要求1所述的基因群,其特征在于,The gene group according to claim 1, wherein: 所述基因群还包括看家基因;The gene group also includes a housekeeping gene; 优选地,所述看家基因包括以下中的至少一个、优选至少3个、最优选6个:GAPDH、GUSB、MRPL19、PSMC4、SF3A1和TFRC。Preferably, the housekeeping gene includes at least one, preferably at least 3, and most preferably 6 of the following: GAPDH, GUSB, MRPL19, PSMC4, SF3A1, and TFRC. 如权利要求1或2所述的基因群在进行乳腺癌分子分型和/或评估其远处转移风险中的应用。The use of the gene group according to claim 1 or 2 for performing breast cancer molecular typing and / or assessing the risk of distant metastasis. 如权利要求1或2所述的基因群在制备对乳腺癌进行分子分型和/或评估其远处转移风险的诊断产品中的应用。The use of the gene group according to claim 1 or 2 in the preparation of a diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastasis. 检测权利要求1或2所述的基因群中的基因的表达水平的试剂在制备对乳腺癌进行分子分型和/或评估其远处转移风险的诊断产品中的应用。Use of an agent for detecting the expression level of a gene in the gene group according to claim 1 or 2 for preparing a diagnostic product for molecularly typing breast cancer and / or assessing the risk of distant metastasis. 一种对乳腺癌进行分子分型和/或评估其远处转移风险的诊断产品,其包含检测权利要求1或2的基因群中基因的表达水平的相关试剂。A diagnostic product for molecularly typing breast cancer and / or assessing its risk of distant metastasis, comprising a reagent for detecting the expression level of a gene in the gene group according to claim 1 or 2. 如权利要求5或6所述的应用或诊断产品,其特征在于,所述诊断产品为体外诊断产品的形式,优选诊断试剂盒的形式。The application or diagnostic product according to claim 5 or 6, wherein the diagnostic product is in the form of an in vitro diagnostic product, preferably in the form of a diagnostic kit. 如权利要求5-7之一所述的应用或诊断产品,其特征在于,所述试剂为检测所述基因转录的RNA,特别是mRNA的量的试剂。The application or diagnostic product according to any one of claims 5 to 7, wherein the reagent is a reagent for detecting the amount of RNA, particularly mRNA, transcribed by the gene. 如权利要求5-8之一所述的应用或诊断产品,其特征在于,所述试剂为检测与所述mRNA互补的cDNA的量的试剂。The application or diagnostic product according to any one of claims 5 to 8, wherein the reagent is a reagent for detecting the amount of cDNA complementary to the mRNA. 如权利要求5-9之一所述的应用或诊断产品,其特征在于,所述诊断产品还包括总RNA抽提试剂、逆转录试剂和/或二代测序试剂。The application or diagnostic product according to any one of claims 5-9, wherein the diagnostic product further comprises a total RNA extraction reagent, a reverse transcription reagent, and / or a second-generation sequencing reagent. 如权利要求5-10之一所述的应用或诊断产品,其特征在于,所述试剂为检测所述基因编码的多肽的量的试剂,优选地,所述试剂为抗体、抗体片段或者亲和性蛋白。The application or diagnostic product according to any one of claims 5 to 10, wherein the reagent is a reagent for detecting an amount of a polypeptide encoded by the gene, and preferably the reagent is an antibody, an antibody fragment, or an affinity Sexual protein. 如权利要求5-11之一所述的应用或诊断产品,其特征在于,所述试剂为探针或引物,优选引物。The application or diagnostic product according to any one of claims 5 to 11, wherein the reagent is a probe or a primer, preferably a primer. 如权利要求12所述的应用或诊断产品,其特征在于,所述引物的序列如SEQ ID NO.1-SEQ ID NO.144所示。The application or diagnostic product according to claim 12, characterized in that the sequence of the primer is shown as SEQ ID NO.1-SEQ ID NO.144. 一组用于进行乳腺癌分子分型和/或评估其远处转移风险的引物,其中所述引物的序列如SEQ ID NO.1-SEQ ID NO.144所示。A set of primers for molecular typing of breast cancer and / or assessing the risk of distant metastasis, wherein the sequence of said primers is shown in SEQ ID NO.1-SEQ ID NO.144. 如权利要求14所述的引物组在用于制备对乳腺癌进行分子分型和/或评估其远处转移风险的产品中的应用。Use of a primer set according to claim 14 for the production of a product for molecularly typing breast cancer and / or assessing its risk of distant metastasis. 如权利要求1-15之一所述的基因群、应用、诊断产品或引物组,其特征在于,所述乳腺癌的分型包括管腔A型、管腔B型、HER2富集型、基底细胞型及免疫增强型。The gene group, application, diagnostic product or primer set according to any one of claims 1 to 15, wherein the typing of breast cancer comprises a lumen A type, a lumen B type, a HER2 enriched type, and a basal Cellular and immune-enhanced.
PCT/CN2019/109459 2018-09-30 2019-09-30 Gene group used for breast cancer molecular typing and distant metastasis risk, diagnostic product, and application Ceased WO2020064006A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201980006647.6A CN111566229B (en) 2018-09-30 2019-09-30 Breast cancer molecular typing and distant metastasis risk gene groups and diagnostic products and applications

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811166148.7 2018-09-30
CN201811166148.7A CN110964815A (en) 2018-09-30 2018-09-30 Breast cancer molecular typing and distant metastasis risk gene group, diagnosis product and application

Publications (2)

Publication Number Publication Date
WO2020064006A2 true WO2020064006A2 (en) 2020-04-02
WO2020064006A3 WO2020064006A3 (en) 2020-04-30

Family

ID=69950303

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/109459 Ceased WO2020064006A2 (en) 2018-09-30 2019-09-30 Gene group used for breast cancer molecular typing and distant metastasis risk, diagnostic product, and application

Country Status (2)

Country Link
CN (2) CN110964815A (en)
WO (1) WO2020064006A2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114672556A (en) * 2020-12-25 2022-06-28 上海善准生物科技有限公司 Colorectal cancer molecular typing and survival risk gene group, diagnosis product and application
CN114480652A (en) * 2022-02-21 2022-05-13 深圳市陆为生物技术有限公司 Products to evaluate responsiveness of breast cancer patients to adjuvant endocrine therapy

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK3301446T3 (en) * 2009-02-11 2020-06-29 Caris Mpi Inc Molecular tumor profiling
CN102586410A (en) * 2011-01-18 2012-07-18 苏州科贝生物技术有限公司 Reagent kit for quantitatively assessing long-term recurrence risks of breast cancer
HK1222888A1 (en) * 2013-05-13 2017-07-14 Nanostring Technologies, Inc. Methods to predict risk of recurrence in node-positive early breast cancer
CN105849278A (en) * 2014-01-08 2016-08-10 李铮 Lectin chip for identifying liver series of disease based on salivary glycoprotein sugar chain and application thereof
CN104293910B (en) * 2014-07-23 2016-07-06 周彤 One group of assessment breast cancer molecular parting gene group and detection kit thereof

Also Published As

Publication number Publication date
WO2020064006A3 (en) 2020-04-30
CN111566229A (en) 2020-08-21
CN110964815A (en) 2020-04-07
CN111566229B (en) 2021-03-26

Similar Documents

Publication Publication Date Title
CA3081061C (en) Method for using expression of klk2 to determine prognosis of prostate cancer
US20230366034A1 (en) Compositions and methods for diagnosing lung cancers using gene expression profiles
EP2504451B1 (en) Methods to predict clinical outcome of cancer
JP5420247B2 (en) Methods for classification and prognosis of hepatocellular carcinoma
CN102575295B (en) Can be used for the method for detection BRAF sudden change, primer, probe and test kit
Vazquez-Mena et al. Amplified genes may be overexpressed, unchanged, or downregulated in cervical cancer cell lines
Colacino et al. Comprehensive analysis of DNA methylation in head and neck squamous cell carcinoma indicates differences by survival and clinicopathologic characteristics
CN110964814B (en) Primers, compositions and methods for nucleic acid sequence variation detection
CN116761899B (en) Colorectal cancer molecular typing and survival risk gene groups and diagnostic products and applications
WO2021036620A1 (en) Application of a group of genes related to ovarian cancer prognosis
CN105683391A (en) Tert promoter mutations in gliomas and subset of tumors
JP2016515800A (en) Gene signatures for prognosis and treatment selection of lung cancer
WO2017112738A1 (en) Methods for measuring microsatellite instability
US20180044736A1 (en) Biomarker based prognostic model for predicting overall survival in patients with metastatic clear cell kidney cancer
WO2021175236A1 (en) Interferon signaling pathway-related gene cluster, diagnostic product, and application
WO2020064006A2 (en) Gene group used for breast cancer molecular typing and distant metastasis risk, diagnostic product, and application
CN106498028B (en) Diagnostic method and kit for T790M mutation of EGFR
CN108456730B (en) Application of recurrence risk gene group as marker in preparation of product for evaluating recurrence risk at distant place in breast cancer molecular typing
US9708666B2 (en) Prognostic molecular signature of sarcomas, and uses thereof
CN110295218B (en) Method for quantifying mutant allele burden of target gene
WO2015127103A1 (en) Methods for treating hepatocellular carcinoma
WO2012045019A2 (en) Brca deficiency and methods of use
CN110964833B (en) A kit for detecting KRAS and BRAF gene mutations in plasma cell-free DNA in one tube
US20140024028A1 (en) Brca deficiency and methods of use
HK40107615A (en) Methods to predict clinical outcome of cancer

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19865959

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19865959

Country of ref document: EP

Kind code of ref document: A2