WO2024074558A1 - Improved methods for detecting cancer - Google Patents
Improved methods for detecting cancer Download PDFInfo
- Publication number
- WO2024074558A1 WO2024074558A1 PCT/EP2023/077458 EP2023077458W WO2024074558A1 WO 2024074558 A1 WO2024074558 A1 WO 2024074558A1 EP 2023077458 W EP2023077458 W EP 2023077458W WO 2024074558 A1 WO2024074558 A1 WO 2024074558A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- dna
- crc
- methylation
- seq
- control
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
Definitions
- the present invention relates to the field of pharmacogenomics and in particular to detecting the presence or absence of circulating tumor DNA in blood or blood-derived samples or in other body fluids that contain DNA released from a tumor and is determined by tumor specific DNA methylation markers, in particular methylated ANKRD13B and/or SEPTIN9 DNA derived from a tumor. Based upon the presence and/or amount of DNA methylation and the age of the subject, the presence of cancer or a tumor is detected by a multivariate model, capable of discriminating subjects suffering from cancer, from non-cancer subjects, in particular colorectal cancer (CRC), from non-CRC subjects. This detection is useful for a minimally or non-invasive as well as early and reliable diagnosis of colorectal cancer.
- the invention provides methods and oligonucleotides suitable for this purpose.
- Colorectal cancer encompasses tumors originating from the colon and rectum. It is the third most common cancer worldwide, but the second most common cancer killer. When colorectal cancer is found at an early stage, the 5-year relative survival rate is about 90%. At advanced stages, however, colorectal cancer survival rates are very low.
- Conventional CRC screening involved either visual exams or stool-based tests. Visual exams look at the structure of the colon and rectum for abnormal areas using a scope put into the rectum (e.g. colonoscopy or sigmoidoscopy) or non-invasive imaging techniques (e.g. x-ray or CR coIonography (virtual colonoscopy)).
- Stool tests such as FIT (Fecal immunochemical test) or gFOBT (Guaiac-based fecal occult blood test) usually detect blood or polyps in stool samples. Stool tests have relatively low sensitivity and specificity and are also problematic with regard to participants’ compliance, satisfaction and intention to be re-screened. Invasive visual exams are uncomfortable and incur a risk of bleeding, tears and infection. Therefore, they are often avoided by at-risk subjects. Non-invasive imaging techniques expose the subjects to radiation and often miss small polyps.
- DNA methylation patterns are largely modified in cancer cells and can therefore be used to distinguish cancer cells from normal tissues. As such, DNA methylation patterns are being used to diagnose all sorts of cancers.
- One of the challenges is diagnosing a cancer as early as possible, because a less advanced cancer, which has smaller tumor size and less cancer cells, releases less free circulating tumor DNA that is available for methylation detection.
- This problem concerns for example elderly people, as it is known that the risk of cancer increases with age, or subjects at risk of diseases, to be monitored, or patients to be monitored for remission of disease or for relapse.
- Methylated ANKRD13B and SEPTIN9 DNA has been used previously for CRC detection (WO 2017/129716 and WO 2006/113466, respectively).
- the present invention provides improved methods for detecting CRC using mANKRD13B (methylated ANKRD13B DNA) and mSEPTIN9 (methylated SEPTIN9 DNA) as markers, by a multivariate model, capable of discriminating non-CRC subjects from subjects suffering from CRC, based upon the presence and/or amount of DNA methylation and the age of the subject.
- the present invention relates to a method for method for compensating age-related effects on the detection and/or quantification of DNA methylation at one or more CpG dinucleotides in the genomic region of at least one gene for which DNA methylation or the absence thereof, is indicative of the presence or absence of a disease or of a disease state by measuring DNA methylation at said one or more CpG dinucleotides of said genomic region in a subpopulation of methylated DNA molecules, from diseased or disease related cells in a biological sample of an individual by using the measured DNA methylation and the age of the individual in a multivariate model.
- the present invention relates to a method for compensating age- related effects on the detection and/or quantification of DNA methylation at one or more CpG dinucleotides in the genomic region of at least one gene for which DNA methylation or the absence thereof, is indicative of the presence or absence of a disease or of a disease state by measuring DNA methylation at said one or more CpG dinucleotides in said genomic region in at least a subpopulation of DNA molecules derived from disease related cells in a biological sample of an individual by using the measured DNA methylation and the age of the individual in a multivariate model.
- the present invention relates to a method for detecting colorectal cancer (CRC) in a subject, comprising the steps: (1) detecting DNA methylation at one or more CpG dinucleotides (i) in the genomic region of or in a sequence in operative contact with the ANKRD13B gene (mANKRD13B) and/or (ii) in the genomic region of or in a sequence in operative contact with the gene SEPTIN9 gene (mSeptin9), by the method of the first and/or second aspect, and (2) entering the presence and/or the amount of DNA methylation and the age of the subject into a multivariate model, wherein the multivariate model is capable of discriminating non-CRC subjects from subjects suffering from CRC, based upon the presence and/or amount of DNA methylation and the age of the subject; and (3) determining the presence or absence of CRC with the multivariate model of (2) with a pre-determined specificity and/or a pre-determined sensitivity.
- CRC colorectal cancer
- the present invention relates to a method for monitoring a subject suspected of having CRC, having an increased risk of developing colorectal cancer (CRC), or who has had CRC, comprising the steps: (1) detecting DNA methylation at one or more CpG dinucleotides (i) in the genomic region of or in a sequence in operative contact with the ANKRD13B gene (mANKRD13B) and/or (ii) in the genomic region of or in a sequence in operative contact with the gene SEPTIN9 gene (mSeptin9), by the method according to the first and/or second aspect one or more times, and (2) entering the presence and/or the amount of DNA methylation and the age of the subject into a multivariate model, wherein the multivariate model is capable of discriminating non-CRC subjects from subjects suffering from CRC, based upon the presence and/or amount of DNA methylation and the age of the subject; and (3) determining the presence or absence of CRC with the multivariate model of (2) with a
- the present invention relates to the use of the method of any one of first, second, third, or fourth aspect, for the detection of colorectal cancer (CRC), or for monitoring a subject having an increased risk of developing a proliferative disorder, in particular CRC, suspected of having a proliferative disorder, in particular CRC or that has had a proliferative disorder, in particular CRC.
- CRC colorectal cancer
- the present invention relates to a method of treating CRC in a subject, said method comprising (i) detecting the presence of CRC according to the method of the first, second or third aspect in the subject, or monitoring the subject according to the method of the fourth aspect, and (ii) treating CRC with a treatment regimen suitable for treating CRC.
- Figure 1 Map of preferred target regions in ANKRD13B. See Table 3 for an explanation of the SEQ ID NOs.
- FIG. 1 Map of preferred target regions in SEPTIN9. See Table 3 for an explanation of the SEQ ID NOs.
- Figure 3 Assay examples.
- the figures show primer pairs (listed as A/B, DZE and G/H) and probe oligomer (C, F, I) either directly or - if matching the antisense strand - as reverse complement, their SEQ ID NO, their oligomer type, their name, their start and end position within the assay, and information for the strand they match on in relation to the genomic reference sequence of the amplified assay.
- Bisulfite specific base pairing (T on C for converted sense strand, G on A for converted antisense strand) to the genomic reference is displayed by ‘+’_ Listed from top to bottom are all three methylation specific assays ANKRD13BoBl (also termed herein “ANB1”), ANKRD13BoB2 (also termed herein “ANB2”), SEPTIN9oB2 (also termed herein “S9B2)”.
- FIG. 4 to Figure 7 Receiver operating characteristic (ROC) curves from blood plasma samples based on assessment of three single markers and age in the same order as follows: ANB1, ANB2, S9B2, age. The Sensitivity at a Specificity of 0.9 is shown and the values are provided in the plot. Numbers in the right lower comer show area under the curve (AUC) of ROC curves.
- AUC area under the curve
- FIG 8 and Figure 9 ROC curves from blood plasma samples based on assessment of the marker combination of all three DNA-methylation markers ANB 1+ANB2+S9B2 without (Fig. 8) and with (Fig. 9) age as single plots and in a comparison.
- the Sensitivity at a Specificity of 0.9 is shown and the values are provided in the plot.
- Numbers in the right lower comer show area under the curve (AUC) of ROC curves.
- FIG. 10 and Figure 11 ROC curves from blood plasma samples based on assessment of the marker combination of all three DNA-methylation markers ANB1, ANB2, S9B2 measured in duplicate aggregated to the sum of positive calls without (Fig. 10) and with (Fig. 11) age as single plots and in a comparison.
- the Sensitivity at a Specificity of 0.9 is shown and the values are provided in the plot. Numbers in the right lower comer show area under the curve (AUC) of ROC curves.
- Figure 12 ROC curves of Fig. 8 and Fig. 9.
- Figure 13 ROC curves of Fig. 10 and Fig. 11.
- Figure 14 Simulation of ROC curves from blood plasma samples based on assessment of the marker combination of all three DNA-methylation markers ANB1, ANB2, S9B2 with and without age based on Sensitivity/Specificity pairs from decision trees calculated with different weights for CRC and control group (examples can be found in Figure 15-24). The Sensitivity at a Specificity of 0.9 is shown and the values are provided in the plot.
- FIG 15 to Figure 24 Simulated ROC curves from blood plasma samples based on assessment of the combination of the three markers ANB1, ANB2, S9B2 (Figure 15-18) and additionally in combination with age (Figure 16-24), showing different decision trees for different Sensitivity/Specificity pairs, as highlighted on the ROC curves, that were the base for the points of the curves.
- the terms used herein are defined as described in “A multilingual glossary of biotechnological terms: (IUPAC Recommendations)”, Leuenberger, H.G.W, Nagel, B. and Kolbl, H. eds. (1995), Helvetica Chimica Acta, CH-4010 Basel, Switzerland).
- the inventors discovered that the diagnosis of a disease based on DNA methylation patterns can be improved by performing diagnosis based on a model that combines data on DNA methylation and age in diseased patients and non-diseased patients.
- the invertors discovered that a suitable method is multivariate analysis of data on DNA methylation and age. Specifically, by considering age in the multivariate analysis, specificity and sensitivity of diagnosis by DNA methylation pattern can be improved, compared to an analysis without considering age.
- the inventors have found that the improvement is independent of the multivariate method chosen.
- the example of the invention shows improvement in three independent methods: logistic regression, sum of positive calls and decision trees.
- Example demonstrates that discriminatory power of the biomarkers mANKRD13B and mSeptin9 for diagnosis of colorectal cancer by a DNA methylation pattern can be significantly improved if additionally the age of the subjects is considered in a multivariate model (cf. Figs 12 and 13).
- the present invention relates to a method for compensating age-related effects on the detection and/or quantification of DNA methylation at one or more CpG dinucleotides in the genomic region of at least one gene for which DNA methylation or the absence thereof, is indicative of the presence or absence of a disease or of a disease state by measuring DNA methylation at said one or more CpG dinucleotides of said genomic region in a subpopulation of methylated DNA molecules, from diseased or disease related cells in a biological sample of an individual by using the measured DNA methylation and the age of the individual in a multivariate model.
- more than one population of methylated DNA may be detected and/or quantified.
- a first population may comprise the sense strand of a genome region
- a second population may comprise the anti-sense strand of the genomic region.
- the present invention relates to a method for compensating age-related effects on the detection and/or quantification of DNA methylation at one or more CpG dinucleotides in the genomic region of at least one gene for which DNA methylation or the absence thereof, is indicative of the presence or absence of a disease or of a disease state by measuring DNA methylation at said one or more CpG dinucleotides in said genomic region in at least a subpopulation of DNA molecules derived from disease related cells in a biological sample of an individual by using the measured DNA methylation and the age of the individual in a multivariate model.
- treating age-related effects refers to the improvement of the specificity and sensitivity of a diagnostic method by including the age as an additional parameter in a multivariate model, compared to a model without the age of the patient/ subject.
- sensitivity of a diagnostic test refers to the probability of a positive diagnostic test, conditioned on truly being positive (true positive rate).
- specificity of a diagnostic test refers to the probability of a negative test, conditioned on truly being negative (true negative rate).
- the age-related effects are caused by an increasingly aberrant DNA methylation of one or more CpG dinucleotides in the genomic regions of genes derived from cells other than the diseased or disease related cells with increasing age.
- the multivariate model is selected from one or more of the following: a sum of positive marker assay components, a principal component analysis, a logistic regression analysis, a nearest neighbor analysis, a support vector machine, a decision tree, and a neural network model.
- the sum of the positive marker components is based on individual thresholds for positivity for individual marker assay components and/or has a specific weight for individual marker assays components.
- the pre-determined specificity is at least 85 %, at least 90 %, at least 95 %, at least 98 % or at least 99 %.
- the pre-determined specificity is at least 85 %, at least 86 %, at least 87 %, at least 88 %, or at least 89 %, at least 90 %, at least 91 %, at least 92 %, at least 93 %, at least 94 %, at least 95 %, at least 96 %, at least 97 %, at least 98 %, or at least 99 %. It is preferred that the specificity is at least 90 %, or at least 95 %,
- the pre-determined sensitivity is at least 60 %, at least 65 %, at least 70 %, at least 75 %, at least 80 %, or at least 85 %.
- the pre-determined sensitivity is at least 60 %, at least 61 %, at least 62 %, at least 63 %, at least 64 %, at least 65 %, at least 66 %, at least 67 %, at least 68 %, at least 69 %, at least 70 %, at least 71 %, at least 72 %, at least 73 %, at least 74 %, at least 75 %, at least 76 %, at least 77 %, at least 78 %, at least 79 %, at least 80 %, at least 81 %, at least 82 %, at least 83 %, at least 84 %, or at least 85 %. It is preferred that the sensitivity is at least 65 %, at least 62 %, at least 63
- the specificity and the sensitivity may be related, as, for example, described by a ROC curve. It is preferred that the pre-determined sensitivity is selected as large as possible so that still an acceptable specificity can be achieved. For example, the predetermined sensitivity is at least 65 % or at least 70 %, and the pre-determined specificity is at least 85 % or at least 90 %.
- the pre-determined sensitivity is at least 65 %, and the predetermined specificity is at least 90 %.
- the pre-determined sensitivity is at least 65 % and the pre-determined specificity is at least 85 %.
- the pre-determined sensitivity is at least 70 %, and the pre-determined specificity is at least 90 %.
- the pre-determined sensitivity is at least 70 %, and the pre-determined specificity is at least 85 %.
- the disease related cells that are the source of the subpopulation of methylated DNA molecules are from a proliferative disorder.
- the detected or quantified subpopulation of at least one methylated DNA molecule from diseased or disease related cells are measured in a biological sample which comprises at least 10-fold higher number of the same DNA molecules that are unmethylated. More particular, the biological sample comprises at least 20-fold, at least 50-fold, or at least 100-fold higher number of the same DNA molecules that are unmethylated.
- the “population” or “population of DNA molecules” describes a plurality of DNA molecules present in a biological sample.
- subpopulation or “subpopulation of DNA molecules” describes a plurality of DNA molecules comprising one or more CpG dinucleotides in the same genomic region and/or the same sequence.
- the term “subpopulation of methylated DNA molecules” describes a plurality of DNA molecules that are methylated, in particular in view of DNA molecules that are unmethylated in the same genomic region (“the same DNA molecules”).
- the term “subpopulation of methylated DNA molecules” with respect to ANKRD13B and/or SEPTIN9 describes a plurality of DNA molecules that are methylated in the genomic region of or in a sequence in operative contact with the ANKRD13B gene (ANKRD13B target region) and/or in the genomic region of or in a sequence in operative contact with the SEPTIN9 gene (Septin9 target region), in particular in view of the population of the same DNA molecules that are unmethylated.
- the genomic DNA may comprise DNA derived from colorectal cancer (CRC) cells.
- the genomic DNA in particular the genomic DNA derived from CRC cells, is cell-free DNA (cell-free tumour DNA comprised in the total cell-free DNA of the sample) or more specifically circulating tumor DNA (ctDNA).
- CRC colorectal cancer
- the phrase “the genomic DNA may comprise DNA derived from colorectal cancer (CRC) cells” does, in a preferred embodiment, mean that the subject has an increased risk of CRC, is suspected of having CRC or has had CRC (i.e. has been treated to remove any detectable sign of CRC, but is suspected to relapse).
- the term "subject" may have different limitations. For example, if the method is to be used for detecting CRC or screening subjects for CRC, the subject is not known to have CRC, i.e. it may or may not have CRC. In this example, The subject preferably has an increased risk of developing or is suspected to have CRC, or has had CRC, or has CRC. "Increased risk” means that one or more risk factors for cancer generally or for CRC can be attributed to the subject, preferably as defined by the American Cancer Society for cancer generally or for CRC.
- risk factors for CRC are: heavy alcohol use (more than 3 or 4 alcohol units a day for men, or more than 2 or 3 alcohol units a day for women; an alcohol unit is defined as 10 ml (8 g) of pure alcohol), tobacco consumption (in particular smoking, but also including smokeless tobacco), being overweight (Body Mass Index (BMI) of 25 to 29.9) or obese (BMI of 30 or more), especially having a larger waistline, physical inactivity (exercise (sports) for less than 150, preferably 75 minutes per week beyond usual (non-sport) daily activities), diet rich in red meats (such as beef, pork, lamb or liver) and processed meats, age of 35 or older, preferably 40 or older, more preferably 45 or older, personal history of colorectal polyps, colorectal cancer and/or inflammatory bowel disease (e.g.
- ulcerative colitis or Crohn’s disease a familial history of colorectal cancer or adenomatous polyps (preferably first degree relative (parent, sibling or child), more preferably diagnosed at age 45 or younger and/or more than one first degree relative affected), having an inherited syndrome increased CRC risk such as preferably Lynch syndrome (hereditary non-polyposis colorectal cancer or HNPCC) or familial adenomatous polyposis (FAP), but also Peutz-Jeghers syndrome (PJS) or MYH-associated polyposis (MAP), racial and ethnic background with increased risk (e.g. African Americans or Ashkenazi Jews), and having type 2 diabetes.
- the subject has a genetic, epigenetic or other known predisposition for CRC.
- colonal cancer also known as bowel cancer and colon cancer
- CRC colonal cancer
- bowel cancer and colon cancer is used in the broadest sense and refers to all cancers that start in the colon or in the rectum. It includes the subtypes adenocarcinoma (cancer starting in cells that make mucus to lubricate the inside of the colon and rectum), carcinoid tumor (cancer starting from the interstitial cells of Cajal (ICC) in the wall of the colon), lymphoma starting in the colon or rectum, and sarcoma starting in blood vessels, muscle layers, or other connective tissues in the wall of the colon and rectum.
- ICC Cajal
- sarcoma starting in blood vessels, muscle layers, or other connective tissues in the wall of the colon and rectum.
- the most common and preferred CRC with regard to the invention is adenocarcinoma.
- the biological sample is a colon or rectum tissue sample or a liquid biopsy, preferably a blood sample, a blood-derived sample, a urine sample, a urine-derived sample, a saliva sample, or a saliva-derived sample.
- a sample derived from another sample comprises cell-free DNA from that sample.
- Preferred examples of a blood-derived sample are a plasma sample, a serum sample, or a sample derived from plasma or serum.
- a “colon or rectum tissue sample” is a tissue sample from any tissue in which CRC can occur. In one embodiment, if the subject has cancer, it is a CRC tissue sample.
- the sequence a genomic region has, or a sequence in operative contact with a gene is also referred to herein as the target region or target DNA and may be the sequence of the entire corresponding SEQ ID NO, or may be a sequence within having a length as specified below in the section “Definitions and further embodiments of the invention”.
- the target DNAs are also referred to using the designations mANKRD13B and mSEPTIN9.
- the first letter “m” means “methylation marker”, and the capital letters refer to the gene the target DNA resides in (the corresponding genomic region is provided in Table 3).
- SEQ ID NOs which correspond to the designation according to Figure 1 and Table 3, with the order of preference indicated herein.
- the genomic target DNA (the DNA region within which methylation is detected) comprises at least one CpG dinucleotide, preferably at least 2, 3, 4, or 5, most preferably at least 6 (e.g. at least 7, 8, 9, 10, 15 or 30) CpG dinucleotides.
- the methylation of at least one CpG dinucleotide comprised in the genomic DNA is detected, preferably of at least 2, 3, 4, or 5, most preferably at least 6 (e.g. at least 7, 8, 9, 10, 15 or 30) CpG dinucleotides.
- DNA methylation is detected at 7 or more (preferably 8 or more) CpG dinucleotides within the mANKRD13B target region.
- the CpG dinucleotides are in the genomic region of or in a sequence in operative contact with mANKRD13B.
- the CpG dinucleotides are in the genomic region of or in a sequence in operative contact with mSeptin9.
- the CpG dinucleotides are in the genomic region of or in a sequence in operative contact with mANKRD I 3B and mSeptin9.
- Preferred mANKRD13B target regions are as follows: the genomic DNA having a sequence comprised in SEQ ID NO: 11 has a sequence comprised in SEQ ID NO: 6, preferably in SEQ ID NO: 16, more preferably in SEQ ID NO: 1.
- Preferred mSEPTIN9 target regions are as follows: the genomic DNA having a sequence comprised in SEQ ID NO: 31 has a sequence comprised in SEQ ID NO: 36, preferably in SEQ ID NO: 26, more preferably in SEQ ID NO: 21.
- methylation of one or more CpG dinucleotides is detected in a genomic DNA polynucleotide comprised in the: (i) mANKRD13B gene, wherein the genomic DNA polynucleotide consists of genomic DNA having a sequence comprised in SEQ ID NO: 11; (ii) mSEPT9 gene, wherein the genomic DNA polynucleotide consists of genomic DNA having a sequence comprised in SEQ ID NO: 31.
- the method of the first and/or second aspect comprises the steps of
- step (b) detecting DNA methylation within the genomic DNA by detecting unconverted cytosine in the converted DNA of step (a).
- a preferred way of carrying out the method comprises the steps of
- step (c) detecting the presence or absence of DNA amplified in step (b); wherein the presence or absence of amplified DNA indicates the presence or absence, respectively, of methylated genomic DNA.
- step b) of amplifying comprises the use of oligonucleotides as comprised in the kit as described herein.
- the method comprises the steps of
- ctDNA circulating tumor DNA
- Detecting DNA methylation”, “detection or quantification of DNA methylation” or “detection and/or quantification of DNA methylation” is also referred to herein as detecting or determining whether methylated DNA is present or absent.
- Presence of methylated genomic DNA refers to a detectable amount of methylated genomic DNA, specifically detectable by PCR. In particular, it refers to the presence of a significant amount of methylated genomic DNA. A significant amount may be described as at least X molecules of the methylated target DNA per 100 pl of the sample used, preferably per 100 pl of blood, serum, plasma or urine.
- X may be as low as 1 and is usually a value between and including 1 and 50, in particular between and including 1 and 2, 3, 4, 5, 10, 15, 20, 25, 30 or 40. If X is low, the methylated targeted DNA can be detected by using a plurality of PCR reactions, e.g. between and including 2 and 10, or 2 and 5, for instance 3.
- total DNA may be, but does not necessarily have to be quantified or checked for a minimum quantity. This can be done by determining the presence or a minimum amount of control DNA, preferably of a housekeeping gene such as ACTB (actin beta). This minimum amount can be defined as a maximum cycling threshold (Ct) of a real-time PCR, e.g. within a range from 30 to 34 (e.g. 32.1), preferably as would be obtained in an assay according to Example 1. In one embodiment, the minimum amount is, e.g.
- telomere length may be determined for bisulfite-converted genomic DNA derived from blood plasma, 0.1, 0.2, 0.25, 0.3, 0.4 or preferably 0.5 ng per PCR reaction (e.g. about 0.25 ng). This may correspond to approximately the same amounts of total genomic DNA per mL of blood plasma.
- the determination may also be made by comparison to a standard, for example a standard comprising genomic DNA and therein a certain amount of substantially fully methylated DNA, e.g. the equivalence of X genomes, wherein X is as above. Detecting the presence of methylated DNA may comprise such determination whether there is sufficient DNA in the sample, preferably comprising determining the presence or a minimum amount of control DNA.
- the determination whether a significant amount of methylated DNA is present or absent is not necessarily a fully quantitative determination, but is preferably semi- quantitative, e.g. as demonstrated in Example 1.
- the determination of the presence of a significant amount may comprise a relative comparison to a control, wherein an amount higher than in the control is deemed a significant amount.
- the control would be a sample of the same sample type (e.g. plasma or urine) and from a subject who does not have CRC and preferably does not have any cancer, or the mean or average level of methylated target DNA of samples of the same sample type (e.g. plasma or urine) and from subjects who do not have CRC and preferably do not have any cancer.
- the method is carried out using a plurality of reactions for each target region and optionally each strand (sense and anti-sense) thereof (each reaction using a portion of the same sample), or “repetitions” or “replicates”, for example using duplicates or triplicates (or more).
- the results e.g. real-time PCR Ct values
- the results can be aggregated for example by using mean, median, maximum or - preferably - minimum values over repetitions for each target region and optionally each strand (sense and anti-sense) of each target region.
- Aggregated results of the sense and anti-sense strand and optionally also of the target regions can be further combined to a single value, for example to a sum, e.g. after multiplying each aggregated result with predefined weights (for example as trained by logistic regression using a training sample set).
- the aggregated results or the single value (combination of aggregated results) can be used as such and set into relation to a scale, or it can be used to determine a probability that methylated genomic DNA is present, or it can be used to classify the sample as positive or negative for methylated genomic DNA based on a pre-defined threshold that can be optimized for sensitivity/ specificity based on training data.
- each result e.g.
- real-time PCR Ct values of the individual reactions is “binarized”, i.e. classified as positive or negative (1 or 0), e.g. based on a predefined threshold.
- the plurality of binarized results obtained can be combined, e.g. as a sum.
- a real-time PCR for detecting methylation in both target regions (x2), both on the sense and the anti-sense strand (x2) in triplicates (x3) yields 12 (2x2x3) binary results which can be added to give a score of 0 to 12.
- This score can be used to classify the sample as positive or negative for methylated genomic DNA based on a pre-defined threshold (any one of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 and 12, preferably any one of 4, 5, 6, 7, 8, 9, 10, 11 and 12 or any one of 8, 9, 10, 11 and 12).
- a pre-defined threshold any one of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 and 12, preferably any one of 4, 5, 6, 7, 8, 9, 10, 11 and 12 or any one of 8, 9, 10, 11 and 12.
- a significant amount can be deemed to be present if, for example, if a certain number of PCR reactions reaches a pre-defined Ct threshold (e.g. 45, 44, 43, 42, 41, 40. 39, 38, 37, 36 or 35, wherein lower Cts are preferred).
- the certain number may for example be at least 50%, 60%, 70%, 80% or 90% of the total number of PCR reactions.
- Another example of a determination is that a significant amount can be deemed to be present if all replicates of at least one marker assay reaches the pre-defined Ct threshold, or one or more than one (but not all) of the replicates of at least two marker assays reach the pre-defined Ct threshold.
- the detection and/or quantification of the DNA methylation comprises determining the amount of methylated genomic DNA, preferably semi-quantitatively, e.g. by determining the number of amplification cycles necessary to detect an amplificate (real-time PCR Ct value), and/or the number of positive signals (i.e. amplificates detected) in a plurality of PCR reactions (optionally including PCR reactions for both the sense and the anti-sense strand, as described herein).
- the amplification is preferably performed by MSP (methylation-specific PCR) (i.e. an amplicon is produced depending on whether one or more CpG sites are converted or not) using at least one primer that is methylation-specific (and specific to bisulfite-converted DNA).
- primers may be methylation-unspecific, but specific to bisulfite-converted DNA (i.e. hybridize only to converted DNA by covering at least one converted C).
- methylation-specificity is achieved by using methylation-specific blocker oligonucleotides, which hybridize specifically to converted or non-converted CpG sites and thereby terminate the PCR polymerization.
- the step of amplifying comprises a PCR, preferably a real-time PCR, wherein the converted DNA is amplified methylation-specifically by using at least one methylation-specific primer in a pair of primers.
- the region of the converted DNA is amplified methylation- specifically by using a pair of methylation-specific primers.
- the presence of amplified DNA is preferably detected by using an oligonucleotide probe, more preferably a methylation-specific oligonucleotide probe.
- each amplificate generated with a pair of primers is detected with one or more (preferably one) of such probes, i.e. a primer pair has corresponding probes. Additional characteristics of the primers and probes are defined further below.
- the detection and/or quantification of DNA methylation at one or more CpG dinucleotides uses DNA conversion methods and detection methods for synthetic DNA derived from genomic DNA in the biological sample that has a sequence which is no longer identical to the genomic nucleic acid sequence.
- detecting and/or quantification of DNA methylation comprises a PCR using at least one methylation-specific primer.
- detecting and/or quantification of DNA methylation comprises a multiplex real-time PCR.
- the PCR is a multiplex real-time PCR comprising at least one of (1) and (2) and at least one of (3) to (5) of
- methylation-unspecific primers suitable for amplifying control DNA wherein the primers of (1) to (4) preferably are methylation-specific primers.
- the primers of (1) to (4) preferably are methylation-specific primers.
- it comprises:
- the multiplex real-time PCR comprises
- primers suitable for amplifying DNA within SEQ ID NO: 35 and optionally (5) methylation-unspecific primers suitable for amplifying control DNA, wherein the primers of (1) to (4) preferably are methylation-specific primers.
- the control DNA is preferably DNA of a housekeeping gene present in the sample, preferably ACTB (actin beta) DNA.
- the present invention relates to a method for detecting colorectal cancer (CRC) in a subject, comprising the steps
- the multivariate model is capable of discriminating non-CRC subjects from subjects suffering from CRC, based upon the presence and/or amount of DNA methylation and the age of the subject;
- the absence of detected methylated genomic DNA of both target regions indicates the absence of CRC.
- the cancer may be of any subtype and stage as defined below, i.e. the presence or absence of any subtype and/or stage can be detected.
- the method of the third aspect further comprises confirming CRC by using one or more further means for detecting CRC.
- the further means may be a cancer marker (or "biomarker") or a conventional (non-marker) detection means.
- the cancer marker can for example be a DNA methylation marker, a mutation marker (e.g. SNP), an antigen marker, a protein marker, a miRNA marker, a cancer specific metabolite, or an expression marker (e.g. RNA or protein expression).
- the conventional means can for example be a biopsy (e.g. visual biopsy examination with or without staining methods for example for protein or expression markers), an imaging technique (e.g.
- X-ray imaging CT scan
- nuclear imaging such as PET and SPECT
- ultrasound magnetic resonance imaging (MRI)
- thermography endoscopy
- digital mammography colonoscopy or virtual colonoscopy
- laparoscopy angiogram
- bone scan or sentinel node mapping for breast cancer or a physical, e.g. tactile examination.
- it is a biopsy or other means that removes and examines a solid tissue sample of the subject from colorectal tissue.
- the further means is suitable for detecting CRC.
- Preferred examples are colonoscopy, blood test for anemia and/or carcinoembryonic antigen (CEA), CR scan, MRI, ultrasound, X-ray, and PET scan.
- CEA carcinoembryonic antigen
- the “presence/absence of detected methylated genomic DNA” preferably means, here and below, the “presence/absence of a significant amount of methylated genomic DNA”.
- the present invention relates to a method for monitoring a subject suspected of having CRC, having an increased risk of developing colorectal cancer (CRC), or who has had CRC, comprising the steps:
- the multivariate model is capable of discriminating non-CRC subjects from subjects suffering from CRC, based upon the presence and/or amount of DNA methylation and the age of the subject;
- kits may comprise at least one PCT primer pair capable of specifically hybridizing with sequences in genomic region of or in a sequence in operative contact with mANKRD13B (ANKRD13B target region), and/or in the genomic region of or in a sequence in operative contact with mSeptin9 (Septin9 target region).
- the kit may comprise (i) a primer pair consisting of a forward primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 12 or 15, and a reverse primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 13 or 14, respectively, wherein the primer pair is suitable for amplification of a sequence comprising one or more CpG dinucleotides of SEQ ID NO: 12 or 15, respectively, and optionally (ii) a primer pair consisting of a forward primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 32 or 35, and a reverse primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 33 or 34, respectively, wherein the primer
- the kit comprises: (1) a first primer pair consisting of a first forward primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 12, and a first reverse primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 13, wherein the primer pair is suitable for amplification of a sequence comprising one or more CpG dinucleotides of SEQ ID NO: 12, and/or (preferably and)
- a second primer pair consisting of a second forward primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 15, and a second reverse primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 14, wherein the primer pair is suitable for amplification of a sequence comprising one or more CpG dinucleotides of SEQ ID NO: 15.
- the kit further comprises
- a third primer pair consisting of a third forward primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 32, and a third reverse primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 33, wherein the primer pair is suitable for amplification of a sequence comprising one or more CpG dinucleotides of SEQ ID NO: 32, and/or (preferably and)
- a fourth primer pair consisting of a fourth forward primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 35, and a fourth reverse primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 34, wherein the primer pair is suitable for amplification of a sequence comprising one or more CpG dinucleotides of SEQ ID NO: 35, and optionally (5) a fifth primer pair consisting of a methylation-unspecific fifth forward primer oligonucleotide and a methylation-unspecific fifth reverse primer oligonucleotide suitable for amplifying control DNA.
- the kit comprises at least one of (1) and (2) and at least one of (3) to (5) of the above.
- it comprises:
- the primer pair suitable for amplification of a sequence comprising one or more CpG dinucleotides of SEQ ID NO: 12 is suitable for amplification of a sequence comprising 7 or more (preferably 8 or more) CpG dinucleotides of SEQ ID NO: 12, and/or the primer pair suitable for amplification of a sequence comprising one or more CpG dinucleotides of SEQ ID NO: 15 is suitable for amplification of a sequence comprising 7 or more (preferably 8 or more) CpG dinucleotides of SEQ ID NO: 15.
- Preferred mANKRD13B primers are as follows: a primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 12, 13, 14 or 15 comprises a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 7, 8, 9 or 10, respectively; preferably SEQ ID NO: 17, 18, 19 or 20, respectively; and more preferably SEQ ID NO: 2, 3, 4 or 5, respectively.
- Preferred mSEPTIN9 primers are as follows: a primer oligonucleotide comprising a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 32, 33, 34 or 35 comprises a sequence that is substantially identical to a stretch of contiguous nucleotides of SEQ ID NO: 37, 38, 39 or 40, respectively; preferably SEQ ID NO: 27, 28, 29 or 30, respectively; and more preferably SEQ ID NO: 22, 23, 24 or 25, respectively, or SEQ ID NO: 42, 43, 44 or 45, respectively.
- the kit comprises one or more (preferably one) probe oligonucleotides for each primer pair the kit comprises (“corresponding primer pair”).
- the probes are capable of binding to a strand of the amplificate the primer pair is suitable to generate in a PCR, preferably to a site between the primer binding sites or overlapping with one or both of the primer binding sites of the corresponding primer pair.
- a probe oligonucleotide is substantially identical to a stretch of contiguous nucleotides of the SEQ ID containing the sequence (or a sequence substantially identical thereto) of one of the primers of the corresponding primer pair.
- a primer or probe oligonucleotide comprises at least 1, 2 or 3 CpG dinucleotides, which makes it a methylation-specific oligonucleotide.
- This methylationspecific oligonucleotide is also specific for bisulfite-converted DNA, since it comprises at least one nucleotide derived from conversion of a C not in a CpG context (e.g. of a CpC, CpA or CpT dinucleotide) in SEQ ID NO: 11 (alternatively SEQ ID NO: 31 or a sub-sequence of either as recited above) or its complement into a T.
- a probe oligonucleotide preferably has one or more modifications selected from the group consisting of a detectable label and a quencher, and/or a length of 5-40 nucleotides. Further, a probe is preferably a methylation-specific oligonucleotide. A primer preferably has a length of 10-40 nucleotides.
- an oligonucleotide comprising a certain sequence preferably is an oligonucleotide having (or consisting of) that sequence.
- the present invention relates to the use of the method of the first aspect, the second aspect, the third aspect and/or the fourth aspect for the detection of colorectal cancer (CRC), or for monitoring a subject having an increased risk of developing a proliferative disorder, in particular CRC, suspected of having a proliferative disorder, in particular CRC or that has had a proliferative disorder, in particular CRC.
- CRC colorectal cancer
- the use is an in vitro use.
- the present invention relates to method of treating CRC in a subject, said method comprising (i) detecting the presence of CRC according to the method of the first, second or third aspect in the subject, or monitoring the subject according to the method of the fourth aspect, and (ii) treating CRC with a treatment regimen suitable for treating CRC.
- the present invention also relates to a method comprising the method of the first, the second, the third and/or the fourth aspect and subsequently referring a subject for such a cancer treatment. Definitions and further embodiments for the cancer treatment regimen are given below.
- detecting DNA methylation refers to at least qualitatively analysing for the presence or absence of methylated target DNA. Methylation is preferably determined at 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 15 or more or 30 or more CpG sites of the target DNA. Usually, the CpG sites analysed are co-methylated in cancer, such that also CpG sites of neighbouring DNA are methylated and can be analysed in addition or instead. "At least qualitatively” means that also a quantitative determination of methylated target DNA, if present, can be performed. Such a “determining the amount” can be performed as described herein.
- the presence or absence of amplified DNA can be detected by any means known in the art, e.g. autoradiography, silver staining or ethidium bromide staining.
- the presence or absence of DNA amplified in step (b) is detected by real-time PCR or by sequencing the amplified DNA.
- the presence of DNA amplified in step (b) is preferably detected by using a methylation-specific oligonucleotide which is a probe.
- the DNA is preferably amplified methylation-specifically using methylation-specific primers or alternatively a methylationspecific blocker with methylation-unspecific primers, the former being preferred.
- a detection by sequencing is preferably a detection by Next Generation Sequencing.
- the converted (e.g. bisulfite converted) methylated target DNA of the sample can be amplified bisulfite-specifically and can but must not necessarily be amplified methylation- specifically.
- the amplified sequences are sequenced and the presence of methylated template is deduced from the presence of sequences or sequence reads derived from the amplified converted target DNA.
- the absolute or relative amount of methylated target DNA may be determined by sequencing, preferably Next Generation Sequencing.
- the converted (e.g. bisulfite converted) target DNA can be amplified either methylation specifically, i.e. the target DNA is amplified only if it is methylated, or methylation-unspecifically, i.e. the target DNA is amplified whether or not it is methylated (in other words whether or not cytosines of the CpG sites are converted). This can be achieved by bisulfite-specific primers which either are or are not methylation-specific, respectively. Then, the amplified sequences are sequenced and subsequently counted.
- the number of determined sequences can be used to estimate the total number of methylated target molecules.
- the ratio of sequences derived from converted methylated DNA (identified in the sequences by CpG sites) and sequences derived from converted unmethylated DNA is calculated, resulting in a (relative) amount of methylated target DNA.
- NGS Next Generation Sequencing
- 2 nd or 3 rd generation sequencing refers to a sequencing the bases of a small fragment of DNA are sequentially identified from signals emitted as each fragment is re-synthesized from a DNA template strand.
- NGS extends this process across millions of reactions in a massively parallel fashion, rather than being limited to a single or a few DNA fragments. This advance enables rapid sequencing of the amplified DNA, with the latest instruments capable of producing hundreds of gigabases of data in a single sequencing run. See, e.g., Shendure and Ji, Nature Biotechnology 26, 1135- 1145 (2008) or Mardis, Annu Rev Genomics Hum Genet. 2008;9:387-402.
- Suitably NGS platforms are available commercially, e.g. the Roche 454 platform, the Roche 454 Junior platform, the Illumina HiSeq or MiSeq platforms, or the Life Technologies SOLiD 5500 or Ion Torrent platforms.
- Substantially fully methylated genomic DNA preferably is DNA, particularly genomic DNA, which has all or substantially all CpG sites methylated. "Substantially all” in this respect means at least 95%, 96%, 97%, 98%, 99%, 99.5% or 99.9%.
- the methylation of all or substantially all CpG sites is achieved by treating the DNA with a CpG methyltransferase in a manner that provides for the methylation of all or substantially all CpG sites.
- methylated or “hypermethylated” as used herein refers to a biochemical process involving the addition of a methyl group to the cytosine or adenine DNA nucleotides.
- CpG site or “CpG dinucleotide”, as used herein, refers to regions of DNA where a cytosine nucleotide occurs next to a guanine nucleotide in the linear sequence of bases along its length.
- CpG is shorthand for "C-phosphate-G", that is cytosine and guanine separated by only one phosphate; phosphate links any two nucleosides together in DNA.
- the "CpG” notation is used to distinguish this linear sequence from the CG base-pairing of cytosine and guanine. Cytosines in CpG dinucleotides can be methylated to form 5-methylcytosine.
- CpG site or "CpG site of genomic DNA” is also used with respect to the site of a former (unmethylated) CpG site in DNA in which the unmethylated C of the CpG site was converted to another as described herein (e.g. by bisulfite to uracil).
- methylation in the context of the present invention means hypermethylation.
- hypermethylation refers to an aberrant methylation pattern or status (i.e.
- nucleotides preferably C(s) of a CpG site(s)
- one or more nucleotides are methylated compared to the same genomic DNA from a non-cancer cell of the subject or a subject not suffering or having suffered from the cancer the subject is treated for, preferably any cancer (healthy control).
- it refers to an increased presence of 5-mCyt at one or a plurality of CpG dinucleotides within a DNA sequence of a test DNA sample, relative to the amount of 5-mCyt found at corresponding CpG dinucleotides within a healthy control DNA sample.
- Hypermethylation as a methylation status/pattern can be determined at one or more CpG site(s). If more than one CpG site is used, hypermethylation can be determined at each site separately or as an average of the CpG sites taken together. Alternatively, all assessed CpG sites must be methylated such that the requirement hypermethylation is fulfilled.
- target DNA refers to a genomic nucleotide sequence at a specific chromosomal location. In the context of the present invention, it is typically a genetic marker that is known to be methylated in the state of disease (for example in cancer cells vs. non-cancer cells).
- a genetic marker can be a coding or non-coding region of genomic DNA.
- region of the target DNA or "region of the converted DNA” as used herein refers to a part of the target DNA which is to be analysed.
- the region is at least 40, 50, 60, 70, 80, 90, 100, or 150 base pairs (bp) long and/or not longer than 150, 200, 300, 400, 500, or 1000 bp (e.g. 50-200, preferably 70-150 bp).
- it is a region comprising at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 CpG sites of the genomic DNA.
- At least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 are methylated in the target DNA.
- in operative contact refers to at least two nucleotide sequences in a functional relation, e.g. sequences having regulative functions, e.g. binding sites for enhancers, or by structural functionality e.g. by dense packaging of the DNA due to DNA methylation limiting functionality of the region.
- a sequence in operative contact with mANKRD13B can be comprised in the mANKRD13B target region or mANKRD13B genomic region.
- a sequence in operative contact with mSEPTIN9 can be comprised in the mSEPTIN9 target region or mSEPTIN9 genomic region.
- the target DNAs of the invention are given in Figure 1 and Table 3.
- sample refers to biological material obtained from a subject and comprises genomic DNA from all chromosomes, preferably genomic DNA covering the whole genome.
- the sample comprises, if a subject has cancer, cell-free genomic DNA (including the target DNA) from cancer cells, preferably circulating genomic DNA from cancer cells.
- sample comprising cell-free DNA from blood refers to a body fluid sample comprising cell-free DNA from blood. While in a preferred embodiment this sample is blood, the term also comprises other body fluids. For example, urine comprises cell- free DNA from blood.
- sample derived from a sample comprising cell-free DNA from blood refers to any sample that is derived by in vitro processing. For example, if the sample is blood, it is preferred that the sample derived therefore is plasma or serum.
- cell-free DNA refers to DNA that is not comprised within an intact cell in the respective body fluid which is the sample or from which the sample is derived, but which is freely circulating in the body liquid sample.
- Cell-free DNA usually is genomic DNA that is fragmented as described below.
- circulating tumor DNA refers to cell-free DNA, which originated from solid tumor tissue, metastases or circulating tumor cells and which usually comprises the target DNA.
- samples comprising the target DNA, especially extracellular target DNA, from cancer cells there is also target DNA from non-cancer cells which is not methylated contrary to the target DNA from cancer cells.
- said target DNA from non-cancer cells exceeds the amount from diseased cells or disease-related cells by at least 10-fold, at least 100- fold, at least 1,000-fold or at least 10,000-fold.
- the genomic DNA comprised in the sample is at least partially fragmented. "At least partially fragmented" means that at least the extracellular DNA, in particular at least the extracellular target DNA, from cancer cells, is fragmented.
- fragment genomic DNA refers to pieces of DNA of the genome of a cell, in particular a cancer cell, that are the result of a partial physical, chemical and/or biological break-up of the lengthy DNA into discrete fragments of shorter length.
- fragmentation means fragmentation of at least some of the genomic DNA, preferably the target DNA, into fragments shorter than 1,500 bp, 1,300 bp, 1,100 bp, 1,000 bp, 900 bp, 800 bp, 700 bp, 600 bp, 500 bp, 400 bp, 300 bp, 200 bp or 100 bp.
- “At least some” in this respect means at least 5%, 10%, 20%, 30%, 40%, 50% or 75%.
- genomic DNA refers to chromosomal DNA and is used to distinguish from coding DNA. As such, it includes exons, introns as well as regulatory sequences, in particular promoters, belonging to a gene.
- converting, in DNA, cytosine unmethylated in the 5-position to uracil or another base that does not hybridize to guanine refers to a process of chemically treating the DNA in such a way that all or substantially all of the unmethylated cytosine bases are converted to uracil bases, or another base which is dissimilar to cytosine in terms of base pairing behaviour, while the 5 -methylcytosine bases remain unchanged.
- the conversion of unmethylated, but not methylated, cytosine bases within the DNA sample is conducted with a converting agent.
- the term “converting agent” as used herein relates to a reagent capable of converting an unmethylated cytosine to uracil or to another base that is detectably dissimilar to cytosine in terms of hybridization properties.
- the converting agent is preferably a bisulfite such as disulfite, or hydrogen sulfite.
- the reaction is performed according to standard procedures (Frommer et al., 1992, Proc Natl Acad Sci USA 89: 1827-31; Olek, 1996, Nucleic Acids Res 24:5064-6; EP 1394172). It is also possible to conduct the conversion enzymatically, e.g by use of methylation specific cytidine deaminases.
- the converting agent is sodium bisulfite or bisulfite.
- annealing when used with respect to an oligonucleotide, is to be understood as a bond of an oligonucleotide to an at least substantially complementary sequence along the lines of the Watson-Crick base pairings in the sample DNA, forming a duplex structure, under moderate or stringent hybridization conditions.
- a single nucleotide or base it refers to the binding according to Watson-Crick base pairings, e.g. C-G, A-T and A-U.
- Stringent hybridization conditions involve hybridizing at 68°C in 5x SSC/5x Denhardt's solution/1.0% SDS, and washing in 0.2x SSC/0.1% SDS at room temperature, or involve the art-recognized equivalent thereof (e.g., conditions in which a hybridization is carried out at 60°C in 2.5 x SSC buffer, followed by several washing steps at 37°C in a low buffer concentration, and remains stable).
- Moderate conditions involve washing in 3x SSC at 42°C, or the art-recognized equivalent thereof.
- the parameters of salt concentration and temperature can be varied to achieve the optimal level of identity between the probe and the target nucleic acid.
- substantially complementary means that an oligonucleotide does not need to reflect the exact sequence of the template and can comprise mismatches and/or spacers as defined herein.
- substantially identical means that an oligonucleotide does not need to be 100% identical to a reference sequence but can comprise mismatches and/or spacers as defined herein. It is preferred that a substantially complementary or identical oligonucleotide comprises up to 1 to 3, i.e. 1, 2 or 3 mismatches and/or spacers, preferably up to one mismatch or spacer per oligonucleotide, such that the intended annealing does not fail due to the mismatches and/or spacers.
- an oligonucleotide does not comprise more than 1 mismatch per 10 nucleotides (rounded up if the first decimal is 5 or higher, otherwise rounded down) of the oligonucleotide.
- oligonucleotide refers to a linear oligomer of 5 to 50 ribonucleotides or preferably deoxyribonucleotides. Preferably, it has the structure of a singlestranded DNA fragment.
- primer oligonucleotide refers to a single-stranded oligonucleotide sequence substantially complementary to a nucleic acid sequence sought to be copied (the template) and serves as a starting point for synthesis of a primer extension product.
- a primer oligonucleotide is 10 to 40 nucleotides, more preferably 15-30 nucleotides and most preferably 19 to 25 nucleotides in length.
- blocker refers to a molecule which binds in a methylationspecific manner to a single-strand of DNA (i.e. it is specific for either the converted methylated or preferably for the converted unmethylated DNA or the amplified DNA derived from it) and prevents amplification of the DNA by binding to it, for example by preventing a primer to bind or by preventing primer extension where it binds.
- Non-limiting examples for blockers are sequence and/or methylation specific antibodies (blocking e.g. primer binding or the polymerase) and in particular blocker oligonucleotides.
- a "blocker oligonucleotide” may be a blocker that prevents the extension of the primer located upstream of the blocker oligonucleotide. It comprises nucleosides/nucleotides having a 1 backbone resistant to the 5' nuclease activity of the polymerase.
- PNA peptide nucleic acid
- LNA locked nucleic acid
- GAA glycol nucleic acid
- TAA threose nucleic acid
- BNA bridged nucleic acids
- NP N3 -P5' phosphoramidate oligomers
- MGB- linked oligonucleotides minor groove binder-linked-oligonucleotides
- PS phosphorothioate
- CrC4alkylphosphonate oligomers phosphoramidates
- P-phosphodiester oligonucleotides a-phosphodiester oligonucleotides or a combination thereof.
- it may be a non-extendable oligonucleotide with a binding site on the DNA single-strand that overlaps with the binding site of a primer oligonucleotide.
- the primer cannot bind and therefore the amplicon is not generated.
- the primer-binding site is accessible and the amplicon is generated.
- the affinity of the blocker is higher than the affinity of the primer for the DNA.
- a blocker oligonucleotide is typically 15 to 50, preferably 20 to 40 and more preferably 25 to 35 nucleotides long.
- a blocker oligonucleotide cannot by itself act as a primer (i.e. cannot be extended by a polymerase) due to a non-extensible 3' end.
- probe oligonucleotide refers to an oligonucleotide that is used to detect an amplicon by annealing to one strand of the amplicon, usually not where any of the primer oligonucleotides binds (i.e. not to a sequence segment of the one strand which overlaps with a sequence segment a primer oligonucleotide anneals to). Preferably it anneals without a mismatch or spacer, in other words it is preferably complementary to one strand of the amplicon.
- a probe oligonucleotide is preferably 5-40 nucleotides, more preferably 10 to 25 and most preferably 20 to 25 nucleotides long.
- the probe is linked, preferably covalently linked, to at least one detectable label which allows detection of the amplicon and/or at least one quencher which allows quenching the signal of a (preferably the) detectable label.
- detectable label does not exhibit any particular limitation.
- the detectable label may be selected from the group consisting of radioactive labels, luminescent labels, fluorescent dyes, compounds having an enzymatic activity, magnetic labels, antigens, and compounds having a high binding affinity for a detectable label.
- fluorescent dyes linked to a probe may serve as a detection label, e.g. in a real-time PCR.
- Suitable radioactive markers are P-32, S-35, 1-125, and H-3
- suitable luminescent markers are chemiluminescent compounds, preferably luminol
- suitable fluorescent markers are preferably dansyl chloride, fluorcein-5-isothiocyanate, and 4-fluor-7-nitrobenz-2-aza-l,3 diazole, in particular 6-Carboxyfluorescein (FAM), 6-Hexachlorofluorescein (HEX), 5(6)- Carboxytetramethylrhodamine (TAMRA), 5(6)-Carboxy-X-Rhodamine (ROX), Cyanin-5- Fluorophor (Cy5) and derivates thereof;
- suitable enzyme markers are horseradish peroxidase, alkaline phosphatase, a-galactosidase, acetylcholinesterase, or biotin.
- a probe may also be linked to a quencher.
- quencher refers to a molecule that deactivates or modulates the signal of a corresponding detectable label, e.g. by energy transfer, electron transfer, or by a chemical mechanism as defined by IUPAC (see compendium of chemical terminology 2 nd ed. 1997).
- the quencher modulates the light emission of a detectable label that is a fluorescent dye.
- a quencher may itself be a fluorescent molecule that emits fluorescence at a characteristic wavelength distinct from the label whose fluorescence it is quenching. In other cases, the quencher does not itself fluoresce (i.e., the quencher is a "dark acceptor").
- quenchers include, for example, dabcyl, methyl red, the QSY diarylrhodamine dyes, and the like.
- the term "covering a CpG site" as used herein with respect to an oligonucleotide refers to the oligonucleotide annealing to a region of DNA comprising this CpG site, before or after conversion of the C of the CpG site (i.e. the CpG site of the corresponding genomic DNA when it is referred to a bisulfite converted sequence).
- the annealing may, with respect to the CpG site (or former CpG site if the C was converted), be methylation-specific or methylation- unspecific as described below.
- methylation-specific refers generally to the dependency from the presence or absence of CpG methylation.
- methylation-specific as used herein with respect to an oligonucleotide means that the oligonucleotide does or does not anneal to a single-strand of DNA (in which cytosine unmethylated in the 5-position has been converted to uracil or another base that does not hybridize to guanine, and where it comprises at least one CpG site before conversion) without a mismatch regarding the position of the C in the at least one CpG site, depending on whether the C of the at least one CpG sites was unmethylated or methylated prior to the conversion, i.e. on whether the C has been converted or not.
- the methylation-specificity can be either positive (the oligonucleotide anneals without said mismatch if the C was not converted) or negative (the oligonucleotide anneals without said mismatch if the C was converted).
- it preferably covers at least 2, 3, 4, 5 or 6 and preferably 3 to 6 CpG sites before conversion or, if used as a primer, covers at least one CpG site in a position where within a DNA amplification reaction a mismatch would block the oligonucleotide’s extension at its 3’ prime end.
- methylation-unspecific refers generally to the independency from the presence or absence of CpG methylation.
- oligonucleotide it means that the oligonucleotide does anneal to a single-strand of DNA (in which cytosine unmethylated in the 5-position has been converted to uracil or another base that does not hybridize to guanine, and where it may or may not comprise at least one CpG site before conversion) irrespective of whether the C of the at least one CpG site was unmethylated or methylated prior to the conversion, i.e. of whether the C has been converted or not.
- the region of the singlestrand of DNA the oligonucleotide anneals to does not comprise any CpG sites (before and after conversion) and the oligonuclotide is methylation-unspecific solely for this reason. While a methylation-unspecific oligonucleotide may cover one or more CpG dinucleotides, it does so with mismatches and/ or spacers.
- mismatch refers to base-pair mismatch in DNA, more specifically a base-pair that is unable to form normal base-pairing interactions (i.e., other than “A” with “T” or “U”, or “G” with “C”).
- An oligonucleotide i.e. a probe, blocker or primer, may also cover an SNP site with an SNP -unspecific mismatch or with a spacer.
- SNP site refers to the site of an "SNP", i.e. a single nucleotide polymorphism at a particular position in the, preferably human, genome that varies among a population of individuals.
- SNPs of the genomic DNA the present application refers to are known in the art and can be found in online databases such as dbSNP of NCBI (http://www.ncbi.nlm.nih.gov/snp).
- SNP -unspecific mismatch refers to a mismatch that is due to a nucleotide substitution that does not substitute the nucleotide with one that corresponds to a nucleotide that is found at the same position in the genome of another individual of the same population.
- spacer refers to a non-nucleotide spacer molecule, which increases, when joining two nucleotides, the distance between the two nucleotides to about the distance of one nucleotide (i.e. the distance the two nucleotides would be apart if they were joined by a third nucleotide).
- spacers are Inosine, d-Uracil, halogenated bases, Amino-dT, C3, Cl 2, Spacer 9, Spacer 18, and dSpacer
- the phrase "method for detecting the presence or absence of cancer in a subject” as used herein refers to a determination whether the subject has cancer or not. As will be understood by persons skilled in the art, such assessment normally may not be correct for 100% of the subjects, although it preferably is correct. The term, however, requires that a correct indication can be made for a statistically significant part of the subjects. Whether a part is statistically significant can be determined easily by the person skilled in the art using several well known statistical evaluation tools, for example, determination of confidence intervals, determination of p values, Student's t-test, Mann- Whitney test, etc. Details are provided in Dowdy and Wearden, Statistics for Research, John Wiley & Sons, New York 1983. The preferred confidence intervals are at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%. The p values are preferably 0.05, 0.01, or 0.005.
- risk thereof' with respect to the method for detecting the presence or absence of cancer in a subject refers to the detection of an increased risk of developing the cancer or an increased probability of having it. If the subject already has an increased risk in view of one or more risk factors that can be attributed to it (as defined herein), the 'risk thereof refers to a risk that is increased further, i.e. that is in addition to the risk due to those risk factors.
- proliferative disorder or “proliferative disease” as used herein in the broadest sense an refers to a tumor or cancer.
- the proliferative disorder or proliferative disease includes colon cancer or colorectal cancer, as described herein.
- the proliferative disorder may also be endometriosis.
- colon cancer or “colorectal cancer” is used in the broadest sense and refers to (1) all stages and all forms of cancer arising from epithelial cells of the large intestine and/or rectum and/or (2) all stages and all forms of cancer affecting the lining of the large intestine and/or rectum. It includes the subtypes adenocarcinoma, gastrointestinal carcinoid tumor, gastrointestinal stromal tumor, primary colorectal lymphoma, leiomyosarcoma, melanoma or squamous cell carcinoma, each originating from the colon (colon cancer) or the rectum (rectal cancer). In the staging systems used for classification of colorectal cancer, the colon and rectum are treated as one organ.
- stage 0 Tis, NO, M0
- stage I T1-T2, NO, M0
- stage IIA T3, NO, M0
- stage IIB T4a, NO, M0
- stage IIC T4b, NO, M0
- stage IIIA T1-T2, Nl, M0; or Tl, N2a, M0
- stage IIIB T3-T4a, Nl, M0; T2-T3, N2a, M0; or T1-T2, N2b, M0
- stage IIIC T4a, N2a, M0; T3-T4a, N2b, M0; or T4b N1-N2 M0
- stage IVA any T, any N, Mia
- stage IVA any T, any N, Mlb
- TNM classification is a staging system for malignant cancer.
- TNM classification refers to the 6 th edition of the TNM stage grouping as defined in Sobin et al. (International Union against Cancer (UICC), TNM Classification of Malignant tumors, 6 th ed. New York; Springer, 2002, pp. 191-203).
- disease includes, but is not limited to a tumor disease or cancer disease, as described herein, or to endometriosis, diseases of the central nerve system (CNS) or cardiovascular disease.
- Diseases of the central nerve system include, but are not limited to dementia caused by Alzheimer and/or other diseases.
- Cardiovascular diseases include, but are not limited to, cardiovascular diseases caused for example by diagnosed or undiagnosed (thus unknown to the patient) infectious diseases, in particular by inflammatory infections, that altered the cells for example in blood vessels or heart tissue, which thus may increase the risk for heart attack and/or stroke.
- diseased cell refers to a cell affected by the tumor disease or cancer disease, as described herein, or to cells altered in comparison to healthy cells.
- a “diseased cell” or “disease related cell”, as used herein, might cause diseases, for example but not limited to endometriosis, diseases of the central nerve system (CNS) or cardiovascular disease.
- Diseases of the central nerve system include, but are not limited to dementia caused by Alzheimer and/or other diseases.
- Cardiovascular diseases include, but are not limited to, cardiovascular diseases caused for example by diagnosed or undiagnosed (thus unknown to the patient) infectious diseases, in particular by inflammatory infections, that altered the cells for example in blood vessels or heart tissue, which thus may increase the risk for heart attack and/or stroke.
- the diseased cell or disease related cell may be obtained from a patient’s sample, for example a biopsy.
- the diseased cell or disease related cell includes a colon cancer cell or a colorectal cancer cell, as described herein.
- cancer cell refers to a cell that acquires a characteristic set of functional capabilities during their development, particularly one or more of the following: the ability to evade apoptosis, self-sufficiency in growth signals, insensitivity to anti-growth signals, tissue invasion/metastasis, significant growth potential, and/or sustained angiogenesis.
- the term is meant to encompass both pre-malignant and malignant cancer cells.
- tumor DNA or "tumor DNA of a cancer cell” as used herein refers simply to DNA of a cancer cell. It is used only to distinguish DNA of a cancer cell more clearly from other DNA referred to herein. Thus, unless ambiguities are introduced, the term “DNA of a cancer cell” may be used instead.
- multivariate model refers to any algorithm or numerical method, that uses simultaneous observation and analysis of more than one outcome variable to provide a result that enables classification of subgroups defined for the data in question.
- sum of positive marker assay components refers to counting the number of all marker components expressed as 0 (negative) or 1 (positive) due to binary decisions.
- the sum of the positive marker components is based on qualitative readouts (signal, no signal) or individual thresholds for positivity used on quantitative readouts for individual marker assay components.
- components can have specific weights.
- Principal component analysis refers to techniques for analyzing datasets containing a high number of dimensions/features per observation, reducing the dimensionality of a dataset while preserving the maximum amount of information, to identify subgroups.
- logistic regression analysis refers to modeling the probability of an event (e.g. a positive diagnosis) by having the log-odds for the event be a linear combination of one or more independent variables, which can then be used for defining a binary classifier based on a cutoff, (see Cramer, JS, 2002, “The Origins of Logistic Regression”, Tinbergen Institute Discussion Paper).
- noise neighbor analysis refers to the k-nearest neighbors algorithm, a non-parametric supervised learning method used for classification (see Evelyn Fix, ’’Discriminatory Analysis: Nonparametric Discrimination Consistency Properties” 1951, Report Number 4, Project Number 21-49-004, USAF School of Aviation Medicine, Randolph Fields, Texas).
- support vector machine refers to supervised learning models with associated learning algorithms that analyze data for classification, including non-linear classification using mapping input into high-dimensional feature spaces (see Cramer JS, 2002, “The Origins of Logistic Regression”, Tinbergen Institute Discussion Paper).
- decision tree refers to algorithms that contain conditional control statements (decision rules) providing tree-like models of decisions and their possible consequences (classification trees) obtained by supervised learning approaches (see e.g. Rokach, Lior; Maimon, O. (2014) “Data mining with decision trees: theory and applications”, 2nd Edition).
- neural network model refers to models based on computing systems that use connected nodes (artificial neurons), which model the neurons in a biological brain, generating an output based on a non-linear function of the sum of its inputs, that can be trained for classification (see e.g. Hardesty, Larry (14 April 2017) "Explained: Neural networks”. MIT News Office. Retrieved 2 June 2022.).
- subject refers to an individual, such as a mammal, including a non-human primate (e.g. chimpanzees and other apes and monkey species). Preferably it is a human.
- the term does not denote a particular age or sex.
- the subject can be any subject of which the methylation status within genomic DNA having a sequence comprised in SEQ ID NO: 11 and/or within genomic DNA having a sequence comprised in SEQ ID NO: 31, in particular from a sample comprising cell-free DNA from blood or a sample derived therefrom of a subject, is not known.
- the term "subject” may have different limitations.
- the method is to be used for detecting cancer or screening subjects for cancer, the subject is not known to have cancer, i.e. it may or may not have cancer.
- the subject preferably is at risk or increased risk or is suspected to have cancer.
- At risk or increased risk means that one or more risk factors can be attributed to the subject), preferably as defined by the American Cancer Society generally or for the respective cancer.
- amplifying or “generating an amplicon” as used herein refers to amplifying a defined region of a double-stranded or single-stranded DNA template, typically with a polymerase chain reaction (PCR).
- An "amplicon” is a double-stranded fragment of DNA according to said defined region.
- primers refers to two oligonucleotides, namely a forward and a reverse primer, that have, with respect to a double-stranded nucleic acid molecule, sequences that are (at least substantially) identical to one strand each such that they each anneal to the complementary strand of the strand they are (at least substantially) identical to.
- forward primer refers to the primer which is (at least substantially) identical to the forward strand (as defined by the direction of the genomic reference sequence) of the double-stranded nucleic acid molecule
- reverse primer refers to the primer which is (at least substantially) identical to the reverse complementary strand of the forward strand in the double-stranded nucleic acid molecule.
- the distance between the sites where forward and reverse primer anneal to their template depends on the length of the amplicon the primers are supposed to allow generating. Typically, with respect to the present invention it is between 40 and 1000 bp, preferably between 40 and 200 bp and more preferably between 60 and 150 bp. Preferred amplicon sizes are specified herein.
- pair of primers suitable for generating an amplicon from a single strand of genomic DNA in which cytosine unmethylated in the 5-position has been converted to uracil or another base that does not hybridize to guanine refers to a pair of primers which takes into account a base change from unmethylated cytosines to uracil, which basepairs with adenine and is therefore replaced with thymine in the amplicon.
- diagnosis refers to a determination whether a subject does or does not have cancer, and preferably also which cancer.
- a diagnosis by methylation analysis of the target DNA as described herein may be supplemented with a further means as described herein to and/or narrow down the cancer detected with the methylation analysis.
- the diagnosis normally may not be correct for 100% of the subjects, although it preferably is correct. The term, however, requires that a correct diagnosis can be made for a statistically significant part of the subjects. For a description of statistic significance and suitable confidence intervals and p values, see above.
- monitoring refers to the repeated detection of methylated DNA or during a certain period of time, typically during at least 1 month, 6 months, 1 year, 2 years, 3 years, 5 years, 10 years, or any other period of time.
- Methylation may be detected based on the amount of methylated target DNA, particular based on changes in the amount in any type of periodical time segment, determined e.g., daily or at least once per week, month, or year.
- the result of the monitoring normally may not be correct for 100% of the subjects, although it preferably is correct. The term, however, requires that a correct result of the monitoring can be achieved for a statistically significant part of the subjects. For a description of statistic significance and suitable confidence intervals and p values, see above.
- screening of subjects refers to the use of the method of the first aspect with samples of a population of subjects.
- the subjects have an increased risk for or are suspected of having CRC.
- one or more of the following risk factors recited herein can be attributed to the subjects of the population.
- the same one or more risk factors can be attributed to all subjects of the population.
- the population may be characterized by a certain minimal age (e.g. 50 or older).
- the term "screening" does not necessarily indicate a definite diagnosis, but is intended to indicate an increased possibility of the presence or of the absence of CRC. An indicated increased possibility is preferably confirmed and/or narrowed down using a further means as described herein.
- the screening result normally may not be correct for 100% of the subjects, although it preferably is correct.
- the term requires that a correct screening result can be achieved for a statistically significant part of the subjects. For a description of statistic significance and suitable confidence intervals and p values, see above.
- treatment refers to a therapeutic treatment, wherein the goal is to reduce progression of cancer.
- beneficial or desired clinical results include, but are not limited to, release of symptoms, reduction of the length of the disease, stabilized pathological state (specifically not deteriorated), slowing down of the disease’ s progression, improving the pathological state and/or remission (both partial and total), preferably detectable.
- a successful treatment does not necessarily mean cure, but it can also mean a prolonged survival, compared to the expected survival if the treatment is not applied.
- the treatment is a first line treatment, i.e. the cancer was not treated previously.
- Cancer treatment involves a treatment regimen.
- treatment regimen refers to how the subject is treated in view of the disease and available procedures and medication.
- cancer treatment regimes are chemotherapy, surgery and/or irradiation or combinations thereof.
- the early detection of cancer the present invention enables allows in particular for a surgical treatment, especially for a curative resection.
- treatment regimen refers to administering one or more anti-cancer agents or therapies as defined below.
- anticancer agent or therapy refers to chemical, physical or biological agents or therapies, or surgery, including combinations thereof, with antiproliferative, antioncogenic and/or carcinostatic properties.
- a chemical anti-cancer agent or therapy may be selected from the group consisting of alkylating agents, antimetabolites, plant alkaloyds and terpenoids and topoisomerase inhibitors.
- the alykylating agents are platinum-based compounds.
- the platinum-based compounds are selected from the group consisting of cisplatin, oxaliplatin, eptaplatin, lobaplatin, nedaplatin, carboplatin, iproplatin, tetraplatin, lobaplatin, DCP, PLD- 147, JM1 18, JM216, JM335, and satraplatin.
- a physical anti-cancer agent or therapy may be selected from the group consisting of radiation therapy (e.g. curative radiotherapy, adjuvant radiotherapy, palliative radiotherapy, teleradiotherapy, brachytherapy or metabolic radiotherapy), phototherapy (using, e.g. hematoporphoryn or photofrin II), and hyperthermia.
- Surgery may be a curative resection, palliative surgery, preventive surgery or cytoreductive surgery. Typically, it involves an excision, e.g. intracapsular excision, marginal, extensive excision or radical excision as described in Baron and Valin (Rec. Med. Vet, Special Cane. 1990; 11(166):999-1007).
- a biological anti-cancer agent or therapy may be selected from the group consisting of antibodies (e.g. antibodies stimulating an immune response destroying cancer cells such as retuximab or alemtuzubab, antibodies stimulating an immune response by binding to receptors of immune cells an inhibiting signals that prevent the immune cell to attack "own" cells, such as ipilimumab, antibodies interfering with the action of proteins necessary for tumor growth such as bevacizumab, cetuximab or panitumumab, or antibodies conjugated to a drug, preferably a cell-killing substance like a toxin, chemotherapeutic or radioactive molecule, such as Y-ibritumomab tiuxetan, I-tositumomab or ado-trastuzumab emtansine), cytokines (e.g.
- interferons or interleukins such as INF-alpha and IL-2
- vaccines e.g. vaccines comprising cancer-associated antigens, such as sipuleucel-T
- oncolytic viruses e.g. naturally oncolytic viruses such as reovirus, Newcastle disease virus or mumps virus, or viruses genetically engineered viruses such as measles virus, adenovirus, vaccinia virus or herpes virus preferentially targeting cells carrying cancer-associated antigens such as EGFR or HER-2
- gene therapy agents e.g.
- DNA or RNA replacing an altered tumor suppressor blocking the expression of an oncogene, improving a subject's immune system, making cancer cells more sensitive to chemotherapy, radiotherapy or other treatments, inducing cellular suicide or conferring an anti-angiogenic effect) and adoptive T cells (e.g. subject-harvested tumorinvading T-cells selected for antitumor activity, or subject-harvested T-cells genetically modified to recognize a cancer-associated antigen) .
- adoptive T cells e.g. subject-harvested tumorinvading T-cells selected for antitumor activity, or subject-harvested T-cells genetically modified to recognize a cancer-associated antigen
- the one or more anti-cancer drugs is/are selected from the group consisting of Abiraterone Acetate, ABVD, ABVE, ABVE-PC, AC, AC-T, ADE, Ado- Trastuzumab Emtansine, Afatinib Dimaleate, Aldesleukin, Alemtuzumab, Aminolevulinic Acid, Anastrozole, Aprepitant, Arsenic Trioxide, Asparaginase Erwinia chrysanthemi, Axitinib, Azacitidine, BEACOPP, Belinostat, Bendamustine Hydrochloride, BEP, Bevacizumab, Bexarotene, Bicalutamide, Bleomycin, Bortezomib, Bosutinib, Brentuximab Vedotin, Busulfan, Cabazitaxel, Cabozantinib-S-Malate, CAFCapecitabine, CAPO
- a method for compensating age-related effects on the detection and/or quantification of DNA methylation at one or more CpG dinucleotides in the genomic region of at least one gene for which DNA methylation or the absence thereof, is indicative of the presence or absence of a disease or of a disease state by measuring DNA methylation at said one or more CpG dinucleotides of said genomic region in a subpopulation of methylated DNA molecules, from diseased or disease related cells in a biological sample of an individual by using the measured DNA methylation and the age of the individual in a multivariate model.
- a method for compensating age-related effects on the detection and/or quantification of DNA methylation at one or more CpG dinucleotides in the genomic region of at least one gene for which DNA methylation or the absence thereof, is indicative of the presence or absence of a disease or of a disease state by measuring DNA methylation at said one or more CpG dinucleotides in said genomic region in at least a subpopulation of DNA molecules derived from disease related cells in a biological sample of an individual by using the measured DNA methylation and the age of the individual in a multivariate model.
- the multivariate model is capable of discriminating non-CRC subjects from subjects suffering from CRC, based upon the presence and/or amount of DNA methylation and the age of the subject;
- a method for monitoring a subject suspected of having CRC, having an increased risk of developing colorectal cancer (CRC), or who has had CRC comprising the steps:
- the multivariate model is capable of discriminating non-CRC subjects from subjects suffering from CRC, based upon the presence and/or amount of DNA methylation and the age of the subject;
- mANKRD13B gene wherein the genomic DNA polynucleotide consists of genomic DNA having a sequence comprised in SEQ ID NO: 11
- mSEPT9 gene wherein the genomic DNA polynucleotide consists of genomic DNA having a sequence comprised in SEQ ID NO: 31.
- the method according to any one of the items 1 to 10, wherein the detection and/or quantification of DNA methylation at one or more CpG dinucleotides uses DNA conversion methods and detection methods for synthetic DNA derived from genomic DNA in the biological sample that has a sequence which is no longer identical to the genomic nucleic acid sequence.
- the method of any one of items 1 to 11, wherein detecting DNA methylation comprises a PCR using at least one methylation-specific primer.
- the method of any one of items 1 to 12, wherein detecting DNA methylation comprises a multiplex real-time PCR comprising
- primers suitable for amplifying DNA within SEQ ID NO: 35 and optionally (5) methylation-unspecific primers suitable for amplifying control DNA, wherein the primers of (1) to (4) preferably are methylation-specific primers.
- the method of any one of items 1 to 14, wherein the biological sample is a colon or rectum tissue sample or a liquid biopsy, preferably a blood sample, a blood-derived sample, a urine sample, a urine-derived sample, a saliva sample, or a saliva-derived sample.
- the multivariate model is selected from one or more of the following: a sum of positive marker assay components, a principal component analysis, a logistic regression analysis, a nearest neighbor analysis, a support vector machine, a decision tree, and a neural network model.
- a method of treating CRC in a subject comprising
- decision trees were used to discriminate of CRC and control samples. Decision trees were built with different weights for the two groups using the rpart function in R (package :rpart) to receive decision trees providing a series of sensitivity/specificity pairs to find a sensitivity close to specificity of 0.9 and to simulate a ROC curve for comparison with the other methods.
- the individual DNA methylation marker measurements by Realtime-PCR as assessed by minimum Cts of duplicates (min Cts) for each of the three MSP marker assays lead to AUCs between 0.7 and 0.8 (see Figure 4 to 6) and to Sensitivities from 0.6 to 0.65, and combined by logistic regression to an AUC of 0.85 and Sensitivity of 0.71 at Specificity of 0.9 (see Figure 8).
- Age standalone lead to AUC of 0.7 and to a sensitivity of 0.35 at Specificity of 0.9 (see Figure 7).
- the discriminatory power of the biomarkers mANKRD13B and mSeptin9 for diagnosis of colorectal cancer by a DNA methylation pattern can be significantly improved if additionally the age of the subjects is considered in a multivariate model.
- the choice of multivariate model did not affect the results: improvement of discriminatory power was obtained in three independent methods: logistic regression, sum of positive calls and decision trees.
- Table 1 Number of samples by gender for colorectal cancer (CRC) and for controls.
- Table 2 Number of samples by gender for different colorectal cancer (CRC) stages, and for controls from individuals with no evidence of disease (NED) and with polyps.
- Table 3 Sequence IDs, abbreviations, names and associated regions in the human genome (GRCh38 build): genomic reference sequences and derived bisulfite converted sequences.
- rc means reverse complement
- C to T or G to A means converted by bisulfite conversion of cytosines outside of CpG context into uracil and replaced by thymidine in subsequent amplification
- bisl refers to the bisulfite converted forward strand (as recited in the SEQ ID of the respective genomic DNA) and bis2 to the bisulfite converted reverse complement strand of the forward strand (reverse complement of the SEQ ID of the respective genomic DNA), whereby the direction of the strand is defined by the direction of the genomic reference sequence as e.g. obtained from the genome build (HCGR38).
- Figure 1 For a mapping of the sequences, see Figure 1.
- Table 5 Performance for single markers (row 1 to 6) and of different marker combinations by logistic regression analysis (row 7 to 10), characterized by Sensitivity at Specificity of 0.9 and by area under the curve (AUC) of receiver operating characteristic (ROC) analysis.
- Table 6 Measurements for all biomarkers age and derived marker positivity count in all 349 individuals from Example 1.
- the first column contains the diagnostic group.
- Column 2-4 contain the three blood plasma derived minimum Ct values from duplicates of methylation specific Realtime PCR quadruplex assays: ANKRD13b assays on the bisulfite converted sense strand (ANB1), ANKRD13b assay on the reverse complement strand (ANB2), Septin9 assay on the reverse complement strand (S9B2).
- Column 5 contains the age of the individuals.
- Column 6 provides the sum of positive (measurable) DNA methylation marker realtime PCR curves from the six reaction (triplexes measured in duplicate) .
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Hospice & Palliative Care (AREA)
- Biophysics (AREA)
- Oncology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| MX2025004113A MX2025004113A (en) | 2022-10-04 | 2025-04-04 | Improved methods for detecting cancer |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP22199681 | 2022-10-04 | ||
| EP22199681.2 | 2022-10-04 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024074558A1 true WO2024074558A1 (en) | 2024-04-11 |
Family
ID=83598342
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/EP2023/077458 Ceased WO2024074558A1 (en) | 2022-10-04 | 2023-10-04 | Improved methods for detecting cancer |
Country Status (2)
| Country | Link |
|---|---|
| MX (1) | MX2025004113A (en) |
| WO (1) | WO2024074558A1 (en) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1394172A1 (en) | 2002-08-29 | 2004-03-03 | Boehringer Mannheim Gmbh | Improved method for bisulfite treatment |
| WO2006113466A2 (en) | 2005-04-15 | 2006-10-26 | Epigenomics Ag. | Methods and nucleic acids for analyses of cellular proliferative disorders |
| WO2017129716A1 (en) | 2016-01-29 | 2017-08-03 | Epigenomics Ag | Methods for detecting cpg methylation of tumor-derived dna in blood samples |
| WO2021122799A1 (en) * | 2019-12-16 | 2021-06-24 | Epigenomics Ag | Methods for detecting colorectal cancer |
-
2023
- 2023-10-04 WO PCT/EP2023/077458 patent/WO2024074558A1/en not_active Ceased
-
2025
- 2025-04-04 MX MX2025004113A patent/MX2025004113A/en unknown
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1394172A1 (en) | 2002-08-29 | 2004-03-03 | Boehringer Mannheim Gmbh | Improved method for bisulfite treatment |
| WO2006113466A2 (en) | 2005-04-15 | 2006-10-26 | Epigenomics Ag. | Methods and nucleic acids for analyses of cellular proliferative disorders |
| WO2017129716A1 (en) | 2016-01-29 | 2017-08-03 | Epigenomics Ag | Methods for detecting cpg methylation of tumor-derived dna in blood samples |
| WO2021122799A1 (en) * | 2019-12-16 | 2021-06-24 | Epigenomics Ag | Methods for detecting colorectal cancer |
Non-Patent Citations (16)
| Title |
|---|
| AUSUBEL ET AL.: "Current Protocols in Molecular Biology", 1995, JOHN WILEY & SONS |
| BARONVALIN, REC. MED. VET, SPECIAL CANC., vol. 11, no. 166, 1990, pages 999 - 1007 |
| CRAMER, JS, THE ORIGINS OF LOGISTIC REGRESSION, 2002 |
| DOWDYWEARDEN: "Statistics for Research", 1983, JOHN WILEY & SONS |
| FROMMER ET AL., PROC NATL ACAD SCI USA, vol. 89, 1992, pages 1827 - 31 |
| HARDESTYLARRY, EXPLAINED: NEURAL NETWORKS, 14 April 2017 (2017-04-14) |
| MAI-BRITT W. OMTOFT: "Abstract 4922: Chronic diseases and age >65 years may cause false positive results in colorectal cancer screening with the blood-based Sept9 methylation assay | Cancer Research | American Association for Cancer Research", 1 July 2016 (2016-07-01), XP093029601, Retrieved from the Internet <URL:https://aacrjournals.org/cancerres/article/76/14_Supplement/4922/612170/Abstract-4922-Chronic-diseases-and-age-gt-65-years> [retrieved on 20230307] * |
| MARDIS, ANNU REV GENOMICS HUM GENET., vol. 9, 2008, pages 387 - 402 |
| OLEK, NUCLEIC ACIDS RES, vol. 24, 1996, pages 5064 - 6 |
| PANJARIAN S.: "Abstract 1071: Age-dependent DNA methylation in normal breast epithelium and breast cancer", vol. 15, no. 15 Suppl., 1 August 2015 (2015-08-01), pages 1 - 4, XP055956536, Retrieved from the Internet <URL:https://aacrjournals.org/cancerres/article/75/15_Supplement/1071/599875/Abstract-1071-Age-dependent-DNA-methylation-in> DOI: 10.1158/1538-7445.AM2015-1071 * |
| ROKACH, LIORMAIMON, O.: "Data mining with decision trees: theory and applications", 2014 |
| SAMBROOK ET AL.: "Molecular Cloning, A Laboratory Manual", 1989, COLD SPRING HARBOR PRESS |
| SHENDUREJI, NATURE BIOTECHNOLOGY, vol. 26, 2008, pages 1135 - 1145 |
| SOBIN ET AL.: "TNM Classification of Malignant tumors", 2002, SPRINGER, article "International Union Against Cancer (UICC", pages: 191 - 203 |
| YUASA YASUHITO: "DNA methylation in cancer and ageing", MECHANISMS OF AGEING AND DEVELOPMENT., vol. 123, no. 12, 1 November 2002 (2002-11-01), CH, pages 1649 - 1654, XP093029558, ISSN: 0047-6374, DOI: 10.1016/S0047-6374(02)00100-8 * |
| ZHU TONGTONG ET AL: "CancerClock: A DNA Methylation Age Predictor to Identify and Characterize Aging Clock in Pan-Cancer", FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, vol. 7, 4 December 2019 (2019-12-04), XP093029566, DOI: 10.3389/fbioe.2019.00388 * |
Also Published As
| Publication number | Publication date |
|---|---|
| MX2025004113A (en) | 2025-07-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20250333796A1 (en) | Methods for detecting cpg methylation of tumor-derived dna in blood samples | |
| ES2397672T3 (en) | Diagnostic method of lung cancers using genetic expression profiles in peripheral blood mononuclear cells | |
| EP2794907B1 (en) | Methods and materials for assessing loss of heterozygosity | |
| US20200299779A1 (en) | Methods for detecting head and neck cancer | |
| US20220403473A1 (en) | Methods for detecting colorectal cancer | |
| US20250263789A1 (en) | Methods of Analyzing Cell Free Nucleic Acids and Applications Thereof | |
| US20240093302A1 (en) | Non-invasive cancer detection based on dna methylation changes | |
| WO2024074558A1 (en) | Improved methods for detecting cancer | |
| US20250129427A1 (en) | Improved methods for detecting colorectal cancer | |
| CN120584199A (en) | Composition for predicting the risk of developing liver cancer | |
| US20230287509A1 (en) | Methods for detecting liver cancer | |
| HK40052835A (en) | Methods for detecting cpg methylation of tumor-derived dna in blood samples | |
| JP2025009303A (en) | Method for predicting efficacy of platinum-based anticancer drugs | |
| KR20240011100A (en) | Probes for Detecting DNA Methylation and Methods for Diagnosing Colon Cancer Using the Same | |
| Vidova et al. | 144P Beyond markers: Functional assays and transcriptomics reveal CSC populations in soft tissue sarcomas | |
| WO2023085932A1 (en) | Prediction of response following folfirinox treatment in cancer patients | |
| JP2024527370A (en) | Circulating microRNA signatures for pancreatic cancer | |
| HK40000100B (en) | Methods for detecting cpg methylation of tumor-derived dna in blood samples | |
| WO2018202666A1 (en) | Cpg-site methylation markers in colorectal cancer |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23783864 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2025/004113 Country of ref document: MX |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWP | Wipo information: published in national office |
Ref document number: MX/A/2025/004113 Country of ref document: MX |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 23783864 Country of ref document: EP Kind code of ref document: A1 |