WO2025127927A1 - Procédé de séquençage de molécules d'acide nucléique et procédés associés - Google Patents
Procédé de séquençage de molécules d'acide nucléique et procédés associés Download PDFInfo
- Publication number
- WO2025127927A1 WO2025127927A1 PCT/NL2024/050664 NL2024050664W WO2025127927A1 WO 2025127927 A1 WO2025127927 A1 WO 2025127927A1 NL 2024050664 W NL2024050664 W NL 2024050664W WO 2025127927 A1 WO2025127927 A1 WO 2025127927A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- dna
- dna molecules
- sample
- sequencing
- molecules
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Definitions
- the present invention relates to the field of molecular diagnostics, more specifically to methods and kits for barcoding, amplifying, and sequencing a plurality of nucleic acid molecules and the use thereof for diagnostic purposes, particularly in the field of cancer.
- Circulating free DNA often released into the bloodstream by apoptotic and necrotic cells, serves as a rich source of genetic information, including mutations associated with cancer. The accurate identification and characterization of this circulating free DNA can be used in tailoring precise and personalized cancer treatment strategies.
- Targeted sequencing focuses on specific genomic regions of interest, offering a cost-effective and focused approach, while whole exome sequencing and whole genome sequencing provide a broader scope, enabling comprehensive analysis of the entire exome or genome, respectively.
- existing technologies face limitations in accurately capturing low- abundance tumor-derived cfDNA, and the patent application herein discloses a novel sequencing method that significantly augments the precision and reliability of cfDNA analysis in cancer patients.
- the subsequent sections provide a detailed exposition of the inventive steps and distinctive features underpinning our improved sequencing methodology. Circulating free DNA (also known as cell-free DNA) are degraded DNA fragments released to body fluids such as blood plasma, urine, cerebrospinal fluid, etc.
- cfDNA refers to DNA that is found outside of cells in the bloodstream or other bodily fluids. It is released into the circulation through processes such as cell death (apoptosis and necrosis) and is being increasingly studied for its potential as a non-invasive biomarker for various medical conditions, including cancer.
- cfDNA is typically small because it originates from the fragmentation of DNA released from cells.
- the size of cfDNA fragments is influenced by various biological processes, and the primary reasons for the small size of cfDNA include Apoptosis and Necrosis. During apoptosis, DNA is cleaved into fragments as part of the cellular breakdown. Necrosis, which is uncontrolled cell death often due to injury or disease, can also result in the release of DNA fragments.
- Various enzymes such as nucleases, are present in the cellular environment, the extracellular environment, and the various fluid streams, such as the bloodstream and urine. These enzymes can further degrade DNA into smaller fragments. DNA fragments can be released into the bloodstream as part of normal physiological processes, such as tissue turnover and repair. Other processes, such as the clearance from the body, also affect the level and the size distribution of cfDNA fragments. The combination of these factors results in the presence of small DNA fragments in the bloodstream. In the context of liquid biopsy and diagnostic applications, the small size of cfDNA is advantageous because it allows for easier extraction and analysis. The analysis of cfDNA has become a valuable tool in medical research and clinical applications, such as the detection of genetic mutations, monitoring of cancer, and prenatal testing.
- the fragments of cfDNA are often double-stranded and relatively short, typically a few hundred base pairs in length. These fragments can have different termini (ends), including blunt ends or staggered ends with overhangs.
- the size distribution of cell-free DNA (cfDNA) can vary, but there are general trends observed in its fragmentation.
- the size range of cfDNA is commonly described in terms of its median or average size and its broader range. Keep in mind that these values can be influenced by factors such as the underlying biology of the source tissue, the specific extraction methods, and the analytical techniques used for measurement.
- the median size of cfDNA is often reported to be around 160-180 base pairs (bp). This means that, on average, half of the cfDNA fragments are shorter than this size, and half are longer.
- cfDNA fragments can extend from around 70 to several hundred base pairs. In healthy individuals, cfDNA fragments may vary in size, but they typically fall within this range. In the context of cfDNA, the organization of histones is disrupted due to the fragmentation of DNA. However, certain studies have explored the possibility of analyzing nucleosome- sized fragments within cfDNA, which could potentially provide information about the chromatin structure of the cells from which the cfDNA originated. The idea being that the sensitivity of DNA to cleavage and fragmentation is not uniformly distributed over the chromosomes and can vary over time, differentiation status, activation status, mutation status, and other factors.
- the invention provides a method for sequencing nucleic acid molecules, comprising: - providing a sample comprising a plurality of preferably linear DNA molecules; - incubating said sample with a restriction enzyme that is sensitive to methylation at the recognition site for said restriction enzyme and subsequently sequencing the DNA molecules in the sample thereby determining sequences of the DNA molecules; and determining from said sequences the methylation status of a restriction enzyme site for said restriction enzyme on said plurality of DNA molecules.
- the invention also provides a method for sequencing nucleic acid molecules, comprising: - providing a sample comprising a plurality of linear DNA molecules; - contacting said sample with a terminal deoxynucleotidyl transferase (TdT) and a mixture of two or more different nucleotides under conditions in which nucleotides in the mixture are sequentially and randomly added to the 3’ termini of the linear DNA molecules by the TdT and denaturing, when present, double stranded DNA in said sample, thereby generating linear single-stranded DNA molecules comprising sequence identifiers at their 3’ termini; - contacting the linear single-stranded DNA molecules comprising the sequence identifiers with a single-stranded DNA ligase under conditions in which the ends of linear single-stranded DNA molecules comprising the sequence identifiers are self-ligated thereby generating a plurality of circular single-stranded DNA molecules each comprising a DNA molecule and a sequence identifier; -
- the invention also provides a kit for sequencing a plurality of DNA molecules, wherein the kit comprises a TdT, a single-strand DNA ligase, a polymerase with strand displacement activity, and at least two nucleotides.
- the invention also allows for the detection of circular DNA in cfDNA samples.
- Circular DNA in human cell-free DNA (cfDNA) samples is a noteworthy topic in molecular biology and genetics. Circular DNA, unlike the more common linear DNA, forms a closed loop and is often associated with extrachromosomal DNA elements. In humans, cfDNA primarily originates from the breakdown of cells and is present in bodily fluids like blood.
- Circular DNA present in the cfDNA fraction will not be provided with a sequence identifier by the TdT but it is still present in the reaction, and it will be amplified.
- Figure 1 Schematic representation of classic barcoding strategies and TdT strategies.
- Figure 2 Schematic representation of a method of the invention and the ability to distinguish between circular and linear DNA in a sample.
- Figure 3 Schematic representation of a method for selective “scarring/marking” of non- methylated DNA to enhance the detection of DNA methylation. It starts with methylation- sensitive restriction enzymes that target methylation hotspots, particularly in CpG islands, digesting only the non-methylated DNA. The DNA pool treated this way can be directly sequenced.
- the analysis of the sequencing data reveals “scars/marks” on the reads that can be used to determine the methylation status of the underlying DNA.
- two double stranded fragments are shown with the same sequence.
- One fragment contains a methylated restriction enzyme site, the other fragment contains the same restriction site but this one is not methylated.
- the fragments are incubated with a restriction enzyme (RE) that only digests unmethylated restriction sites.
- Methylated versions (m5-C) of the restriction enzyme site are not digested. Sequencing of the result of the digestion yields reads that end in the restriction enzyme site representing fragments that were digested by the enzyme and reads that contain the entire restriction enzyme site representing fragments where the enzyme did not cut (-).
- FIG. 4 Schematic representation of the technology for selective removal of non- methylated DNA to enhance the detection of DNA methylation. It starts with methylation- sensitive restriction enzymes that target methylation hotspots, particularly in CpG islands, digesting only the non-methylated DNA.
- Exonucleases then further break down these non- methylated fragments, enriching the sample in methylated DNA.
- the final step involves sequencing the enriched samples, where the analysis focuses on the coverage of CpG islands compared to surrounding areas. Higher coverage signifies preserved methylation, indicating a successful enrichment process and enabling precise methylation pattern analysis.
- Figure 5 Electrophoretic traces of a proof of concept (POC) experiment showing the selective depletion of non-methylated DNA.
- M-P* (lane H1) is the MYC promoter sequence with methylated BstUI sites
- M- P (lane A2) is a non-methylated MYC promoter
- M-T (lane B2) is the MYC terminator sequence which lack the BstUI site.
- Lanes B1, C1, and D1 show the effect of BstUI digestion on the input DNA; as expected, only the non-methylated MYC promoter sequence (M-P) was digested. Following digestion, the DNA was treated with Lambda and P1 exonucleases, which fully deplete the digested M-P sequence while M-P* and M-T are left untouched.
- FIG. 6 Overview of the synthetic sequences used for the proof of concept (POC) experiment.
- the figure depicts Integrative Genomics Viewer (IGV) tracks showing the eventual presence of a CpG island (“CpG_GRCh38.bed” track), The eventual presence of a BstUI restriction site (“BstUI +” track), and the location of the DNA sequences used for the POC experiment (“selected_targets.bed” track).
- IOV Integrative Genomics Viewer
- BstUI + BstUI restriction site
- selected_targets.bed the MYC-Promoter sequence
- the MYC-Terminator sequence is not part of any CpG islands, and it does not contain any BstUI site.
- BstUI is expected to digest only a non-methylated MYC-Promoter sequence.
- the MYC-Terminator sequence will not be digested as it doesn’t contain the restriction site, while methylation on the restriction site would block BstUI from digesting the MYC- Promoter sequence.
- Figure 7 IGV screenshot showing the result of a proof-of-concept experiment. Reads from methylated DNA (top track) and non-methylated DNA (bottom track) are compared. Non- methylated DNA was correctly digested in correspondence with the two restriction sites highlighted by the blue bars below the sequence track. The alignment shows the “scar” corresponding to the non-methylated restriction site.
- FIG. 8 IGV screenshot showing the result of methylation detection using a methylation sensitive restriction enzyme and methylation detection using bisulfite conversion.
- the figure represents a typical IGV screenshot used in this application to show examples of results obtained with a method of the invention.
- IGV visualizes the alignment of sequencing reads versus a reference genome.
- the reference genome used is listed in the top bar, left corner (1), followed by the selected chromosome (2) and the genetic coordinates of the zoomed-in area (3).
- Below the top bar is a graphical representation of the selected chromosome, its banding pattern (4), and the length of the zoomed-in area (5).
- This figure compares the sequencing data obtained with the methods of the invention (6) and bisulfite sequencing (7).
- the image visualizes the coverage (8) associated with the sequencing reads aligned to the reference genome (9).
- the reference sequence (10), genes and transcripts annotations (11), and a track highlighting the presence of CpG islands (12). Mutations with respect to the reference sequence that are detected in the sequence reads are indicated (13).
- the figure shows a comparison of the methylation detection technology (top panel) with conventional bisulfite sequencing (bottom panel) in analyzing DNA methylation and single nucleotide polymorphisms (SNPs). This figure depicts a region of Chromosome 5 of the human genome. In the top panel, sequencing reads generated using the proposed technology are aligned.
- FIG. 9 IGV screenshot comparing sequencing reads generated by a method of the invention (top) with bisulfite sequencing (bottom). This picture shows the alignments near a CpG island on Chromosome 9.
- the CpG island appears to be non-methylated, as evidenced by the sudden drop in coverage.
- the rest of the region does not include other CpG islands.
- the proposed technology delivers more mappable reads leading to a higher and more uniform coverage.
- a distinct drop in coverage is explicitly observed at the methylation site, indicating the absence of methylation at this location.
- This precise detection of methylation patterns is achieved without the need for aggressive chemical treatments, enabling simultaneous identification of SNPs in the surrounding regions.
- Figure 10 IGV screenshot (chromosome 20) comparing sequencing reads generated by a method of the invention (top) with bisulfite sequencing (bottom). This picture shows the alignments near four CpG islands on Chromosome 20.
- the first three CpG islands are non- methylated, resulting in a sudden drop in coverage in the aligned reads generated with the proposed technology.
- the right-most CpG island instead, appears to be methylated, thus protected by the effect of the particular restriction enzyme, resulting in regular coverage.
- the comparison vs bisulfite sequencing shows a striking difference in the quality of the coverage and the alignments and the ease of detection.
- Figure 11 IGV screenshot (chromosome 15) comparing sequencing reads generated by the proposed technology (top) with bisulfite sequencing (bottom) near CpG-88 on Chromosome 15.
- FIG. 12 IGV screenshot (chromosome 5) comparing sequencing reads generated by a method of the invention (top) with bisulfite sequencing (bottom).
- the IGV screenshot shows the same features as figure 8, as detailed in the legend thereto, but now for an exemplary region on chromosome 5. The method used preserves the native DNA sequence, enabling accurate SNP calling, even in GC-rich regions.
- bisulfite sequencing involves the chemical conversion of cytosines to thymines (and guanines to adenines), leading to numerous false SNPs visible as colored mismatches in the coverage track.
- the proposed method preserves the native DNA sequence, enabling accurate SNP calling, even in GC-rich regions.
- bisulfite sequencing involves chemical conversion of cytosines to thymines (and guanines to adenines), leading to numerous false SNPs visible as grayscale mismatches in the coverage track.
- a plurality of nucleic acid molecules refers to a diverse set or a multitude of DNA (deoxyribonucleic acid) or RNA (ribonucleic acid) molecules.
- the term "plurality" in this context emphasizes the presence of a variety or multiple types of nucleic acid molecules. These molecules can differ in their sequences, lengths, and functions, reflecting the genetic diversity within a biological system.
- a plurality of nucleic acid molecules may encompass the entire genetic material of an organism or be a subset thereof, representing a part of the complexity and heterogeneity of the genetic information encoded within that collection of molecules.
- the present inventions have various advantages such as but not limited to obtaining sequencing information in a single assay.
- nucleotide sequence of a target pool of DNA molecules allow the determination of the nucleotide sequence of a target pool of DNA molecules; the determination of the nucleotide sequence of the DNA edges, i.e. the complete sequence of the 3’ and 5’ ends of the target DNA; the determination of the methylation status of a pool of DNA; and the determination of the molecular structure of the DNA molecules, i.e. circular and linear DNA can be distinguished. Further advantages are that they do not require the modification of the nucleic acid of which the sequence is to be determined. Some embodiments add nucleotides to an end of nucleic acid molecules in the sample. However, also these embodiments do not change the sequence of the nucleic acid sequence that was in the sample.
- the methods of the present invention are simple and allow to obtain sequence information of whole genomes in one go.
- the methods of the invention do not require the use blunting enzymes, PCR or ligation-based barcoding strategies. It also avoids the chemical conversion of methylated bases, like bisulfite conversion.
- Blunting enzymes such as exo- and endonucleases remove nucleotides digest single stranded DNA and remove overhangs from the 5’- or 3’-ends of the target DNA thereby removing the possibility to determine these end sequences in the target DNA.
- Ligation-based barcoding involves the use of DNA ligases to attach short sequences of nucleotides (barcodes) to DNA fragments. These barcodes serve as unique identifiers that can be used to differentiate between samples during downstream analyses. In general, barcoding enables the simultaneous analysis of multiple samples, enhancing throughput and efficiency.
- ligation-based barcoding strategies are typically complex, requiring careful design and validation of barcodes to avoid overlapping sequences.
- the ligation reaction may have varying efficiencies, potentially leading to incomplete ligation and the additional reaction steps increase the risk of cross-contamination between samples, which can complicate data analysis.
- DNA is treated with bisulfite, unmethylated cytosine residues are converted to uracil, whereas methylated cytosines remain unchanged. The conversion is readily detected for instance using sequencing techniques.
- One of the problems of using the bisulfite method for detecting methylation is that the treatment also introduces other changes in the DNA rendering a concomitant sequence determination less accurate.
- a method of the invention does not introduce significant changes in the target sequence and also leave these ends of the target sequence intact.
- the barcoding methods of the invention, the methylation sensitive restriction digestion and the subsequent sequencing of the modified DNA can be achieved without changing the reaction vessel. This renders the methods of the invention robust and less prone to contamination than other methods of barcoding, methylation detection and sequencing.
- Embodiments of the present invention provide novel approaches for barcoding, using terminal deoxynucleotidyl transferase (TdT), and/or methylation detection, using selective digestion of methylated or non-methylated DNA.
- the methylation status of the target DNA can be unveiled either by alignment of DNA reads after digestion of non-methylated DNA, (method depicted in Figure 3) or for instance by depletion of non-methylated DNA (method depicted in Figure 4).
- the exact sequence of 3’ and 5’ ends of the target DNA, the barcoding of the same DNA, and the distinction between circular and linear DNA can be accomplished using methods depicted in Figure 2.
- the DNA is digested with a restriction enzyme that digests non-methylated versions of the restriction enzyme digestion site.
- the methods for methylation detection of the present invention mark a significant breakthrough in both epigenetics and cancer diagnostics.
- the present invention enables direct detection of methylation from native DNA strands. In some embodiments, this is achieved through a selective depletion step that removes unmethylated CpG islands while preserving methylated DNA.
- This selective depletion involves two key enzymatic processes. First, a restriction enzyme is used to specifically target and cleave non-methylated CpG sites. Following this, an exonuclease digests the cleaved DNA fragments, a process facilitated by the presence of phosphate groups introduced during the restriction enzyme digestion.
- the depletion step is not necessary. Instead, the digestion alone can reveal the presence of methylated or non-methylated sites directly through sequencing data analysis. After sequencing, the reads are mapped to the reference genome, and the digested sites are identified by examining the starting and ending positions of the reads. For example: If a site is methylated, the restriction enzyme cannot cleave it, and sequencing reads will span the site, indicating its protection. If a site is non-methylated, it will be cleaved by the restriction enzyme.
- sequencing reads will start or end in proximity to the digestion site, and no reads will span the site.
- a mix of reads spanning the site and reads ending at the site will be observed in approximately equal ratios.
- restriction enzymes that cut the site if it is methylated and not when it is not methylated. In such cases the sequence reads will span the site if it is not methylated and there will be no reads etc in the case where the site was methylated.
- the accompanying IGV screenshots illustrate this principle: the top image, obtained using the method of the invention, demonstrates superior coverage and preserved sequence integrity, while the bottom image, generated using conventional bisulfite sequencing, exhibits degraded regions corresponding to unmethylated CpG islands.
- a method of the invention is advantageous for several reasons such as but not limited to the possibility of obtaining information on the methylation status of essentially all occurrences of the selected restriction enzyme.
- the enzymatic methods described herein are much more accurate than chemical methods for the detection of methylation making it easier to detect the presence of mutations in the plurality of DNA molecules as opposed to errors induced by the methods used.
- Terminal transferases are enzymes that catalyze the addition of nucleotides to the 3' end (terminus) of a DNA or RNA molecule.
- Terminal deoxynucleotidyl transferase is an enzyme particularly known for its ability to add nucleotides to the 3' ends of DNA in a template-independent manner, meaning that it does not require a DNA template to guide the addition of nucleotides.
- the protein is encoded by the DNTT gene.
- Aliases for the gene or protein in humans are DNA Nucleotidylexotransferase; TDT; Terminal Deoxynucleotidyltransferase; Terminal Addition Enzyme; Terminal Transferase; or EC 2.7.7.31.
- TdT is utilized in the labeling of DNA fragments for techniques such as DNA sequencing and in the addition of labeled nucleotides for various labeling experiments.
- TdT novel applications of TdT are in the cloning and other molecular biology techniques.
- a homopolymeric tail By adding a homopolymeric tail to the 3' end of a DNA fragment, researchers can create a template for priming the synthesis of a complementary DNA strand using a primer complementary to the homopolymeric tail.
- Homopolymeric tailing can be achieved by contacting a DNA molecule with TdT in the presence of only one type of nucleotide. TdT will add the nucleotides to the 3’ end of the DNA molecule. Because only one type of nucleotide is added, the resulting sequence is known and can be used to design a primer to synthesize a reverse complementary strand of the DNA molecule.
- the TdT is incubated under conditions that allow the addition of two or more different nucleotides to the 3’ termini of the plurality of single- stranded DNA molecules.
- TdT can add nucleotides depending on the availability of the particular nucleotide in the mixture.
- the two or more different nucleotides are typically added in equimolar amounts.
- the inclusion in the nascent sequence is apparently random.
- the complexity of newly formed sequences increases with the number of different nucleotides in the mixture and the number of nucleotides that is added to the 3’ termini.
- the complexity of the new sequences added is very low.
- two different nucleotides, for instance A and T are used there are only two different ends, specifically; ends with an A and ends with a T.
- the complexity rapidly increases with the number of nucleotides that are added to the 3’ ends and the number of different nucleotides in the mixture.
- the number of different sequences formed is 2 n where n is the average length of the newly formed sequences.
- the complexity is (4 or more) n .
- the two or more different nucleotides are preferably three or more, preferably four, different nucleotides.
- the nucleotides are preferably A, C, G and T.
- the complexity does not usually need to exactly match the number of sequences in the plurality of single-stranded DNA molecules. Discrimination of different molecules in the mixture is aided by the increasing complexity of sequences in the plurality.
- the combination of an identifier and the sequence of the DNA molecule also provides identification information. So even when there are two or more identifiers with the same sequence, the linkage to target DNA molecules with a substantially different sequence allows adequate discrimination and subsequent identification of the target DNA molecules in the plurality of single-stranded DNA molecules.
- the complexity (number of different) sequence identifiers generated can be increased with an increased number of different nucleotides and increased length of the sequence identifier.
- the complexity can be increased to such an extent that each sequence identifier attached to the 3’ end of the plurality of DNA molecules is unique.
- Such sequence identifiers are also referred to as unique sequence identifiers. This is a nice feature but, as mentioned herein above, not an essential feature.
- a method of the invention is particularly suited to sequence the entirety of linear DNA fragments, including the ends of the fragments. Barcoding target DNA or providing DNA with sequence identifiers is typically accomplished by capturing the DNA with a backbone or with adaptors by means of ligation.
- Conditions encompass a suitable buffer for the enzyme to work in, a suitable temperature, and sufficient time to produce sequence identifiers of sufficient complexity at the 3’ termini.
- the DNA molecules comprising the sequence identifiers at their 3’ termini are contacted with a single-stranded DNA ligase under conditions in which the ends of the DNA molecules comprising the sequence identifiers are self-ligated, thereby generating a plurality of circular single-stranded DNA molecules each comprising a DNA molecule and a sequence identifier.
- the single-stranded DNA ligase is a CircLigase TM .
- CircLigase TM is a thermostable ligase that catalyzes the intramolecular ligation (i.e. circularization) of ssDNA and ssRNA templates. Self-ligation can further be enhanced over intermolecular ligation by reducing the concentration of DNA molecules in solution by increasing the volume of the ligation reaction. The ligation reaction is performed using buffer conditions that support efficient ligation of the single strands. If necessary the DNA molecules comprising the sequence identifiers at their 3’ termini are provided with a phosphate group at the 5’-end prior to the ligation. This can be done with a suitable polynucleotide kinase (PNK) using the protocol of the manufacturer.
- PNK polynucleotide kinase
- Methods of the invention are particularly suited to produce DNA circles that each have one single-stranded DNA molecule from the plurality of single-stranded DNA molecules and one sequence identifier.
- the plurality of circular single-stranded DNA molecules is subsequently contacted with a polymerase with strand displacement activity and one or more primers whereupon a rolling circle amplification reaction is performed, thereby generating a plurality of amplification products, each comprising one or more copies of a circular single-stranded DNA molecule from the plurality of circular single-stranded DNA molecules.
- leftover linear DNA, if any is preferably removed prior to the rolling circle amplification.
- Performing a rolling circle amplification after removing linear DNA typically produces more high molecular weight concatemers of backbone and target DNA.
- Methods further include subjecting DNA circles that are produced in the ligation reaction to an amplification reaction, preferably a rolling circle amplification (RCA).
- Rolling circle amplification produces an ordered array of copies of at least two of said DNA circles.
- Rolling circle amplification produces DNA molecules of high molecular weight. Which is suited for sequencing. The multiple copies of the same circle and, thus, the same DNA molecule from the plurality and the same sequence identifier allow for a more accurate sequence determination and reduce sequencing errors.
- Rolling circle amplification has recently been reviewed by Mohsen and Kool (2016) Acc Chem Res.
- rolling circle amplification and rolling circle replication are sometimes used interchangeably in the art.
- rolling circle replication is used to refer to the replication of naturally occurring plasmid and virus genomes.
- the terms refer to a similar underlying principle, i.e. the repeated copying of the same circular DNA producing a longer nucleic acid molecule with an ordered array of backbone-target nucleic acid copies.
- Present techniques for rolling circle amplification enable the production of large arrays containing many copies of the produced DNA circles.
- Concatemers can have 2 or more copies, preferably 4 or more copies of the produced circles.
- Rolling circle amplification is performed by a polymerase with strand displacement activity and requires the usual priming sequence to generate the start.
- Particular polymerases with strand displacement activity and with high processivity are available to produce concatemers of considerable length.
- Polymerases with high processivity are polymerases that can polymerize a thousand nucleotides or more without dissociating from the DNA template. They can preferably polymerize two, three, four thousand nucleotides or more without dissociating from the DNA template. Polymerases with high processivity are among others discussed in Kelman et al; 1998: Structure Vol 6; pp 121-125.
- Rolling circle amplification can yield very high molecular weight concatemers using polymerases with high processivity and strand-displacement capacity, such as phi29 polymerase.
- This polymerase can polymerize 10 kb or more.
- High processivity polymerases are, therefore, preferably polymerases that polymerize 10 kb or more without dissociating from the DNA template (Blanco et al; 1999. J. Biol. Chem. 264 (15): 8935–40).
- the polymerization can be started on a nick in the double-strand DNA, or the DNA can be melted and annealed in the presence of one or more suitable primers.
- primers examples include random hexamer primers, one or more backbone-specific primers, one or more target nucleic acid- specific primers, or a combination thereof. Random primers are typically preferred when target nucleic acid sequences are not known or when a variety of target nucleic acid sequences are to be sequenced. One or more specific primers can be used to sequence- specific target nucleic acids of which the basis sequence is known. A variant is one or more primers that are specific for one or more particular sequences in a plurality of DNA molecules. Such primers can be used in different situations, for instance, in the focused sequencing of particular sets of sequences, for instance, sequences known to be associated with certain cancers or certain pathogens.
- the polymerase is preferably Phi29 DNA Polymerase, Bst DNA Polymerase, or Vent (exo-) DNA Polymerase.
- the polymerase is Phi29 DNA Polymerase.
- the Phi29 DNA Polymerase is preferably EquiPhi29 DNA polymerase. Both are available from Thermofisher. EquiPhi29 DNA polymerase is a mutant of Phi29 DNA polymerase with particularly favorable properties. Sequencing methods have evolved over time. The old Sanger sequencing method has been replaced by the now common next-generation sequencing (NGS) methods. These methods have recently been reviewed by Goodwin et al. (2016; Nature Reviews
- NGS next-generation sequencing
- Illumina sequencing also known as sequencing by synthesis, is a widely adopted high- throughput DNA sequencing technology. Illumina sequencing is characterized by its high accuracy, high throughput, and relatively short read lengths. It is widely used for various applications, including whole-genome sequencing, resequencing, metagenomics, and transcriptomics. The technology has played a pivotal role in advancing genomics research and has become a cornerstone in many laboratories for its efficiency and cost-effectiveness.
- Determined sequences can be aligned based on the sequence identifiers and, optionally, the determined sequence of a single-stranded DNA molecule.
- the sequences from the aligned different reads can be used to filter out errors and determine the sequence of one single-stranded DNA molecule from the plurality of single-stranded DNA molecules.
- the plurality of linear single-stranded DNA molecules can be derived from any source. When the source is or contains linear double-stranded DNA, the source can be denatured to create the plurality of single-stranded DNA molecules.
- the denaturing step can be done before or after the step wherein the sequence identifier is added to the 3’-end of the DNA molecules. The denaturing step is preferably done after the sequence identifier is added.
- RNA source can be copied into DNA and made single-stranded to become a plurality of single-stranded DNA molecules.
- a plurality of RNA molecules is contacted with a reverse transcriptase under conditions in which RNA molecules are copied into cDNA molecules, thereby producing said plurality of single- stranded DNA molecules.
- a plurality of single-stranded nucleic acid molecules is a plurality of linear single- stranded nucleic acid molecules.
- a plurality of single-stranded DNA molecules is a plurality of linear single-stranded DNA molecules.
- a plurality of single-stranded RNA molecules is a plurality of linear single-stranded RNA molecules.
- the plurality of single-stranded molecules preferably comprises circulating free DNA. DNA that circulates freely or that is associated with cellular particles in the blood or other bodily fluid samples is typically smaller than 400 nucleotides. Target nucleic acid molecules of such lengths are particularly suited to the methods of the invention.
- samples with relatively small nucleic acid molecules are some types of forensic samples, fossil samples, samples of nucleic acid isolated from environments that are inherently hostile to nucleic acid molecule integrity, such as stool samples, surface water samples, and other samples rich in microbial organisms.
- the sample can be any sample comprising one or more nucleic acid molecules of which the sequence is to be determined.
- One example is a sample comprising tumor DNA.
- the plurality of single-stranded DNA molecules can also be circulating tumor DNA (ctDNA) or cell-free DNA (cfDNA) present in liquid biopsies, including but not limited to blood, saliva, pleural fluid, or ascites fluid.
- the plurality of single-stranded DNA molecules can also be single-stranded cDNA derived from messenger RNA, microRNA, CRISPR RNA, non-coding RNA, viral RNA, or other sources of RNA.
- the plurality of single-stranded DNA molecules can also be derived from genomic DNA, PCR products, plasmid DNA, viral DNA, or other sources.
- the means and methods of the present invention are particularly suited for the sequencing of short DNA.
- DNA in the plurality of single-stranded DNA molecules is preferably 400 base pairs or less, more preferably 300 base pairs or less, more preferably 200 base pairs or less, more preferably 150 or less.
- the lower limit of the target DNA is preferably 20 base pairs, more preferably 30 base pairs, more preferably 40 base pairs and more preferably 50 base pairs. Any lower limit can be combined with any upper limit.
- the size of DNA fragments is given in nucleotides here. This refers, of course, to the number in one strand.
- the invention further provides a kit for sequencing a plurality of DNA molecules, wherein the kit comprises a TdT, a single-strand DNA ligase, a polymerase with strand displacement activity, and at least two nucleotides.
- the kit preferably comprises CircLigase as a ligase and a Phi29 Polymerase as the polymerase.
- CfDNA holds significant diagnostic potential due to its unique properties and the wealth of information it carries.
- cfDNA represents a non-invasive source of genetic material. Its diagnostic utility lies, among others, in detecting specific genetic alterations, such as mutations, methylation patterns, and copy number variations, reflecting the genomic landscape of tissues of origin. In cancer diagnostics, the analysis of cfDNA allows for the identification of tumor-specific mutations, enabling early detection, monitoring of treatment response, and the assessment of minimal residual disease. Furthermore, cfDNA can be indicative of other pathological conditions, such as prenatal abnormalities or autoimmune disorders.
- the sample comprising circulating fluid is preferably a blood sample such as a serum sample, a urine sample, a cerebrospinal fluid sample, a lymph fluid sample, or a saliva sample.
- the sample is preferably a blood sample, preferably a serum sample.
- a method for sequencing nucleic acid molecules comprising: - providing a sample comprising a plurality of preferably linear DNA molecules; - incubating said sample with a restriction enzyme that is sensitive to methylation at the recognition site for said restriction enzyme and subsequently sequencing the DNA molecules in the sample thereby determining sequences of the DNA molecules; and determining from said sequences the methylation status of a restriction enzyme site for said restriction enzyme on said plurality of DNA molecules.
- any one of aspects 1-3 further comprising removing DNA molecules that were cut by said restriction enzyme from said sample prior to said sequencing the DNA molecules in the sample. 5.
- the method of aspect 4 wherein the majority of the plurality of DNA molecules do not have a phosphate group at their 5’ ends prior to said incubation with said restriction enzyme. 6.
- any one of aspects 1-10 further comprising adding sequence identifiers to DNA molecules in the sample prior to sequencing of the DNA molecules.
- the sequence identifiers are added by providing a sample comprising the plurality of linear DNA molecules; - contacting said sample with a terminal deoxynucleotidyl transferase (TdT) and a mixture of two or more different nucleotides under conditions in which nucleotides in the mixture are sequentially and randomly added to the 3’ termini of linear DNA molecules by the TdT and denaturing, when present, double stranded DNA in said sample, thereby generating linear single-stranded DNA molecules comprising sequence identifiers at their 3’ termini; - contacting the linear single-stranded DNA molecules comprising the sequence identifiers with a single-stranded DNA ligase under conditions in which the ends of linear single- stranded DNA molecules comprising the sequence identifiers are self-ligated thereby generating a pluralit
- the provided DNA sample comprises a plurality of linear single-stranded DNA molecules and a plurality of circular single- stranded DNA molecules.
- the sequence identifiers are unique sequence identifiers.
- the sample comprises circulating free DNA (cfDNA).
- the sequencing is performed using a short read sequencing method, preferably a next-generation sequencing method involving massively parallel sequencing, more preferably using the Illumina Sequencing Platform. 18.
- kits for sequencing a plurality of DNA molecules comprising a TdT, a single-strand DNA ligase, a polymerase with strand displacement activity, and at least two nucleotides.
- the ligase is a CircLigaseTM
- the polymerase with strand displacement activity is a Phi29 Polymerase.
- a method for sequencing nucleic acid molecules comprising: - providing a sample comprising a plurality of linear DNA molecules; - contacting said sample with a terminal deoxynucleotidyl transferase (TdT) and a mixture of two or more different nucleotides under conditions in which nucleotides in the mixture are sequentially and randomly added to the 3’ termini of the linear DNA molecules by the TdT and denaturing, when present, double stranded DNA in said sample, thereby generating linear single-stranded DNA molecules comprising sequence identifiers at their 3’ termini; - contacting the linear single-stranded DNA molecules comprising the sequence identifiers with a single-stranded DNA ligase under conditions in which the ends of linear single-stranded DNA molecules comprising the sequence identifiers are self-ligated thereby generating a plurality of circular single-stranded DNA molecules each comprising a DNA molecule and a sequence identifier; - contacting the plurality of
- 34 The method of any one of aspects 26-33, wherein the single-strand DNA ligase is a CircLigase TM .
- Aspects 1-10 are method wherein the methylation status of one or more DNA molecules is determined by determining the sequence of the DNA molecules in a situation wherein these had been previously exposed to a restriction enzyme that is sensitive to the presence or absence of methylation on the DNA.
- Aspects 25-39 are directed towards sequencing DNA molecules using sequence identifiers to be able to, among others, distinguish sequences derived from different DNA molecules with essentially the same sequence. Essentially identical sequences with different sequence identifiers are typically derived from different DNA molecules. This feature may, for instance, be useful in quantification methods. A non-limiting example of such usefulness is in determining the prevalence mutant DNA molecules.
- the sample may be a sample directly derived from a clinical or biological source or the sample may be a processing stage thereof for instance such as the samples contacted and processed in the various steps of the claims and/or aspects.
- the sample of aspect 26 can, for example, be directly derived from a clinical or biological source but the sample can also be the result of exposure of such a sample to a methylation sensitive restriction enzyme such as described in aspects 1-10 prior to determining the sequence thereof.
- a methylation sensitive restriction enzyme such as described in aspects 1-10 prior to determining the sequence thereof.
- the fact the sample typically comprises a plurality of linear DNA molecules does not exclude the fact that circular DNA molecules may also be present.
- Restriction enzymes also known as restriction endonucleases, are enzymes that recognize specific DNA sequences and cleave the DNA at or near these sequences.
- restriction enzymes play a crucial role in molecular biology by cutting DNA molecules at precise locations, enabling scientists to manipulate and study DNA in various ways.
- the sequences that restriction enzymes recognize are typically palindromic, meaning they read the same backward and forward. Such sequences are referred to as restriction enzyme sites.
- the resulting fragments can be analyzed, manipulated, or used in various molecular biology techniques, such as cloning and DNA sequencing.
- Methylation-sensitive restriction enzymes are a subclass of restriction enzymes that recognize specific DNA sequences and cleave DNA, but their activity is influenced by the methylation status of the DNA.
- Methylation is a chemical modification where a methyl group (CH3) is added to certain bases in the DNA molecule, typically cytosine bases in CpG dinucleotides.
- Methylation-sensitive restriction enzymes can be divided into two main categories, e.g. enzymes that cleave DNA only when specific sites are methylated. If the recognition site is unmethylated, these enzymes do not cleave the DNA and enzymes that cleave DNA at specific sequences only when specific methylation at the recognition site is absent. Methylation of the restriction site protects the DNA from cleavage by these enzymes.
- Non-limiting examples of methylation-sensitive restriction enzymes are: - HpaII: This enzyme recognizes the sequence 5'-CCGG-3' but only cleaves if the internal cytosine (C) is unmethylated; - MspI: Similar to HpaII, MspI recognizes the sequence 5'-CCGG-3' and cleaves if the internal cytosine is unmethylated; - BstUI: This enzyme recognizes the sequence 5'-CGCG-3' and cleaves if the cytosine (C) is unmethylated. - HhaI: HhaI recognizes the sequence 5'-GCGC-3' and cleaves only if the internal cytosine (C) is unmethylated.
- SacII recognizes the sequence 5'-CCGCGG-3' and cleaves DNA at this site. Methylation of the cytosine (C) within the recognition site inhibits cleavage by SacII.
- the methylation-sensitive restriction enzyme in methods of the invention is preferably a methylation-sensitive restriction enzyme that digests non-methylated versions of the restriction enzyme digestion site, such as , but not limited to: HpaII; MspI; BstUI, HhaI or SacII.
- DNA methylation is a fundamental epigenetic mechanism in which methyl groups are added to the DNA molecule. This modification doesn't change the DNA sequence itself but can affect how genes are expressed.
- Methylation typically occurs on cytosine bases, specifically at cytosine-guanine dinucleotide sequences (CpG sites), although it can also occur at non-CpG sites in certain contexts, such as during development.
- the methylation status of DNA refers to the presence or absence of methylation at a certain position; to the extent of such methylation at a certain site; or to the pattern and extent of methylation across the genome of an organism or a specific set of genes. It is sometimes described as methylated or unmethylated, or hyper- or hypo-methylated, or the like.
- the methylation status of DNA can be dynamic and can change in response to various environmental factors, developmental stages, and disease conditions.
- the sequences can be used to determine where the restriction enzyme has cut or not. The presence of a complete copy of the restriction enzyme site in the determined sequence indicates that the methylation-sensitive restriction enzyme did not cut this particular occurrence of the site in the sample. The presence of the resulting parts of digestion of the restriction enzyme site by the restriction enzyme at the ends of determined sequences indicates that the methylation-sensitive restriction enzyme did cut this particular occurrence of the site in the sample.
- genomic or other large DNA fragments these can be fragmented to obtain suitably sized fragments. For instance as indicated herein above by shearing or sonication or by digestion with another restriction enzyme. Exposure to the methylation-sensitive restriction enzyme will result in undigested fragments that have only unphosphorylated 5’-ends and result in digested fragments that are unphosphorylated on one 5’-end and phosphorylated on the 5’-end that is the result of the successful digestion by the enzyme. These latter ends are sensitive for digestion by the 5’-phosphate dependent exonuclease and will be removed. The removed fragments will not be sequenced. The result will be that the sample is enriched for DNA fragments that were not cut by the methylation-sensitive restriction enzyme.
- the exonuclease step not only enriches for certain types of fragments but also creates a gap in the sequence reads that is more easily noticed than without the exonuclease treatment.
- the 5’-phosphate dependent exonuclease is preferably Lambda exonuclease. This enzyme digests only strands that have a 5’- phosphate. Digestion by Lambda exonuclease of a double stranded DNA fragment that has one phosphorylated 5’-end and one unphosphorylated 5’-end will result in digestion of the strand with the 5’-phosphate group while the other unphosphorylated strand is left intact. This single strand is preferably also removed thereby increasing the enrichment further.
- the enrichment is preferably further improved by incubating the sample with a single strand nucleotide specific nuclease subsequent to and/or at the same time as said incubation with said restriction enzyme. This will remove the single strand and enrich the sequences for sequences of DNA fragments where not digested by the methylation-sensitive restriction enzyme.
- the single strand nucleotide specific nuclease is preferably Nuclease P1.
- Specially preferred samples for the enrichment are sample of small DNA fragments such as but not limited to cfDNA.
- the DNA can be treated with a methylation-sensitive restriction enzyme, without prior de-phosphorylation and without subsequent exonuclease treatment.
- the DNA can then optionally be tagged with random nucleotides using TdT, undergo melting to generate single-stranded DNA, and self-circularized using CircLigase.
- the circular DNA is then amplified using rolling circle amplification and sequenced.
- the effect of the methylation-sensitive restriction enzyme can then be deducted by the analysis of the aligned reads. Reads coming from digested DNA will terminate at the restriction site, while undigested reads will span the restriction site.
- Figure 3 Sequencing results are more easily interpreted by adding sequence identifiers to DNA molecules in the sample prior to sequencing of the DNA molecules. It is therefore preferred to add sequence identifiers to DNA molecules in the sample prior to sequencing of the DNA molecules.
- Sequence identifiers can be used to distinguish sequences of the same molecule from an identical sequence originating from a different molecule because such a sequence will typically have a different sequence identifier. Sequence identifiers can also be used to improve the accuracy of the sequence information for instance in determining difference between a mutation in the original sample from a mutation/error that occurred during the sequencing reaction. Sequence identifiers can also be used to economize and sequence several samples at the same time. In such a case the sequence identifier can contain a part that is the same for all DNA molecules in a sample but different from other samples.
- sequence identifiers are added by - contacting a sample comprising the plurality of linear DNA molecules with a terminal deoxynucleotidyl transferase (TdT) and a mixture of two or more different nucleotides under conditions in which nucleotides in the mixture are sequentially and randomly added to the 3’ termini of linear DNA molecules by the TdT and denaturing, when present, double stranded DNA in said sample, thereby generating linear single-stranded DNA molecules comprising sequence identifiers at their 3’ termini; - contacting the linear single-stranded DNA molecules comprising the sequence identifiers with a single-stranded DNA ligase under conditions in which the ends of linear single-stranded DNA molecules comprising the sequence identifiers are self- ligated thereby generating a plurality of circular single-stranded DNA molecules each comprising a DNA molecule and a sequence identifier; - contacting the plurality of circular single-strand
- the method preferably further comprises aligning respective sequences of a single- stranded DNA molecule using the sequence identifier.
- the provided DNA sample preferably comprises a plurality of linear single-stranded DNA molecules and a plurality of circular single-stranded DNA molecules.
- the sequence identifiers are preferably unique sequence identifiers.
- the sample preferably comprises circulating free DNA (cfDNA).
- the sequencing is preferably performed using a short read sequencing method, preferably a next-generation sequencing method involving massively parallel sequencing, more preferably using the Illumina Sequencing Platform. Said two or more different nucleotides are preferably three or more, preferably four, different nucleotides.
- the single- strand DNA ligase is preferably a CircLigaseTM.
- the polymerase with strand displacement activity is preferably a DNA polymerase with high processivity.
- the polymerase is preferably selected from the group consisting of Phi29 DNA Polymerase, Bst DNA Polymerase, and Vent (exo-) DNA Polymerase.
- the Phi29 DNA Polymerase is preferably EquiPhi29TM DNA polymerase.
- the primer for the RCA is preferably a specific primer.
- the kit for sequencing a plurality of DNA molecules is preferably a kit that comprises a TdT, a single-strand DNA ligase, a polymerase with strand displacement activity, and at least two nucleotides.
- the ligase is a CircLigaseTM
- the polymerase with strand displacement activity is a Phi29 Polymerase.
- cfDNA Extraction cfDNA was extracted from blood samples of patients using a standard cfDNA extraction kit (QIA kit) using the manufacturers protocol. The extracted cfDNA predominantly consisted of short, double-stranded DNA fragments. Terminal Deoxynucleotidyl Transferase (TdT) Treatment: The cfDNA was treated with TdT (obtained from New England Biolabs) and a mixture of four different nucleotides (A, T, C, G; Sigma) using the manufacturers protocol. This step randomly added a sequence of nucleotides to the 3’ ends of each cfDNA fragment, serving as unique sequence identifiers.
- TdT Terminal Deoxynucleotidyl Transferase
- the sequences obtained, including the terminal regions, can be fed into an AI-based analytical model designed to identify patterns and mutations associated with various cancers.
- the AI model can then be trained to recognize cancer-specific signatures in the cfDNA sequences, including those present in the sequence edges. Including these features is expected to bring: Improved Diagnostic Accuracy:
- the AI model is expected to use the end-sequence information, together with other features, to distinguish between benign and malignant cfDNA samples.
- Personalized Treatment Strategies The model also provided insights into the genetic makeup of individual tumors, facilitating personalized treatment planning.
- the input DNA is composed of 3 non-phosphorylated sequences derived from the human MYC gene.
- the input DNA is composed of a methylated MYC-Promoter (M-P*), a non- methylated MYC-Promoter (M-P), and a MYC-Terminator without any CpG sites (M-T).
- the input DNA is digested with a methylation-blocked restriction enzyme, BstUI.
- BstUI digests only the unmethylated MYC-Promoter, as supposed to, thus generating 5'- Phosphorylated ends only on that DNA.
- the DNA was fragmented via sonication to 300bp.
- the overhangs of the ends of the resulting molecules were repaired (blunt ended) by means of end repair module from New England Biolabs and subsequently dephosphorylated.
- the fragmented DNA was then split into two aliquots.
- One aliquot was sent to a third-party company for bisulfite sequencing, while the other was processed using the method disclosed in Example 2.
- the sequencing data obtained was analyzed by direct comparison on IGV, where images 8- 12 were taken as a screenshot.
- Figure 8 shows a zoom-in of figure 12. It shows the difference in accuracy of the resulting sequences from the method of the invention when compared to bisulfite methylation detection.
- Figure 9 show the sequence gaps obtained after treatment with the methylation sensitive enzyme BstUI and the gaps obtained with the bisulfite treatment (see legend).
- Figure 9 shows the result for a part of the chromosome X.
- Figures 10 and 11 show the results of the same experiment but now for parts of chromosomes 20 and 15 respectively.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
L'invention concerne des procédés de séquençage de molécules d'acide nucléique et des kits utiles dans de tels procédés. Selon un mode de réalisation, l'invention concerne un procédé de séquençage de molécules d'acide nucléique dans un échantillon comportant une pluralité de molécules d'ADN linéaires ; la mise en contact de cet échantillon avec une désoxynucléotidyl transférase terminale (TdT) et un mélange d'au moins deux nucléotides différents dans des conditions où les nucléotides du mélange sont ajoutés de manière successive et aléatoire aux extrémités 3' des molécules d'ADN linéaires par la TdT et la dénaturation, le cas échéant, de l'ADN double brin dans cet échantillon, générant ainsi des molécules d'ADN linéaire simple brin contenant des identificateurs de séquence à leurs extrémités 3'. Les molécules d'ADN linéaires comportant les identifiants de séquence sont circularisées à l'aide d'une ligase d'ADN simple brin et les cercles de l'échantillon sont soumis à une amplification par cercle roulant. Les produits d'amplification sont ensuite séquencés, moyennant quoi des identifiants de séquence sont utilisés pour assembler des séquences à partir de la même molécule nucléique d'origine dans l'échantillon. L'invention concerne également des kits comportant des matières utiles à la mise en œuvre des procédés. Selon un autre mode de réalisation, l'invention concerne des procédés de détection de la méthylation dans une pluralité de molécules d'ADN.
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| NL2036501 | 2023-12-12 | ||
| NL2036501A NL2036501B1 (en) | 2023-12-12 | 2023-12-12 | Method for sequencing nucleic acid molecules and related methods. |
| NL2037734 | 2024-05-21 | ||
| NL2037734 | 2024-05-21 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025127927A1 true WO2025127927A1 (fr) | 2025-06-19 |
Family
ID=93962426
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/NL2024/050664 Pending WO2025127927A1 (fr) | 2023-12-12 | 2024-12-12 | Procédé de séquençage de molécules d'acide nucléique et procédés associés |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2025127927A1 (fr) |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2016024182A1 (fr) * | 2014-08-13 | 2016-02-18 | Vanadis Diagnostics | Procédé d'estimation de la quantité d'un locus méthylé dans un échantillon |
| WO2016053638A1 (fr) * | 2014-09-30 | 2016-04-07 | Ge Healthcare Bio-Sciences Corp. | Procédé pour l'analyse d'acide nucléique directement à partir d'un échantillon biologique non purifié |
| US20180030527A1 (en) * | 2004-03-08 | 2018-02-01 | Rubicon Genomics, Inc. | Methods and compositions for generating and amplifying dna libraries for sensitive detection and analysis of dna methylation |
| WO2018035170A1 (fr) * | 2016-08-15 | 2018-02-22 | Accuragen Holdings Limited | Compositions et procédés permettant de détecter des variants de séquences rares |
| EP3798318A1 (fr) * | 2019-09-30 | 2021-03-31 | Diagenode S.A. | Procédé et kit de séquençage à haut débit |
-
2024
- 2024-12-12 WO PCT/NL2024/050664 patent/WO2025127927A1/fr active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180030527A1 (en) * | 2004-03-08 | 2018-02-01 | Rubicon Genomics, Inc. | Methods and compositions for generating and amplifying dna libraries for sensitive detection and analysis of dna methylation |
| WO2016024182A1 (fr) * | 2014-08-13 | 2016-02-18 | Vanadis Diagnostics | Procédé d'estimation de la quantité d'un locus méthylé dans un échantillon |
| WO2016053638A1 (fr) * | 2014-09-30 | 2016-04-07 | Ge Healthcare Bio-Sciences Corp. | Procédé pour l'analyse d'acide nucléique directement à partir d'un échantillon biologique non purifié |
| WO2018035170A1 (fr) * | 2016-08-15 | 2018-02-22 | Accuragen Holdings Limited | Compositions et procédés permettant de détecter des variants de séquences rares |
| EP3798318A1 (fr) * | 2019-09-30 | 2021-03-31 | Diagenode S.A. | Procédé et kit de séquençage à haut débit |
Non-Patent Citations (4)
| Title |
|---|
| BLANCO ET AL., J. BIOL. CHEM, vol. 264, no. 15, 1999, pages 8935 - 40 |
| GOODWIN ET AL.: "Nature Reviews", GENETICS, vol. 17, 2016, pages 333 - 351 |
| KELMAN ET AL., STRUCTURE, vol. 6, 1998, pages 121 - 125 |
| MOHSENKOOL, ACC CHEM RES, vol. 49, no. 11, 2016, pages 2540 - 2550 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20230392191A1 (en) | Selective degradation of wild-type dna and enrichment of mutant alleles using nuclease | |
| JP2024060054A (ja) | ヌクレアーゼ、リガーゼ、ポリメラーゼ、及び配列決定反応の組み合わせを用いた、核酸配列、発現、コピー、またはdnaのメチル化変化の識別及び計数方法 | |
| JP5237126B2 (ja) | ライゲーションアッセイを用いてハイスループットシークエンスに基づき遺伝子関連配列を検出する方法 | |
| EP3368688B1 (fr) | Compositions et procédés de détermination de cytosines modifiées par séquençage | |
| US8586310B2 (en) | Method for multiplexed nucleic acid patch polymerase chain reaction | |
| JP7240337B2 (ja) | ライブラリー調製方法ならびにそのための組成物および使用 | |
| US20090047680A1 (en) | Methods and compositions for high-throughput bisulphite dna-sequencing and utilities | |
| JP2020513801A (ja) | メチル化状態が維持されるdna増幅方法 | |
| EP3102699B1 (fr) | Amorces de pcr | |
| JP6767870B2 (ja) | エラーのないdnaシークエンシング | |
| JP2014520525A (ja) | オリゴヌクレオチド・ライゲーション・アッセイに基づく配列ベースの遺伝子型決定 | |
| EP2722401B1 (fr) | Addition d'un adaptateur par clivage invasif | |
| US20240301466A1 (en) | Efficient duplex sequencing using high fidelity next generation sequencing reads | |
| CN110468179B (zh) | 选择性扩增核酸序列的方法 | |
| US11111514B2 (en) | Method for multiplexed nucleic acid patch polymerase chain reaction | |
| US20180051330A1 (en) | Methods of amplifying nucleic acids and compositions and kits for practicing the same | |
| WO2025127927A1 (fr) | Procédé de séquençage de molécules d'acide nucléique et procédés associés | |
| US9909170B2 (en) | Method for multiplexed nucleic acid patch polymerase chain reaction | |
| KR20230124636A (ko) | 멀티플렉스 반응에서 표적 서열의 고 감응성 검출을위한 조성물 및 방법 | |
| JP2022546485A (ja) | 腫瘍高精度アッセイのための組成物および方法 | |
| NL2036501B1 (en) | Method for sequencing nucleic acid molecules and related methods. | |
| US12091715B2 (en) | Methods and compositions for reducing base errors of massive parallel sequencing using triseq sequencing | |
| WO2024137316A1 (fr) | Oligonucléotides et procédés pour capturer des matrices simple brin et/ou des matrices avec des surplombs 3' | |
| KR20250011954A (ko) | 종양학 검정을 위한 조성물 및 방법 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24827936 Country of ref document: EP Kind code of ref document: A1 |