[go: up one dir, main page]

EP4642929A2 - Method for detecting minority genetic variants - Google Patents

Method for detecting minority genetic variants

Info

Publication number
EP4642929A2
EP4642929A2 EP23841633.3A EP23841633A EP4642929A2 EP 4642929 A2 EP4642929 A2 EP 4642929A2 EP 23841633 A EP23841633 A EP 23841633A EP 4642929 A2 EP4642929 A2 EP 4642929A2
Authority
EP
European Patent Office
Prior art keywords
sequence
dna
origin
biological sample
genetic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP23841633.3A
Other languages
German (de)
French (fr)
Inventor
Chiara Leo
Anna ROCCHI
Alekos SIMONI
Greta IMMOBILE MOLARO
Andrea Crisanti
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Polo D'innovazione Di Genomica Genetica E Biologica Srl
Original Assignee
Polo D'innovazione Di Genomica Genetica E Biologica Srl
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Polo D'innovazione Di Genomica Genetica E Biologica Srl filed Critical Polo D'innovazione Di Genomica Genetica E Biologica Srl
Publication of EP4642929A2 publication Critical patent/EP4642929A2/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6858Allele-specific amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Definitions

  • the present invention relates to a method for detecting a first nucleic acid sequence in a biological sample, which comprises a second nucleic acid sequence potentially competing in the detection of such first nucleic acid sequence.
  • Cell-free DNA can originate from various different mechanisms and can be for example released: by cells, as a communication and adjustment mechanism of the immune system (Korabecna et al., (2020) Cell-free DNA in plasma as an essential immune system regulator. Ski Rep 10, 17478 DOI: https://DOI.org/10.1038/s41598-020-74288-2); as a consequence of apoptosis and necrosis of the cells (Ranucci R (2019) Cell-Free DNA: Applications in Different Diseases. Methods Mol Biol. 1909:3-12.
  • circulating tumor DNA ctDNA
  • ctDNA circulating tumor DNA
  • the quantity of cell-free DNA is very low in the blood flow in healthy individuals.
  • the concentration of cfDNA in blood samples tends to increase during the development of tumors, especially in the advanced stage, during the gestation period in maternal blood, and in the presence of cardiovascular and chronic diseases.
  • EVs are particularly stable in plasma and they can be considered a promising marker, for example in the diagnosis of patients with tumors at the initial stage (Fernando et al., WL (2017) New evidence that a large proportion of human blood plasma cell-free DNA is localized in exosomes.
  • EVs have been shown to be crucial bio-markers in prognoses.
  • the study of EV content can be of great help in early identification of the formation of premetastatic niches, since the tumoral cells have shown a greater activity of secreting exosomes with altered content (Chin and Wang, (2016) Cancer Tills the Premetastatic Field: Mechanistic Basis and Clinical Implications. Clin Cancer Res. 2016 Aug l;22(15):3725-33. DOI: 10.1158/1078-0432.CCR-16-0028).
  • BEAMing Safe-Sequencing System BEAMing Safe-Sequencing System
  • TamSeq Tagged- Amplicon deep Sequencing
  • CAPP-Seq Cancer Individual Profiling by deep Sequencing
  • digital PCR for detecting single-nucleotide mutations in cfDNA
  • WGS whole-genome sequencing
  • technologies can be divided into targeted approaches, aimed at detecting mutations in a set of predefined genes [for example, mutations in the EGFR gene are important for the response of patients with non- small-cell lung cancer (NSCLC) to blocking by tyrosine kinase inhibitors (TKI)] or non-targeted approaches (for example, comparative genomic hybridization on arrays, WGS or exome sequencing), which entail screening of the whole genome.
  • NSCLC non- small-cell lung cancer
  • TKI tyrosine kinase inhibitors
  • non-targeted approaches for example, comparative genomic hybridization on arrays, WGS or exome sequencing
  • the aim of the present invention is to provide a method for detecting a rare genetic variant in a biological sample which overcomes the limitations of the known art.
  • an object of the invention is to provide a method that makes it possible to detect a specific rare genetic variant in a context of nonmutated sequences, so reducing the need for ultra-deep sequencing with respect to the known art.
  • Another object of the invention is to provide a method that makes it possible to detect an exogenous DNA sequence in a biological sample.
  • Another object of the present invention is to provide a method that makes it possible to detect a DNA sequence of fetal or embryonic origin in a biological sample.
  • Another object of the invention is to provide a method that is highly reliable, and relatively easy to execute.
  • a method for identifying a first nucleic acid sequence in a biological sample comprising the following steps:
  • nucleic acids comprising:
  • step (ii) optionally fragmenting DNA obtained in step (i) or step (i-a), obtaining DNA fragments with a length comprised between 100 and 10,000 base pairs;
  • step (iii) circularizing DNA fragments present in the DNA isolated in step (i) or in the DNA obtained in step (ii), obtaining a mixture of circularized DNAs comprising circular DNA comprising the "first sequence” and circular DNA comprising the "second sequence”;
  • step (iv) placing the mixture of circularized DNAs obtained in step (iii) in contact with a CRISPR system characterized by an sgRNA chosen from an sgRNA that is complementary to the "second sequence” of DNA and an sgRNA comprising from 1 to 3 nucleotides that are not complementary to the "second sequence” of DNA, thus obtaining a mixture of DNA comprising circular DNA comprising the "first sequence” and linear DNA comprising the "second sequence”;
  • step (v) selectively amplifying the circular DNA obtained in step (iv) by rolling circle amplification (RCA), thus obtaining an amplification product comprising a plurality of copies of the first DNA sequence;
  • step (vi) identifying the first DNA sequence in the amplification product obtained in step (v).
  • first sequence means a genetic sequence selected from the group constituted by a genetic sequence of tumoral origin, a genetic sequence of fetal origin, a genetic sequence of embryonic origin, a genetic sequence of bacterial origin, a genetic sequence of viral origin, a genetic sequence of plant origin, a genetic sequence of animal origin.
  • second sequence means any genetic sequence that potentially competes in the detection of the marker of interest (“first sequence”).
  • the method of the invention entails isolating the nucleic acids from the biological sample with methods known to the person skilled in the art such as, for example, extraction of nucleic acids with phenol-chloroform, extraction of nucleic acids using magnetic beads, and extraction of nucleic acids using silica membrane columns.
  • the biological sample is selected from the group consisting of blood, saliva, sputum, urine, feces, cerebrospinal fluid, and liquid biopsies.
  • the method includes a step of removing the cellular portion. For example, if the method is applied to the detection of cfDNA of tumoral origin, the presence of cells would entail an excessive presence of DNA of the “second sequence”, so compromising the sensitivity and specificity of the method.
  • the “first sequence” is a DNA sequence.
  • Isolated nucleic acids comprise the sequence of interest (“first sequence”) often in the presence of a great excess of one or more sequences (“second sequence”) potentially competing in the detection of the “first sequence”.
  • first sequence is mutated DNA of tumoral origin present in the sample together with an excess of non-mutated DNA (“second sequence”).
  • second sequence is exogenous DNA present in the sample together with an excess of endogenous DNA (“second sequence”).
  • first sequence is fetal or embryonic DNA in a sample constituted predominantly by maternal DNA.
  • the circulating DNA extracted in step (i) of the method is generally sufficiently fragmented that it can be circularized as described in step (iii) of the method of the invention.
  • the method of the invention further comprises the optional step of (ii) fragmenting DNA obtained in step (i) or step (i-a), obtaining DNA fragments with a length comprised between 100 and 10,000 base pairs.
  • fragmentation can be obtained with chemical or physical methods known to the person skilled in the art, such as for example sonication and tagmentation.
  • the method of the invention then comprises a step of (iii) circularizing DNA fragments present in the DNA isolated in step (i) or in the DNA obtained in step (ii), obtaining a mixture of circularized DNAs comprising circular DNA comprising the "first sequence” and circular DNA comprising the "second sequence”.
  • Circularization of the fragments is done with methods known to the person skilled in the art (“Application of Circular Ligase to Provide Template for Rolling Circle Amplification of Low Amounts of Fragmented DNA”. Ada N. Nunez, MSFS, Mark F. Kavlick, BS, James M. Robertson, PHD, CFSRU, and Bruce Budowle, PHD, FBI Laboratory, FBI Academy, Quantico, VA 22135).
  • step (iii) is obtained by means of the following steps: a. repairing the ends of the DNA fragments; b. adding a nucleotide to the 3' end of each fragment; c. adding an adapter comprising: cl. a nucleotide complementary to the nucleotide added in step b. to one end of the adapter; c2. a restriction enzyme recognition sequence "a”; c3. a sequence "0" comprised between 15 and 35 base pairs, characterized in that it does not form secondary structures or homodimers. c4. an arbitrary sequence "y” comprising 150 to 1000 base pairs; d. adding a ligase, thus obtaining the formation of a mixture of circularized DNAs comprising:
  • the function of the “y” sequence of the adapter is to support facilitation of circularization via a reduction in conformational entropy. This is a synthetic sequence designed so that under reaction conditions it will not form hybrids with the “first sequence”, the “second sequence” or the sgRNA of the CRISPR systems used.
  • step (iv) the mixture of circularized DNAs obtained in step (iii) is brought into contact with a CRISPR system characterized by an sgRNA that is substantially complementary to the “second sequence” of DNA.
  • a CRISPR system characterized by an sgRNA that is substantially complementary to the “second sequence” of DNA.
  • This system makes it possible to specifically cut the “second sequence” which is thus linearized, while the circular DNA that comprises the “first sequence” remains intact.
  • the specificity can be optimized by conducting the reaction under conditions of low salinity and/or by using CRISPR systems characterized by sgRNA designed to maximize discrimination during hybridization.
  • the specificity can be optimized by using CRISPR systems characterized by “destabilized” sgRNA, i.e. sgRNA that comprise from 1 to 3 bases that are not complementary to the second sequence, preferably chosen from CG residues.
  • the CRISPR system is a CRISPR-Cas9 system.
  • the high specificity of the CRISPR system makes it possible to simultaneously degrade multiple sequences using several CRISPR systems characterized by respectively complementary sgRNA, simultaneously in multiplex mode. In this manner the method of the invention makes it possible to simultaneously linearize several sequences (“second sequence”) which potentially compete with one or more sequences of interest (“first sequence”).
  • the “second sequence” is chosen from the group consisting of the sequences from SEQ ID NO: 1 to SEQ ID NO:2I023.
  • the method of the invention advantageously combines CRISPR technology to linearize the second sequence, with a step of (v) selectively amplifying the circular DNA obtained in step (iv) by rolling circle amplification, thus obtaining an amplification product comprising a plurality of copies of the first DNA sequence; the combination of steps (iv) and (v) results in an extremely selective amplification of the one or more markers of interest (“first sequence”).
  • the method then entails (vi) identifying the first DNA sequence in the amplification product obtained in step (v).
  • the selective amplification of the non-linearized sequences in step (iv) using RCA can be obtained with a single primer complementary to the sequence “P” introduced to the circular template in step (iii) of the method.
  • the method therefore makes it possible to investigate in a multiplexed manner multiple “first” sequences without needing to use specific primers for the amplification step.
  • the identification in step (vi) can be done using various methods such as for example real-time PCR, illumina sequencing, or nanopore sequencing.
  • identification of the first sequence in step (vi) is obtained by sequencing the amplification product obtained in step (v).
  • the identification does not require ultra-deep sequencing.
  • the DNA extracted in step (i) and optionally fragmented in step (ii) can be subjected to one or more steps of enrichment with one or more gene sequences of interest before proceeding with the circularization in step (iii).
  • the method of the invention further comprises, prior to step (iii), the step of (ii)a capturing one or both of the "first sequence" and the "second sequence” by means of a probe complementary thereto.
  • the method of the invention comprises, prior to step (iii), the step of (ii)b amplifying one or both of the "first sequence" and the "second sequence".
  • the amplification in step (ii)b is preferably obtained by PCR.
  • the method of the invention therefore makes it possible to detect and identify rare genetic variants in a biological sample.
  • the method does not require knowledge of the sequence of the variation of interest.
  • the combination of CRISPR systems, designed to cut potentially competing sequences, with RCA amplification makes it possible to amplify the rare variant with extraordinarily specificity, resulting in a simplification of the procedure with respect to the methods currently used and with a reduced need for ultra-deep sequencing to identify the variants.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Oncology (AREA)
  • Hospice & Palliative Care (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention relates to a method for detecting a first nucleic acid sequence in a biological sample, which comprises a second nucleic acid sequence competing in the detection of said first sequence.

Description

METHOD FOR DETECTING MINORITY GENETIC VARIANTS
The present invention relates to a method for detecting a first nucleic acid sequence in a biological sample, which comprises a second nucleic acid sequence potentially competing in the detection of such first nucleic acid sequence.
The presence of cell-free DNA (cfDNA) was first identified in samples of human peripheral blood in 1948 (Mandel and Metais, (1948) “Les acides nucleiques du plasma sanguin chez 1’homme” Biologie 3- 4:241-243). Right from its identification, cell-free DNA rapidly became the focus of intense interest for research into bio-markers with clinical applications and in various different research areas.
One of the greatest strengths of the application of cell-free DNA as a bio-marker is associated with the possibility to analyze it using non-invasive sampling, such as for example a blood sample.
Cell-free DNA can originate from various different mechanisms and can be for example released: by cells, as a communication and adjustment mechanism of the immune system (Korabecna et al., (2020) Cell-free DNA in plasma as an essential immune system regulator. Ski Rep 10, 17478 DOI: https://DOI.org/10.1038/s41598-020-74288-2); as a consequence of apoptosis and necrosis of the cells (Ranucci R (2019) Cell-Free DNA: Applications in Different Diseases. Methods Mol Biol. 1909:3-12. DOI: 10.1007/978-1-4939-8973-7 1); from the fetus, during embryonic development in the maternal blood flow (Ranucci (2019); Bronkhorst et al., (2022) New Perspectives on the Importance of Cell-Free DNA Biology. Diagnostics, 12(9):2147. DOI: 10.3390/diagnosticsl2092147); from pathogens during the development of infections (Blauwkamp et al., (2019) Analytical and clinical validation of a microbial cell-free DNA sequencing test for infectious disease. Nat Microbiol 4, 663-674 DOI: https://DOI.org/10.1038/s41564-018-0349-6); from the human microbiome, as well as from assimilated environmental DNA (Bronkhorst et al., 2022). Cell-free DNA often is in highly fragmented form, with fragment dimensions of approximately 160-200bp, and is present not only in the peripheral blood of patients, but also in various different types of biological samples, such as for example saliva, sputum, urine, feces, cerebrospinal liquid and, in general, in samples referred to as “liquid biopsies”.
The variety of possible cell-free DNA derivations makes possible a wide spectrum of application thereof via the selection of specific markers. The scopes of application therefore vary from oncology to prenatal diagnosis, from pathogen identification to monitoring of transplanted organs, from the study and control of cardiovascular and chronic diseases to the control of the presence of contaminants through searching for assimilated environmental DNA (Bronkhorst et al., 2022).
In addition to the variety of possible sources of cell-free DNA, another advantage of its application is associated with the duration of its half-life, generally limited over time. This aspect enables, especially in the clinical setting, non-invasive and continuous monitoring over time of the conditions of the patient under observation. For example, the application of circulating tumor DNA (ctDNA) is well known and widespread in the phases of diagnosis, prognosis and choice/monitoring of therapy in cancer patients.
Generally, the quantity of cell-free DNA is very low in the blood flow in healthy individuals. However, the concentration of cfDNA in blood samples tends to increase during the development of tumors, especially in the advanced stage, during the gestation period in maternal blood, and in the presence of cardiovascular and chronic diseases.
Together with cell-free DNA, in recent years, particular interest has been directed, especially in oncology, to the study and identification of biomarkers in the DNA present in extracellular vesicles (EVs).
It has been observed that EVs are particularly stable in plasma and they can be considered a promising marker, for example in the diagnosis of patients with tumors at the initial stage (Fernando et al., WL (2017) New evidence that a large proportion of human blood plasma cell-free DNA is localized in exosomes. PLoS ONE 12(8): e0183915 DOI: https://DOI.org/10.1371/joumal.pone.0183915).
Furthermore, EVs have been shown to be crucial bio-markers in prognoses. In fact, the study of EV content can be of great help in early identification of the formation of premetastatic niches, since the tumoral cells have shown a greater activity of secreting exosomes with altered content (Chin and Wang, (2016) Cancer Tills the Premetastatic Field: Mechanistic Basis and Clinical Implications. Clin Cancer Res. 2016 Aug l;22(15):3725-33. DOI: 10.1158/1078-0432.CCR-16-0028).
In recent years, various highly-sensitive and specific methods have been developed for detecting cfDNA and its specific bio-markers, including “BEAMing Safe-Sequencing System” (BEAMing Safe-SeqS), “Tagged- Amplicon deep Sequencing” (TamSeq), “Cancer Individual Profiling by deep Sequencing” (CAPP-Seq), digital PCR for detecting single-nucleotide mutations in cfDNA, whole-genome sequencing (WGS) for establishing changes in the number of copies, or the application of CRISPR-Cas systems to cut a target of interest in a highly specific manner.
However, the above mentioned techniques require a prior knowledge of the sequence of interest, together with the need for ultra- deep sequencing in order to be able to identify the first sequence of interest with statistical significance and be able to distinguish it from the second sequence. The need for ultra-deep sequencing limits the repeatability of analysis on samples of the same origin, for example from the same patient, owing to the long analysis times and high costs.
In principle, technologies can be divided into targeted approaches, aimed at detecting mutations in a set of predefined genes [for example, mutations in the EGFR gene are important for the response of patients with non- small-cell lung cancer (NSCLC) to blocking by tyrosine kinase inhibitors (TKI)] or non-targeted approaches (for example, comparative genomic hybridization on arrays, WGS or exome sequencing), which entail screening of the whole genome. Usually, targeted approaches have greater analytical sensitivity than non-targeted approaches, but considerable efforts are being made to improve the detection limitations of the latter.
Such limitations are mainly caused by the very nature of cfDNA (short half-life, low concentrations, and high fragmentation levels) which requires a great deal of time and highly optimized workflows, as well as an ultra-deep sequencing for a significant analysis of the markers of interest.
In light of the foregoing, it is necessary to improve the reliability of cfDNA analysis techniques in terms of sensitivity and specificity, as well as to identify protocols that can become standard and more accessible both in terms of applicability and of times and costs.
The aim of the present invention is to provide a method for detecting a rare genetic variant in a biological sample which overcomes the limitations of the known art.
Within this aim, an object of the invention is to provide a method that makes it possible to detect a specific rare genetic variant in a context of nonmutated sequences, so reducing the need for ultra-deep sequencing with respect to the known art.
Another object of the invention is to provide a method that makes it possible to detect an exogenous DNA sequence in a biological sample.
Another object of the present invention is to provide a method that makes it possible to detect a DNA sequence of fetal or embryonic origin in a biological sample.
Another object of the invention is to provide a method that is highly reliable, and relatively easy to execute.
This aim and these and other objects which will become better apparent hereinafter are achieved by a method for identifying a first nucleic acid sequence in a biological sample, said method comprising the following steps:
(i) isolating nucleic acids from the biological sample, said nucleic acids comprising:
(a) a first nucleic acid sequence, and
(b) a second nucleic acid sequence;
(i-a) optionally reverse-transcribing RNA present in the nucleic acids;
(ii) optionally fragmenting DNA obtained in step (i) or step (i-a), obtaining DNA fragments with a length comprised between 100 and 10,000 base pairs;
(iii) circularizing DNA fragments present in the DNA isolated in step (i) or in the DNA obtained in step (ii), obtaining a mixture of circularized DNAs comprising circular DNA comprising the "first sequence" and circular DNA comprising the "second sequence";
(iv) placing the mixture of circularized DNAs obtained in step (iii) in contact with a CRISPR system characterized by an sgRNA chosen from an sgRNA that is complementary to the "second sequence" of DNA and an sgRNA comprising from 1 to 3 nucleotides that are not complementary to the "second sequence" of DNA, thus obtaining a mixture of DNA comprising circular DNA comprising the "first sequence" and linear DNA comprising the "second sequence";
(v) selectively amplifying the circular DNA obtained in step (iv) by rolling circle amplification (RCA), thus obtaining an amplification product comprising a plurality of copies of the first DNA sequence;
(vi) identifying the first DNA sequence in the amplification product obtained in step (v).
In the present invention the term “first sequence” means a genetic sequence selected from the group constituted by a genetic sequence of tumoral origin, a genetic sequence of fetal origin, a genetic sequence of embryonic origin, a genetic sequence of bacterial origin, a genetic sequence of viral origin, a genetic sequence of plant origin, a genetic sequence of animal origin.
In the present invention, the term “second sequence” means any genetic sequence that potentially competes in the detection of the marker of interest (“first sequence”).
The method of the invention entails isolating the nucleic acids from the biological sample with methods known to the person skilled in the art such as, for example, extraction of nucleic acids with phenol-chloroform, extraction of nucleic acids using magnetic beads, and extraction of nucleic acids using silica membrane columns.
The biological sample is selected from the group consisting of blood, saliva, sputum, urine, feces, cerebrospinal fluid, and liquid biopsies.
Some applications require cell-free biological samples. In such an eventuality, if cell samples are used, the method includes a step of removing the cellular portion. For example, if the method is applied to the detection of cfDNA of tumoral origin, the presence of cells would entail an excessive presence of DNA of the “second sequence”, so compromising the sensitivity and specificity of the method.
In a preferred embodiment the “first sequence” is a DNA sequence.
Isolated nucleic acids comprise the sequence of interest (“first sequence”) often in the presence of a great excess of one or more sequences (“second sequence”) potentially competing in the detection of the “first sequence”. For example, for a liquid biopsy in oncology, the “first sequence” is mutated DNA of tumoral origin present in the sample together with an excess of non-mutated DNA (“second sequence”). In the event a contamination is detected, the “first sequence” is exogenous DNA present in the sample together with an excess of endogenous DNA (“second sequence”). In the field of prenatal diagnosis, the “first sequence” is fetal or embryonic DNA in a sample constituted predominantly by maternal DNA.
The circulating DNA extracted in step (i) of the method is generally sufficiently fragmented that it can be circularized as described in step (iii) of the method of the invention.
In an embodiment, particularly when the DNA is extracted from EVs, the method of the invention further comprises the optional step of (ii) fragmenting DNA obtained in step (i) or step (i-a), obtaining DNA fragments with a length comprised between 100 and 10,000 base pairs. Such fragmentation can be obtained with chemical or physical methods known to the person skilled in the art, such as for example sonication and tagmentation.
The method of the invention then comprises a step of (iii) circularizing DNA fragments present in the DNA isolated in step (i) or in the DNA obtained in step (ii), obtaining a mixture of circularized DNAs comprising circular DNA comprising the "first sequence" and circular DNA comprising the "second sequence”. Circularization of the fragments is done with methods known to the person skilled in the art (“Application of Circular Ligase to Provide Template for Rolling Circle Amplification of Low Amounts of Fragmented DNA”. Ada N. Nunez, MSFS, Mark F. Kavlick, BS, James M. Robertson, PHD, CFSRU, and Bruce Budowle, PHD, FBI Laboratory, FBI Academy, Quantico, VA 22135).
In an embodiment the circularization of step (iii) is obtained by means of the following steps: a. repairing the ends of the DNA fragments; b. adding a nucleotide to the 3' end of each fragment; c. adding an adapter comprising: cl. a nucleotide complementary to the nucleotide added in step b. to one end of the adapter; c2. a restriction enzyme recognition sequence "a"; c3. a sequence "0" comprised between 15 and 35 base pairs, characterized in that it does not form secondary structures or homodimers. c4. an arbitrary sequence "y" comprising 150 to 1000 base pairs; d. adding a ligase, thus obtaining the formation of a mixture of circularized DNAs comprising:
- circular DNA comprising the adapter and the "first sequence"; and
- circular DNA comprising the adapter and the "second sequence".
The function of the “y” sequence of the adapter is to support facilitation of circularization via a reduction in conformational entropy. This is a synthetic sequence designed so that under reaction conditions it will not form hybrids with the “first sequence”, the “second sequence” or the sgRNA of the CRISPR systems used.
In step (iv) the mixture of circularized DNAs obtained in step (iii) is brought into contact with a CRISPR system characterized by an sgRNA that is substantially complementary to the “second sequence” of DNA. This system makes it possible to specifically cut the “second sequence” which is thus linearized, while the circular DNA that comprises the “first sequence” remains intact. For the purposes of the specificity of the method it is therefore critically important to minimize hybridization of the sgRNA to the “first sequence”. This can be achieved using various different approaches known to the person skilled in the art. For example, the specificity can be optimized by conducting the reaction under conditions of low salinity and/or by using CRISPR systems characterized by sgRNA designed to maximize discrimination during hybridization. In a preferred embodiment, particularly when the first and the second sequence differ by a single nucleotide, the specificity can be optimized by using CRISPR systems characterized by “destabilized” sgRNA, i.e. sgRNA that comprise from 1 to 3 bases that are not complementary to the second sequence, preferably chosen from CG residues.
In a preferred embodiment, the CRISPR system is a CRISPR-Cas9 system. The high specificity of the CRISPR system makes it possible to simultaneously degrade multiple sequences using several CRISPR systems characterized by respectively complementary sgRNA, simultaneously in multiplex mode. In this manner the method of the invention makes it possible to simultaneously linearize several sequences (“second sequence”) which potentially compete with one or more sequences of interest (“first sequence”).
In an embodiment, the “second sequence” is chosen from the group consisting of the sequences from SEQ ID NO: 1 to SEQ ID NO:2I023.
The method of the invention advantageously combines CRISPR technology to linearize the second sequence, with a step of (v) selectively amplifying the circular DNA obtained in step (iv) by rolling circle amplification, thus obtaining an amplification product comprising a plurality of copies of the first DNA sequence; the combination of steps (iv) and (v) results in an extremely selective amplification of the one or more markers of interest (“first sequence”). The method then entails (vi) identifying the first DNA sequence in the amplification product obtained in step (v).
Advantageously, the selective amplification of the non-linearized sequences in step (iv) using RCA can be obtained with a single primer complementary to the sequence “P” introduced to the circular template in step (iii) of the method. The method therefore makes it possible to investigate in a multiplexed manner multiple “first” sequences without needing to use specific primers for the amplification step.
The identification in step (vi) can be done using various methods such as for example real-time PCR, illumina sequencing, or nanopore sequencing.
In a preferred embodiment, identification of the first sequence in step (vi) is obtained by sequencing the amplification product obtained in step (v). Advantageously, with respect to the known art, by virtue of the selective amplification of the marker of interest, the identification does not require ultra-deep sequencing.
The DNA extracted in step (i) and optionally fragmented in step (ii) can be subjected to one or more steps of enrichment with one or more gene sequences of interest before proceeding with the circularization in step (iii).
In a preferred embodiment the method of the invention further comprises, prior to step (iii), the step of (ii)a capturing one or both of the "first sequence" and the "second sequence" by means of a probe complementary thereto.
Alternatively, or in addition, in another embodiment the method of the invention comprises, prior to step (iii), the step of (ii)b amplifying one or both of the "first sequence" and the "second sequence". The amplification in step (ii)b is preferably obtained by PCR.
The method of the invention therefore makes it possible to detect and identify rare genetic variants in a biological sample. Advantageously, the method does not require knowledge of the sequence of the variation of interest. The combination of CRISPR systems, designed to cut potentially competing sequences, with RCA amplification makes it possible to amplify the rare variant with exquisite specificity, resulting in a simplification of the procedure with respect to the methods currently used and with a reduced need for ultra-deep sequencing to identify the variants.
The disclosures in Italian Patent Application No. 102022000027141 from which this application claims priority are incorporated herein by reference.

Claims

1. A method for identifying a first nucleic acid sequence in a biological sample, said method comprising the following steps:
(i) isolating nucleic acids from the biological sample, said nucleic acids comprising:
(a) a first nucleic acid sequence, and
(b) a second nucleic acid sequence;
(i-a) optionally reverse-transcribing RNA present in the nucleic acids;
(ii) optionally fragmenting DNA obtained in step (i) or step (i-a), obtaining DNA fragments with a length comprised between 100 and 10,000 base pairs;
(iii) circularizing DNA fragments present in the DNA isolated in step (i) or in the DNA obtained in step (ii), obtaining a mixture of circularized DNAs comprising circular DNA comprising the "first sequence" and circular DNA comprising the "second sequence";
(iv) placing the mixture of circularized DNAs obtained in step (iii) in contact with a CRISPR system characterized by an sgRNA chosen from an sgRNA that is complementary to the "second sequence" of DNA and an sgRNA comprising from 1 to 3 nucleotides that are not complementary to the "second sequence" of DNA, thus obtaining a mixture of DNA comprising circular DNA comprising the "first sequence" and linear DNA comprising the "second sequence";
(v) selectively amplifying the circular DNA obtained in step (iv) by rolling circle amplification, thus obtaining an amplification product comprising a plurality of copies of the first DNA sequence;
(vi) identifying the first DNA sequence in the amplification product obtained in step (v).
2. The method according to claim 1, wherein the biological sample is selected from a cell-free biological sample and a biological sample from which the cellular portion has been removed.
3. The method according to claim 1 or 2, wherein the identification of the first sequence in step (vi) is obtained by sequencing the amplification product obtained in step (v).
4. The method according to any one of the preceding claims, wherein said "first sequence" is a DNA sequence.
5. The method according to any one of the preceding claims, further comprising, prior to step (iii), the step of (ii)a capturing one or both of the "first sequence" and the "second sequence" by means of a probe complementary thereto.
6. The method according to any one of the preceding claims, further comprising, prior to step (iii), the step of (ii)b amplifying one or both of the "first sequence" and the "second sequence".
7. The method according to claim 6, wherein said amplification is obtained by PCR.
8. The method according to any one of the preceding claims, wherein step (iii) comprises the following steps: a. repairing the ends of the DNA fragments; b. adding a nucleotide to the 3' end of each fragment; c. adding an adapter comprising: cl. a nucleotide complementary to the nucleotide added in step b. to one end of said adapter; c2. a restriction enzyme recognition sequence "a"; c3. a sequence "0" comprised between 15 and 35 base pairs, characterized in that it does not form secondary structures or homodimers. c4. an arbitrary sequence "y" comprising 150 to 1000 base pairs; d. adding a ligase, thus obtaining the formation of a mixture of circularized DNAs comprising:
- circular DNA comprising the adapter and the "first sequence"; and
- circular DNA comprising the adapter and the "second sequence".
9. The method according to any one of the preceding claims, wherein said CRISPR system is a CRISPR-Cas9 system.
10. The method according to any one of preceding claims, wherein said "second sequence" is chosen from the group consisting of SEQ ID NO: 1 to SEQ ID NO:21023.
11. The method according to any one of the preceding claims, wherein said biological sample is selected from the group constituted by blood, saliva, sputum, urine, feces, cerebrospinal fluid, and liquid biopsies.
12. The method according to any one of the preceding claims, wherein said "first sequence" is selected from the group constituted by a genetic sequence of tumoral origin, a genetic sequence of fetal origin, a genetic sequence of embryonic origin, a genetic sequence of bacterial origin, a genetic sequence of viral origin, a genetic sequence of plant origin, a genetic sequence of animal origin.
EP23841633.3A 2022-12-29 2023-12-29 Method for detecting minority genetic variants Pending EP4642929A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IT102022000027141A IT202200027141A1 (en) 2022-12-29 2022-12-29 Method for detecting minority genetic variants.
PCT/IB2023/063372 WO2024142006A2 (en) 2022-12-29 2023-12-29 Method for detecting minority genetic variants

Publications (1)

Publication Number Publication Date
EP4642929A2 true EP4642929A2 (en) 2025-11-05

Family

ID=86099767

Family Applications (1)

Application Number Title Priority Date Filing Date
EP23841633.3A Pending EP4642929A2 (en) 2022-12-29 2023-12-29 Method for detecting minority genetic variants

Country Status (4)

Country Link
EP (1) EP4642929A2 (en)
CN (1) CN120530205A (en)
IT (1) IT202200027141A1 (en)
WO (1) WO2024142006A2 (en)

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8383345B2 (en) * 2008-09-12 2013-02-26 University Of Washington Sequence tag directed subassembly of short sequencing reads into long sequencing reads
WO2018183897A1 (en) * 2017-03-31 2018-10-04 Grail, Inc. Higher target capture efficiency using probe extension
EP3650558A1 (en) * 2018-11-07 2020-05-13 Siemens Healthcare GmbH Liquid sample workflow for nanopore sequencing
ES3013495T3 (en) * 2019-01-31 2025-04-14 Guardant Health Inc Method for isolating and sequencing cell-free dna
CN111876472B (en) * 2020-06-17 2023-12-01 江门市灿明生物科技有限公司 Method for detecting trace nucleic acid in multiple mixed nucleic acids
WO2022026891A1 (en) * 2020-07-31 2022-02-03 Element Biosciences, Inc. Multiplexed covid-19 padlock assay

Also Published As

Publication number Publication date
WO2024142006A3 (en) 2024-08-08
IT202200027141A1 (en) 2024-06-29
CN120530205A (en) 2025-08-22
WO2024142006A2 (en) 2024-07-04

Similar Documents

Publication Publication Date Title
CN104745679B (en) A kind of method and kit of Non-invasive detection EGFR genetic mutation
US9422592B2 (en) System and method of detecting RNAS altered by cancer in peripheral blood
US20190203285A1 (en) Method for predicting organ transplant rejection using next-generation sequencing
JP7232643B2 (en) Deep sequencing profiling of tumors
CN110964814B (en) Primers, compositions and methods for nucleic acid sequence variation detection
CN102586420B (en) Method and kit for assaying breast cancer susceptibility genes
JP7761619B2 (en) Method for accurate parallel detection and quantification of nucleic acids - Patent Application 20070122997
TW202305143A (en) Highly sensitive methods for accurate parallel quantification of nucleic acids
EP3105324B1 (en) Ngs systems control and methods involving the same
CN116377036A (en) CRISPR/Cas12 a-based lung cancer patient peripheral blood EGFR gene mutation detection system and method
US20180291369A1 (en) Error-proof nucleic acid library construction method and kit
CN105296471B (en) DNA tags, PCR primers and their applications
EP4642929A2 (en) Method for detecting minority genetic variants
JP2025542506A (en) Methods for detecting minority gene variants
US20220127601A1 (en) Method of determining the origin of nucleic acids in a mixed sample
CN102533991B (en) Method and kit for detecting habitual drinking gene
KR20190038464A (en) The composition for detecting mutations of RAS/BRAF and the kit consisting of the compounds
US20220145368A1 (en) Methods for noninvasive prenatal testing of fetal abnormalities
HK40080902A (en) Gene methylation markers or combinations thereof for concomitant diagnosis of her2 in gastric carcinoma, and use thereof
KR20240032631A (en) Highly sensitive methods for accurate parallel quantification of variant nucleic acids
CN118291630A (en) PCR non-diagnostic detection method for multiple mutation sites in circulating tumor cells and application thereof
HK40072445A (en) Highly sensitive methods for accurate parallel quantification of nucleic acids
CN120464786A (en) A rapid detection system for specific SNP sites based on (RT)RPA-CRISPR/Cas12a, its construction method and application
CN115044671A (en) Gene methylation marker for gastric cancer HER2 concomitant diagnosis or combination and application thereof
EA047854B1 (en) METHODS OF NON-INVASIVE PRENATAL TESTING OF FETAL ANOMALIES

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: UNKNOWN

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20250711

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR