[go: up one dir, main page]

WO2013056640A1 - 核酸文库的制备方法及其应用以及试剂盒 - Google Patents

核酸文库的制备方法及其应用以及试剂盒 Download PDF

Info

Publication number
WO2013056640A1
WO2013056640A1 PCT/CN2012/083036 CN2012083036W WO2013056640A1 WO 2013056640 A1 WO2013056640 A1 WO 2013056640A1 CN 2012083036 W CN2012083036 W CN 2012083036W WO 2013056640 A1 WO2013056640 A1 WO 2013056640A1
Authority
WO
WIPO (PCT)
Prior art keywords
dop
nucleic acid
primer
pcr amplification
region
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2012/083036
Other languages
English (en)
French (fr)
Inventor
殷旭阳
张春雷
蒋慧
张秀清
陈盛培
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BGI Shenzhen Co Ltd
Original Assignee
BGI Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BGI Shenzhen Co Ltd filed Critical BGI Shenzhen Co Ltd
Priority to EP12841309.3A priority Critical patent/EP2770090B1/en
Priority to US14/352,492 priority patent/US9359642B2/en
Publication of WO2013056640A1 publication Critical patent/WO2013056640A1/zh
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6853Nucleic acid amplification reactions using modified primers or templates
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/686Polymerase chain reaction [PCR]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the invention belongs to the field of genetic engineering technology, and in particular relates to a method for preparing a nucleic acid library, an application thereof and a kit.
  • BACKGROUND OF THE INVENTION Research tools that are indispensable in the field of research.
  • the next-generation sequencing technology enables simultaneous sequencing of each base in a short period of time by parallel sequencing of millions of short DNA fragments, and the cost is greatly reduced.
  • NGS technology is used in many applications, such as genomics, transcriptomics, epigenomics, clinical diagnosis, and more.
  • NGS platforms for next-generation sequencing technologies on the market, including Illumina's Genome Analyzer, Hiseq, Miseq series sequencing platforms, Roche's 454 sequencing platform, Life Technologies' SOLID sequencing platform, Ion Torrent sequencing platform, and more.
  • DNA/RNA samples need to be processed prior to sequencing to prepare a library of DNA fragments.
  • the preparation of the library requires a microgram of starting DNA/RNA amount.
  • optimization can reduce the initial amount of library construction, for single-cell or very small nucleic acid samples, library preparation cannot be directly performed, so it is serious. Blocking the application of single cell and quantitative nucleic acid sequencing.
  • the present invention aims to solve at least one of the technical problems existing in the prior art.
  • Another object of the present invention is to provide a kit suitable for use in the above method.
  • the invention provides a method of preparing a nucleic acid library.
  • the method comprises the steps of: performing a DOP-PCR amplification on a nucleic acid sample to obtain a first PCR amplification product; and performing a second PCR amplification on the first PCR amplification product using a DOP-Amp primer And obtaining a second PCR amplification product; and performing linker ligation PCR on the second PCR amplification product to obtain a third PCR amplification product, the third PC amplification product constituting the nucleic acid library.
  • nucleic acid samples can be efficiently prepared by using the method, and the obtained nucleic acid library can be effectively applied to a high-throughput sequencing platform, thereby being able to effectively determine Nucleic acid sequence information of the nucleic acid sample.
  • the method for preparing a nucleic acid library of the present invention has a single process, is extremely easy to operate, and is easy to standardize in operation flow, and is easy to push. Wide, low cost, high sensitivity, high accuracy and repeatability.
  • the invention also provides a method of determining a nucleic acid sequence of a nucleic acid sample.
  • the method comprises the steps of: (i) constructing a nucleic acid library of a nucleic acid sample using a method of preparing a nucleic acid library according to an embodiment of the invention; and (ii) sequencing and analyzing the nucleic acid library, To determine the nucleic acid sequence of the nucleic acid sample.
  • the inventors have found that the method can efficiently determine nucleic acid sequence information of nucleic acid samples, especially single cell and micronucleic acid samples, and has a single operation, high sensitivity, high precision, good repeatability and low cost.
  • the present invention also provides a kit.
  • the kit comprises: (1) a DOP primer; (2) a DOP-Amp primer; and (3) a linker primer.
  • the kit is suitable for the above method of preparing a nucleic acid library and a method of determining a nucleic acid sequence of a nucleic acid sample.
  • the inventors have found that, by using the kit of the present invention, in combination with the above method for preparing a nucleic acid library or a method for determining a nucleic acid sequence of a nucleic acid sample, a nucleic acid sample, particularly a nucleic acid library of a single cell and a micronucleic acid sample, can be efficiently prepared and obtained.
  • the nucleic acid library can be effectively applied to a high-throughput sequencing platform, thereby effectively determining the nucleic acid sequence information of the nucleic acid sample, and the obtained information has high accuracy and repeatability.
  • the method of preparing a nucleic acid library of the present invention the method of determining the nucleic acid sequence of the nucleic acid sample, and the characteristics of the kit can also be described as follows:
  • the invention provides a method of preparing a nucleic acid library. According to an embodiment of the invention, the method comprises the steps of:
  • the sample to be tested is subjected to DOP-PC (Degenerate Oligonucleotide Primed PC) amplification to obtain a first PCR amplification product;
  • DOP-PC Degenerate Oligonucleotide Primed PC
  • the third PCR amplification product has a linker at the 5' end and a linker at the 3' end.
  • the sample described in step (a) is selected from the group consisting of:
  • nucleic acid sample containing 1-200 single cells or
  • the sample is selected from the group consisting of:
  • Quantitative genomic DNA Quantitative genomic DNA, co-immunoprecipitation product DNA, free DNA, cDNA, or a combination thereof.
  • the DNA is from the environment, more preferably from the soil and/or water body.
  • the DNA is derived from body fluids or excretions, more preferably from plasma and/or urine.
  • the DNA is treated chemically or physically, more preferably by a solution of sulphate.
  • step (b) uses a DOP with a cartridge and an oligonucleotide region? The analytes were randomly amplified from the sample DNA.
  • the DOP primer has an untubular oligonucleotide region at the 5' end and an anchor region at the middle of the cartridge and the 3' end; or at 5' The non-tubular oligonucleotide region at the end and the cartridge and oligonucleotide region at the middle and 3' ends.
  • the 3'-terminal anchoring region of the DOP primer has a sequence length of 2 to 12 nucleotides, preferably 4 to 8 nucleotides.
  • the 3'-end anchoring region of the DOP primer is selected from the group consisting of TG, ATGTGG, TGTGG, or GTCT.
  • the 5'-end non-tubular oligonucleotide sequence of the DOP primer is as shown in SEQ ID NO: 2, or has a homology of > 50% to the sequence shown in SEQ ID NO: 2.
  • the 5'-end non-tubular oligonucleotide of the DOP primer has a length of 5-30 bp, preferably 5-20 bp, more preferably 6-13 bp.
  • the cartridge-derived oligonucleotide sequence is represented by (N)m, wherein N at each base position includes A, T, G, and C, and m is 3-20 Integer.
  • the DOP-Amp primer of step (c) is complementary or substantially complementary to the 5'-end non-tubular oligonucleotide described in step (b).
  • the DOP-Amp primer sequence binds to the 5'-end non-tubular oligonucleotide region of the DOP primer.
  • the DOP-Amp sequence is as shown in SEQ ID NO: 2.
  • step (d) is carried out by a linker primer for PCR amplification.
  • the linker primers are P5 and P7, and the 3' ends of P5 and P7 each have an untubular oligonucleotide region that binds to the DOP-Amp primer sequence, the non-tubular
  • the sequence of the oligonucleotide region is identical or fully complementary to the sequence set forth in SEQ ID NO: 2 or has > 80% homology to the sequence set forth in SEQ ID NO: 2.
  • the linker primer P7 also has a barcode or index sequence.
  • the selection according to the fragment size described in the step (e) is: selecting a fragment of a length of 100-100 bp in the third PCR amplification product.
  • the selecting according to the fragment size is: selecting a fragment of 200-500 bp in length from the third PCR amplification product.
  • the invention also provides a method of detecting a nucleotide sequence in a micronucleic acid sample.
  • the method comprises the steps of:
  • the sequencing is a second generation high throughput sequencing method, the second generation high throughput sequencing method being selectable but not limited to being performed on a Roche 454 FLX, Illumina Solexa or ABI SOLID sequencing platform.
  • the sequencing comprises the steps of:
  • the nucleic acid library to be sequenced is hybridized with a sequencing probe immobilized on a flow cell, and subjected to solid phase bridge PCR amplification to form a sequencing cluster; the "synthesis-edge sequencing" method is used for the sequencing cluster Sequencing is performed to obtain nucleotide sequence information in the sample.
  • the invention also provides a kit which can be used in any of the first and second aspects of the invention.
  • the kit comprises:
  • the kit further comprises: a reagent required for performing PCR amplification, a reagent for nucleic acid purification, a flow cell for performing high-throughput sequencing, or a combination thereof .
  • FIG. 1 is a flow chart showing a method of determining a nucleic acid sequence of a nucleic acid sample of the present invention according to an embodiment of the present invention
  • FIG. 2 shows a schematic flow chart of a method for preparing a nucleic acid library of the present invention according to an embodiment of the present invention
  • FIG. 3 shows a method for preparing a single cell genomic DNA library by the method for preparing a nucleic acid library of the present invention according to an embodiment of the present invention.
  • FIG. 4 is a diagram showing the ligation of a PCR product by a linker of four samples (YH1, YH2, YH3, T21) after fragment selection and purification, according to one embodiment of the present invention, when a nucleic acid library is prepared by the method for preparing a nucleic acid library of the present invention.
  • the fragment detection result wherein, FIG. 4A is the detection result of the sample YH1, and the required main band is located at 377 bp; FIG. 4B is the detection result of the sample YH2, and the required main band is located at 326 bp; FIG. 4C is the detection result of the sample YH3, and the required main The band is located at 339 bp;
  • Figure 4D shows the result of the sample T21, and the required main band is located at 360 bp;
  • FIG. 5 shows an electrophoretic detection of a PCR product of each linker when a nucleic acid library of a micronucleic acid sample (IP product DNA, Plasma DNA, cDNA, gDNA) is prepared by the method for preparing a nucleic acid library of the present invention according to an embodiment of the present invention. Result;
  • Figure 6 is a view showing the result of fragment detection of a nucleic acid library after fragment selection and purification when a nucleic acid library of a micro DNA/cDNA sample is prepared by the method for producing a nucleic acid library of the present invention according to an embodiment of the present invention, wherein Figure 6A For the micro-genomic DNA-200pg test results, Figure 6B shows the micro-genomic DNA-40pg test results, 6C is immunoprecipitated (IP) DNA-200pg detection result, FIG 6D is immunoprecipitation (IP) DNA-40pg detection result, 6E is a cDNA- original density detection result, FIG. 6F is a cDNA-5- 1 detection result, FIG. 6G For plasma free DNA - 200pg test results. Detailed description of the invention
  • first means two or more.
  • One of the objects of the present invention is to provide a method and use for library preparation of single cell or micronucleic acid samples.
  • Another object of the present invention is to provide a kit suitable for use in the above method.
  • the invention provides a method of preparing a nucleic acid library. According to an embodiment of the present invention, referring to Figure 2, the method includes the following steps:
  • the nucleic acid sample is subjected to DOP-PCR amplification to obtain a first PCR amplification product.
  • the method for producing a nucleic acid library of the present invention is particularly suitable for single cell and micronucleic acid samples, wherein the nucleic acid content in the nucleic acid sample is not particularly limited.
  • the nucleic acid sample comprises 2 picograms to 1 microgram of nucleic acid.
  • the source of the nucleic acid sample is also not particularly limited.
  • the nucleic acid sample may be from 1-200 single cells.
  • the nucleic acid sample may be at least one selected from the group consisting of micro genomic DNA, immunoprecipitation product DNA, free DNA, and cDNA.
  • the nucleic acid sample may be derived from at least one of an environment, a bodily fluid, and a fecal matter. #>
  • the nucleic acid sample may be derived from at least one of soil and water.
  • the nucleic acid sample may be derived from at least one of plasma and urine.
  • a nucleic acid sample can be processed before the nucleic acid sample is subjected to DOP-PC amplification, for example, when the nucleic acid sample is a single cell, it can be subjected to cell lysis so that it can release genomic DNA, thereby enabling Increasing the efficiency of nucleic acid samples for DOP-PC amplification facilitates subsequent steps.
  • the nucleic acid sample when the nucleic acid sample is a single cell, before the DOP-PCR amplification of the nucleic acid sample, the single cell may be further treated with the alkaline cell lysate.
  • the nucleic acid sample prior to performing DOP-PCR amplification on the nucleic acid sample, may be further subjected to chemical treatment or physical treatment.
  • the aforementioned chemical treatment is hydrogen sulfite Salt treatment.
  • the nucleic acid sample can be adapted for methylation library construction.
  • the conditions for performing DOP-PCR amplification are not particularly limited.
  • DOP-PCR amplification is performed using a DOP primer having a cartridge and an oligonucleotide region, the DOP primer having a non-tubular oligonucleotide region at the 5' end and a cartridge at the middle And an oligonucleotide region and an anchor region at the 3' end.
  • the DOP primer has a non-tubular oligonucleotide region at the 5' end and a cartridge and oligonucleotide region at the middle and 3' ends.
  • the anchoring region at the 3' end of the DOP primer, the non-tubular oligonucleotide region at the 5' end, and the specific structure of the cartridge-and-oligonucleotide region are not particularly limited.
  • the 3'-end anchoring region of the DOP primer has 2-12 nucleotides.
  • the 3'-end anchor region of the DOP primer has 4-8 nucleotides.
  • the 3'-terminal anchoring region of the DOP primer has a nucleotide sequence selected from at least one of TG, ATGTGG, TGTGG, and GTCT.
  • the 5'-end non-tubular oligonucleotide region of the DOP primer has the nucleotide sequence set forth in SEQ ID NO: 2. According to further embodiments of the invention, the nucleotide sequence of the 5'-end non-tubular oligonucleotide region of the DOP primer has >50% homology to the nucleotide sequence set forth in SEQ ID NO: 2. According to an embodiment of the present invention, the nucleotide sequence of the 5'-end non-tubular oligonucleotide region of the DOP primer is 5-30 bp, preferably 5-20 bp, more preferably 6-13 bp.
  • the cartridge-and-oligonucleotide region has 3-20 nucleotides.
  • the first PCR amplification product is subjected to a second PCR amplification using a DOP-Amp primer to obtain a second PCR amplification product.
  • the specific structure of the DOP-Amp primer is not particularly limited.
  • the DOP-Amp primer is complementary or substantially complementary to the nucleotide sequence of the 5'-end non-tubular oligonucleotide region of the aforementioned DOP primer.
  • the DOP-Amp primer is capable of binding to the 5' non-tubular oligonucleotide region of the aforementioned DOP primer.
  • the DOP-Amp primer has the nucleotide sequence set forth in SEQ ID NO: 2. Thereby, the efficiency of the second PCR amplification can be effectively improved.
  • the second PCR amplification product is ligated to PC to obtain a third PCR amplification product, and the third PC amplification product constitutes the nucleic acid library.
  • the condition for connecting the connector to the PC is not particularly limited.
  • a linker-ligated PL is used to link the PC, wherein the 3' end of the linker-ligand has a nucleotide sequence that is compatible with the aforementioned DOP-Amp primer.
  • the linker primer comprises a tag sequence.
  • the nucleic acid libraries of the plurality of nucleic acid samples are separately constructed by the method for preparing a nucleic acid library of the present invention, and the tag sequences of the respective libraries are different from each other, the nucleic acid libraries of the plurality of nucleic acid samples can be simultaneously subjected to high-throughput sequencing, thereby It can effectively reduce the cost of sequencing and improve the sequencing efficiency while ensuring accurate and repeatable results.
  • the obtained third PCR amplification product has a linker at both the 5' end and the 3' end.
  • the method of producing a nucleic acid library of the present invention may further comprise: performing fragment selection on the third PCR amplification product.
  • the third amplification product can be selected depending on the size of the fragment to remove interference.
  • the specific conditions for fragment selection are not particularly limited, and a third amplification product of a suitable length range may be selected according to a specific experimental situation, for example, when the prepared nucleic acid library is used for the Illumina Hiseq2000 sequencing system, A third amplification product having a length of 200-800 bp was selected.
  • the segment selection can be Further comprising: selecting a third PCR amplification product having a length of 100-1000 bp.
  • the fragment selection may further comprise: selecting a third PCR amplification product having a length of 200-500 bp.
  • nucleic acid samples can be efficiently prepared by using the method, and the obtained nucleic acid library can be effectively applied to a high-throughput sequencing platform, thereby being able to effectively determine Nucleic acid sequence information of the nucleic acid sample.
  • the method for preparing a nucleic acid library of the present invention has a process cartridge, is extremely easy to operate, is easy to standardize in operation, is easy to generalize, and has low cost, high sensitivity, high precision, and good repeatability.
  • the second PCR amplification product is subjected to ligated PCR can be selected depending on the use of the nucleic acid library.
  • the second PCR amplification product may be directly subjected to electrophoresis or concentration detection, followed by Sanger sequencing, or may be prepared as a plasmid. The Sanger method was followed by sequencing.
  • the pre-prepared nucleic acid library is used for high-throughput sequencing such as Illlumina sequencing
  • the second PCR amplification product can be ligated to the PC so that the obtained nucleic acid library carries the sequencing linker for subsequent sequencing.
  • the invention also provides a method of determining a nucleic acid sequence of a nucleic acid sample.
  • the method may include the following steps:
  • a nucleic acid library of a nucleic acid sample is constructed using a method of preparing a nucleic acid library according to an embodiment of the present invention.
  • the kind of the nucleic acid sample is not particularly limited, and may be an untreated single cell or a nucleic acid such as DNA or cDNA.
  • the nucleic acid sample may be at least one selected from the group consisting of a single cell and a micro nucleic acid sample.
  • sequencing and data analysis of the nucleic acid library to determine the nucleic acid sequence of the nucleic acid sample are not particularly limited.
  • sequencing can be performed using a high throughput sequencing platform.
  • sequencing can be performed using at least one selected from the group consisting of Roche454 FLX. Illumina Solexa and ABI SOLID sequencing platforms. Thereby, the sequencing efficiency can be improved, and the nucleic acid sequence of the nucleic acid sample can be efficiently determined.
  • the inventors have found that the method can efficiently determine the nucleic acid sequence information of nucleic acid samples, especially single cells and micronucleic acid samples, and operate the cartridges with high sensitivity, high precision, good repeatability and low cost.
  • the present invention also provides a kit.
  • the kit comprises: (1) a DOP primer; (2) a DOP-Amp primer; and (3) a linker primer.
  • the DOP primer, the DOP-Amp primer and the linker-ligating primer are respectively disposed in different containers.
  • the sequence and structure of the DOP primer are not particularly limited.
  • the DOP primer has a cartridge and an oligonucleotide region, wherein the DOP primer has a non-tubular oligonucleotide region at the 5' end, a cartridge and oligonucleotide region located at the middle, and is located The anchoring zone at the 3' end.
  • the DOP primer has a non-tubular oligonucleotide region at the 5' end, and a cartridge and oligonucleoplex at the middle and 3' ends Glycosyl acid region.
  • the anchoring region at the 3' end of the DOP primer, the non-tubular oligonucleotide region at the 5' end, and the specific structure of the cartridge-and-oligonucleotide region are not particularly limited.
  • the 3'-end anchor region of the DOP primer has 2-12 nucleotides.
  • the 3'-end anchor region of the DOP primer has 4-8 nucleotides.
  • the 3'-end anchor region of the DOP primer has a nucleotide sequence selected from at least one of TG, ATGTGG, TGTGG and GTCT.
  • the 5'-end non-tubular oligonucleotide region of the DOP primer has the nucleotide sequence set forth in SEQ ID NO: 2.
  • the nucleotide sequence of the 5'-end non-tubular oligonucleotide region of the DOP primer has >50% homology to the nucleotide sequence set forth in SEQ ID NO: 2.
  • the 5'-end non-tubular oligonucleotide region of the DOP primer has a nucleotide sequence length of 5-30 bp, preferably 5-20 bp, more preferably 6-13 bp.
  • the cartridge and oligonucleotide regions have from 3 to 20 nucleotides.
  • the specific structure of the DOP-Amp primer is not particularly limited.
  • the DOP-Amp primer is complementary or substantially complementary to the nucleotide sequence of the 5' non-tubular oligonucleotide region of the aforementioned DOP primer. Therefore, when the nucleic acid library of the single cell or the micro nucleic acid sample is prepared by using the kit of the present invention, the second PCR amplification product can be efficiently subjected to the second PCR amplification, thereby enabling the nucleic acid library to be efficiently prepared according to the present invention.
  • the DOP-Amp primer is capable of binding to the 5' non-tubular oligonucleotide region of the DOP primer.
  • the DOP-Amp primer has the nucleotide sequence as shown in SEQ ID NO: 2.
  • the specific structure of the linker-ligating primer is not particularly limited.
  • the 3' end of the linker-ligand has a nucleotide sequence that is compatible with the DOP-Amp primer.
  • the linker primer comprises a tag sequence.
  • nucleic acid library of a plurality of nucleic acid samples is separately constructed by using the kit of the present invention, and the tag sequences of the respective libraries are different from each other, the nucleic acid library of the plurality of nucleic acid samples can be simultaneously subjected to high-throughput sequencing, thereby ensuring Under the premise of accurate and repeatable results, the cost of sequencing can be effectively reduced and the sequencing efficiency can be improved.
  • the kit is suitable for the above method of preparing a nucleic acid library and a method of determining a nucleic acid sequence of a nucleic acid sample.
  • the inventors have found that, by using the kit of the present invention, in combination with the above method for preparing a nucleic acid library or a method for determining a nucleic acid sequence of a nucleic acid sample, a nucleic acid sample, particularly a nucleic acid library of a single cell and a nucleic acid sample, can be efficiently prepared and obtained.
  • the nucleic acid library can be effectively applied to a high-throughput sequencing platform, thereby effectively determining the nucleic acid sequence information of the nucleic acid sample, and the obtained information has high accuracy and repeatability.
  • nucleic acid library for a single cell or a micronucleic acid sample was first established by the inventors of the present invention through extensive and intensive research.
  • the nucleic acid library can be prepared and sequenced for the body fluid of the patient, and the gene information related to the patient's disease in the sample can be effectively obtained, thereby providing important information for the prevention and treatment of human diseases.
  • specific meanings of the terms and expressions used in this article are as follows:
  • primer is a generic term for an oligonucleotide which is capable of complementary pairing with a template and capable of synthesizing a DNA strand complementary to a template under the action of a DNA polymerase.
  • Primers may be natural RNA, DNA, or any form of natural nucleotide, or even non-natural nucleotides such as LNA or ZNA.
  • DOP-PC a cartridge-and-oligonucleotide primer PCR
  • a method for amplification of micro-DNA or single cells including but not limited to: low stringency preamplification and high stringency amplification. step.
  • Low stringency amplification was performed based on DOP primers with the aim of adding a stretch of sequence to the DNA fragment as a PCR primer binding region; and using DOP-Amp primers for high stringency amplification of low stringency amplification products.
  • the DOP primer i.e., the cartridge and oligonucleotide primer, comprises at least two portions, from 5' to 3', respectively: a 5'-end non-tubular oligonucleotide region and a downstream calf-oligonucleotide region.
  • the DOP primer sequence is represented by SEQ ID NO: 1: GCTCTTCCGATCT NN, wherein GCTCTTCCGATCT is a 5'-end non-tubular oligonucleotide region, and NNN is a cartridge-and-oligonucleotide region,
  • the sequence of the oligonucleotide region can be written as: (N) m, m is a positive integer selected from 3-20, and N is independently selected from A, T, G and C.
  • the DOP primer sequence may also include a 3' end anchor sequence such as "TG”, "ATGTGG”, “TGTGG”, “GTCT” and the like.
  • PCR amplification using DOP primers allows random binding of DOP primers to the sample by base-pair pairing to allow random fragmentation of the sample.
  • the nucleic acid sample is subjected to DOP-PCR amplification using a DOP primer to obtain a first PCR amplification product.
  • the DOP-Amp primer which is substantially complementary to the specific nucleotide sequence of the DOP primer, enables further amplification of the product amplified by PCR using the DOP primer.
  • the DOP-Amp primer sequence is complementary or substantially complementary to the 5' non-tubular nucleotide region of the above DOP primer.
  • the DOP-Amp primer sequence is identical or completely complementary to the nucleotide sequence of the 5'-end non-tubular nucleotide region of the DOP primer or at least covers the 5'-end non-tubular nucleotide
  • the 50% region of the nucleotide sequence of the region or the nucleotide sequence of the non-tubular nucleotide region of its 5' end is >80%.
  • the nucleotide sequence of the DOP-Amp primer is set forth in SEQ ID NO: 2: 5'-GCTCTTCCGATCT-3'.
  • the first PCR amplification product is subjected to second PCR amplification using a DOP-Amp primer to obtain a second PCR amplification product.
  • Linker PCR Adaptor-Ligation PCR
  • Linker PCR refers to the addition of a linker to template DNA while performing PC At both ends of the fragment, where the "linker” is the linker sequence of the high-throughput sequencing library, the linker-ligand is an important component of the linker-linked PCR.
  • the above-described linker-ligating primers include, but are not limited to, P5 and P7, wherein P5 has a sequence of SE'h AATGATACGGCGACCACCGAGATCTACACTCTTTCCCT AC ACGACGCTCTTCCGATCT shown by SEg ID NO: 3; P7 has SEQ ID NO: a sequence as shown in any one of 4-7, which can be expressed as
  • a method for preparing a nucleic acid library according to the present invention which is capable of comprehensively covering a DNA fragment of a nucleic acid sample by DOP-PCR amplification of DNA in the nucleic acid sample by a DOP;
  • the DOP primer sequence has an at least 5'-end non-tubular nucleotide region and a downstream calf-oligonucleotide region, and the obtained first PCR amplification product band
  • the second PCR amplification product can be subjected to the second PCR amplification to obtain the second PCR amplification product, and the second PCR amplification is high stringency expansion. Increase, can greatly improve the sensitivity of amplification;
  • the linker-ligated PC uses a linker-ligating primer to add a linker to the second PCR amplification product to obtain a third PCR amplification product, thereby being directly usable for the next step of sequencing. ;
  • a plurality of samples can be simultaneously constructed and sequenced without interference from a fluorescent background;
  • the method for preparing a nucleic acid library of the present invention and a method for determining a nucleic acid sequence of a nucleic acid sample are not restricted by species, and a method for preparing a nucleic acid library of the present invention and a method for determining a nucleic acid sample can be utilized by a human, an animal, a microorganism, a plant or the like. a nucleic acid sequence method for individual library building and sequencing;
  • the method for preparing a nucleic acid library of the present invention and a method for determining a nucleic acid sequence of a nucleic acid sample have low test cost, high sensitivity, high precision, and good repeatability.
  • a single-cell genomic DNA library of each sample is prepared by using a single-stage lymphocyte in human blood as a sample, and the method for preparing a nucleic acid library of the present invention is separately prepared as follows:
  • Sample source Blood single cell samples were obtained from peripheral blood of a normal person (YH, Yanhuang plan sample) and a Down syndrome patient (T21).
  • YH1, YH2, YH3 Three YH (YH1, YH2, YH3) and one T21 peripheral blood single lymphocytes were isolated by a mouth pipette method and placed in PC tubes containing 2 ⁇ 1 alkaline cell lysate (200 mM KOH, 50 mM DTT), respectively. Store at 80 ° C for more than 30 minutes, spare.
  • DOP-PCR amplification (ie low stringency amplification, preamplification):
  • the DOP primer sequence is GCTCTTCCGAT CT N (SEQ ID NO: 1).
  • the 5' non-tubular oligonucleotide sequence is GCTCTTCCGAT CT
  • the 3'-and-oligole oligonucleotide sequence is N N
  • N can be any of A, T, G and C.
  • each DOP reaction system obtained in the above configuration was subjected to DOP-PC amplification according to the reaction conditions shown in Table 2 to obtain a first PCR amplification product.
  • Table 2
  • Second PCR amplification (ie high stringency amplification)
  • the DOP-Amp primer capable of binding to the 5' uncoiled oligonucleotide sequence of the above DOP primer was separately added to each DOP reaction system amplified by DOP-PC, and the final concentration of the DOP-Amp primer was 2 ⁇ .
  • the DOP-Amp bower sequence is GCTCTTCCGATCT (SEQ ID NO: 2), that is, the DOP-Amp primer sequence is the 5'-end non-tubular oligonucleotide sequence portion of the above DOP primer.
  • each of the second PCR amplification reaction systems was subjected to a second PCR amplification reaction to obtain a second PCR amplification product.
  • each of the second PCR amplification products obtained above is subjected to electrophoresis and concentration detection, and is used as a standard. 5.
  • Connector connection PC Adaptor-Ligation PCR
  • the linker-ligation PCR reaction system of each of the second PCR amplification products obtained in the step (4) was separately prepared.
  • the DNA polymerase used is a thermostable DNA polymerase pfx enzyme.
  • sequences of the ⁇ 5 linker primer and the ⁇ 7 linker primer are:
  • each linker is ligated into the PCR reaction system, and the P5 linker ligation primer sequence is the same, and both are the linker sequences in the Illumina Hiseq2000 library construction shown in SEQ ID NO: 3, which include a linker that binds to Flowcell and sequencing.
  • the sequence of the P7 adaptor-ligating primer as described above has the tag sequence XXXXX, and the label is different in each of the linker-connected PC reaction systems.
  • the last 13 bp of the P5 and P7 linker sequences were identical to the DOP-Amp primer sequence.
  • the linker of YH1 is linked to the P7 sequence of the PC reaction system:
  • CTTCCGATCT SEQ ID NO: 4
  • the linker of YH2 is linked to the P7 sequence of the PC reaction system.
  • CTTCCGATCT SEQ ID NO: 5
  • the linker of YH3 is linked to the P7 sequence of the PC reaction system.
  • CTTCCGATCT SEQ ID NO: 6
  • the linker of T21 is linked to the P7 sequence of the PC reaction system: CTTCCGATCT ( SEQ ID NO: 7 )##
  • each of the linkers was ligated to the PCR reaction system to perform a ligation-ligation PCR reaction, respectively, to obtain respective third PCR amplification products, and each of the third PCR amplification products respectively constituted a nucleic acid library of each sample.
  • YH1 library, YH2 library, YH3 library and T21 library constituted a nucleic acid library of each sample.
  • Each of the above nucleic acid libraries that is, the third PCR amplification product, contains impurities such as primer dimers, and the fragment size distribution is relatively dispersed, so that fragment selection and purification are required, and the specific steps are as follows:
  • the gel fragments in the size range of 300 bp to 500 bp were cut out, and then the QIAquick Gel Extraction Kit of Qiagen was used for DNA fragment recovery and purification. For details, see the kit instructions.
  • Fig. 4A shows the detection result of the YH1 library, and the required main band is located at 377 bp
  • Fig. 4B shows the detection result of the YH2 library, and the required main band is located at 326 bp
  • Fig. 4C shows the detection result of the YH3 library, and the required main The band is located at 339 bp
  • Figure 4D shows the results of the T21 library, and the required main band is located at 360 bp. The results showed that the selected and purified fragments were qualified.
  • T21 T21ADDPPEI-5 20980937 (21.0M) 94.36% 90.27% Further, the ratio of the data volume of each autosome in the total number of autosomes in the four nucleic acid libraries was counted, and the number of reads was used for statistics. 7.
  • Example 1 was repeated, which differs from Example 1 in that the reaction conditions for DOP-PCR amplification are different, and the reaction conditions for DOP-PCR amplification of this example are shown in Table 8 below: Table 8
  • the sample of this example is a micro DNA/cDNA sample, which is a trace amount of genomic DNA, immunoprecipitation (IP) product DNA, plasma free DNA (Plasma DNA), and RNA reverse transcription cDNA product.
  • IP immunoprecipitation
  • gDNA genomic DNA
  • cDNA samples are obtained from the total RNA of mouse 1 ⁇ ⁇ reverse transcribed by Superscript II reverse transcriptase with a random hexanucleotide primer, after 5-fold dilution of the original concentration gradient, respectively, 5-1, a library was prepared starting concentration 5-2, 5-3 is performed.
  • a nucleic acid library of each micro DNA/cDNA sample is separately prepared according to the following steps:
  • the DOP reaction system of each DNA/cDNA sample was configured according to the ratio in Table 9.
  • each DOP reaction system obtained in the above configuration was separately subjected to DOP-PC amplification according to the reaction conditions shown in Table 10 below to obtain a first PC amplification product.
  • the DOP-Amp primers were separately added to each DOP reaction system amplified by DOP-PCR, and the final concentration of the DOP-Amp primer was 0.4 ⁇ M to obtain respective second PCR amplification reaction systems. Then, according to the reaction conditions of Table 11, each of the second PCR amplification reaction systems was subjected to a second PCR amplification reaction to obtain a second PCR amplification product.
  • the linker-ligated PCR reaction systems of the respective second PCR amplification products obtained above were separately prepared.
  • the DNA polymerase used is a thermostable DNA polymerase pfx enzyme.
  • the sequence of the ⁇ 5 linker primer in Table 12 is the same as that of the ⁇ 5 linker primer in Example 1.
  • the ⁇ 7 linker primers for each linker to the PC reaction system are:
  • CTTCCGATCT SEQ ID NO: 4
  • the linker of the Plasma DNA sample is ligated to the P7 linker of the PCR reaction system.
  • CTTCCGATCT SEQ ID NO: 5
  • CAAGCAGAAGACGGCATAC CTTCCGATCT SEQ ID NO: 6
  • CTTCCGATCT SEQ ID NO: 7
  • each linker was ligated to the PCR reaction system for joint connection to PC.
  • each of the third PCR amplification products constitutes a nucleic acid library of each sample.
  • nucleic acid libraries were subjected to electrophoresis detection at 4V, and the electrophoresis gel pattern is shown in Fig. 5.
  • the left-to-right lanes in the above figure are: a nucleic acid library of IP product DNA with a starting amount of 200 pg, 40 pg, and 8 pg, respectively, Marker (D2000), starting amounts of 200 pg, 40 pg, respectively.
  • Figure 5 shows that this example successfully prepared a nucleic acid library of four micro DNA/cDNA samples.
  • Fragment selection and purification of each of the above nucleic acid libraries are carried out according to the following steps: excision of a size range of 200 bp to 500 bp The gel block was then subjected to DNA fragment recovery and purification using Qiagen's QIAquick Gel Extraction Kit. See the kit instructions for specific instructions.
  • FIG. 6A shows the results of micro-genome DN A-200pg
  • Figure 6B shows the results of micro-genomic DNA-40pg
  • Figure 6C shows the results of immunoprecipitation (IP) DNA-200pg
  • Figure 6D shows the immunoprecipitation IP.
  • DNA-40pg detection result 6E is a cDNA- original density detection result
  • FIG. 6F is a cDNA-5- result of detection 1
  • FIG. 6G is a plasma free DNA-200pg detection result.
  • each library obtained by the above preparation was separately sequenced by the sequencing method of Example 1, and the results showed that the nucleic acid library of the micro DNA/cDNA sample can be efficiently prepared by using the method for preparing a nucleic acid library of the present invention, and the obtained nucleic acid is prepared.
  • the library can be used to detect and confirm the disease.
  • a kit for preparing a library of single cells and micronucleic acids comprising the following components:
  • the method for preparing a nucleic acid library of the present invention, a method for determining a nucleic acid sequence of a nucleic acid sample, and a kit can be effectively used for preparation and sequencing of a nucleic acid library of a single cell and a nucleic acid sample, and have high sensitivity, high accuracy, and repeatability. Good sex, so it can be effectively used for the detection and diagnosis of diseases.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biomedical Technology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

提供了制备核酸文库的方法、确定核酸样本的核酸序列的方法以及试剂盒。其中,制备核酸文库的方法包括以下步骤:对核酸样本进行DOP-PCR扩增,以便获得第一PCR扩增产物;利用DOP-Amp引物对第一PCR扩增产物进行第二PCR扩增,以便获得第二PCR扩增产物;以及对第二PCR扩增产物进行接头连接PCR,以便获得第三PCR扩增产物,该第三PCR扩增产物构成该核酸文库。

Description

核酸文库的制备方法及其应用以及试剂盒 优先权信息
本申请请求 201 1 年 10 月 18 日向中国国家知识产权局提交的、 专利申请号为 201110316066.8的专利申请的优先权和权益, 并且通过参照将其全文并入此处。 技术领域
本发明属于基因工程技术领域,具体地涉及核酸文库的制备方法及其应用以及试剂 盒。 背景技术 究领域中必不可少的研究工具。 新一代测序技术是通过对上百万条 DNA短片段的同时 的平行测序, 使得在短时间内就能够完成每个碱基的测序, 且成本大幅度降低。 NGS 技术在很多方面得到应用, 如基因组学、 转录组学、 表观基因组学、 临床诊断等。
目前市场上新一代测序技术 NGS 平台有好几种, 包括 Illumina公司的 Genome Analyzer、 Hiseq、 Miseq系列测序平台, Roche公司的 454测序平台, Life Technologies 公司的 SOLID测序平台、 Ion Torrent测序平台等等。
但是无论何种 NGS平台, 在测序前都需对 DNA/RNA样本进行处理, 制备成 DNA 片段文库。 通常情况下, 文库的制备需要微克级的起始 DNA/RNA量, 虽然经优化可以 将建库起始量降低, 但对于单细胞或极微量的核酸样本, 仍无法直接进行文库制备, 因 此严重阻碍了对单细胞和 量核酸测序的应用。
因此本领域迫切需要开发针对单细胞和微量核酸样本的文库构建方法。 发明内容
本发明旨在至少解决现有技术中存在的技术问题之一。
本发明的目的是提供一种对单细胞或微量核酸样本进行文库制备的方法和用途。
本发明的另一目的是提供一种适用于上述方法的试剂盒。
根据本发明的一个方面, 本发明提供了一种制备核酸文库的方法。 根据本发明的实施 例, 该方法包括以下步骤: 对核酸样本进行 DOP-PCR扩增, 以便获得第一 PCR扩增产物; 利用 DOP-Amp引物对第一 PCR扩增产物进行第二 PCR扩增, 以便获得第二 PCR扩增产 物; 以及对第二 PCR扩增产物进行接头连接 PCR, 以便获得第三 PCR扩增产物, 该第三 PC 扩增产物构成该核酸文库。 根据本发明的实施例, 利用该方法能够高效地制备核酸样 本, 尤其是单细胞和微量核酸样本的核酸文库, 并且获得的核酸文库能够有效地应用于高 通量测序平台, 进而能够有效地确定该核酸样本的核酸序列信息。 另外, 发明人惊奇地发 现, 本发明的制备核酸文库的方法, 过程筒单, 极易操作, 操作流程极易标准化, 易于推 广, 并且费用低、 灵敏度高、 精确度高、 可重复性好。
根据本发明的另一方面, 本发明还提供了一种确定核酸样本的核酸序列的方法。 根据 本发明的实施例, 该方法包括以下步骤: (i)利用根据本发明实施例的制备核酸文库的方法, 构建核酸样本的核酸文库; 以及 (ii)对该核酸文库进行测序和数据分析, 以便确定该核酸样 本的核酸序列。 发明人发现, 利用该方法能够高效地确定核酸样本尤其是单细胞和微量核 酸样本的核酸序列信息, 并且操作筒单、 灵敏度高、 精确度高、 可重复性好、 成本低。
根据本发明的又一方面, 本发明还提供了一种试剂盒。 根据本发明的实施例, 该试剂 盒包括: (l) DOP引物; (2) DOP-Amp引物; 以及 (3)接头连接引物。 根据本发明的实施例, 该试剂盒适用于上述制备核酸文库的方法和确定核酸样本的核酸序列的方法。 发明人发现, 利用本发明的试剂盒, 结合上述制备核酸文库的方法或确定核酸样本的核酸序列的方法, 能够有效地制备核酸样本, 尤其是单细胞和微量核酸样本的核酸文库, 并且获得的核酸文 库能够有效地应用于高通量测序平台, 进而能够有效地确定该核酸样本的核酸序列信息, 且获得的信息精确度高、 可重复性好。
具体地, 根据本发明的实施例, 本发明的制备核酸文库的方法、 确定核酸样本的核酸 序列的方法以及试剂盒的特征还可以描述如下:
在本发明的第一方面, 本发明提供了一种制备核酸文库的方法。 根据本发明的实施例, 该方法包括步骤:
a. 提供一待测样本, 所述样本含有的核酸总量为 2皮克〜 1微克;
b. 对待测样本进行 DOP-PC (Degenerate Oligonucleotide Primed PC )扩增,获得第一 PCR扩增产物;
c. 用 DOP-Amp引物对第一 PCR扩增产物进行第二次 PCR扩增, 获得第二 PCR扩增 产物;
d. 对获得的第二 PCR扩增产物进行接头连接 PCR (adaptor-ligation PCR),获得第三 PCR 扩增产物, 即为核酸文库。
根据本发明的实施例, 所述的第三 PCR扩增产物的 5'端具有接头, 且 3'端具有接头。 根据本发明的实施例, 还包括步骤 (e): 对第三 PCR扩增产物依据片段大小进行选择。 才艮据本发明的实施例, 步骤 (a)中所述的样本选自下组:
1-200个单细胞构成的样本, 或
含有 1-200个单细胞的核酸样本, 或
核酸总含量为 2皮克〜 1 克的核酸样本。
根据本发明的实施例, 所述样本选自下组:
ί量基因组 DNA、 免疫共沉淀产物 DNA、 游离 DNA、 cDNA、 或其组合。
根据本发明的实施例, 所述 DNA来自环境, 更佳地, 来自土壤和 /或水体。
根据本发明的实施例, 所述 DNA来自体液或排泄物, 更佳地, 来自血浆和 /或尿液。 才艮据本发明的实施例, 所述的 DNA经化学或物理方法处理, 更佳地, 经亚石直酸氢盐处 理。 根据本发明的实施例, 步骤 (b)使用带有筒并寡核苷酸区的 DOP? I物对样本 DNA进行 随机扩增。
才艮据本发明的实施例, 所述的 DOP引物具有位于 5'端的非筒并寡核苷酸区和位于中部 的筒并寡核苷酸区和 3'端的锚定区;或者位于 5'端的非筒并寡核苷酸区和位于中部和 3'端的 筒并寡核苷酸区。
根据本发明的实施例, 所述 DOP引物的 3'端锚定区的序列长度为 2-12个核苷酸,较佳 地 4-8个核苷酸。
根据本发明的实施例,所述 DOP引物的 3'端锚定区任选自下组: TG、 ATGTGG、TGTGG, 或 GTCT。
根据本发明的实施例, 所述 DOP引物的 5'端非筒并寡核苷酸序列如 SEQ ID NO: 2所 示, 或与 SEQ ID NO: 2所示序列的同源性 > 50%。
根据本发明的实施例, 所述的 DOP引物的 5'端非筒并寡核苷酸长度为 5-30bp, 较佳地 5-20bp, 更佳地 6-13bp。
根据本发明的实施例, 所述的筒并寡核苷酸序列如 (N)m所示, 其中每个碱基位置上的 N包括 A、 T、 G和 C, m为 3-20的正整数。
根据本发明的实施例, 步骤 (c)所述的 DOP-Amp引物与步骤 (b)所述的 5'端非筒并寡核 苷酸互补或基本上互补。
根据本发明的实施例, DOP-Amp引物序列结合于 DOP引物 5'端非筒并寡核苷酸区。 根据本发明的实施例, DOP-Amp弓 I物序列如 SEQ ID NO: 2所示。
根据本发明的实施例, 步骤 (d)用接头引物进行接头连接 PCR扩增。
根据本发明的实施例, 所述的接头引物为 P5和 P7, 且 P5和 P7的 3'端都具有可结合 于 DOP-Amp 引物序列的非筒并寡核苷酸区, 所述的非筒并寡核苷酸区的序列与 SEQ ID NO: 2所示序列相同或完全互补, 或者与 SEQ ID NO: 2所示序列有 > 80%的同源性。
根据本发明的实施例, 所述的接头引物 P7还具有标签 (barcode或 index)序列。
根据本发明的实施例, 步骤 (e)中所述的依据片段大小进行选择为: 在第三 PCR扩增产 物中选择 100-lOOObp长度的片段。
根据本发明的实施例, 所述的依据片段大小进行选择为: 在第三 PCR扩增产物中选择 200-500bp长度的片段。
在本发明的第二方面, 本发明还提供了一种检测微量核酸样本中核苷酸序列的方法。 根据本发明的实施例, 该方法包括步骤:
(i) 对于所提供的单细胞及微量核酸样本, 用本发明第一方面任一所述的方法制备所述 样本的核酸文库;
(ii)对所述核酸文库中的片段进行测序和数据分析。
根据本发明的实施例, 所述的测序为第二代高通量测序法, 所述第二代高通量测序法 可选择但不限于在 Roche454 FLX、 Illumina Solexa或 ABI SOLID测序平台上进行。
根据本发明的实施例, 所述的测序包括步骤: 将需要测序的核酸文库与测序芯片(flow cell)上固定的测序探针进行杂交, 并进行固相 桥式 PCR扩增, 形成测序簇; 对所述测序簇用"边合成-边测序"法进行测序, 获得样本中核 苷酸序列信息。
在本发明的第三方面, 本发明还提供了一种可用于本发明第一方面和第二方面任一方 法的试剂盒。 才艮据本发明的实施例, 该试剂盒包括:
(1)第一容器以及位于容器内的用于进行第一 PCR扩增的 DOP引物;
(2)第二容器以及位于容器内的用于进行第二 PCR扩增的 DOP-Amp引物;
(3)第三容器以及位于容器内的用于进行第三 PCR扩增的接头连接引物;
(4)说明书。
根据本发明的实施例, 所述试剂盒还包括: 用于进行 PCR扩增所需的试剂、 用于核酸 纯化的试剂、 用于进行高通量测序的测序芯片(flow cell), 或其组合。 本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得 明显, 或通过本发明的实践了解到。 附图说明
本发明的上述和 /或附加的方面和优点从结合下面附图对实施例的描述中将变得明 显和容易理解, 其中:
图 1 显示了根据本发明一个实施例的本发明的确定核酸样本的核酸序列的方法的 流程示意图;
图 2显示了根据本发明一个实施例的本发明的制备核酸文库的方法的流程示意图; 图 3显示了根据本发明的一个实施例,利用本发明的制备核酸文库的方法制备单细 胞基因组 DNA文库时各接头连接 PCR产物的电泳检测结果;
图 4显示了根据本发明的一个实施例,利用本发明的制备核酸文库的方法制备核酸 文库时, 经片段选择和纯化后四个样本 (YH1 , YH2 , YH3 , T21)的接头连接 PCR产物 的片段检测结果, 其中, 图 4A为样本 YH1 的检测结果, 需要的主带位于 377bp; 图 4B为样本 YH2的检测结果, 需要的主带位于 326bp; 图 4C为样本 YH3的检测结果, 需要的主带位于 339bp; 图 4D为样本 T21的检测结果, 需要的主带位于 360bp;
图 5显示了根据本发明的一个实施例, 利用本发明的制备核酸文库的方法制备微量 核酸样本 (IP产物 DNA, Plasma DNA, cDNA, gDNA)的核酸文库时, 各接头连接 PCR产 物的电泳检测结果; 以及
图 6显示了根据本发明的一个实施例, 利用本发明的制备核酸文库的方法制备微量 DNA/cDNA样本的核酸文库时, 经片段选择和纯化后的核酸文库的片段检测结果, 其中, 图 6A为微量基因组 DNA-200pg检测结果, 图 6B为微量基因组 DNA-40pg检测结果, 图 6C为免疫沉淀 (IP)DNA-200pg检测结果, 图 6D为免疫沉淀 (IP)DNA-40pg检测结果, 图 6E 为 cDNA-原浓度检测结果, 图 6F为 cDNA-5-1检测结果, 图 6G为血浆游离 DNA-200pg检 测结果。 发明详细描述
下面详细描述本发明的实施例, 所述实施例的示例在附图中示出, 其中自始至终相 同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附 图描述的实施例是示例性的, 仅用于解释本发明, 而不能理解为对本发明的限制。
需要说明的是, 术语 "第一,, 、 "第二,, 、 "第三,, 仅用于描述目的, 而不能理解 为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有 "第一"、 "第二,, 、 "第三,, 的特征可以明示或者隐含地包括一个或者更多个该特征。 进一步地, 在本发明的描述中, 除非另有说明, "多个,, 的含义是两个或两个以上。
本发明的目的之一在于提供一种对单细胞或微量核酸样本进行文库制备的方法和用 途。
本发明的另一目在于提供一种适用于上述方法的试剂盒。 制备核酸文库的方法
根据本发明的一个方面, 本发明提供了一种制备核酸文库的方法。 根据本发明的实施 例, 参照图 2, 该方法包括以下步骤:
首先, 对核酸样本进行 DOP-PCR扩增, 以便获得第一 PCR扩增产物。 根据本发明的 实施例, 本发明的制备核酸文库的方法尤其适用于单细胞和微量核酸样本, 其中, 核酸样 本中的核酸含量不受特别限制。 根据本发明的具体示例, 核酸样本包含 2皮克〜 1微克的核 酸。 才艮据本发明的实施例, 核酸样本的来源也不受特别限制。 才艮据本发明的一些实施例, 核酸样本可以来自 1-200个单细胞。根据本发明的一些具体示例,核酸样本可以为选自微量 基因组 DNA、 免疫共沉淀产物 DNA、 游离 DNA和 cDNA的至少一种。 根据本发明的一些 实施例, 核酸样本可以来源于环境、 体液和排泄物的至少一种。 #>据本发明的一些具体示 例, 核酸样本可以来源于土壤和水体的至少一种。 根据本发明的另一些具体示例, 核酸样 本可以来源于血浆和尿液的至少一种。
根据本发明的实施例,对核酸样本进行 DOP-PC 扩增之前,可以将核酸样本进行处理, 例如当核酸样本为单细胞时, 可以将其进行细胞裂解, 以便使其释放基因组 DNA, 进而能 够提高核酸样本进行 DOP-PC 扩增的效率, 从而有利于后续步骤的进行。根据本发明的一 个具体示例, 当核酸样本为单细胞时, 对核酸样本进行 DOP-PCR扩增之前, 可以进一步包 括利用碱性细胞裂解液将单细胞进行处理。
根据本发明的一些实施例, 对核酸样本进行 DOP-PCR扩增之前, 可以进一步包括将核 酸样本进行化学处理或物理处理。 根据本发明的一些具体示例, 前述化学处理为亚硫酸氢 盐处理。 由此, 该核酸样本能够适用于甲基化文库构建。
根据本发明的实施例, 进行 DOP-PCR扩增的条件不受特别限制。根据本发明的一些实 施例, 利用 DOP引物进行 DOP-PCR扩增, 其中 DOP引物具有筒并寡核苷酸区, DOP引 物具有位于 5'端的非筒并寡核苷酸区、位于中部的筒并寡核苷酸区和位于 3'端的锚定区。根 据本发明的另一些实施例, DOP引物具有位于 5'端的非筒并寡核苷酸区, 以及位于中部和 3'端的筒并寡核苷酸区。 其中, DOP引物的 3'端的锚定区、 5'端的非筒并寡核苷酸区以及筒 并寡核苷酸区的具体结构不受特别限制。 才艮据本发明的实施例, DOP引物的 3'端锚定区具 有 2-12个核苷酸。才艮据本发明的另一些实施例, DOP引物的 3'端锚定区具有 4-8个核苷酸。 根据本发明的实施例, DOP引物的 3'端锚定区具有选自 TG、 ATGTGG、 TGTGG和 GTCT 的至少一种的核苷酸序列。 才艮据本发明的一些实施例, DOP引物的 5'端非筒并寡核苷酸区 具有 SEQ ID NO: 2所示的核苷酸序列。 根据本发明的另一些实施例, DOP引物的 5'端非 筒并寡核苷酸区的核苷酸序列与 SEQ ID NO: 2所示核苷酸序列的同源性 > 50%。根据本发 明的实施例, DOP引物的 5'端非筒并寡核苷酸区的核苷酸序列长度为 5-30bp,优选 5-20bp, 更优选 6-13bp。 才艮据本发明的实施例, 筒并寡核苷酸区具有 3-20个核苷酸。 由此, 能够有 效地提高 DOP-PCR扩增的效率, 从而能够有效地提高制备核酸文库的效率。
接着,利用 DOP-Amp引物对第一 PCR扩增产物进行第二 PCR扩增,以便获得第二 PCR 扩增产物。 根据本发明的实施例, DOP-Amp引物的具体结构不受特别限制。 根据本发明的 一些具体示例, DOP-Amp引物与前述的 DOP引物的 5'端非筒并寡核苷酸区的核苷酸序列 互补或基本上互补。 由此, 能够有效地对第一 PCR扩增产物进行第二 PCR扩增。 根据本发 明的另一些实施例, DOP-Amp引物能够与前述的 DOP引物的 5'端非筒并寡核苷酸区结合。 根据本发明的一些具体示例, DOP-Amp引物具有如 SEQ ID NO: 2所示的核苷酸序列。 由 此, 能够有效地提高第二 PCR扩增的效率。
然后, 对第二 PCR扩增产物进行接头连接 PC , 以便获得第三 PCR扩增产物, 该第三 PC 扩增产物构成该核酸文库。 根据本发明的实施例, 进行接头连接 PC 的条件不受特别 限制。 根据本发明的一些实施例, 利用接头连接引物进行接头连接 PC , 其中接头连接引 物的 3'端具有能够与前述的 DOP-Amp 引物匹配的核苷酸序列。 根据本发明的另一些实施 例, 接头连接引物包含标签序列。 由此, 能够有效地制备包含标签序列的核酸文库。 进而, 当利用本发明的制备核酸文库的方法分别构建多个核酸样本的核酸文库, 且使各文库的标 签序列相互不同时, 可以将多个核酸样本的核酸文库同时进行高通量测序, 从而能够在保 证结果准确和可重复性的前提下, 有效地降低测序成本, 提高测序效率。
根据本发明的实施例, 获得的第三 PCR扩增产物的 5'端和 3'端均具有接头。
根据本发明的实施例, 本发明的制备核酸文库的方法可以进一步包括: 对第三 PCR扩 增产物进行片段选择。 由此, 可以依据片段大小, 将第三扩增产物进行选择, 以便去除干 扰。 根据本发明的实施例, 片段选择的具体条件不受特别限制, 可以根据具体实验情况选 取适宜长度范围的第三扩增产物, 例如, 当制备的核酸文库是用于 Illumina Hiseq2000测序 系统时, 可以选择长度为 200-800bp的第三扩增产物。 根据本发明的具体示例, 片段选择可 以进一步包括:选择长度为 100-lOOObp的第三 PCR扩增产物。根据本发明的另一些实施例, 片段选择可以进一步包括: 选择长度为 200-500bp的第三 PCR扩增产物。
根据本发明的实施例, 利用该方法能够高效地制备核酸样本, 尤其是单细胞和微量核 酸样本的核酸文库, 并且获得的核酸文库能够有效地应用于高通量测序平台, 进而能够有 效地确定该核酸样本的核酸序列信息。 另外, 发明人惊奇地发现, 本发明的制备核酸文库 的方法, 过程筒单, 极易操作, 操作流程极易标准化, 易于推广, 并且费用低、 灵敏度高、 精确度高、 可重复性好。
此外, 需要说明的是, 可以根据核酸文库的用途选择是否对第二 PCR扩增产物进行接 头连接 PCR。 具体地, 例如当预制备的核酸文库是用于进行 Sanger法测序时, 可以不进行 接头连接 PCR, 而将第二 PCR扩增产物直接进行电泳或浓度检测后进行 Sanger法测序, 或 者制备成质粒后进行 Sanger 法测序。 而当预制备的核酸文库是用于进行高通量测序例如 Illlumina测序时, 可以对第二 PCR扩增产物进行接头连接 PC , 以便使获得的核酸文库携 带测序接头, 便于后续测序的进行。 确定核酸样本的核酸序列的方法
根据本发明的另一方面, 本发明还提供了一种确定核酸样本的核酸序列的方法。 根据 本发明的实施例, 参照图 1和图 2, 该方法可以包括以下步骤:
(i)利用根据本发明实施例的制备核酸文库的方法,构建核酸样本的核酸文库。根据本发 明的实施例, 核酸样本的种类不受特别限制, 可以为未处理过的单细胞, 也可以为 量的 DNA或 cDNA等核酸。 根据本发明的实施例, 核酸样本可以为选自单细胞和微量核酸样本 的至少一种。
(ii)对该核酸文库进行测序和数据分析, 以便确定该核酸样本的核酸序列。根据本发明 的实施例, 进行测序的方法和装置不受特别限制。 根据本发明的一些实施例, 可以利用高 通量测序平台进行测序。根据本发明的一些具体示例,可以利用选自 Roche454 FLX. Illumina Solexa和 ABI SOLID测序平台的至少一种进行测序。 由此, 能够提高测序效率, 高效地确 定核酸样本的核酸序列。
发明人发现, 利用该方法能够高效地确定核酸样本尤其是单细胞和微量核酸样本的核 酸序列信息, 并且操作筒单、 灵敏度高、 精确度高、 可重复性好、 成本低。
根据本发明的又一方面, 本发明还提供了一种试剂盒。 根据本发明的实施例, 该试剂 盒包括: (l) DOP引物; (2) DOP-Amp引物; 以及 (3)接头连接引物。
根据本发明的实施例, 在本发明的试剂盒中, DOP引物、 DOP-Amp引物和接头连接引 物分别设置于不同容器。
才艮据本发明的实施例, 在本发明的试剂盒中, DOP 引物的序列及结构不受特别限制。 才艮据本发明的一些实施例, DOP引物具有筒并寡核苷酸区, 其中 DOP引物具有位于 5'端的 非筒并寡核苷酸区、 位于中部的筒并寡核苷酸区和位于 3'端的锚定区。 根据本发明的另一 些实施例, DOP引物具有位于 5'端的非筒并寡核苷酸区, 以及位于中部和 3'端的筒并寡核 苷酸区。根据本发明的实施例, DOP引物的 3'端的锚定区、 5'端的非筒并寡核苷酸区以及筒 并寡核苷酸区的具体结构不受特别限制。 才艮据本发明的一些实施例, DOP引物的 3'端锚定 区具有 2-12个核苷酸。 才艮据本发明的另一些实施例, DOP引物的 3'端锚定区具有 4-8个核 苷酸。根据本发明的一些实施例, DOP引物的 3'端锚定区具有选自 TG、 ATGTGG、 TGTGG 和 GTCT的至少一种的核苷酸序列。 才艮据本发明的一些具体示例, DOP引物的 5'端非筒并 寡核苷酸区具有 SEQ ID NO: 2所示的核苷酸序列。 才艮据本发明的一些实施例, DOP引物 的 5'端非筒并寡核苷酸区的核苷酸序列与 SEQ ID NO: 2所示核苷酸序列的同源性 > 50%。 根据本发明的一些实施例, DOP引物的 5'端非筒并寡核苷酸区的核苷酸序列长度为 5-30bp, 优选 5-20bp, 更优选 6-13bp。 根据本发明的另一些实施例, 筒并寡核苷酸区具有 3-20个核 苷酸。 由此, 利用本发明的试剂盒制备单细胞或微量核酸样本的核酸文库时, 能够有效地 提高 DOP-PCR扩增的效率, 从而能够有效地提高制备核酸文库的效率。
才艮据本发明的实施例,在本发明的试剂盒中, DOP-Amp引物的具体结构不受特别限制。 根据本发明的一些具体示例, DOP-Amp引物与前述的 DOP引物的 5'端非筒并寡核苷酸区 的核苷酸序列互补或基本上互补。 由此, 利用本发明的试剂盒制备单细胞或微量核酸样本 的核酸文库时, 能够有效地对第一 PCR扩增产物进行第二 PCR扩增, 从而能够有效地制备 核酸文库才艮据本发明的另一些实施例, DOP-Amp引物能够与所述 DOP引物的 5'端非筒并 寡核苷酸区结合。 根据本发明的一些具体示例, DOP-Amp引物具有如 SEQ ID NO: 2所 示的核苷酸序列。 由此, 利用本发明的试剂盒制备单细胞或微量核酸样本的核酸文库时, 能够有效地提高第二 PCR扩增的效率, 从而能够有效地提高制备核酸文库的效率。
根据本发明的实施例, 在本发明的试剂盒中, 接头连接引物的具体结构不受特别限制。 根据本发明的一些具体示例,接头连接引物的 3'端具有能够与 DOP-Amp引物匹配的核苷酸 序列。 根据本发明的另一些实施例, 接头连接引物包含标签序列。 由此, 利用本发明的试 剂盒能够有效地制备包含标签序列的核酸文库。 进而, 当利用本发明的试剂盒分别构建多 个核酸样本的核酸文库, 且使各文库的标签序列相互不同时, 可以将多个核酸样本的核酸 文库同时进行高通量测序, 从而能够在保证结果准确和可重复性的前提下, 有效地降低测 序成本, 提高测序效率。
根据本发明的实施例, 该试剂盒适用于上述制备核酸文库的方法和确定核酸样本的核 酸序列的方法。 发明人发现, 利用本发明的试剂盒, 结合上述制备核酸文库的方法或确定 核酸样本的核酸序列的方法, 能够有效地制备核酸样本, 尤其是单细胞和 ί量核酸样本的 核酸文库, 并且获得的核酸文库能够有效地应用于高通量测序平台, 进而能够有效地确定 该核酸样本的核酸序列信息, 且获得的信息精确度高、 可重复性好。
需要说明的是, 上述对单细胞或微量核酸样本进行核酸文库制备的方法, 是本发明的 发明人经过广泛而深入的研究, 首次建立的。 利用本发明的方法, 可以针对患者的体液制 备核酸文库并测序, 进而能够有效得到样本中与患者疾病相关的基因信息, 从而能够为人 类疾病的预防和治疗提供重要的信息。 此外, 本文中所使用的术语及表达方式的具体含义如下:
在本文中所用的术语 "含有,,可以表示 "具有 (comprise)"、 "基本上由…构成"和 "由… 构成" 的含义。
在本文中所用的术语 "以上,, 和 "以下,, 包括本数, 例如 "80%以上" 指> 80%, "2% 以下" 指< 2%。 引物
在本文中所用的术语 "引物"是能与模板互补配对, 在 DNA聚合酶的作用下能够合成 与模板互补的 DNA链的寡聚核苷酸的总称。 引物可以是天然的 RNA、 DNA, 也可以是任 何形式的天然核苷酸, 甚至可以是非天然的核苷酸如 LNA或 ZNA等。
DOP-PCR (Degenerate Oligonucleotide Primed PCR)
DOP-PC , 即筒并寡核苷酸引物 PCR, 是一种对微量 DNA或单细胞进行扩增的方法, 包括 (但不限于): 低严谨度预扩增和高严谨度扩增两个步骤。 低严谨度扩增基于 DOP 引物 进行, 目的是在 DNA片段加上了一段固定序列作为 PCR引物结合区;再用 DOP-Amp引物 对低严谨度扩增产物进行高严谨度扩增。
DOP引物, 即筒并寡核苷酸引物, 其至少包括两部分, 从 5'到 3'分别为: 5'端非筒并 寡核苷酸区及其下游的筒并寡核苷酸区。 根据本发明的实施例, DOP 引物序列如 SEQ ID NO: 1所示: GCTCTTCCGATCT N N, 其中, GCTCTTCCGATCT为 5'端的非 筒并寡核苷酸区, N NN 为筒并寡核苷酸区, 筒并寡核苷酸区的序列可记为: (N)m, m为任选自 3-20的正整数, N独立地任选自 A、 T、 G和 C。 根据本发明的一个实 施例, DOP引物序列还可以包括 3'端锚定序列,如 "TG" , " ATGTGG" , "TGTGG" , "GTCT" 等。 利用 DOP引物进行 PCR扩增, 能够通过碱基互补配对使 DOP引物与样本随机结合, 以便对样本进行随机片段化的扩增。 在本发明的制备核酸文库的方法中, 利用 DOP引物对 核酸样本进行 DOP-PCR扩增, 获得第一 PCR扩增产物。
DOP-Amp引物, 其与 DOP引物的特定核苷酸序列基本上互补, 从而能够对采用 DOP 引物进行 PCR扩增的产物进行进一步扩增。 根据本发明的实施例, DOP-Amp 引物序列与 上述 DOP 引物的 5' 端非筒并核苷酸区互补或基本上互补。 根据本发明的一个实施例, DOP-Amp引物序列与 DOP引物的 5'端非筒并核苷酸区的核苷酸序列相同或完全互补或至 少覆盖了其 5'端非筒并核苷酸区核苷酸序列的 50%区域或与其 5'端非筒并核苷酸区核苷酸 序列的同源性 > 80%。根据本发明的另一个实施例, DOP-Amp引物的核苷酸序列如 SEQ ID NO: 2所示: 5'-GCTCTTCCGATCT-3'。在本发明的制备核酸文库的方法中,利用 DOP-Amp 引物对第一 PCR扩增产物进行第二 PCR扩增, 获得第二 PCR扩增产物。 接头连接 PCR (Adaptor-Ligation PCR)
接头连接 PCR(Adaptor-Ligation PCR), 是指在进行 PC 的同时将接头加到模板 DNA 片段两端, 其中, 这里的 "接头" 为高通量测序文库的接头序列, 接头连接引物是接头连 接 PCR的重要组成部分。
根据本发明的一个实施例, 上述接头连接引物包括 (但不限于): P5和 P7 , 其中 P5具有 ^口 SEg ID NO: 3所示的序歹' h AATGATACGGCGACCACCGAGATCTACACTCTTTCCCT AC ACGACGCTCTTCCGATCT; P7具有 SEQ ID NO: 4-7任一项所示的序列, 其可表示为
CCGATCT , 其中 XXXXXX为标签 (barcode)序列, 用于区分不同的核酸样本。 此外, 需要说明的是, 本发明的制备核酸文库的方法以及确定核酸样本的核酸序列的 方法至少具有以下优点:
1. 本发明的制备核酸文库的方法,通过 DOP弓 I物对核酸样本中的 DNA进行 DOP-PCR 扩增, 能够全面覆盖核酸样本的 DNA片段;
2. 在本发明的制备核酸文库的方法中, DOP引物序列具有至少 5'端的非筒并核苷酸区 及其下游的筒并寡核苷酸区, 则获得的第一 PCR扩增产物带有特定的核苷酸序列, 在加入 DOP-Amp引物后,能够对第一 PCR扩增产物进行第二 PCR扩增,获得第二 PCR扩增产物, 且第二 PCR扩增为高严谨性扩增, 能够大大提高扩增的灵敏度;
3. 在本发明的制备核酸文库的方法中, 接头连接 PC 使用接头连接引物在第二 PCR 扩增产物两端加上接头, 获得第三 PCR扩增产物, 从而能够直接用于下一步的测序;
4. 利用本发明的制备核酸文库的方法以及确定核酸样本的核酸序列的方法, 可以同时 对多个样品进行建库和测序, 且没有荧光背景的干扰;
5. 本发明的制备核酸文库的方法以及确定核酸样本的核酸序列的方法, 不受物种的限 制, 人、 动物、 微生物、 植物等均可以利用本发明的制备核酸文库的方法以及确定核酸样 本的核酸序列的方法, 进行个体式建库和测序;
6. 本发明的制备核酸文库的方法以及确定核酸样本的核酸序列的方法, 试验费用低, 灵敏度高、 精确度高、 重复性好。
下面将结合实施例对本发明的方案进行解释。 本领域技术人员将会理解, 下面的实施 例仅用于说明本发明, 而不应视为限定本发明的范围。 实施例中未注明具体技术或条件的, 按照本领域内的文献所描述的技术或条件(例如参考 J.萨姆布鲁克等著, 黄培堂等译的《分 子克隆实验指南》, 第三版, 科学出版社)或者按照产品说明书进行。 所用试剂或仪器未注 明生产厂商者, 均为可以通过市购获得的常规产品, 例如可以采购自 Illumina公司。 实施例 1: 单细胞基因组 DNA文库的制备及测序
本实施例以人血液中的单个淋巴细胞为样本, 利用本发明的制备核酸文库的方法, 分 别制备各样本的单细胞基因组 DNA文库, 具体步骤如下:
1. 样本来源: 血液单细胞样本分别来自于一个正常人 (YH, 炎黄计划样本)和一个唐氏综合症患者 (T21)的外周血。
2. 单细胞分离:
用口吸管方法共分离出 3个 YH ( YH1、 YH2、 YH3 )和 1个 T21外周血单个淋巴细胞, 分别置于含 2μ1碱性细胞裂解液 (200mM KOH, 50mM DTT)的 PC 管中, -80°C冻存 30分 钟以上, 备用。
3. DOP-PCR扩增(即低严谨度扩增、 预扩增):
将上述各 PC 管分别于 65 °C下处理 15分钟, 以便分别获得 4份单细胞 DNA, 然后分 别按照表 1中的配比配置各单细胞 DNA的 DOP反应体系。 表 1
组分 终浓度
lOx Pfx扩增緩冲液 lx
10x Enhancer Solution lx
10mM dNTP混合液 各 ImM
50mM MgS04 ImM
DOP引物 0.3 ~ 2μΜ
pfx DNA聚合酶 1 ~ 2.5单位
单细胞 DNA (按需要)
灭菌蒸馏水 至目标总体积 其中, 配置 DOP反应体系时, 应将表 1中的上述各组分混合, 并覆盖适量的矿物油, 盖上管盖后, 瞬时离心。 此外, 在各 DOP反应体系中, DOP引物序列为 GCTCTTCCGAT CT N(SEQ ID NO: 1)。 其中, 其 5'非筒并寡核苷酸序列为 GCTCTTCCGAT CT, 3'筒并寡核苷酸序列为 N N, N可以任意为 A、 T、 G和 C。
然后, 按照表 2所示的反应条件, 将上述配置获得的各 DOP反应体系进行 DOP-PC 扩增, 以便获得第一 PCR扩增产物。 表 2
温度 时间 循环数
94~98°C 2~5分钟 1
94~98°C 20秒 ~ 2分钟
10~ 19°C 5~30分钟
20~30°C 5~30分钟 1 - 10
31 ~40°C 5~30分钟
65~72°C 15秒 ~ 3分钟
4~ 12°C 保持 由于 DOP引物含 5 '非筒并寡核苷酸序列和 3 '筒并寡核苷酸序列, DOP-PCR扩增即可 将 DOP引物的 5'非筒并寡核苷酸序列加在第一 PCR扩增产物片段的两端。
4. 第二 PCR扩增 (即高严谨度扩增)
将能够与上述 DOP引物的 5'非筒并寡核苷酸序列结合的 DOP-Amp引物分别加入经过 DOP-PC 扩增的各 DOP反应体系中, 并使 DOP-Amp引物的终浓度为 2μΜ, 以便分别获 得各第二 PCR扩增反应体系。其中, DOP-Amp弓 |物序列为 GCTCTTCCGATCT( SEQ ID NO: 2), 即 DOP-Amp引物序列为上述 DOP引物的 5'端非筒并寡核苷酸序列部分。
然后, 按照表 3的反应条件, 将各第二 PCR扩增反应体系进行第二 PCR扩增反应, 以 便获得第二 PCR扩增产物。
表 3
温度 时间 循环数
94~98°C 2~5分钟 1
94~98°C 15秒~2分钟
50~60°C 15秒~2分钟 5~30
65~72°C 15秒~2分钟
4~ 12°C 保持
然后, 将上述获得的各第二 PCR扩增产物分别进行电泳及浓度检测, 合格后备用。 5. 接头连接 PC (Adaptor-Ligation PCR)
按照表 4中的配比,分别配制步骤 (4)获得的各第二 PCR扩增产物的接头连接 PCR反应 体系。 其中, 所采用的 DNA聚合酶为耐热的 DNA聚合酶 pfx酶。 表 4
组分 终浓度
lOx Pfx扩增緩冲液 2x
10x Enhancer Solution lx
lOmM dNTP混合液 各 OlmM
50mM MgS04 ImM
P5接头连接引物 2μΜ
P7接头连接引物 2μΜ
pfx DNA聚合酶 2U
DOP-PCR产物模板 (按需要)
灭菌蒸馏水 至目标总体积
其中, Ρ5接头连接引物和 Ρ7接头连接引物的序列分别为:
Ρ5接头连接引物:
CGCTCTTCCGATCT ( SEQ ID NO: 3 )„
P7接头连接引物:
CAGACGTGTGCTCTTCCGATCT。
其中, 各接头连接 PCR反应体系中, P5接头连接引物序列相同, 均为 SEQ ID NO: 3 所示的 Illumina Hiseq2000文库构建中的接头序列, 其包括结合 Flowcell和测序的接头。 如 前所述的 P7接头连接引物的序列, 其具有标签序列 XXXXXX, 并且在各接头连接 PC 反 应体系中, 标签不同。 并且, P5和 P7接头序列的最后 13bp均与 DOP-Amp引物序列一致。
具体地, 各接头连接 PC 反应体系的 P7序列如下:
YH1的接头连接 PC 反应体系的 P7序列:
CTTCCGATCT ( SEQ ID NO: 4 );
YH2的接头连接 PC 反应体系的 P7序列
CTTCCGATCT ( SEQ ID NO: 5 );
YH3的接头连接 PC 反应体系的 P7序列
CTTCCGATCT ( SEQ ID NO: 6 );
T21的接头连接 PC 反应体系的 P7序列: CTTCCGATCT ( SEQ ID NO: 7 )„
其中, 在 SEQ ID NO: 4-7中, 加粗斜体的 6个序列为标签序列。
然后, 按照下表 5中的反应条件, 将各接头连接 PCR反应体系分别进行接头连接 PCR 反应, 以便分别获得各第三 PCR扩增产物, 各第三 PCR扩增产物分别构成各样本的核酸文 库: YH1文库、 YH2文库、 YH3文库和 T21文库。 表 5
温度 时间 循环数
94~98°C 2~5分钟 1
94~98°C 15秒〜 •2分钟
50~60°C 15秒〜 •2分钟 0~10
65~72°C 15秒〜 •2分钟
94~98°C 15秒〜 •2分钟
58~65°C 15秒〜 •2分钟 5~20
65~72°C 15秒〜 •2分钟
4~ 12°C 保持 将上述各核酸文库进行电泳检测, 其电泳胶图见图 3。 如图 3所示, 泳道从左到右分别 为: Marker(D2000)、 YH1文库、 YH2文库、 YH3文库、 T21文库和阴性对照。
6.片段选择和纯化
上述各核酸文库, 即第三 PCR扩增产物, 均含引物二聚体等杂质, 且片段大小分布较 分散, 因此需进行片段选择和纯化, 具体步骤如下:
切取 300bp-500bp大小范围的胶块,然后利用 Qiagen公司的 QIAquick Gel Extraction Kit 进行 DNA片段回收和纯化。 具体操作参见试剂盒说明书。
然后, 将上述经片段选择和纯化的各核酸文库进行 Agilent 2100 bioanalyzer检测, 检测 结果见图 4。 如图 4所示, 图 4A为 YH1文库的检测结果, 需要的主带位于 377bp; 图 4B 为 YH2文库的检测结果, 需要的主带位于 326bp; 图 4C为 YH3文库的检测结果, 需要的 主带位于 339bp; 图 4D为 T21文库的检测结果, 需要的主带位于 360bp。 结果表明, 选择 和纯化的片段合格。
由此, 制备获得 4个单细胞样本的血液淋巴单细胞基因组 DNA文库。
7. Illumina Hiseq2000测序 利用 Single-End测序法, 通过一个测序通道, 将上述获得的各核酸文库同时进行测序, 其中, 读长 50bp。 测序数据统计结果见表 6。
表 6
样品名 文库编号 Read数 比对率 (%) 唯一比对率 (%)
YH1 YH1AADPPEI-1 20673324(20.7M) 93.86% 89.80%
YH2 YH2ABDPPEI-2 21352924(21.4M) 94.32% 90.03%
YH3 YH3ACDPPEI-3 21290934(21.3M) 93.46% 89.61%
T21 T21ADDPPEI-5 20980937(21.0M) 94.36% 90.27% 进一步, 分别统计 4 个核酸文库中, 每条常染色体的数据量占常染色体总数据量的比 例, 其中以 read数进行统计, 统计结果见表 7。
^7
染色体 YH1 YH2 YH3 T21
chrl 0.085 0.084 0.083 0.083
chr2 0.096 0.094 0.093 0.093
chr3 0.082 0.078 0.078 0.076
chr4 0.079 0.079 0.076 0.075
chr5 0.070 0.071 0.071 0.070
chr6 0.071 0.067 0.069 0.068
chr7 0.057 0.057 0.058 0.057
chr8 0.057 0.054 0.058 0.056
chr9 0.039 0.042 0.043 0.042
chrlO 0.047 0.046 0.045 0.046
chrll 0.048 0.050 0.050 0.049
chrl2 0.048 0.049 0.051 0.049
chrl 3 0.038 0.038 0.039 0.038
chrl4 0.031 0.034 0.034 0.034
chrl 5 0.026 0.029 0.029 0.028
chrl 6 0.025 0.023 0.023 0.024
chrl 7 0.021 0.021 0.021 0.021
chrl 8 0.029 0.031 0.028 0.029 chrl9 0.011 0.011 0.011 0.012
chr20 0.019 0.020 0.020 0.020
chr21 0.013 0.013 0.014 0.020
chr22 0.008 0.008 0.008 0.008 如表 7所示, T21文库的测序结果显示了 21三体单细胞 (T21)的 21号染色体数据量比 例明显高于 YH单细胞 (YH1, YH2, YH3, 均为核型正常), JLT21: YH接近 3: 2的比率。 表明利用本发明的制备核酸文库的方法能够有效地制备单细胞的基因组 DNA文库, 并且制 备获得的单细胞基因组 DNA文库可以有效地用于染色体数目异常的检测。 实施例 2
重复实施例 1, 其中与实施例 1的不同之处在于: DOP-PCR扩增的反应条件不同, 本 实施例的 DOP-PCR扩增的反应条件见下表 8: 表 8
温度 时间 循环数
94~98°C 2~5分钟 1
94~98°C 15秒~2分钟
10~ 19°C 20秒 ~ 5分钟
20~30°C 20秒 ~ 5分钟 5~25
31 ~40°C 20秒 ~ 5分钟
65~72°C 15秒~3分钟
4~ 12°C 保持
结果表明, 采用本实施例中的 DOP-PCR扩增反应条件, 同样能够达到构建核酸文库并 用于下一步测序及检测的目的。 实施例 3微量 DNA/cDNA样本的核酸文库的制备
本实施例的样本为微量 DNA/cDNA样本, 分别为微量的基因组 DNA, 免疫共沉淀 (IP) 产物 DNA, 血浆游离 DNA(PlasmaDNA), 以及 RNA反转录的 cDNA产物。 其中, IP产物 DNA、血浆游离 DNA、基因组 DNA(gDNA)样本均需进行 5倍梯度稀释,然后分别以 200pg、 40pg、 8pg的起始量进行文库制备。 cDNA样本则是由 1μβ的小鼠总 RNA用六核苷酸随机 引物经 Superscript II反转录酶进行反转录而获得的, 经过 5倍梯度稀释后分别以原浓度、 5—1、 5—2、 5—3的起始浓度进行文库制备。
本实施例利用本发明的制备核酸文库的方法, 按照以下步骤, 分别制备各微量 DNA/cDNA样本的核酸文库:
1、 DOP-PCR扩增
分别按照表 9中的配比配置各 量 DNA/cDNA样本的 DOP反应体系。
表 9
组分 终浓度
lOx Pfx扩增緩冲液 2x
10x Enhancer Solution lx
10mM dNTP混合液 各 0.5mM
50mM MgS04 ImM
DOP引物 0.8μΜ
pfx DNA聚合酶 2U
微量 DNA/cDNA (按需要)
灭菌蒸馏水 至目标总体积 其中, 配置 DOP反应体系时, 应将表 9中的上述各组分混合, 并覆盖适量的矿物油, 盖上管盖后, 瞬时离心。
然后, 按照下表 10 所示的反应条件, 将上述配置获得的各 DOP反应体系分别进行 DOP-PC 扩增, 以便获得第一 PC 扩增产物。
表 10
温度 时间 循环数
95 V 5分钟 1
95 V 1分钟
15 V 20分钟
25 V 20分钟 1 ~ 5
35 V 20分钟
68 V 1分钟
4V 保持
2、 第二 PCR扩增
将 DOP-Amp引物分别加入经过 DOP-PCR扩增的各 DOP反应体系中, 并使 DOP-Amp 引物的终浓度为 0.4μΜ, 以便分别获得各第二 PCR扩增反应体系。 然后, 按照表 11的反应条件 , 将各第二 PCR扩增反应体系进行第二 PCR扩增反应 , 以便获得第二 PCR扩增产物。
表 11
温度 时间 循环数
95 V 5分钟 1
95 V 1分钟
54V 30秒 15 - 20
68 V 30秒 ~ 1分钟
4V 保持
然后, 将上述获得的各第二 PCR扩增产物分别进行电泳及浓度检测, 合格后备用。 3、 接头连接 PCR
按照表 12中的配比, 分别配制上述获得的各第二 PCR扩增产物的接头连接 PCR反应 体系。 其中, 所采用的 DNA聚合酶为耐热的 DNA聚合酶 pfx酶。
表 12
组分 终浓度
lOx Pfx扩增緩冲液 2x
10x Enhancer Solution lx
10mM dNTP混合液 各 0.5mM
50mM MgS04 ImM
P5接头连接引物 0.4μΜ
P7接头连接引物 0.4μΜ
pfx DNA聚合酶 1单位
DOP-PCR产物模板 ΙΟμΙ
灭菌蒸馏水 至目标总体积 其中, 表 12中的 Ρ5接头连接引物与实施例 1中的 Ρ5接头连接引物的序列相同。 各接头连接 PC 反应体系的 Ρ7接头连接引物分别为:
IP产物 DNA样本的接头连接 PC 反应体系的 P7接头连接引物:
CTTCCGATCT ( SEQ ID NO: 4 );
Plasma DNA样本的接头连接 PCR反应体系的 P7接头连接引物: CTTCCGATCT ( SEQ ID NO: 5 );
cDNA样本的接头连接 PCR反应体系的 P7接头连接引物:
CAAGCAGAAGACGGCATAC CTTCCGATCT ( SEQ ID NO: 6 );
gDNA样本的接头连接 PCR反应体系的 P7接头连接引物:
CTTCCGATCT ( SEQ ID NO: 7 )„
其中, 在 SEQ ID NO: 4-7中, 加粗斜体的 6个序列为标签序列。
然后, 按照下表 13中的反应条件, 将各接头连接 PCR反应体系分别进行接头连接 PC
R反应, 以便分别获得各第三 PCR扩增产物, 各第三 PCR扩增产物分别构成各样本的核酸 文库。
表 13
温度 时间 循环数
95 V 5分钟 1
95 V 1分钟
54V 30秒 3 ~ 10
68 V 30秒 ~ 1分钟
95 V 1分钟
62 V 30秒 5 ~ 10
68 V 30秒 ~ 1分钟
4V 保持 将上述各核酸文库进行电泳检测, 其电泳胶图见图 5。 如图 5所示, 其中, 上图从左至 右的泳道分别为: 起始量分别为 200pg、 40pg、 8pg 的 IP 产物 DNA 的核酸文库、 Marker(D2000)、 起始量分别为 200pg、 40pg、 8pg的 Plasma DNA (血浆游离 DNA )的核酸 文库、 Marker(D2000); 下图从左至右的泳道分别为: Marker(D2000)、 起始浓度分别为原浓 度、 5人 5 5-3的 cDNA样本的核酸文库、 Marker(D2000)、 起始量分别为 200pg、 40pg、 8pg的 gDNA (基因组 DNA )样本的核酸文库、 阴性对照。
图 5表明, 本实施例成功制备了四种微量 DNA/cDNA样本的核酸文库。
5、 片段选择和纯化
按照以下步骤,对上述各核酸文库进行片段选择和纯化: 切取 200bp-500bp大小范围的 胶块, 然后利用 Qiagen公司的 QIAquick Gel Extraction Kit进行 DNA片段回收和纯化。 具 体操作见试剂盒说明书。
然后, 将上述经片段选择和纯化的各核酸文库进行 Agilent 2100 bioanalyzer检测, 检测 结果见图 6。 如图 6所示, 图 6 A为微量基因组 DN A-200pg检测结果, 图 6B为微量基因组 DNA-40pg检测结果, 图 6C 为免疫沉淀 (IP)DNA-200pg检测结果, 图 6D 为免疫沉淀 IP)DNA-40pg检测结果, 图 6E为 cDNA-原浓度检测结果, 图 6F为 cDNA-5— 1检测结果, 图 6G为血浆游离 DNA-200pg检测结果。
由此, 制备获得 4个微量 DNA/cDNA样本的核酸文库。
然后, 利用实施例 1 的测序方法分别对上述制备获得的各文库进行测序, 结果表明, 利用本发明的制备核酸文库的方法能够有效地制备微量 DNA/cDNA样本的核酸文库, 并且 制备获得的核酸文库可以用来进行疾病的检测和确诊。 实施例 4试剂盒
一种用于制备单细胞及微量核酸的文库的试剂盒, 其包括以下组分:
(1)第一容器以及位于容器内的 DOP引物;
(2)第二容器以及位于容器内的 DOP-Amp引物;
(3)第三容器以及位于容器内的接头连接引物;
(4)第四容器以及位于容器内的用于进行 DOP-PCR扩增所需的试剂;
(5)第五容器以及位于容器内的用于核酸纯化的试剂;
(6)第六容器以及位于容器内的用于进行高通量测序的测序芯片(flow cell);
(7)说明书。 工业实用性
本发明的制备核酸文库的方法、 确定核酸样本的核酸序列的方法以及试剂盒, 能够有 效地用于单细胞及 ί核酸样本的核酸文库的制备和测序, 并且灵敏度高、 准确度高、 可重 复性好, 从而能够有效地用于疾病的检测和确诊。 尽管本发明的具体实施方式已经得到详细的描述, 本领域技术人员将会理解。 根据已 经公开的所有教导, 可以对那些细节进行各种修改和替换, 这些改变均在本发明的保护范 围之内。 本发明的全部范围由所附权利要求及其任何等同物给出。
在本说明书的描述中, 参考术语 "一个实施例"、 "一些实施例"、 "示意性实施例"、 "示 例"、 "具体示例"、 或 "一些示例" 等的描述意指结合该实施例或示例描述的具体特征、 结 构、 材料或者特点包含于本发明的至少一个实施例或示例中。 在本说明书中, 对上述术语 的示意性表述不一定指的是相同的实施例或示例。 而且, 描述的具体特征、 结构、 材料或 者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。

Claims

权利要求书
1、 一种制备核酸文库的方法, 其特征在于, 包括以下步骤:
对核酸样本进行 DOP-PC 扩增, 以便获得第一 PCR扩增产物;
利用 DOP-Amp引物对所述第一 PCR扩增产物进行第二 PCR扩增,以便获得第二 PCR 扩增产物; 以及
对所述第二 PCR扩增产物进行接头连接 PCR, 以便获得第三 PCR扩增产物,所述第三 PC 扩增产物构成所述核酸文库。
2、根据权利要求 1所述的方法,其特征在于,所述核酸样本包含 2皮克〜 1微克的核酸。
3、根据权利要求 1所述的方法, 其特征在于, 所述第三 PCR扩增产物的 5'端和 3'端均 具有接头。
4、 根据权利要求 1所述的方法, 其特征在于, 进一步包括:
对所述第三 PCR扩增产物进行片段选择。
5、 根据权利要求 1所述的方法, 其特征在于, 所述核酸样本来自 1-200个单细胞。
6、 根据权利要求 1所述的方法, 其特征在于, 所述核酸样本为选自微量基因组 DNA、 免疫共沉淀产物 DNA、 游离 DNA和 cDNA的至少一种。
7、 根据权利要求 1所述的方法, 其特征在于, 所述核酸样本来源于环境、 体液和排泄 物的至少一种。
8、 根据权利要求 7所述的方法, 其特征在于, 所述核酸样本来源于土壤和水体的至少 一种。
9、 根据权利要求 7所述的方法, 其特征在于, 所述核酸样本来源于血浆和尿液的至少 一种。
10、 根据权利要求 1所述的方法, 其特征在于, 对所述核酸样本进行 DOP-PCR扩增之 前, 进一步包括将所述核酸样本进行化学处理或物理处理。
11、 根据权利要求 10所述的方法, 其特征在于, 所述化学处理为亚硫酸氢盐处理。
12、 根据权利要求 1所述的方法, 其特征在于, 利用 DOP引物进行所述 DOP-PCR扩 增, 其中所述 DOP引物具有筒并寡核苷酸区, 所述 DOP引物具有位于 5'端的非筒并寡核 苷酸区、 位于中部的筒并寡核苷酸区和位于 3'端的锚定区。
13、 根据权利要求 12所述的方法, 其特征在于, 所述 DOP引物具有位于 5'端的非筒 并寡核苷酸区, 以及位于中部和 3'端的筒并寡核苷酸区。
14、根据权利要求 12所述的方法,其特征在于, 所述 DOP引物的 3'端锚定区具有 2-12 个核苷酸。
15、 根据权利要求 14所述的方法, 其特征在于, 所述 DOP引物的 3'端锚定区具有 4-8 个核苷酸。
16、 根据权利要求 12所述的方法, 其特征在于, 所述 DOP引物的 3'端锚定区具有选 自 TG、 ATGTGG、 TGTGG和 GTCT的至少一种的核苷酸序列。
17、 根据权利要求 12或 13所述的方法, 其特征在于, 所述 DOP引物的 5'端非筒并寡 核苷酸区具有 SEQ ID NO: 2所示的核苷酸序列。
18、 根据权利要求 12或 13所述的方法, 其特征在于, 所述 DOP引物的 5'端非筒并寡 核苷酸区的核苷酸序列与 SEQ ID NO: 2所示核苷酸序列的同源性 > 50%。
19、 根据权利要求 12或 13所述的方法, 其特征在于, 所述 DOP引物的 5'端非筒并寡 核苷酸区的核苷酸序列长度为 5-30bp, 优选 5-20bp, 更优选 6-13bp。
20、根据权利要求 12所述的方法, 其特征在于, 所述筒并寡核苷酸区具有 3-20个核苷 酸。
21、根据权利要求 12或 13所述的方法,其特征在于,所述 DOP-Amp引物与所述 DOP 引物的 5'端非筒并寡核苷酸区的核苷酸序列互补或基本上互补。
22、根据权利要求 12或 13所述的方法, 其特征在于, 所述 DOP-Amp引物能够与所述 DOP引物的 5'端非筒并寡核苷酸区结合。
23、根据权利要求 12或 13所述的方法, 其特征在于, 所述 DOP-Amp引物具有如 SEQ ID NO: 2所示的核苷酸序列。
24、 根据权利要求 1、 22或 23所述的方法, 其特征在于, 利用接头连接引物进行所述 接头连接 PCR, 其中所述接头连接引物的 3'端具有能够与所述 DOP-Amp引物匹配的核苷 酸序列。
25、 根据权利要求 24所述的方法, 其特征在于, 所述接头连接引物包含标签序列。
26、 根据权利要求 4 所述的方法, 其特征在于, 所述片段选择进一步包括: 选择长度 为 100-lOOObp的第三 PCR扩增产物。
27、 根据权利要求 4 所述的方法, 其特征在于, 所述片段选择进一步包括: 选择长度 为 200-500bp的第三 PCR扩增产物。
28、 一种确定核酸样本的核酸序列的方法, 其特征在于, 包括以下步骤:
(i)利用权利要求 1-27任一所述的方法, 构建所述核酸样本的核酸文库; 以及
(ii)对所述核酸文库进行测序和数据分析, 以便确定所述核酸样本的核酸序列。
29、 根据权利要求 28所述的方法, 其特征在于, 所述核酸样本为选自单细胞和微量核 酸样本的至少一种。
30、 根据权利要求 28所述的方法, 其特征在于, 利用高通量测序平台进行所述测序。
31、根据权利要求 30所述的方法,其特征在于,利用选自 Roche454 FLX. Illumina Solexa 和 ABI SOLID测序平台的至少一种进行所述测序。
32、 一种试剂盒, 其特征在于, 包括:
(1) DOP引物;
(2) DOP-Amp引物; 以及
(3)接头连接引物。
33、 根据权利要求 32所述的试剂盒, 其特征在于, 所述 DOP引物、 DOP-Amp引物和 接头连接? I物分别设置于不同容器。
34、 根据权利要求 32所述的试剂盒, 其特征在于, 所述 DOP引物具有筒并寡核苷酸 区, 其中所述 DOP引物具有位于 5'端的非筒并寡核苷酸区、 位于中部的筒并寡核苷酸区和 位于 3'端的锚定区。
35、 根据权利要求 32所述的试剂盒, 其特征在于, 所述 DOP引物具有位于 5'端的非 筒并寡核苷酸区, 以及位于中部和 3'端的筒并寡核苷酸区。
36、根据权利要求 34所述的方法,其特征在于, 所述 DOP引物的 3'端锚定区具有 2-12 个核苷酸。
37、 根据权利要求 36所述的试剂盒, 其特征在于, 所述 DOP引物的 3'端锚定区具有 4-8个核苷酸。
38、 根据权利要求 34所述的试剂盒, 其特征在于, 所述 DOP引物的 3'端锚定区具有 选自 TG、 ATGTGG、 TGTGG和 GTCT的至少一种的核苷酸序列。
39、 根据权利要求 34或 35所述的试剂盒, 其特征在于, 所述 DOP引物的 5'端非筒并 寡核苷酸区具有 SEQ ID NO: 2所示的核苷酸序列。
40、 根据权利要求 34或 35所述的试剂盒, 其特征在于, 所述 DOP引物的 5'端非筒并 寡核苷酸区的核苷酸序列与 SEQ ID NO: 2所示核苷酸序列的同源性 > 50%。
41、 根据权利要求 34或 35所述的试剂盒, 其特征在于, 所述 DOP引物的 5'端非筒并 寡核苷酸区的核苷酸序列长度为 5-30bp, 优选 5-20bp, 更优选 6-13bp。
42、根据权利要求 34或 35所述的试剂盒,其特征在于,所述筒并寡核苷酸区具有 3-20 个核苷酸。
43、 根据权利要求 34或 35所述的试剂盒, 其特征在于, 所述 DOP-Amp引物与所述
DOP引物的 5'端非筒并寡核苷酸区的核苷酸序列互补或基本上互补。
44、根据权利要求 34或 35所述的试剂盒, 其特征在于, 所述 DOP-Amp引物能够与所 述 DOP引物的 5'端非筒并寡核苷酸区结合。
45、 根据权利要求 34或 35所述的试剂盒, 其特征在于, 所述 DOP-Amp引物具有如 SEQ ID NO: 2所示的核苷酸序列。
46、 根据权利要求 43或 44所述的试剂盒, 其特征在于, 所述接头连接引物的 3'端具 有能够与所述 DOP-Amp引物匹配的核苷酸序列。
47、 根据权利要求 32所述的试剂盒, 其特征在于, 所述接头连接引物包含标签序列。
PCT/CN2012/083036 2011-10-18 2012-10-16 核酸文库的制备方法及其应用以及试剂盒 Ceased WO2013056640A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP12841309.3A EP2770090B1 (en) 2011-10-18 2012-10-16 Method for preparing nucleic acid library, its uses and kits
US14/352,492 US9359642B2 (en) 2011-10-18 2012-10-16 Method for preparing nucleic acid library, its uses and kits

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110316066.8A CN103060924B (zh) 2011-10-18 2011-10-18 微量核酸样本的文库制备方法及其应用
CN201110316066.8 2011-10-18

Publications (1)

Publication Number Publication Date
WO2013056640A1 true WO2013056640A1 (zh) 2013-04-25

Family

ID=48103808

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/083036 Ceased WO2013056640A1 (zh) 2011-10-18 2012-10-16 核酸文库的制备方法及其应用以及试剂盒

Country Status (5)

Country Link
US (1) US9359642B2 (zh)
EP (1) EP2770090B1 (zh)
CN (1) CN103060924B (zh)
TW (1) TW201321518A (zh)
WO (1) WO2013056640A1 (zh)

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104153004A (zh) * 2014-08-11 2014-11-19 上海美吉生物医药科技有限公司 一种用于扩增子测序的建库方法
CN105368920A (zh) * 2014-08-20 2016-03-02 北京百诺奇生物科技有限公司 一种检测结核分枝杆菌及其耐药基因突变的试剂盒及应用
CN105506063A (zh) * 2014-09-22 2016-04-20 深圳华大基因科技有限公司 引物组合物及其用途
CN104263726A (zh) * 2014-09-25 2015-01-07 天津诺禾致源生物信息科技有限公司 适用于扩增子测序文库构建的引物及扩增子测序文库的构建方法
CN105524983B (zh) * 2014-09-30 2019-06-21 大连晶泰生物技术有限公司 基于高通量测序的标记和捕获多个样本的一个或多个特定基因的方法和试剂盒
US10344317B2 (en) * 2014-10-13 2019-07-09 Mgi Tech Co., Ltd Method and a sequence combination for producing nucleic acid fragments
US10563196B2 (en) * 2014-10-17 2020-02-18 Mgi Tech Co., Ltd Primer for nucleic acid random fragmentation and nucleic acid random fragmentation method
CN104357918B (zh) * 2014-11-25 2016-08-17 北京阅微基因技术有限公司 血浆游离dna文库的构建方法
CN104963000B (zh) * 2014-12-15 2021-04-06 杭州贝瑞和康基因诊断技术有限公司 一种快速构建单细胞dna测序文库的方法和试剂盒
US20170349926A1 (en) * 2014-12-22 2017-12-07 DNAe Group Holdings LTD. Bubble primers
CA2995468A1 (en) * 2015-08-12 2017-02-16 Circulogene Theranostics, Llc Method of preparing cell free nucleic acid molecules by in situ amplification
CN105332063B (zh) * 2015-08-13 2017-04-12 厦门飞朔生物技术有限公司 一种单管高通量测序文库的构建方法
CN105219766B (zh) * 2015-11-10 2017-12-22 东华大学 一种三轮扩增的多重pcr方法
CN106555226B (zh) * 2016-04-14 2019-07-23 大连晶泰生物技术有限公司 一种构建高通量测序文库的方法和试剂盒
CN106011230A (zh) * 2016-05-10 2016-10-12 人和未来生物科技(长沙)有限公司 用于检测碎片化dna目标区域的引物组合物及其应用
CN106283201B (zh) * 2016-09-20 2019-08-06 中国医学科学院肿瘤医院 基于高通量测序的tcr多样性检测和文库构建
CN106636063A (zh) * 2016-09-27 2017-05-10 广州精科医学检验所有限公司 引物组合物、其用途、构建文库和确定核酸序列的方法
CN106554999B (zh) * 2016-10-25 2020-03-17 大连晶泰生物技术有限公司 高通量检测新生儿糖尿病致病基因突变位点的测序文库构建方法、试剂盒及其用途
CN106554955B (zh) * 2016-10-25 2020-07-14 大连晶泰生物技术有限公司 构建pkhd1基因突变的测序文库的方法和试剂盒及其用途
CN108504649B (zh) * 2017-02-24 2021-03-05 金弗康生物科技(上海)股份有限公司 编码pcr二代测序建库方法、试剂盒及检测方法
JP7047373B2 (ja) * 2017-12-25 2022-04-05 トヨタ自動車株式会社 次世代シーケンサー用プライマー並びにその製造方法、次世代シーケンサー用プライマーを用いたdnaライブラリー並びにその製造方法、及びdnaライブラリーを用いたゲノムdna解析方法
CN108103175B (zh) * 2018-01-02 2021-09-17 序康医疗科技(苏州)有限公司 一种用于检测eml4-alk、ros1和ret融合基因突变的方法
CN108103055B (zh) * 2018-01-09 2021-05-28 序康医疗科技(苏州)有限公司 一种单细胞rna逆转录与文库构建的方法
IL271454B2 (en) * 2018-05-17 2025-04-01 Illumina Inc Rapid single-cell genetic sequencing with low Aggregation bias
CN109056077A (zh) * 2018-09-13 2018-12-21 武汉菲沙基因信息有限公司 一种适用于PacBio测序平台的扩增子混样测序文库构建方法
WO2020077400A1 (en) * 2018-10-17 2020-04-23 PerkinElmer Health Sciences (Australia) Pty Ltd Barcoding of nucleic acids
AU2020204908A1 (en) * 2019-01-04 2021-07-29 William Marsh Rice University Quantitative amplicon sequencing for multiplexed copy number variation detection and allele ratio quantitation
GB201911515D0 (en) * 2019-08-12 2019-09-25 Univ London Queen Mary Methods for generating a population of polynucleotide molecules
CN111041069B (zh) * 2019-12-26 2021-01-19 人和未来生物科技(长沙)有限公司 一种低起始量dna样本的高通量测序文库构建方法及其应用
CN111139315A (zh) * 2020-04-03 2020-05-12 杭州启棣生物技术有限公司 一种利用二代测序高通量检测呼吸道病毒的方法与应用
CN112359101B (zh) * 2020-11-13 2023-10-03 苏州金唯智生物科技有限公司 一种质检寡核苷酸交叉污染的方法
CN112626214A (zh) * 2020-12-29 2021-04-09 嘉兴允英医学检验有限公司 检测1p/19q杂合性缺失的引物组、试剂盒及方法
CN114015751A (zh) * 2021-10-26 2022-02-08 江苏海伯基因科技有限公司 扩增基因组dna的方法、试剂盒及其获得扩增引物的方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0522310D0 (en) * 2005-11-01 2005-12-07 Solexa Ltd Methods of preparing libraries of template polynucleotides
US20090099040A1 (en) * 2007-10-15 2009-04-16 Sigma Aldrich Company Degenerate oligonucleotides and their uses
US9914950B2 (en) * 2012-08-23 2018-03-13 Tufts University Homopolymer mediated nucleic acid amplification

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
See also references of EP2770090A4 *
VIVIAN, G. ET AL.: "Whole genome amplification using a degenerate oligonucleotide primer allows hundreds of genotypes to be performed on less than one nanogram of genomic DNA", PROC. NATL. ACAD. SCI. USA, vol. 93, 1996, pages 14676 - 14679, XP002126260 *
WAN, XIUQING ET AL.: "Identification of Tobacco Transgenic Events with Adaptor Ligation PCR Walking Methods", TOBACCO SCIENCE & TECHNOLOGY, 2007, pages 49 - 53, XP008170310 *
WANG, JING ET AL.: "Study on the Factors Influencing the Whole Genome Amplification from a Single Cell by Degenerate Oligonucleotide Primered PCR", JOURNAL OF ANHUI AGRICULTURAL SCIENCES, vol. 36, no. 16, 2008, pages 6675 - 6677, XP008170451 *

Also Published As

Publication number Publication date
EP2770090A4 (en) 2014-10-22
EP2770090B1 (en) 2018-04-18
US20140296084A1 (en) 2014-10-02
CN103060924A (zh) 2013-04-24
TW201321518A (zh) 2013-06-01
EP2770090A1 (en) 2014-08-27
CN103060924B (zh) 2016-04-20
HK1183698A1 (zh) 2014-01-03
US9359642B2 (en) 2016-06-07

Similar Documents

Publication Publication Date Title
WO2013056640A1 (zh) 核酸文库的制备方法及其应用以及试剂盒
CN110997932B (zh) 用于甲基化测序的单细胞全基因组文库
JP7379418B2 (ja) 腫瘍のディープシークエンシングプロファイリング
CN103088433B (zh) 全基因组甲基化高通量测序文库的构建方法及其应用
CN108138364B (zh) 一种核酸单链环状文库的构建方法和试剂
CN103806111A (zh) 高通量测序文库的构建方法及其应用
WO2012159564A1 (zh) 甲基化高通量检测方法
CN108138228B (zh) 用于下一代测序的高分子量dna样品追踪标签
AU2020220461A1 (en) Haplotagging - haplotype phasing and single-tube combinatorial barcoding of nucleic acid molecules using bead-immobilized Tn5 transposase
WO2013071876A1 (zh) 高通量测序文库的构建方法及其应用
WO2012037878A1 (zh) 核酸标签及其应用
CN111808854B (zh) 带有分子条码的平衡接头及快速构建转录组文库的方法
CN113710815B (zh) 用于多重拷贝数变异检测和等位基因比率定量的定量扩增子测序
CN114787385A (zh) 用于检测核酸修饰的方法和系统
KR20200005658A (ko) 서열-기반의 유전 검사용 대조군을 제조하기 위한 조성물 및 방법
WO2023116373A1 (zh) 一种生成标记的核酸分子群的方法及其试剂盒
CN117915922A (zh) 与假尿苷和5-羟甲基胞嘧啶的修饰和检测相关的组合物和方法
US20230383339A1 (en) Multiplexed methylated dna immunoprecipitation sequencing to study dna methylation using low amounts of dna
WO2023237180A1 (en) Optimised set of oligonucleotides for bulk rna barcoding and sequencing
WO2023025784A1 (en) Optimised set of oligonucleotides for bulk rna barcoding and sequencing
EP4430209A1 (en) Target enrichment and quantification utilizing isothermally linear-amplified probes
CN117545852A (zh) 用于消耗和/或富集由生物样品制备的文库片段的固体载体和方法
HK40027672B (zh) 用於甲基化测序的单细胞全基因组文库
HK1260078A1 (zh) 腫瘤的深度測序概況分析
HK1225074B (zh) 探针及其用途

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12841309

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 14352492

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2012841309

Country of ref document: EP