CN105200530A - Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing - Google Patents
Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing Download PDFInfo
- Publication number
- CN105200530A CN105200530A CN201510657439.6A CN201510657439A CN105200530A CN 105200530 A CN105200530 A CN 105200530A CN 201510657439 A CN201510657439 A CN 201510657439A CN 105200530 A CN105200530 A CN 105200530A
- Authority
- CN
- China
- Prior art keywords
- sample
- construction process
- sequence
- library
- genome
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 56
- 238000012070 whole genome sequencing analysis Methods 0.000 title abstract 3
- 239000012634 fragment Substances 0.000 claims abstract description 24
- 238000012163 sequencing technique Methods 0.000 claims abstract description 16
- 238000006243 chemical reaction Methods 0.000 claims description 29
- 230000008569 process Effects 0.000 claims description 28
- 238000010276 construction Methods 0.000 claims description 25
- 238000012408 PCR amplification Methods 0.000 claims description 22
- 239000000047 product Substances 0.000 claims description 22
- 229920002684 Sepharose Polymers 0.000 claims description 11
- 230000004087 circulation Effects 0.000 claims description 7
- 238000000926 separation method Methods 0.000 claims description 7
- 238000013467 fragmentation Methods 0.000 claims description 6
- 238000006062 fragmentation reaction Methods 0.000 claims description 6
- 238000012268 genome sequencing Methods 0.000 claims description 6
- 230000026731 phosphorylation Effects 0.000 claims description 6
- 238000006366 phosphorylation reaction Methods 0.000 claims description 6
- 238000000746 purification Methods 0.000 claims description 6
- 239000012264 purified product Substances 0.000 claims description 5
- 238000005520 cutting process Methods 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 230000004044 response Effects 0.000 claims description 2
- 238000001514 detection method Methods 0.000 abstract description 9
- 230000002068 genetic effect Effects 0.000 abstract description 5
- 238000011161 development Methods 0.000 abstract description 4
- 230000004907 flux Effects 0.000 abstract description 3
- 238000010219 correlation analysis Methods 0.000 abstract 1
- 239000011324 bead Substances 0.000 description 17
- 239000003153 chemical reaction reagent Substances 0.000 description 16
- 239000003292 glue Substances 0.000 description 14
- 238000001962 electrophoresis Methods 0.000 description 11
- 241000209094 Oryza Species 0.000 description 7
- 235000007164 Oryza sativa Nutrition 0.000 description 7
- 239000002184 metal Substances 0.000 description 7
- 235000009566 rice Nutrition 0.000 description 7
- 238000012360 testing method Methods 0.000 description 6
- 238000007405 data analysis Methods 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000012141 concentrate Substances 0.000 description 3
- 230000001186 cumulative effect Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 238000007689 inspection Methods 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 108090000623 proteins and genes Proteins 0.000 description 3
- 238000003908 quality control method Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides a method for establishing a multi-sample hybrid library suitable for high-flux whole-genome sequencing. The method comprises the following steps: breaking genome DNA of a plurality of samples, distributing a concentration zone at 500bp, simultaneously carrying out the 5'-end restoration and adding a basic group A to a 3' end after the sample is broken, connecting DNA fragments having 5'phosphoration and 3' viscous end A of different samples and connectors containing different barcode sequences, carrying out low-circulation PCR for a connection product, purifying and mixing an amplified product to obtain a multi-sample hybrid sequencing library suitable for the high-flux whole-genome sequencing. The method has the advantages of high flux, low cost and short time, the generated library sequencing quality is high, the method is suitable for establishing a large-scale multi-sample sequencing library, and the application prospect is very wide in the research such as high-flux SNP detection and identifier development, genetic map drawing, whole-genome correlation analysis and the like.
Description
Technical field
The present invention relates to field of bioinformatics, specifically, relate to a kind of construction process being applicable to the Multi-example mixing sequencing library of high-flux sequence.
Background technology
In the order-checking of two generations, usually Illumina, NEB or Vazyme test kit Library development flow used is mature and stable, after sample has interrupted, the follow-up DNA chain end carried out is repaired, 3 ' end adds base step A does not all need purifying, next step reaction can be carried out, therefore build storehouse process simple and efficient, and it is high to build Kucheng's power; But build storehouse for multiple sample, test kit conveniently builds storehouse, then complex operation, especially the library fragments in later stage is selected, need single sample to be carried out respectively electrophoresis cut glue screening fragment or utilize magnetic bead to screen fragment, cause building storehouse cost intensive, need consume a large amount of time and build storehouse expense.The high-throughput of prior art simplifies gene order-checking library constructing method mainly for not with reference to genomic sample in addition, cut prediction by enzyme to carry out enzyme and cut and interrupt, build library, randomness is good not, again by removing tumor-necrosis factor glycoproteins and other redundant sequences, by remaining data analysis, obtain relevant associated region, accuracy is not high enough.
Summary of the invention
The object of this invention is to provide a kind of construction process being applicable to the Multi-example mixing sequencing library of high-throughput genome sequencing, to make up in prior art the deficiency building such library longer, complex operation consuming time.
The construction process being applicable to the Multi-example mixing sequencing library of high-throughput genome sequencing provided by the invention, comprises the following steps:
(1) genomic dna of different sample interrupts, and obtains the DNA fragmentation concentrating band to be distributed in 500bp;
(2) sample DNA of having no progeny of simultaneously fighting each other carries out the flat end reparation of 5 ' phosphorylation and 3 ' end interpolation base A, obtains the DNA fragmentation with 5 ' phosphorylation and 3 ' sticky end A;
(3) respectively 5 ' the phosphorylation that has of different sample is connected with the joint containing the unique barcode sequence can distinguishing sample with the DNA fragmentation of 3 ' sticky end A, obtains the connection product can distinguishing sample source;
(4) respectively pcr amplification is carried out to the connection product containing different barcode, obtain the amplified production containing different bar code sequence, purifying, quantitatively, mixed sample; Mixed PCR primer carries out sepharose separation and purification, and purified product forms the multiple sample mixing sequencing libraries being suitable for high-throughput genome sequencing.
In the inventive method, the genomic dna concentration described in step (1) is 20 ~ 100ng/ μ L.Described genomic dna be through agarose gel electrophoresis detect and ultraviolet spectrophotometer method detection.
Wherein, the sample gene group DNA of step (1) also comprises magnetic beads for purifying step after interrupting.
In the inventive method, the reaction system of step (2) is:
Wherein, the reaction conditions of step (2) is: 20 DEG C of 30min, 65 DEG C of 30min.
In the inventive method, the sequence of step (3) joint contains the normal chain of 8 base bar code sequences and the minus strand of 8 base bar code sequences.
Step (3) described joint, its sequence is:
Normal chain 5 '-3 ':
AATGATACGGCGACCACCGAGATCTACACXXXXXXXXTCTTTCCCTACACGACGCTCTTCCGATCT
Minus strand 5 '-3 ':
CAAGCAGAAGACGGCATACGAGATYYYYYYYYGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT;
Wherein: the XXXXXXXX in normal chain represents normal chain bar code sequence, the YYYYYYYY in minus strand represents minus strand bar code sequence.
The bar code sequence of above-mentioned positive and negative chain is the random combine of A, T, C, G tetra-kinds of bases, can design according to the actual needs of those skilled in the art; The bar code sequence of positive and negative chain need make for distinguishing sample simultaneously.
In step (4), pcr amplification primer sequence used is:
P1:5’-3’:AATGATACGGCGACCGAGATCTACAC
P2:5’-3’:CAAGCAGAAGACGGCATACGAGAT。
In the inventive method, the PCR response procedures of step (4) is: 95-98 DEG C of 3min; 95-98 DEG C of 10sec, 65 DEG C of 30sec, 72 DEG C of 30-40sec, totally 10 circulations; 72 DEG C of 5min.
It is that the fragment cutting 650bp lower edges carries out purifying that step (4) mixed PCR primer carries out sepharose separation and purification.
The invention provides the application of above-mentioned construction process in Multi-example genome association is analyzed.
In the step (4) of the inventive method, after purifying, the quantitative detecting method of amplified production can be ultraviolet spectrophotometer method, nucleic acid fluorescent quantitative method, agarose gel electrophoresis detection.
The inventive method is the improvement on Vazyme test kit banking process, the method of building storehouse is carried out for batch sample, magnetic beads for purifying is carried out after sample interrupts, DNA end reparation afterwards, 3 ' end add base step A and all operate on 96 orifice plates, greatly improve flux and efficiency, when jointing, add the both-end Index that can identify different sample simultaneously, PCR carries out enrichment, is then mixed in the library after different pcr amplification, cuts glue carry out library fragments selection finally by electrophoresis.The method is compared with traditional method, and 96 orifice plate batch operations can improve flux greatly, becomes a library after final multiple sample mixes sample, selects fragment step saves a large amount of reagent cost, human cost and time at follow-up glue of cutting.The method is specially adapted to resurvey sequence and the low genome of the order-checking degree of depth of the little biological genome of genome and resurveys sequence.
Technical scheme of the present invention can bring following beneficial effect:
(1) sample initial amount requires to reduce.In the prior art, conventional single library construction needs at least 5 μ g samples, therefore higher to the demand of sample, cause the rare sample of part because less not the reaching of total amount builds library standard, the minimizing of purification step in Library development flow of the present invention, initial amount can be reduced to 2.5 μ g by 5 original μ g, the DNA initial amount of this level, both reduced the requirement to sample, also meet the output to follow-up data and analysis, and the reduction of initial amount had made to build storehouse reagent cost reduction simultaneously simultaneously;
(2) for the sample had with reference to genome sequence, carry out machinery and interrupt at random, build library, final literature data covers full-length genome, and randomness is better; By order-checking gained all data by with compare with reference to genome, therefore higher to the SNP site of interpretation of result, Indel site accuracy;
(3) before mixed sample, the advanced performing PCR amplification of each sample, namely PCR enriched product is obtained, carry out quantitative and mixed sample again, if first carry out mixed sample, first mixed sample can be caused irregular due to the content difference of different sample, and the difference of this sample room can be amplified further by the PCR process after mixed sample, therefore larger on the output impact of later stage different sample data amount, and the present invention first carries out pcr amplification to each sample, and then carry out mixed sample, this operation can avoid mixed sample error to be amplified further after pcr amplification, makes the homogeneity of sample better;
(4) pcr amplification has Preference, in pcr amplification process, primer may be partial in conjunction with genomic certain section of region, a large amount of amplification this section of region, therefore the product obtained there will be genomic covering uneven, and then redundancy is high to make sequencing data occur, so, in experimentation, in order to avoid too high and the reducing genomic coverage of causing of later data redundancy, the present invention by the design of the cycle number of pcr amplification lower than 10 circulations, like this, effectively can reduce the Preference that pcr amplification brings, ensure that the validity of data analysis;
(5) in conventional Library development flow, single sample needs to cut glue screening fragment respectively, each sample runs glue from point sample, electrophoresis, contaminate glue, cut glue to expend time in length to purifying, waste time and energy especially for Multi-example, and present design carries out cutting glue after being mixed by sample, such as, 100 samples cut the glue time, the present invention can be utilized completely required time to be down to the time of 1 sample, therefore to greatly reduce time and cost that single sample electrophoresis cuts glue.
For the library construction of 100 samples, conventionally build storehouse, building the storehouse cycle reaches 10 working dayss, and utilizes the inventive method 100 samples can be built storehouse cycle time to 4 working days, greatly saves and builds the storehouse time.On the other hand, to resurvey sequence for resurvey sequence and the low genome of the order-checking degree of depth of the biological genome that genome is little, library is built separately according to existing routine techniques, not only require higher to genomic dna initial amount, and it is higher to build storehouse reagent cost, utilizing the inventive method, reduce the DNA initial amount building storehouse, making to build this reduction of Kucheng by reducing using amount of reagent simultaneously.
Accompanying drawing explanation
Fig. 1 is the distribution plan of data analysis Insert Fragment after the embodiment of the present invention 1 checks order.
Fig. 2 is the distribution plan of data analysis Insert Fragment after the embodiment of the present invention 2 checks order.
Fig. 3 is the distribution plan of data analysis Insert Fragment after the embodiment of the present invention 3 checks order.
Fig. 4 is the schema that the Multi-example mixing sequencing library of high-flux sequence of the present invention builds.
Embodiment
Following examples for illustration of the present invention, but are not used for limiting the scope of the invention.If do not specialize, the conventional means that technique means used in embodiment is well known to those skilled in the art.Biochemical reagents used in following examples are commercially available.
In following embodiment, its sequence of the joint of use is:
Normal chain 5 '-3 ':
AATGATACGGCGACCACCGAGATCTACACXXXXXXXXTCTTTCCCTACACGACGCTCTTCCGATCT
Minus strand 5 '-3 ':
CAAGCAGAAGACGGCATACGAGATYYYYYYYYGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT;
Wherein: the XXXXXXXX in normal chain represents normal chain bar code sequence, the YYYYYYYY in minus strand represents minus strand bar code sequence.
In following embodiment, pcr amplification primer sequence used is:
P1:5’-3’:AATGATACGGCGACCGAGATCTACAC
P2:5’-3’:CAAGCAGAAGACGGCATACGAGAT。
Embodiment 120 paddy rice samples mix the construction process (1) in sample library
Select 20 paddy rice samples, library construction comprises the steps:
1, genomic DNA fragment and purifying are got 20 paddy rice sample gene group DNA5 μ g respectively and are moved into the low absorption centrifuge tube of 1.5mL, use ddH
2o mends to cumulative volume 55 μ L, and utilize Ultrasonic Cell Disruptor to carry out machinery to sample and interrupt, the time of interrupting is 5min, and electrophoresis detection concentrates band to be positioned at 500bp.Residue interrupts product and is determining that getting half after segment ranges correctly interrupts product (25 μ L), carries out purifying with the AMpure magnetic bead (45 μ L) of 1.8 times of volumes, finally with 18 μ LddH
2o carries out back dissolving.
2, end reparation, add reagent to interrupting in product after the 1st step purifying, reaction system is as follows:
Reaction system is placed in constant-temperature metal bath 20 DEG C and hatches 30min, carry out purifying with the AMpure magnetic bead (45 μ L) of 1.8 times of volumes, finally with 17 μ LddH
2o carries out back dissolving.
3,3 '-end adds A reaction, and in the product after the 2nd step purifying, add reagent, reaction system is:
Reaction system is placed in constant-temperature metal bath 37 DEG C and hatches 30min, carries out purifying with the AMpure magnetic bead (45 μ L) of 1.8 times of volumes, finally with 30 μ LddH
2o carries out back dissolving.
4, jointing reaction, in the product after the 3rd step purifying, add reagent, wherein C4-N is the joint with Index, synthesizes in Takara) reaction system is as follows:
Reaction system is placed in constant-temperature metal bath 20 DEG C and hatches 30min, carries out purifying with the AMpure magnetic bead (72 μ L) of 1.8 times of volumes, finally with 30 μ LddH
2o carries out back dissolving.
5, pcr amplification PCR reaction system:
Pcr amplification is carried out: 98 DEG C of 3min according to the program of following settings; 98 DEG C of 10sec, 65 DEG C of 30sec, 72 DEG C of 30sec, 10 circulations; 72 DEG C of 5min.
Get 5 μ LPCR products, carry out electrophoresis (1.8% sepharose) with 100bpDNALadderMarker, detect between 500bp-700bp whether have disperse.Purifying is carried out, finally with 30 μ LddH with the AMpure magnetic bead (40 μ L) of 1 times of volume
2o carries out back dissolving.
6, Concentration Testing and mixed sample
The PCR primer of getting after 2 μ L above-mentioned steps purifying carries out Nanodrop detection.Mixed sample is carried out according to detected result.By unified for the sample Cmin value be diluted in sample, each sample is got 150ng and is carried out mixed sample.
7, cut glue to reclaim and get the mixed PCR purified product of 600ng and carry out sepharose (2%) electrophoretic separation, cut the fragment of 650bp lower edges.QIAquickGelExtractionKit purifying is utilized to cut film segment, 35 μ LElutionBuffer wash-outs.
8, corresponding reagent is added: Q5E is that Q5 surpasses fidelity dna polysaccharase.
Pcr amplification is carried out: 98 DEG C of 3min according to the program of following settings; 98 DEG C of 10sec, 65 DEG C of 30sec, 72 DEG C of 30sec, 10 circulations; 72 DEG C of 5min.
Get 5 μ LPCR products, carry out electrophoresis (1.8% sepharose) with 100bpDNALadderMarker, detect whether clip size is about 650bp, carry out purifying with the AMpure magnetic bead (40 μ L) of 1 times of volume, finally with 30 μ LddH
2o carries out back dissolving.
9, the sequencing library that Library Quality controls the 8th step obtains carries out quality examination on Agilent2100Bioanalyzer, and segment ranges meets substantially cuts glue scope.
10, check order and to check order being diluted to proper concn by the library of quality inspection in the 9th step, order-checking platform is IlluminaHighSeq2500, and order-checking length is both-end 125bp, and the Reads Insert Fragment size distribution that order-checking produces is shown in Fig. 1.Insert Fragment main peak is distributed between 400-500bp as seen from Figure 1, but between 150-400bp, there is a large amount of small segment, reason may be that the global cycle number of twice PCR is too much, small segment excessive amplification caused, in addition, pcr amplification cycle number too much can cause data redudancy high, genome coverage low (see table 1), the redundancy of 1X data is up to 70%, genome coverage is on average only 20%, be not enough to carry out genetic map analysis, therefore, carry out embodiment 2, flow process is optimized, save second step PCR process, reduce total PCR cycle number.
Table 1 single sample data measured amount, data redudancy and genome coverage
In the present embodiment, because pcr amplification cycle number is too much, cause later data to analyze Insert Fragment abnormal, data redudancy is high, and coverage is low, is not enough to carry out genetic map analysis.
Embodiment 220 paddy rice samples mix sample sequencing library construction process (2)
The present embodiment selects 20 paddy rice samples, on the basis of embodiment 1 method, is optimized flow process, saves second step PCR process, thus reduces total PCR cycle number.Library construction comprises the steps:
1, genomic DNA fragment and purifying
Get genomic dna 5 μ g and move into the low absorption centrifuge tube of 1.5mL, use ddH
2o mends to cumulative volume 55 μ L, utilize Ultrasonic Cell Disruptor to carry out machinery to sample to interrupt, the time of interrupting is 5min, electrophoresis detection concentrates band to be positioned at 500bp, residue interrupts product and is determining that getting half after segment ranges correctly interrupts product (25 μ L), purifying is carried out, finally with 18 μ LddH with the AMpure magnetic bead (45 μ L) of 1.8 times of volumes
2o carries out back dissolving.
2, end reparation, add reagent to interrupting in product after the 1st step purifying, reaction system is as follows:
Reaction system is placed in constant-temperature metal bath 20 DEG C and hatches 30min, carry out purifying with the AMpure magnetic bead (45 μ L) of 1.8 times of volumes, finally with 17 μ LddH
2o carries out back dissolving.
3,3 '-end adds A reaction, and in the product after the 2nd step purifying, add reagent, reaction system is:
Reaction system is placed in constant-temperature metal bath 37 DEG C and hatches 30min, carries out purifying with the AMpure magnetic bead (45 μ L) of 1.8 times of volumes, finally with 30 μ LddH
2o carries out back dissolving.
4, jointing reaction, in the product after the 3rd step purifying, add reagent, reaction system is as follows:
Reaction system is placed in constant-temperature metal bath 20 DEG C and hatches 30min, carries out purifying with the AMpure magnetic bead (72 μ L) of 1.8 times of volumes, finally with 30 μ LddH
2o carries out back dissolving.
5, pcr amplification adds corresponding reagent:
Pcr amplification is carried out: 98 DEG C of 3min according to the program of following settings; 98 DEG C of 10sec, 65 DEG C of 30sec, 72 DEG C of 30sec, 10 circulations; 72 DEG C of 5min.
Get 5 μ LPCR products, carry out electrophoresis (1.8% sepharose) with 100bpDNALadderMarker, detect between 500bp-700bp whether have disperse, carry out purifying with the AMpure magnetic bead (40 μ L) of 1 times of volume, finally with 30 μ LddH
2o carries out back dissolving.
6, Concentration Testing and mixed sample
The PCR primer of getting after 2 μ L above-mentioned steps purifying carries out Nanodrop detection.Mixed sample is carried out according to detected result.By unified for the sample Cmin value be diluted in sample, each sample is got 150ng and is carried out mixed sample.
7, cut glue to reclaim and get the mixed PCR purified product of 600ng and carry out sepharose (2%) electrophoretic separation, cut the fragment of 650bp lower edges.QIAquickGelExtractionKit purifying is utilized to cut film segment, 35 μ LElutionBuffer wash-outs.
8, Library Quality controls
The sequencing library that 7th step obtains is carried out quality examination on Agilent2100Bioanalyzer, and segment ranges meets 650bp size substantially.
9, check order and to check order being diluted to proper concn by the library of quality inspection in the 8th step, order-checking platform is IlluminaHighSeq2500, and order-checking length is both-end 125bp.In the present embodiment, the main peak of Insert Fragment is positioned at about 400-500bp, coincidence theory value, see Fig. 2, meanwhile, conclusion as can be drawn from Table 2, the redundancy of data is only 1%, illustrate same area on genome repeat lower, therefore surveyed data distribution uniform on genome, genome coverage is high
Meet the structure of genetic map.But, in the present embodiment library construction process, end reparation, add A reaction and add joint reaction carry out respectively, all need to carry out purifying after each step simultaneously, not only expend the operating time, also make DNA further loss in each step purification step, therefore for above-mentioned deficiency, carry out embodiment 3, the flow process of mixed sample being built to storehouse is optimized further.
Table 2 single sample data measured amount, data redudancy and genome coverage
The construction process of embodiment 3 20 paddy rice sample mix sequencing libraries of the present invention
Select 20 the paddy rice samples identical with the first two embodiment, library construction comprises the steps:
1, genomic DNA fragment and purifying
Get genomic dna 2.5 μ g and move into the low absorption centrifuge tube of 1.5mL, use ddH
2o mends to cumulative volume 55 μ L, utilize Ultrasonic Cell Disruptor to carry out machinery to sample to interrupt, the time of interrupting is 5min, and electrophoresis detection determination fragment concentrates scope to be 500bp, the AMpure magnetic bead (45 μ L) that residue interrupts product 1.8 times of volumes carries out purifying, finally with 27 μ LddH
2o carries out back dissolving.
2, end reparation, add reagent respectively to interrupting in product after the 1st step purifying, reaction system is as follows:
Pressure-vaccum mixing also brief centrifugation, reacts in PCR instrument according to following program.After reaction terminates, carry out next step joint connection immediately.20℃30min;65℃30min。
3, jointing reaction, in the 2nd step reaction product, add reagent, reaction system is as follows:
Reaction system is placed in constant-temperature metal bath 20 DEG C and hatches 15min, carries out purifying with the AMpure magnetic bead (96 μ L) of 1.8 times of volumes, finally with 30 μ LddH
2o carries out back dissolving.
4, pcr amplification adds corresponding reagent according to following table: Q5E is that Q5 surpasses fidelity dna polysaccharase.
Pcr amplification is carried out: 98 DEG C of 3min according to the program of following settings; 98 DEG C of 10sec, 65 DEG C of 30sec, 72 DEG C of 30sec, 10 circulations; 72 DEG C of 5min.
Get 5 μ LPCR products, carry out electrophoresis (1.8% sepharose) with 100bpDNALadderMarker, detect between 500bp-700bp whether have disperse, carry out purifying with the AMpure magnetic bead (40 μ L) of 1 times of volume, finally with 30 μ LddH
2o carries out back dissolving.
5, the PCR primer after 2 μ L above-mentioned steps purifying got by Concentration Testing and mixed sample carries out Nanodrop detection.Mixed sample is carried out according to detected result.By unified for the sample Cmin value be diluted in sample, each sample is got 150ng and is carried out mixed sample.
6, cut glue recovery and mixed PCR purified product is carried out sepharose (2%) electrophoretic separation, cut the fragment of 650bp lower edges.QIAquickGelExtractionKit purifying is utilized to cut film segment, 35 μ LElutionBuffer wash-outs.
7, the sequencing library that Library Quality controls the 6th step obtains carries out quality examination on Agilent2100Bioanalyzer, and segment ranges meets 650bp size substantially.
8, check order and to check order being diluted to proper concn by the library of quality inspection in the 7th step, order-checking platform is IlluminaHighSeq2500, and order-checking length is both-end 125bp.
The present invention is through the optimization gradually of embodiment 1-3, and finally obtain the method for Multi-example genome sequencing library construction in embodiment 3, schema is shown in Fig. 4.The inventive method (embodiment 3) is relative to embodiment 1 and 2, in step prepared by library, more save the operating time, data aspect, Insert Fragment normal in size, is shown in Fig. 3, and data redudancy reduces, genome coverage higher (see table 3), meets the structure carrying out genetic map.
Table 3 single sample data measured amount, data redudancy and genome coverage
Although above the present invention is described in detail with a general description of the specific embodiments, on basis of the present invention, can make some modifications or improvements it, this will be apparent to those skilled in the art.Therefore, these modifications or improvements without departing from theon the basis of the spirit of the present invention, all belong to the scope of protection of present invention.
Claims (10)
1. be applicable to a construction process for multiple sample mixing sequencing libraries of high-throughput genome sequencing, comprise the following steps:
(1) genomic dna of different sample interrupts, and obtains the DNA fragmentation concentrating band to be distributed in 500bp;
(2) sample DNA of having no progeny of simultaneously fighting each other carries out the flat end reparation of 5 ' phosphorylation and 3 ' end interpolation base A, obtains the DNA fragmentation with 5 ' phosphorylation and 3 ' sticky end A;
(3) respectively 5 ' the phosphorylation that has of different sample is connected with the joint containing the unique barcode sequence can distinguishing sample with the DNA fragmentation of 3 ' sticky end A, obtains the connection product can distinguishing sample source;
(4) respectively pcr amplification is carried out to the connection product containing different barcode, obtain the amplified production containing different bar code sequence, purifying, quantitatively, mixed sample; Mixed PCR primer carries out sepharose separation and purification, and purified product forms the multiple sample mixing sequencing libraries being suitable for high-flux sequence.
2. construction process according to claim 1, is characterized in that, the genomic dna concentration described in step (1) is 50 ~ 100ng/ μ L.
3. construction process according to claim 1, is characterized in that, the reaction system of step (2) is:
4. construction process according to claim 1, is characterized in that, the reaction conditions of step (2) is: 20 DEG C of 30min, 65 DEG C of 30min.
5. construction process according to claim 1, is characterized in that, the sequence of step (3) joint contains the normal chain of 8 base bar code sequences and the minus strand of 8 base bar code sequences.
6. construction process according to claim 1, is characterized in that, step (3) described joint, and its sequence is:
Normal chain 5 '-3 ':
AATGATACGGCGACCACCGAGATCTACACXXXXXXXXTCTTTCCCTACACGACGCTCTTCCGATCT
Minus strand 5 '-3 ':
CAAGCAGAAGACGGCATACGAGATYYYYYYYYGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT;
Wherein: the XXXXXXXX in normal chain represents normal chain bar code sequence, the YYYYYYYY in minus strand represents minus strand bar code sequence.
7. according to the arbitrary described construction process of claim 1-6, it is characterized in that, in step (4), pcr amplification primer sequence used is:
P1:5’-3’:AATGATACGGCGACCGAGATCTACAC
P2:5’-3’:CAAGCAGAAGACGGCATACGAGAT。
8., according to the arbitrary described construction process of claim 1-6, it is characterized in that, the PCR response procedures of step (4) is: 95 DEG C of-98 DEG C of 3min; 95 DEG C of-98 DEG C of 10sec, 65 DEG C of 30sec, 72 DEG C of 30-40sec, totally 10 circulations; 72 DEG C of 5min.
9., according to the arbitrary described construction process of claim 1-6, it is characterized in that, it is that the fragment cutting 650bp lower edges carries out purifying that step (4) mixed PCR primer carries out sepharose separation and purification.
10. the application of the arbitrary described construction process of claim 1-9 in Multi-example genome association is analyzed.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510657439.6A CN105200530A (en) | 2015-10-13 | 2015-10-13 | Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510657439.6A CN105200530A (en) | 2015-10-13 | 2015-10-13 | Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN105200530A true CN105200530A (en) | 2015-12-30 |
Family
ID=54948476
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201510657439.6A Pending CN105200530A (en) | 2015-10-13 | 2015-10-13 | Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN105200530A (en) |
Cited By (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105861700A (en) * | 2016-05-17 | 2016-08-17 | 湖州昂朴医学检验有限公司 | High throughput test method for neuromuscular disorders |
| CN105926043A (en) * | 2016-04-19 | 2016-09-07 | 苏州贝康医疗器械有限公司 | Method for improving proportion of fetal free DNA in maternal plasma free DNA sequencing library |
| CN105950612A (en) * | 2016-07-08 | 2016-09-21 | 北京全式金生物技术有限公司 | High-efficiency method for ligating DNA (Deoxyribonucleic Acid) with adapters |
| CN105985949A (en) * | 2015-11-02 | 2016-10-05 | 中国动物卫生与流行病学中心 | RNA high-throughput sequencing library construction method |
| CN106086189A (en) * | 2016-06-22 | 2016-11-09 | 杭州和壹基因科技有限公司 | A kind of RAD banking process improving data homogeneity |
| CN106591956A (en) * | 2016-11-15 | 2017-04-26 | 上海派森诺医学检验所有限公司 | Sequencing library construction method |
| CN108315320A (en) * | 2017-01-16 | 2018-07-24 | 李辉辉 | A kind of preparation method of RNA high-throughput sequencing libraries |
| CN108504651A (en) * | 2017-02-27 | 2018-09-07 | 深圳市乐土精准医疗科技有限公司 | The library constructing method and reagent in library are built in PCR product large sample size mixing based on high-flux sequence |
| CN108728903A (en) * | 2017-04-21 | 2018-11-02 | 深圳市乐土精准医疗科技有限公司 | The banking process of thalassemia large sample screening is used for based on high-flux sequence |
| CN108753921A (en) * | 2018-06-04 | 2018-11-06 | 广州微芯生物科技有限公司 | A kind of method building gene order-checking library and corresponding joint sequence and kit |
| CN108866155A (en) * | 2018-06-11 | 2018-11-23 | 中国农业科学院深圳农业基因组研究所 | A kind of preparation method of next generation's sequencing library |
| CN108949942A (en) * | 2018-07-17 | 2018-12-07 | 浙江大学 | A kind of mitochondria genome sequencing method based on high-flux sequence |
| CN109477141A (en) * | 2016-05-17 | 2019-03-15 | D名-It股份有限公司 | Sample identification method |
| CN110872615A (en) * | 2018-08-31 | 2020-03-10 | 成都先导药物开发股份有限公司 | A high-throughput next-generation sequencing method for sequencing nucleotide duplexes |
| CN113774121A (en) * | 2021-09-13 | 2021-12-10 | 武汉大学 | A low-volume m6A high-throughput sequencing method based on RNA-linked tags |
| WO2022199242A1 (en) * | 2021-03-25 | 2022-09-29 | 南方医科大学 | Set of barcode linkers and medium-flux multi-single-cell representative dna methylation library construction and sequencing method |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102978206A (en) * | 2012-11-27 | 2013-03-20 | 北京诺禾致源生物信息科技有限公司 | High-throughput sequencing joint applied to hybrid library building and library building method thereof |
| CN104005090A (en) * | 2014-05-28 | 2014-08-27 | 北京诺禾致源生物信息科技有限公司 | Method for building high throughput sequencing library of low-quality sample deoxyribonucleic acid (DNA) |
| CN104032377A (en) * | 2014-06-30 | 2014-09-10 | 北京诺禾致源生物信息科技有限公司 | Construction method of single cell transcriptome sequencing library and application of construction method |
| CN104232627A (en) * | 2013-06-13 | 2014-12-24 | 深圳华大基因科技有限公司 | 2b-RAD pooling technology |
| CN104694635A (en) * | 2015-02-12 | 2015-06-10 | 北京百迈客生物科技有限公司 | Method for constructing high-flux simplified genome sequencing library |
-
2015
- 2015-10-13 CN CN201510657439.6A patent/CN105200530A/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102978206A (en) * | 2012-11-27 | 2013-03-20 | 北京诺禾致源生物信息科技有限公司 | High-throughput sequencing joint applied to hybrid library building and library building method thereof |
| CN104232627A (en) * | 2013-06-13 | 2014-12-24 | 深圳华大基因科技有限公司 | 2b-RAD pooling technology |
| CN104005090A (en) * | 2014-05-28 | 2014-08-27 | 北京诺禾致源生物信息科技有限公司 | Method for building high throughput sequencing library of low-quality sample deoxyribonucleic acid (DNA) |
| CN104032377A (en) * | 2014-06-30 | 2014-09-10 | 北京诺禾致源生物信息科技有限公司 | Construction method of single cell transcriptome sequencing library and application of construction method |
| CN104694635A (en) * | 2015-02-12 | 2015-06-10 | 北京百迈客生物科技有限公司 | Method for constructing high-flux simplified genome sequencing library |
Cited By (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105985949A (en) * | 2015-11-02 | 2016-10-05 | 中国动物卫生与流行病学中心 | RNA high-throughput sequencing library construction method |
| CN105926043A (en) * | 2016-04-19 | 2016-09-07 | 苏州贝康医疗器械有限公司 | Method for improving proportion of fetal free DNA in maternal plasma free DNA sequencing library |
| CN105861700A (en) * | 2016-05-17 | 2016-08-17 | 湖州昂朴医学检验有限公司 | High throughput test method for neuromuscular disorders |
| CN105861700B (en) * | 2016-05-17 | 2019-07-30 | 上海昂朴生物科技有限公司 | A kind of high-flux detection method for neuromuscular disease |
| CN109477141A (en) * | 2016-05-17 | 2019-03-15 | D名-It股份有限公司 | Sample identification method |
| CN109477141B (en) * | 2016-05-17 | 2022-07-12 | D名-It股份有限公司 | Sample identification method |
| CN106086189A (en) * | 2016-06-22 | 2016-11-09 | 杭州和壹基因科技有限公司 | A kind of RAD banking process improving data homogeneity |
| CN105950612A (en) * | 2016-07-08 | 2016-09-21 | 北京全式金生物技术有限公司 | High-efficiency method for ligating DNA (Deoxyribonucleic Acid) with adapters |
| CN105950612B (en) * | 2016-07-08 | 2019-06-21 | 北京全式金生物技术有限公司 | A kind of efficient DNA connector connecting method |
| CN106591956A (en) * | 2016-11-15 | 2017-04-26 | 上海派森诺医学检验所有限公司 | Sequencing library construction method |
| CN108315320A (en) * | 2017-01-16 | 2018-07-24 | 李辉辉 | A kind of preparation method of RNA high-throughput sequencing libraries |
| CN108504651A (en) * | 2017-02-27 | 2018-09-07 | 深圳市乐土精准医疗科技有限公司 | The library constructing method and reagent in library are built in PCR product large sample size mixing based on high-flux sequence |
| CN108728903A (en) * | 2017-04-21 | 2018-11-02 | 深圳市乐土精准医疗科技有限公司 | The banking process of thalassemia large sample screening is used for based on high-flux sequence |
| CN108753921A (en) * | 2018-06-04 | 2018-11-06 | 广州微芯生物科技有限公司 | A kind of method building gene order-checking library and corresponding joint sequence and kit |
| CN108866155A (en) * | 2018-06-11 | 2018-11-23 | 中国农业科学院深圳农业基因组研究所 | A kind of preparation method of next generation's sequencing library |
| CN108949942A (en) * | 2018-07-17 | 2018-12-07 | 浙江大学 | A kind of mitochondria genome sequencing method based on high-flux sequence |
| CN110872615A (en) * | 2018-08-31 | 2020-03-10 | 成都先导药物开发股份有限公司 | A high-throughput next-generation sequencing method for sequencing nucleotide duplexes |
| WO2022199242A1 (en) * | 2021-03-25 | 2022-09-29 | 南方医科大学 | Set of barcode linkers and medium-flux multi-single-cell representative dna methylation library construction and sequencing method |
| CN113774121A (en) * | 2021-09-13 | 2021-12-10 | 武汉大学 | A low-volume m6A high-throughput sequencing method based on RNA-linked tags |
| CN113774121B (en) * | 2021-09-13 | 2024-02-20 | 武汉大学 | Low sample size m based on RNA (ribonucleic acid) connection tag 6 A high throughput sequencing method |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN105200530A (en) | Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing | |
| Dumschott et al. | Oxford Nanopore sequencing: new opportunities for plant genomics? | |
| EP2835752B1 (en) | Method and system for detecting copy number variation | |
| CN104894651B (en) | Building method of high-throughput sequencing library of trace starter DNA (deoxyribonucleic acid) and high-throughput sequencing library built by building method | |
| CN104894271B (en) | Method and device for detecting gene fusion | |
| Wang et al. | Forensic nanopore sequencing of microhaplotype markers using QitanTech’s QNome | |
| CN113564266B (en) | SNP typing genetic marker combination, detection kit and application | |
| Turchi et al. | Assessment of the Precision ID Identity Panel kit on challenging forensic samples | |
| CN105463116B (en) | A kind of Forensic medicine composite detection kit and detection method based on 20 triallelic SNP genetic markers | |
| CN105002567A (en) | Construction method of high-throughput simplified methylation sequencing library without reference genome | |
| CN106636063A (en) | Primer compound, application thereof and method for establishing library and confirming nucleotide sequence | |
| CN104532360A (en) | Whole-genome methylation sequencing library and construction method thereof | |
| CN113981048A (en) | Primer composition, kit and method for detecting micro-haplotype locus based on next-generation sequencing technology and application thereof | |
| CN110387438A (en) | Multi-primers, kit and method for enterovirus high-flux sequence | |
| CN112029842A (en) | Kit and method for ABO blood type genotyping based on high-throughput sequencing | |
| Sukser et al. | Assessment of Illumina® Human mtDNA Genome assay: workflow evaluation with development of analysis and interpretation guidelines | |
| CN108866155B (en) | A kind of preparation method of next generation sequencing library | |
| CN111378732B (en) | Mitochondrial genome sequencing primer, kit and method | |
| CN107236037B (en) | Mutant MSH6 protein, and coding gene and application thereof | |
| CN104357563A (en) | Method for performing high-throughput sequencing on haplotype of genome subjected to two-time DNA fragmentation | |
| CN114015749A (en) | Construction method of mitochondrial genome sequencing library based on high-throughput sequencing and amplification primer | |
| CN104988223A (en) | Primer and method for simultaneously detecting VKORC1 and CYP2C9 gene polymorphisms | |
| Holtgräwe et al. | A partially phase-separated genome sequence assembly of the Vitis rootstock ‘Börner’(Vitis riparia× Vitis cinerea) and its exploitation for marker development and targeted mapping | |
| CN111944807A (en) | Human sequencing sample tracking marker, and monitoring method and monitoring device for human sequencing sample cross contamination | |
| CN105349627A (en) | Kit for SNP genotyping of Chinese children short stature susceptibilty genes and utilization method of kit |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20151230 |