TW202146659A - Comparing copies of polynucleotides with different features - Google Patents
Comparing copies of polynucleotides with different features Download PDFInfo
- Publication number
- TW202146659A TW202146659A TW110118675A TW110118675A TW202146659A TW 202146659 A TW202146659 A TW 202146659A TW 110118675 A TW110118675 A TW 110118675A TW 110118675 A TW110118675 A TW 110118675A TW 202146659 A TW202146659 A TW 202146659A
- Authority
- TW
- Taiwan
- Prior art keywords
- polynucleotide
- populations
- polynucleotides
- copies
- population
- Prior art date
Links
- 102000040430 polynucleotide Human genes 0.000 title claims abstract description 329
- 108091033319 polynucleotide Proteins 0.000 title claims abstract description 329
- 239000002157 polynucleotide Substances 0.000 title claims abstract description 328
- 238000000034 method Methods 0.000 claims abstract description 190
- 108091034117 Oligonucleotide Proteins 0.000 claims abstract description 84
- 239000000758 substrate Substances 0.000 claims abstract description 50
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims abstract description 49
- 238000002360 preparation method Methods 0.000 claims description 25
- 238000004519 manufacturing process Methods 0.000 claims description 24
- NOIRDLRUNWIUMX-UHFFFAOYSA-N 2-amino-3,7-dihydropurin-6-one;6-amino-1h-pyrimidin-2-one Chemical compound NC=1C=CNC(=O)N=1.O=C1NC(N)=NC2=C1NC=N2 NOIRDLRUNWIUMX-UHFFFAOYSA-N 0.000 claims description 10
- 238000003199 nucleic acid amplification method Methods 0.000 description 85
- 230000003321 amplification Effects 0.000 description 84
- 125000003729 nucleotide group Chemical group 0.000 description 79
- 239000002773 nucleotide Substances 0.000 description 77
- 230000008569 process Effects 0.000 description 69
- 150000007523 nucleic acids Chemical class 0.000 description 52
- 230000000295 complement effect Effects 0.000 description 51
- 102000039446 nucleic acids Human genes 0.000 description 48
- 108020004707 nucleic acids Proteins 0.000 description 48
- 239000000523 sample Substances 0.000 description 41
- 108020004414 DNA Proteins 0.000 description 31
- 102000053602 DNA Human genes 0.000 description 31
- 238000012163 sequencing technique Methods 0.000 description 29
- 210000004027 cell Anatomy 0.000 description 22
- 239000012530 fluid Substances 0.000 description 22
- 239000012634 fragment Substances 0.000 description 22
- 238000006243 chemical reaction Methods 0.000 description 21
- 230000015572 biosynthetic process Effects 0.000 description 20
- 238000003752 polymerase chain reaction Methods 0.000 description 20
- 239000000243 solution Substances 0.000 description 19
- 239000003153 chemical reaction reagent Substances 0.000 description 17
- 239000007787 solid Substances 0.000 description 17
- 108091093088 Amplicon Proteins 0.000 description 15
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 13
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 13
- 239000000499 gel Substances 0.000 description 13
- 238000009396 hybridization Methods 0.000 description 13
- 239000000463 material Substances 0.000 description 12
- 239000007790 solid phase Substances 0.000 description 12
- 230000000694 effects Effects 0.000 description 10
- 230000007717 exclusion Effects 0.000 description 10
- 238000013467 fragmentation Methods 0.000 description 10
- 238000006062 fragmentation reaction Methods 0.000 description 10
- 238000006073 displacement reaction Methods 0.000 description 9
- 239000000203 mixture Substances 0.000 description 9
- 230000002441 reversible effect Effects 0.000 description 9
- 230000008901 benefit Effects 0.000 description 8
- 229920000642 polymer Polymers 0.000 description 8
- 229920002477 rna polymer Polymers 0.000 description 8
- 239000011324 bead Substances 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 230000000670 limiting effect Effects 0.000 description 7
- 238000006116 polymerization reaction Methods 0.000 description 7
- 241000191025 Rhodobacter Species 0.000 description 6
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 6
- 238000000576 coating method Methods 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- 239000011521 glass Substances 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- -1 polypropylene Polymers 0.000 description 6
- 239000011148 porous material Substances 0.000 description 6
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 102100034343 Integrase Human genes 0.000 description 5
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 5
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 5
- 102000018120 Recombinases Human genes 0.000 description 5
- 108010091086 Recombinases Proteins 0.000 description 5
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- 239000011248 coating agent Substances 0.000 description 5
- 230000002255 enzymatic effect Effects 0.000 description 5
- 239000007850 fluorescent dye Substances 0.000 description 5
- 238000003384 imaging method Methods 0.000 description 5
- 238000011901 isothermal amplification Methods 0.000 description 5
- 239000002751 oligonucleotide probe Substances 0.000 description 5
- 239000010703 silicon Substances 0.000 description 5
- 229910052710 silicon Inorganic materials 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- PPBRXRYQALVLMV-UHFFFAOYSA-N Styrene Chemical compound C=CC1=CC=CC=C1 PPBRXRYQALVLMV-UHFFFAOYSA-N 0.000 description 4
- 239000000654 additive Substances 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 230000005670 electromagnetic radiation Effects 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 241000219357 Cactaceae Species 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 101001038339 Homo sapiens LIM homeobox transcription factor 1-alpha Proteins 0.000 description 3
- 102100040290 LIM homeobox transcription factor 1-alpha Human genes 0.000 description 3
- 101100074794 Mesocricetus auratus Lhx1 gene Proteins 0.000 description 3
- 108060004795 Methyltransferase Proteins 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 239000000919 ceramic Substances 0.000 description 3
- 230000036425 denaturation Effects 0.000 description 3
- 238000004925 denaturation Methods 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000009545 invasion Effects 0.000 description 3
- 244000005700 microbiome Species 0.000 description 3
- BPUBBGLMJRNUCC-UHFFFAOYSA-N oxygen(2-);tantalum(5+) Chemical compound [O-2].[O-2].[O-2].[O-2].[O-2].[Ta+5].[Ta+5] BPUBBGLMJRNUCC-UHFFFAOYSA-N 0.000 description 3
- 239000004033 plastic Substances 0.000 description 3
- 229920003023 plastic Polymers 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 238000000527 sonication Methods 0.000 description 3
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 229920000089 Cyclic olefin copolymer Polymers 0.000 description 2
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- DRBBFCLWYRJSJZ-UHFFFAOYSA-N N-phosphocreatine Chemical compound OC(=O)CN(C)C(=N)NP(O)(O)=O DRBBFCLWYRJSJZ-UHFFFAOYSA-N 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 238000003508 chemical denaturation Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000003505 heat denaturation Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000010899 nucleation Methods 0.000 description 2
- 238000000059 patterning Methods 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- 238000005498 polishing Methods 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 108090000623 proteins and genes Proteins 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000005096 rolling process Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 239000000377 silicon dioxide Substances 0.000 description 2
- 238000005507 spraying Methods 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 229910001936 tantalum oxide Inorganic materials 0.000 description 2
- LLTDOAPVRPZLCM-UHFFFAOYSA-O 4-(7,8,8,16,16,17-hexamethyl-4,20-disulfo-2-oxa-18-aza-6-azoniapentacyclo[11.7.0.03,11.05,9.015,19]icosa-1(20),3,5,9,11,13,15(19)-heptaen-12-yl)benzoic acid Chemical compound CC1(C)C(C)NC(C(=C2OC3=C(C=4C(C(C(C)[NH+]=4)(C)C)=CC3=3)S(O)(=O)=O)S(O)(=O)=O)=C1C=C2C=3C1=CC=C(C(O)=O)C=C1 LLTDOAPVRPZLCM-UHFFFAOYSA-O 0.000 description 1
- UBKVUFQGVWHZIR-UHFFFAOYSA-N 8-oxoguanine Chemical compound O=C1NC(N)=NC2=NC(=O)N=C21 UBKVUFQGVWHZIR-UHFFFAOYSA-N 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 108020005174 Archaeal RNA Proteins 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000606125 Bacteroides Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010014080 DNA Polymerase gamma Proteins 0.000 description 1
- 102000016903 DNA Polymerase gamma Human genes 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 1
- 108010076551 DNA polymerase C Proteins 0.000 description 1
- 102100029765 DNA polymerase lambda Human genes 0.000 description 1
- 101710177421 DNA polymerase lambda Proteins 0.000 description 1
- 108010061914 DNA polymerase mu Proteins 0.000 description 1
- 241001137858 Euryarchaeota Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 239000004642 Polyimide Substances 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 102000017143 RNA Polymerase I Human genes 0.000 description 1
- 108010013845 RNA Polymerase I Proteins 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 102000014450 RNA Polymerase III Human genes 0.000 description 1
- 108010078067 RNA Polymerase III Proteins 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 241001430267 Rhodobacteraceae Species 0.000 description 1
- 229910052581 Si3N4 Inorganic materials 0.000 description 1
- 229910004298 SiO 2 Inorganic materials 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 229920006362 Teflon® Polymers 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 241000205188 Thermococcus Species 0.000 description 1
- 101100388071 Thermococcus sp. (strain GE8) pol gene Proteins 0.000 description 1
- 241000589500 Thermus aquaticus Species 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 108700002693 Viral Replicase Complex Proteins Proteins 0.000 description 1
- YAIQCYZCSGLAAN-UHFFFAOYSA-N [Si+4].[O-2].[Al+3] Chemical compound [Si+4].[O-2].[Al+3] YAIQCYZCSGLAAN-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 108010013829 alpha subunit DNA polymerase III Proteins 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 241000617156 archaeon Species 0.000 description 1
- IVRMZWNICZWHMI-UHFFFAOYSA-N azide group Chemical group [N-]=[N+]=[N-] IVRMZWNICZWHMI-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000005513 bias potential Methods 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000003851 biochemical process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 238000005530 etching Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 239000005350 fused silica glass Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 229910000449 hafnium oxide Inorganic materials 0.000 description 1
- WIHZLLGSGQNAGK-UHFFFAOYSA-N hafnium(4+);oxygen(2-) Chemical compound [O-2].[O-2].[Hf+4] WIHZLLGSGQNAGK-UHFFFAOYSA-N 0.000 description 1
- 239000008241 heterogeneous mixture Substances 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000007403 mPCR Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 238000004377 microelectronic Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000000465 moulding Methods 0.000 description 1
- 238000007481 next generation sequencing Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000003499 nucleic acid array Methods 0.000 description 1
- 108091014756 nucleotide binding proteins Proteins 0.000 description 1
- 102000026415 nucleotide binding proteins Human genes 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000005298 paramagnetic effect Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 229950007002 phosphocreatine Drugs 0.000 description 1
- 238000000206 photolithography Methods 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 229920000058 polyacrylate Polymers 0.000 description 1
- 229920001083 polybutene Polymers 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920001721 polyimide Polymers 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 1
- 239000004810 polytetrafluoroethylene Substances 0.000 description 1
- 229920002635 polyurethane Polymers 0.000 description 1
- 239000004814 polyurethane Substances 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000005464 sample preparation method Methods 0.000 description 1
- 230000002000 scavenging effect Effects 0.000 description 1
- 150000003376 silicon Chemical class 0.000 description 1
- HQVNEWCFYHHQES-UHFFFAOYSA-N silicon nitride Chemical compound N12[Si]34N5[Si]62N3[Si]51N64 HQVNEWCFYHHQES-UHFFFAOYSA-N 0.000 description 1
- 229910052814 silicon oxide Inorganic materials 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- PBCFLUZVCVVTBY-UHFFFAOYSA-N tantalum pentoxide Inorganic materials O=[Ta](=O)O[Ta](=O)=O PBCFLUZVCVVTBY-UHFFFAOYSA-N 0.000 description 1
- 238000005382 thermal cycling Methods 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 235000012431 wafers Nutrition 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6816—Hybridisation assays characterised by the detection means
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6851—Quantitative amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2537/00—Reactions characterised by the reaction format or use of a specific feature
- C12Q2537/10—Reactions characterised by the reaction format or use of a specific feature the purpose or use of
- C12Q2537/16—Assays for determining copy number or wherein the copy number is of special importance
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2545/00—Reactions characterised by their quantitative nature
- C12Q2545/10—Reactions characterised by their quantitative nature the purpose being quantitative analysis
- C12Q2545/113—Reactions characterised by their quantitative nature the purpose being quantitative analysis with an external standard/control, i.e. control reaction is separated from the test/target reaction
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2565/00—Nucleic acid analysis characterised by mode or means of detection
- C12Q2565/50—Detection characterised by immobilisation to a surface
- C12Q2565/514—Detection characterised by immobilisation to a surface characterised by the use of the arrayed oligonucleotides as identifier tags, e.g. universal addressable array, anti-tag or tag complement array
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- General Engineering & Computer Science (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Saccharide Compounds (AREA)
Abstract
Description
本申請案係關於製造包含標識符(identifier)序列之多核苷酸群體之拷貝、使寡核苷酸與標識符序列雜合及偵測或比較雜合寡核苷酸之量的方法。This application relates to methods of making copies of a population of polynucleotides comprising an identifier sequence, hybridizing oligonucleotides to the identifier sequence, and detecting or comparing the amount of hybrid oligonucleotides.
目前許多定序平台使用「合成定序」(sequencing by synthesis,SBS)技術及基於螢光之偵測方法。在一些實例中,將待定序之自一或多個核苷酸群體分離之許多多核苷酸連接於基板表面且進行拷貝。隨後可對表面連接的拷貝執行SBS。製造多核苷酸之拷貝或擴增多核苷酸且對拷貝定序增加在定序期間發射之螢光信號,且藉此增強定序過程。Many current sequencing platforms use "sequencing by synthesis" (SBS) technology and fluorescence-based detection methods. In some examples, a number of polynucleotides isolated from one or more populations of nucleotides to be sequenced are attached to the surface of the substrate and copied. SBS can then be performed on the surface-linked copies. Making copies of a polynucleotide or amplifying a polynucleotide and sequencing the copies increases the fluorescent signal emitted during sequencing, and thereby enhances the sequencing process.
連接於基板之多核苷酸之拷貝可藉由固相核酸擴增方法合成,該方法使得擴增產物固定於固體支撐物上以便形成包括固定化核酸分子之群集的陣列。此類陣列上之各群集或群落為目標多核苷酸股之複數個拷貝及複數個與其互補之固定化多核苷酸股。群集擴增方法或聚類方法為其中針對SBS合成目標多核苷酸之表面連接拷貝及互補序列的方法之實例。亦可用於產生表面連接拷貝等的適合方法之一些實例包括橋式擴增、動力排除擴增(kinetic exclusion amplification,「ExAmp」)或其他方法。Copies of the polynucleotide attached to the substrate can be synthesized by solid-phase nucleic acid amplification methods that immobilize the amplification products on a solid support to form an array comprising clusters of immobilized nucleic acid molecules. Each cluster or colony on such an array is a plurality of copies of a target polynucleotide strand and a plurality of immobilized polynucleotide strands complementary thereto. Cluster amplification methods or clustering methods are examples of methods in which surface-ligated copies and complementary sequences of a polynucleotide of interest are synthesized for SBS. Some examples of suitable methods that may also be used to generate surface-linked copies, etc., include bridge amplification, kinetic exclusion amplification ("ExAmp"), or other methods.
聚類(clustering)包括使用聚合酶合成表面連接之群集。然而,某些聚合酶及聚合方法之已知問題係與目標多核苷酸之各種特徵相關的定量合成偏向。舉例而言,在一些情況下,聚類方法可相比於GC含量相對較高之多核苷酸,偏向於擴增更多的具有較低百分比之鳥嘌呤(G)-胞嘧啶(C)鹼基對之目標多核苷酸之拷貝。在其他情況下,聚類方法可相對於相對較長的多核苷酸,偏向於擴增更多的相對較短的目標多核苷酸之拷貝。在又一些其他實例中,其他理論來源的偏向可影響多核苷酸之相對擴增水準,諸如多核苷酸樣品製備方法或其他差異。Clustering involves the use of polymerases to synthesize surface-linked clusters. However, a known problem with certain polymerases and polymerization methods is the quantitative synthesis bias associated with various characteristics of the target polynucleotide. For example, in some cases, clustering methods may be biased toward amplifying more guanine (G)-cytosine (C) bases with lower percentages than polynucleotides with relatively higher GC content The base pair's copy of the target polynucleotide. In other cases, the clustering method may be biased towards amplifying more copies of relatively short polynucleotides of interest relative to relatively long polynucleotides. In still other examples, other theoretical sources of bias may affect the relative amplification levels of polynucleotides, such as polynucleotide sample preparation methods or other differences.
至少鑒於前述內容,定序技術將因此受益於在聚類及其他擴增過程中確定此類偏向之存在,及鑑別、分離及修改此類技術可使此類偏向最小化且產生更精確定序結果之態樣的方法。At least in view of the foregoing, sequencing techniques would therefore benefit from determining the presence of such biases during clustering and other amplification processes, and identifying, isolating and modifying such techniques could minimize such biases and yield more precise sequencing method of results.
在一態樣中,提供一種方法,其包括:製造兩個或更多個包括標識符序列之多核苷酸群體之拷貝,其中該等拷貝連接於基板,使寡核苷酸與該等標識符序列雜合,及比較與該等兩個或更多個多核苷酸群體之拷貝雜合的寡核苷酸之量,其中至少一個特徵在該等兩個或更多個多核苷酸群體之間或在連接於該基板之該等兩個或更多個多核苷酸群體之拷貝的製造之間不同。In one aspect, there is provided a method comprising: making two or more copies of a population of polynucleotides comprising an identifier sequence, wherein the copies are attached to a substrate such that the oligonucleotides are associated with the identifiers Sequence hybridization, and comparing the amount of oligonucleotides hybridized to copies of the two or more polynucleotide populations, wherein at least one characteristic is between the two or more polynucleotide populations Or differ between the production of copies of the two or more polynucleotide populations attached to the substrate.
在一實例中,該至少一個特徵係選自長度、鳥嘌呤-胞嘧啶含量及製備方法。在另一實例中,該至少一個特徵包括鳥嘌呤-胞嘧啶含量。在又一實例中,該至少一個特徵包括長度。在又一實例中,該至少一個特徵包括製備方法。在另一實例中,至少一個特徵在連接於該基板之該等兩個或更多個多核苷酸群體之拷貝的製造之間不同。在仍另一實例中該等寡核苷酸包括螢光團。In one example, the at least one characteristic is selected from the group consisting of length, guanine-cytosine content, and method of preparation. In another example, the at least one characteristic includes guanine-cytosine content. In yet another example, the at least one characteristic includes length. In yet another example, the at least one feature includes a method of making. In another example, at least one characteristic differs between the production of copies of the two or more polynucleotide populations attached to the substrate. In yet another example the oligonucleotides comprise fluorophores.
在一實例中,該方法進一步包括偵測與連接於該基板之該等兩個或更多個多核苷酸群體之拷貝雜合的寡核苷酸的量之間的差異,其中該差異為至少約10%。在另一實例中,該差異為至少約20%。在又一實例中,該差異為至少約30%。In one example, the method further comprises detecting a difference between the amount of oligonucleotides hybridized to copies of the two or more polynucleotide populations attached to the substrate, wherein the difference is at least about 10%. In another example, the difference is at least about 20%. In yet another example, the difference is at least about 30%.
在一實例中,該至少一個特徵包括組合,且該組合包括鳥嘌呤-胞嘧啶含量、長度、製備方法及連接於該基板之該等兩個或更多個多核苷酸群體之拷貝的製造中之兩者或更多者,該等兩個或更多個多核苷酸群體包括三個或更多個多核苷酸群體,及該等三個或更多個多核苷酸群體中之每一者之組合不同於另一多核苷酸群體之組合。In one example, the at least one feature includes a combination, and the combination includes guanine-cytosine content, length, method of preparation, and production of copies of the two or more polynucleotide populations attached to the substrate two or more, the two or more polynucleotide populations include three or more polynucleotide populations, and each of the three or more polynucleotide populations The combination is different from the combination of another polynucleotide population.
另一實例進一步包括偵測與連接於該基板之該等三個或更多個多核苷酸群體中之兩者或更多者之拷貝雜合的寡核苷酸的量之間的差異,其中該差異為至少約10%。在一實例中,該差異為至少約20%。在另一實例中,該差異為至少約30%。Another example further includes detecting a difference between the amount of oligonucleotides hybridized to copies of two or more of the three or more polynucleotide populations attached to the substrate, wherein The difference is at least about 10%. In one example, the difference is at least about 20%. In another example, the difference is at least about 30%.
在另一態樣中,提供一種方法,其包括:製造兩個或更多個包括標識符序列之多核苷酸群體之拷貝,其中該等拷貝連接於基板,使包括螢光團之寡核苷酸與該等標識符序列雜合,及偵測與該等兩個或更多個多核苷酸群體之拷貝雜合的寡核苷酸之量,其中至少一個特徵在該等兩個或更多個多核苷酸群體之間或在連接於該基板之該等兩個或更多個多核苷酸群體之拷貝的製造之間不同,及該至少一個特徵係選自長度、鳥嘌呤-胞嘧啶含量、製備方法及連接於該基板之該等兩個或更多個多核苷酸群體之拷貝的製造。In another aspect, there is provided a method comprising: making two or more copies of a population of polynucleotides comprising an identifier sequence, wherein the copies are attached to a substrate such that an oligonucleotide comprising a fluorophore The acid hybridizes to the identifier sequences, and detects the amount of oligonucleotides hybridized to copies of the two or more polynucleotide populations, wherein at least one characteristic is in the two or more Differences between populations of polynucleotides or between the production of copies of the two or more populations of polynucleotides attached to the substrate, and the at least one characteristic is selected from the group consisting of length, guanine-cytosine content , methods of preparation and manufacture of copies of the two or more polynucleotide populations linked to the substrate.
在一實例中,該至少一個特徵包括鳥嘌呤-胞嘧啶含量。在另一實例中,該至少一個特徵包括長度。在又一實例中,該至少一個特徵包括製備方法。在又一實例中,至少一個特徵在連接於該基板之該等兩個或更多個多核苷酸群體之拷貝的製造之間不同。In one example, the at least one characteristic includes guanine-cytosine content. In another example, the at least one characteristic includes length. In yet another example, the at least one feature includes a method of making. In yet another example, at least one characteristic differs between the production of copies of the two or more polynucleotide populations attached to the substrate.
另一實例進一步包括偵測與該等兩個或更多個多核苷酸群體之拷貝雜合之寡核苷酸的量之間的差異,其中該差異為至少約10%。在一實例中,該差異為至少約20%。在又一實例中,該差異為至少約30%。Another example further includes detecting a difference between the amounts of oligonucleotides hybridized to copies of the two or more polynucleotide populations, wherein the difference is at least about 10%. In one example, the difference is at least about 20%. In yet another example, the difference is at least about 30%.
本發明係關於一種用於評估諸如作為SBS過程之一部分的多核苷酸拷貝中之偏向的方法。特定言之,包括一種鑑別相對於由不同群體製造之拷貝,由給定群體製造的多核苷酸拷貝相對更多或更少的偏向的存在之過程。不同群體之多核苷酸可藉由一個群體與另一群體間不同之特徵而彼此區分。特徵可為群體之多核苷酸之任何特徵,包括多核苷酸股之物理屬性或作為樣品製備之態樣的多核苷酸群體經歷之過程。The present invention relates to a method for assessing bias in polynucleotide copies such as those that are part of the SBS process. In particular, includes a process of identifying the presence of a bias for relatively more or less copies of a polynucleotide made by a given population relative to copies made by a different population. Polynucleotides of different populations can be distinguished from one another by characteristics that differ from one population to another. The characteristic can be any characteristic of the polynucleotides of the population, including the physical properties of the polynucleotide strands or the processes that the polynucleotide population undergoes as an aspect of sample preparation.
舉例而言,相對於來自另一群體之多核苷酸,來自一個群體之多核苷酸之C及/或G鹼基:A及/或T鹼基之比可更低或更高。在另一實例中,來自群體之多核苷酸可為數個核苷酸長,其中一個群體之多核苷酸的長度不同於另一群體之多核苷酸的長度。在另一實例中,不同多核苷酸群體可已經歷不同製備方法。舉例而言,其可已經歷將目標分子片段化為用於拷貝及定序之較短多核苷酸的不同方法,或將寡核苷酸序列或標識符添加至多核苷酸的不同方法(有時被稱作編索引(indexing)、索引標記或標條碼之過程,其為對多核苷酸進行標記或編索引以鑑別隨後所製造之其拷貝的方式),或將多核苷酸序列與初始樣品分離的不同方法(諸如分離選擇預定尺寸或預定尺寸範圍內之多核苷酸)。在其他實例中,由一個多核苷酸群體形成群集之方法可不同於由不同多核苷酸群體形成群集之方法。For example, a polynucleotide from one population can have a lower or higher ratio of C and/or G bases:A and/or T bases relative to a polynucleotide from another population. In another example, polynucleotides from a population can be several nucleotides long, wherein the length of the polynucleotides of one population is different from the length of the polynucleotides of the other population. In another example, different polynucleotide populations may have undergone different production methods. For example, it may have undergone different methods of fragmenting target molecules into shorter polynucleotides for copying and sequencing, or adding oligonucleotide sequences or identifiers to polynucleotides (with a process known as indexing, indexing, or barcoding, which is the way in which a polynucleotide is labeled or indexed to identify subsequently manufactured copies of it), or the association of a polynucleotide sequence with an original sample Different methods of separation (such as separation of polynucleotides that select a predetermined size or range of predetermined sizes). In other examples, the method of forming clusters from one population of polynucleotides can be different from the method of forming clusters from different populations of polynucleotides.
在一些實例中,任何特徵,無論是否直接關於不同群體之多核苷酸之物理特徵,或間接指示由於其製備、儲存、處理、操作或製備或聚類過程的多核苷酸本身之特徵,或與其他特徵相關,諸如可與多核苷酸一起存在之其他組分等,均可區分兩個或更多個群體。如本文中所揭示之方法可用以判定特徵之差異是否在拷貝時,諸如在聚類過程期間引起偏向,其產生相對於另一群體,由一個群體拷貝多核苷酸之不成比例地更高的量或速率。In some instances, any characteristic, whether directly related to a physical characteristic of a polynucleotide of a different population, or indirectly indicative of a characteristic of the polynucleotide itself as a result of its preparation, storage, handling, manipulation, or process of preparation or clustering, or is associated with Correlation of other characteristics, such as other components that may be present with the polynucleotide, can distinguish two or more populations. The methods as disclosed herein can be used to determine whether differences in features are causing a bias in copying, such as during a clustering process, that results in a disproportionately higher amount of polynucleotides being copied by one population relative to another population or rate.
在一些實例中,群體可在多於一個特徵(包括來自GC含量、長度、製備方法或諸如在聚類期間製造拷貝之過程中之特徵)方面不同。舉例而言,群體之長度(例如群體多核苷酸中之核苷酸數目)及GC含量(例如與多核苷酸群體中之A及/或T殘基相比,多核苷酸群體中之G及/或C殘基之相對量)可不同。或其可關於此等及製備方法、在聚類期間拷貝方法中之任一者或前述中之兩者或更多者之任何組合而不同。在一些實例中,群體可關於一或多個特徵或關於任何兩個或更多個特徵之組合,或關於任何三個或更多個特徵之組合(諸如,長度、GC含量或製備多核苷酸以用於拷貝或聚類之方法,及或諸如聚類方法之拷貝方法)而不同。In some examples, populations may differ in more than one characteristic, including characteristics from GC content, length, method of preparation, or a characteristic in a process such as making copies during clustering. For example, the length of a population (eg, the number of nucleotides in a population polynucleotide) and GC content (eg, G and/or G in a polynucleotide population compared to A and/or T residues in a polynucleotide population) and/or relative amounts of C residues) may vary. Or it may differ with respect to any of these and methods of preparation, methods of copying during clustering, or any combination of two or more of the foregoing. In some examples, a population can be related to one or more characteristics or to a combination of any two or more characteristics, or to a combination of any three or more characteristics (such as length, GC content, or polynucleotide production) by the method used for copying or clustering, and or copying methods such as clustering methods).
構成群體之特徵的製備不同多核苷酸群體之方法中的差異可賦予群體不同結構特徵,諸如獲得具有所欲尺寸之多核苷酸之有效性、群體內多核苷酸尺寸之一致性、群體內多少多核苷酸恰當地具有轉接子(adapter)或連接於其之其他序列等的差異,其均可引起在聚類之後顯現出來的偏向或拷貝差異。本文中所揭示之方法可用於確定製備方法之差異的此類影響。Variations in the methods of making different populations of polynucleotides that characterize a population can confer different structural characteristics on the population, such as the effectiveness of obtaining polynucleotides of a desired size, the uniformity of polynucleotide size within a population, how many within a population Polynucleotides suitably have differences in adapters or other sequences linked thereto, etc., which can all cause biases or copy differences that manifest after clustering. The methods disclosed herein can be used to determine such effects of differences in manufacturing methods.
在一些實例中,可預先選擇兩個或更多個多核苷酸群體中之一或多者之特徵,包括前述特徵中之任一者或前述特徵中之任何兩者或更多者彼此組合。舉例而言,可能有利的係確定聚類過程或拷貝過程之其他態樣是否引起、增加、減少、消除或以其他方式影響多核苷酸長度、GC含量、製備過程或其他特徵,或製造拷貝之方法,或其中兩者或更多者之任何組合之偏向。因此,多核苷酸群體之特徵可經預先選擇且經組態以反映此類潛在或假設之偏向原因或來源,及所進行之聚類或其他拷貝過程及所比較之兩個或更多個多核苷酸群體之拷貝的量。藉由在拷貝開始時每一者之起始量正規化,一個群體之拷貝的量大於另一群體之拷貝的量可表示在使用之拷貝條件下對於或對抗具有預先選擇之特徵的多核苷酸之偏向。In some examples, the characteristics of one or more of the two or more polynucleotide populations may be preselected, including any of the foregoing characteristics or any two or more of the foregoing characteristics in combination with each other. For example, it may be beneficial to determine whether other aspects of the clustering process or copying process cause, increase, decrease, eliminate, or otherwise affect polynucleotide length, GC content, production process, or other characteristics, or the rate at which copies are made. method, or any combination of two or more of them. Thus, the characteristics of a polynucleotide population can be preselected and configured to reflect such potential or hypothesized causes or sources of bias, as well as the clustering or other copying processes performed and the two or more polynuclei being compared The amount of copies of the nucleotide population. By normalizing the starting amount of each at the onset of copying, the amount of copies in one population that is greater than the amount of copies in another population can be indicative of a polynucleotide with a preselected characteristic for or against the copy conditions used. the bias.
在圖1的流程圖中說明此方法的實例。製備兩個或更多個多核苷酸群體用於諸如藉由聚類過程拷貝。在製備過程中包括向群體之多核苷酸中添加寡核苷酸序列,且向另一群體之多核苷酸中添加另一寡核苷酸序列。在其中使用超過兩個多核苷酸群體之實例中,可將寡核苷酸序列添加至群體之多核苷酸中,該等寡核苷酸序列不同於添加至其他群體中之每一者之多核苷酸中的寡核苷酸序列,使得各群體之多核苷酸包括對該群體之多核苷酸具有特異性且不同於添加至任何其他群體之多核苷酸中之寡核苷酸的寡核苷酸序列。此類寡核苷酸之序列,稱為標識符序列的差異可使得其雜合至具有與其互補之序列的寡核苷酸。舉例而言,添加至其中兩個或更多個群體之多核苷酸之各標識符序列可與寡核苷酸序列雜合,該寡核苷酸序列與其兩個或更多個群體中之任何其他者之多核苷酸之標識符序列不可雜合。如下文進一步闡述,藉此可在來自不同群體之多核苷酸之間進行區分的標識符序列之存在可准許根據如本文所揭示之方法鑑別多核苷酸的拷貝來自給定群體,而非任何其他群體。An example of this method is illustrated in the flowchart of FIG. 1 . Two or more polynucleotide populations are prepared for copying, such as by a clustering process. The preparation process involves adding an oligonucleotide sequence to a polynucleotide of a population and adding another oligonucleotide sequence to a polynucleotide of another population. In instances where more than two populations of polynucleotides are used, oligonucleotide sequences may be added to the polynucleotides of the population that differ from the polynuclei added to each of the other populations An oligonucleotide sequence in nucleotides such that the polynucleotides of each population include oligonucleotides that are specific to the polynucleotides of that population and are different from oligonucleotides added to the polynucleotides of any other population acid sequence. Differences in the sequence of such oligonucleotides, referred to as identifier sequences, allow them to hybridize to oligonucleotides having sequences complementary thereto. For example, each identifier sequence added to a polynucleotide of two or more of these populations can hybridize to an oligonucleotide sequence that is any of its two or more populations The identifier sequences of the other polynucleotides are not hybridizable. As set forth further below, the presence of an identifier sequence whereby polynucleotides from different populations can be distinguished may allow for the identification of copies of polynucleotides from a given population, but not any other, according to the methods as disclosed herein group.
兩個或更多個群體之單股多核苷酸可隨後經拷貝,其中拷貝連接於基板。舉例而言,如上文所提及,可根據固態排除擴增聚類過程、橋式擴增聚類或其他過程執行拷貝。在一非限制性實例中,多核苷酸之3'端可雜合至連接於基板之引子序列,且進行聚合過程以產生多核苷酸之互補序列,自表面連接引子開始且延伸至各多核苷酸之5'端的互補序列。來自兩個或更多個群體的多核苷酸可隨後自其表面連接互補序列擴增。根據橋式PCR過程,作為非限制性實例,兩個或更多個群體之多核苷酸的表面連接互補序列之游離3'端可隨後與連接於基板之另一引子序列雜合。隨後可藉由聚合酶反應拷貝互補序列,產生兩個或更多個多核苷酸群體之多核苷酸的拷貝,以及自表面延伸的其互補序列。表面結合互補序列及拷貝可隨後自彼此去雜合,且進行另一輪聚合,其中拷貝兩個或更多個多核苷酸群體的多核苷酸的表面連接拷貝及其互補序列(接在其游離3'端雜合至表面連接引子後,作為聚合酶反應之起始位點),隨後互補的表面連接多核苷酸對自彼此去雜合。藉由重複此過程,可形成連接於基板的兩個或更多個群體之多核苷酸之拷貝及其互補序列的群集。製造多核苷酸群體之拷貝的其他相當方法亦可用於其他實例中,無論係PCR、滾環擴增、多重置換擴增、隨機引發擴增、等溫擴增等。Two or more populations of single-stranded polynucleotides can then be copied, where the copies are attached to the substrate. For example, as mentioned above, copying may be performed according to a solid state exclusion amplification clustering process, bridge amplification clustering, or other processes. In a non-limiting example, the 3' end of the polynucleotide can be hybridized to a primer sequence attached to the substrate, and a polymerization process performed to generate the complementary sequence of the polynucleotide, starting from the surface-attached primer and extending to each polynucleotide The complementary sequence to the 5' end of the acid. Polynucleotides from two or more populations can then be amplified from their surface ligated complementary sequences. According to a bridge PCR procedure, as a non-limiting example, the free 3' ends of the surface-linked complementary sequences of the polynucleotides of the two or more populations can then be hybridized to another primer sequence linked to the substrate. The complementary sequence can then be copied by a polymerase reaction, producing copies of the polynucleotides of the two or more polynucleotide populations, and their complementary sequences extending from the surface. The surface-bound complementary sequence and the copy can then be dehybridized from each other, and another round of polymerization is performed in which the surface-bound copy of the polynucleotide of the two or more polynucleotide populations and its complementary sequence (connected to its free 3) are copied. After the 'end is hybridized to the surface-linked primer, which serves as the initiation site for the polymerase reaction), the complementary pairs of surface-linked polynucleotides are subsequently dehybridized from each other. By repeating this process, clusters of copies of the polynucleotides and their complements of two or more populations linked to the substrate can be formed. Other comparable methods of making copies of a polynucleotide population can also be used in other examples, whether PCR, rolling circle amplification, multiple displacement amplification, random priming amplification, isothermal amplification, and the like.
隨後可測定來自兩個或更多個多核苷酸群體中之一者的多核苷酸之基板連接拷貝的量。舉例而言,可添加可與兩個或更多個群體中之一者之多核苷酸之標識符序列雜合的寡核苷酸以使得其與該等多核苷酸上存在之該標識符序列雜合。可雜合寡核苷酸可包括可偵測標記物,諸如能夠在藉由給定波長之電磁輻射刺激後發射可偵測螢光之螢光標記物。藉由誘導此類雜合寡核苷酸發螢光且偵測所發射螢光之量,可評估來自兩個或更多個群體中之一者的多核苷酸之拷貝之量。The amount of substrate-attached copies of a polynucleotide from one of the two or more polynucleotide populations can then be determined. For example, an oligonucleotide that can hybridize to an identifier sequence of a polynucleotide of one of two or more populations can be added such that it matches the identifier sequence present on the polynucleotides mixed. Hybridizable oligonucleotides can include detectable labels, such as fluorescent labels capable of emitting detectable fluorescence upon stimulation by electromagnetic radiation of a given wavelength. By inducing such hybrid oligonucleotides to fluoresce and detecting the amount of fluorescence emitted, the amount of copies of the polynucleotide from one of the two or more populations can be assessed.
該寡核苷酸可隨後去雜合,隨後與另一寡核苷酸一起培育,該另一寡核苷酸可與兩個或更多個多核苷酸群體中之另一者之多核苷酸之標識符序列雜合。該另一可雜合寡核苷酸可包括可偵測標記物,諸如能夠在藉由給定波長之電磁輻射刺激後發射可偵測螢光之螢光標記物。藉由誘導此類其他雜合寡核苷酸發螢光且偵測所發射螢光之量,可評估來自兩個或更多個群體中之另一者的多核苷酸之拷貝之量。在其中由超過兩個多核苷酸群體之多核苷酸之特徵或特徵組合引起或產生之潛在拷貝偏向的實例中,可重複雜合可與個別核苷酸群體之多核苷酸之各標識符序列雜合之寡核苷酸,量測雜合寡核苷酸之量,隨後其去雜合(若之後有另一寡核苷酸之雜合)之過程,以獲得各類型寡核苷酸之量的量測值,作為來自兩個或更多個多核苷酸群體中之每一者的多核苷酸之拷貝的量的度量。The oligonucleotide can then be dehybridized and then incubated with another oligonucleotide that can be combined with a polynucleotide of the other of the two or more polynucleotide populations The identifier sequence is heterozygous. The other hybridizable oligonucleotide may include a detectable label, such as a fluorescent label capable of emitting detectable fluorescence upon stimulation by electromagnetic radiation of a given wavelength. By inducing such other hybrid oligonucleotides to fluoresce and detecting the amount of fluorescence emitted, the amount of copies of the polynucleotide from the other of the two or more populations can be assessed. In instances where potential copy bias is caused or arises from features or combinations of features of polynucleotides of more than two polynucleotide populations, each identifier sequence of polynucleotides of individual nucleotide populations may be complexed with each other Hybrid oligonucleotides, the process of measuring the amount of hybrid oligonucleotides, followed by its de-hybridization (if there is subsequent hybridization of another oligonucleotide), to obtain the results of various types of oligonucleotides. A measure of quantity as a measure of the quantity of copies of a polynucleotide from each of two or more populations of polynucleotides.
在一實例中,與兩個或更多個多核苷酸群體中之每一者之拷貝雜合之寡核苷酸的量之間的差異可藉由比較例如可與各別標識符序列雜合之寡核苷酸發射之螢光之相關量來偵測。舉例而言,樣品可包括兩個多核苷酸群體,其包括不同標識符序列且藉由具有不同特徵界定特徵。不同樣品可包括不同相對比例之來自兩個群體中之每一者的多核苷酸。舉例而言,一個群體可構成樣品之總核苷酸含量之約0%、約5%、約10%、約15%、約20%、約25%、約30%、約35%、約40%、約45%、約50%、約55%、約60%、約65%、約70%、約75%、約80%、約85%、約90%或約100%,且另一群體構成樣品之其餘部分。可隨後根據本文揭示內容諸如在聚類過程中製造群體之拷貝及其互補序列。In one example, the difference between the amounts of oligonucleotides that are hybridized to copies of each of the two or more polynucleotide populations can be, for example, hybridized to the respective identifier sequences by comparison. detected by the relative amount of fluorescence emitted by the oligonucleotide. For example, a sample can include two populations of polynucleotides that include different identifier sequences and define features by having different characteristics. Different samples can include different relative proportions of polynucleotides from each of the two populations. For example, a population can constitute about 0%, about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40% of the total nucleotide content of the sample %, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, or about 100%, and another population Make up the rest of the sample. Copies of the population and its complement can then be made in accordance with the disclosure herein, such as in a clustering process.
寡核苷酸可隨後雜合至多核苷酸群體之拷貝之標識符序列。可量測雜合寡核苷酸之量,諸如在寡核苷酸包括螢光團之一實例中,可偵測且定量螢光發射,作為雜合至給定群體之標識符序列之寡核苷酸之總量的度量。以此方式,可量測及比較與各群體雜合之寡核苷酸之量,得到在拷貝之後各群體之多核苷酸之拷貝之相對豐度的指示。在一實例中,當樣品包括不同相對比例之各多核苷酸群體時,差異可為可偵測的。舉例而言,在諸如藉由聚類拷貝之前,當一個群體構成樣品之核苷酸含量之約0%、約5%、約10%、約15%、約20%、約25%、約30%、約35%、約40%或約45%且另一群體構成其餘部分時,差異可為可偵測的。The oligonucleotides can then hybridize to an identifier sequence of copies of the polynucleotide population. The amount of hybrid oligonucleotides can be measured, such as in one instance where the oligonucleotides include a fluorophore, and fluorescent emission can be detected and quantified as oligonuclei hybridized to an identifier sequence of a given population A measure of the total amount of nucleotides. In this way, the amount of oligonucleotides heterozygous with each population can be measured and compared, resulting in an indication of the relative abundance of copies of the polynucleotides of each population after copying. In one example, differences may be detectable when the samples include different relative proportions of each polynucleotide population. For example, when a population constitutes about 0%, about 5%, about 10%, about 15%, about 20%, about 25%, about 30% of the nucleotide content of a sample prior to copying, such as by clustering %, about 35%, about 40%, or about 45%, and another population makes up the remainder, the difference may be detectable.
在一實例中,自寡核苷酸量測螢光發射,該等寡核苷酸包括與各群體之標識符序列雜合之螢光團,且確定螢光差異。舉例而言,可與一個多核苷酸群體之標識符序列雜合之寡核苷酸可包括與可與另一寡核苷酸群體之標識符序列雜合之寡核苷酸可偵測地不同的螢光團,使得來自一者之螢光發射可獨立於來自另一者之螢光發射偵測,且反之亦然(例如,Alexa 647、Alexa 532等)。在一實例中,與一個標識符序列雜合之寡核苷酸發射之螢光可比與另一標識符序列雜合之寡核苷酸發射之螢光大或小至少約10%、或大或小至少約15%、或大或小至少約20%、或大或小至少約25%、或大或小至少約30%、或大或小至少約35%、或大或小至少約40%、或大或小至少約45%、或大或小至少約50%。在另一實例中,與一個標識符序列雜合之寡核苷酸發射之螢光可比與另一標識符序列雜合之寡核苷酸發射之螢光大或小約10%、或大或小約15%、或大或小約20%、或大或小約25%、或大或小約30%、或大或小約35%、或大或小約40%、或大或小約45%、或大或小約50%。In one example, fluorescence emission is measured from oligonucleotides comprising a fluorophore hybridized to the identifier sequence of each population, and differences in fluorescence are determined. For example, an oligonucleotide that can hybridize to an identifier sequence of one population of polynucleotides can include a detectably different oligonucleotide that can hybridize to an identifier sequence of another population of oligonucleotides fluorophores such that fluorescence emission from one can be detected independently of fluorescence emission from the other, and vice versa (eg, Alexa 647, Alexa 532, etc.). In one example, the fluorescence emitted by an oligonucleotide hybridized to one identifier sequence may be at least about 10% greater or less than the fluorescence emitted by an oligonucleotide hybridized to another identifier sequence, or greater or lesser At least about 15%, or at least about 20% larger or smaller, or at least about 25% larger or smaller, or at least about 30% larger or smaller, or at least about 35% larger or smaller, or at least about 40% larger or smaller, At least about 45% larger or smaller, or at least about 50% larger or smaller. In another example, an oligonucleotide hybridized to one identifier sequence may emit about 10% greater or less fluorescence than an oligonucleotide hybridized to another identifier sequence, or greater or lesser About 15%, more or less about 20%, more or less about 25%, more or less about 30%, more or less about 35%, more or less about 40%, more or less about 45% %, more or less about 50%.
實例展示於圖2中。在最左圖區中,展示兩種多核苷酸,一種來自兩個多核苷酸群體中的每一者。各自包括索引或標識符序列。在實際實例中,可使用複數個來自兩個或更多個群體中之每一者的多核苷酸。展示上面將發生固相拷貝之表面。在此實例中,表面為流體槽之表面。連接於表面之引子(例如引子P5及P7)與多核苷酸之3'端之一部分互補且可與其雜合。多核苷酸隨後與表面連接引子雜合,該等引子隨後藉由聚合酶延伸以形成多核苷酸之互補序列。多核苷酸隨後去雜合,留下表面連接互補序列,自原來為表面連接引子者延伸。使用兩個或更多個群體之多核苷酸之表面連接互補序列作為模板聚合,結果為形成群體之多核苷酸之表面連接拷貝。此類股隨後經線性化且自彼此去雜合,之後重複該過程。藉由反覆地重複此過程,兩個或更多個群體之多核苷酸之表面連接拷貝及其表面連接互補序列之數目經擴增,產生表面連接群集。代表群體的多核苷酸之拷貝或互補序列的一組股隨後可諸如藉由酶促裂解自基板移除。參看圖2中之指示「聚類及線性化」的箭頭。An example is shown in FIG. 2 . In the far left panel area, two polynucleotides are shown, one from each of the two polynucleotide populations. Each includes an index or sequence of identifiers. In practical examples, a plurality of polynucleotides from each of two or more populations can be used. Show the surface on which solid phase copying will occur. In this example, the surface is the surface of the fluid bath. Primers attached to the surface (eg, primers P5 and P7) are complementary to and can hybridize to a portion of the 3' end of the polynucleotide. The polynucleotide is then hybridized with surface-linked primers, which are then extended by a polymerase to form the complementary sequence of the polynucleotide. The polynucleotide is then dehybridized, leaving a surface-linked complementary sequence, extending from what was originally a surface-linked primer. Polymerization using the surface-linked complementary sequences of the polynucleotides of two or more populations as templates results in the formation of surface-linked copies of the polynucleotides of the populations. Such strands are then linearized and dehybridized from each other, after which the process is repeated. By iteratively repeating this process, the number of surface-linked copies of the polynucleotides of two or more populations and their surface-linked complements are amplified, resulting in surface-linked clusters. A set of strands representing copies of a population of polynucleotides or complementary sequences can then be removed from the substrate, such as by enzymatic cleavage. See the arrows in Figure 2 indicating "Clustering and Linearization".
若區分兩個或更多個群體中之一者與另一者之特徵或其組合引起拷貝過程中之偏向,或若拷貝過程影響不同,則此類偏向可以反映於與另一群體相比來自一個群體之多核苷酸之表面連接拷貝的量之間的差異中。此類差異可藉由雜合可與一個群體之標識符序列雜合但不與任何其他群體之標識符序列雜合且攜帶可偵測連接物,諸如螢光標記物的寡核苷酸探針來確定。如圖2之第三圖區中所示,此類探針可與一個群體之拷貝雜合,洗掉過量未結合之探針,隨後藉由量測在用已知誘發來自連接於寡核苷酸探針之螢光標記物的發射的一定波長之電磁輻射刺激表面之後,發射了多少螢光來偵測雜合探針之量。If characteristics or combinations of characteristics that distinguish one from the other of two or more populations cause a bias in the copying process, or if the copying process affects differently, such bias may be reflected in the In the difference between the amounts of surface-linked copies of a polynucleotide of a population. Such differences can be achieved by hybridizing oligonucleotide probes that can hybridize to an identifier sequence of one population but not to any other population and carry a detectable linker, such as a fluorescent label to make sure. As shown in the third panel of Figure 2, such probes can hybridize to copies of a population, wash away excess unbound probe, and subsequently induce induction from oligonucleotides linked to oligonucleotides by measuring The amount of hybrid probe is detected by how much fluorescence is emitted after stimulating the surface with electromagnetic radiation of a certain wavelength emitted by the fluorescent label of the acid probe.
隨後,第一探針可去雜合且洗掉,之後用另一探針進行雜合。此另一寡核苷酸可與另一群體之標識符序列(但不與任何其他群體之標識符序列)雜合且攜帶可偵測連接物,諸如螢光標記物。如圖2之最後一個圖區中所示,此另一探針可與此另一群體之拷貝雜合,洗掉過量未結合之探針,隨後藉由量測在用已知誘發來自連接於該另一寡核苷酸探針之螢光標記物的發射的一定波長之電磁輻射刺激表面之後,發射了多少螢光來偵測雜合探針之量。比較自第一雜合探針及自第二雜合探針偵測到多少螢光指示多少來自兩個或更多個群體中之兩者之多核苷酸之拷貝連接於表面的差異。Subsequently, the first probe can be dehybridized and washed away, followed by hybridization with another probe. This other oligonucleotide can hybridize to an identifier sequence of another population (but not to any other population's identifier sequence) and carry a detectable linker, such as a fluorescent label. As shown in the last panel of Figure 2, this other probe can hybridize to a copy of this other population, wash away excess unbound probe, and subsequently induce induction from ligation in the The amount of hybrid probe is detected by how much fluorescence is emitted after the surface is stimulated with electromagnetic radiation of a certain wavelength emitted by the fluorescent label of the other oligonucleotide probe. Comparing how much fluorescence is detected from the first hybrid probe and from the second hybrid probe indicates the difference in how many copies of the polynucleotide from both of the two or more populations are attached to the surface.
藉由將此差異與來自用於起始聚類過程之兩個或更多個多核苷酸群體中之每一者的多核苷酸之量的差異進行比較,可鑑別一或多個區分兩個或更多個多核苷酸群體的特徵對拷貝偏向或由拷貝方法產生之偏向之影響。亦即,藉由彼此比較與兩個或更多個群體中之每一者之拷貝雜合的各此類寡核苷酸之量,其藉由來自用於拷貝之各群體之多核苷酸的相對量正規化,可確定給定拷貝偏向之存在及幅度。舉例而言,若特徵引起拷貝之偏向,則來自藉由該特徵界定特徵(諸如較高GC含量、較長長度、樣品製備過程、前述中之兩者或更多者之任何組合等)之兩個或更多個多核苷酸群體中之一者的多核苷酸之拷貝之相對量可超過來自不同地藉由該特徵界定特徵(諸如較低GC含量、較短長度、不同樣品製備過程、前述中之兩者或更多者之另一不同組合等)之兩個或更多個多核苷酸群體中之另一者的多核苷酸之拷貝之相對量。繼而,此類差異之偵測可指示對於或對抗拷貝藉由該特徵或特徵組合界定特徵之多核苷酸的偏向。By comparing this difference to the difference in the amount of polynucleotides from each of the two or more polynucleotide populations used to initiate the clustering process, one or more can be identified that distinguish two Effects of characteristics of one or more populations of polynucleotides on copy bias or bias generated by the method of copying. That is, by comparing the amount of each such oligonucleotide that hybridizes to copies of each of the two or more populations with each other, it is determined by the amount of polynucleotides from each population used for copying. Relative quantitative normalization can determine the presence and magnitude of a given copy bias. For example, if a feature causes a bias in copying, it is from two of the features defined by that feature (such as higher GC content, longer length, sample preparation process, any combination of two or more of the foregoing, etc.) The relative amount of copies of a polynucleotide of one of the one or more populations of polynucleotides may exceed that from a characteristic that is differently defined by that characteristic (such as lower GC content, shorter length, different sample preparation procedures, the aforementioned the relative amount of copies of the polynucleotide of the other of the two or more polynucleotide populations, another different combination of two or more of them, etc.). In turn, detection of such differences can indicate a bias toward or against copies of the polynucleotide characterized by the feature or combination of features.
固態擴增過程使得由與表面結合之初始群體形成多核苷酸之拷貝及互補序列。多核苷酸之拷貝包括標識符序列。繼而,該等拷貝之互補序列包括標識符序列之互補序列,且標識符序列之該等互補序列亦可獨特地雜合至不與連接於表面之多核苷酸之其他拷貝或互補序列雜合之寡核苷酸探針。使寡核苷酸雜合至多核苷酸之表面結合拷貝上的標識符序列且量測此類雜合寡核苷酸之量,其指示在拷貝過程期間發生多少多核苷酸拷貝。類似地,使寡核苷酸雜合至多核苷酸之拷貝的表面結合互補序列上的標識符序列的互補序列且量測此類雜合寡核苷酸之數量,其亦指示在拷貝過程期間發生多少多核苷酸拷貝。偵測可雜合及雜合至多核苷酸之表面連接拷貝之標識符序列,或表面連接互補序列上之標識符序列的互補序列的探針之量可用作發生多少多核苷酸群體拷貝之指示。The solid state amplification process results in the formation of copies and complementary sequences of polynucleotides from the initial population bound to the surface. A copy of the polynucleotide includes the identifier sequence. In turn, the complement of such copies includes the complement of the identifier sequence, and the complement of the identifier sequence may also be uniquely hybridized to not hybridized to other copies or complements of the polynucleotide attached to the surface. Oligonucleotide probes. The oligonucleotides are hybridized to the surface of the polynucleotide to bind an identifier sequence on the copies and the amount of such hybrid oligonucleotides is measured, which indicates how many copies of the polynucleotide occurred during the copying process. Similarly, oligonucleotides are hybridized to the surface of a copy of a polynucleotide that binds the complement of an identifier sequence on a complementary sequence and the number of such hybrid oligonucleotides is measured, which is also indicative of during the copying process How many polynucleotide copies occur. The amount of probes that detect the complement of an identifier sequence that can hybridize and hybridize to a surface-linked copy of a polynucleotide, or a surface-linked identifier sequence on a complementary sequence, can be used as a measure of how many copies of a population of polynucleotides occur. instruct.
儘管在一些實例中,無來自兩個群體之多核苷酸可具有彼此相同之標識符序列,但在其他實例中,來自兩個或更多個群體之多核苷酸可包括彼此相同的標識符序列。舉例而言,多核苷酸可包括超過一個標識符序列。來自所有兩個或更多個多核苷酸群體之多核苷酸可具有各群體獨有之第一標識符序列。其可具有在兩個或更多個群體中之兩者或更多者之間共有但不同於兩個或更多個群體中之任何其他群體的第二標識符序列。群體可具有亦由一些群體共有但其他群體區分之第三標識符序列。群體可進一步具有由所有群體共有之第四標識符序列。在此實例中,群體之間給定標識符序列的差異可使得在此類雜合序列處可與一個序列雜合之寡核苷酸在此類標識符序列處不可與另一序列雜合。表面結合拷貝及互補序列的多核苷酸所來自之群體可因此藉由雜合對給定標識符序列具有特異性之探針來確定。Although in some instances no polynucleotides from two populations may have identifier sequences identical to each other, in other instances polynucleotides from two or more populations may include identical identifier sequences to each other . For example, a polynucleotide can include more than one identifier sequence. Polynucleotides from all two or more populations of polynucleotides can have a first identifier sequence unique to each population. It may have a second identifier sequence that is common between two or more of the two or more populations but different from any other of the two or more populations. A population may have a third sequence of identifiers that is also shared by some populations but distinguished by others. The population may further have a fourth identifier sequence common to all populations. In this example, differences between populations for a given identifier sequence can be such that an oligonucleotide that can hybridize to one sequence at such a hybrid sequence cannot hybridize to another sequence at such an identifier sequence. The population of polynucleotides from which surface-bound copies and complementary sequences are derived can thus be determined by hybridizing probes specific for a given identifier sequence.
在一非限制性實例中,可存在四個多核苷酸群體。兩個群體可具有比其他兩個群體高的GC含量,且兩個群體可具有長度比其他兩個群體長的多核苷酸。長度及GC含量可在四個群體之間混合,其中第一群體具有高GC含量之長多核苷酸,第二群體具有低GC含量之長多核苷酸,第三群體具有高GC含量之短多核苷酸,且第四群體具有低GC含量之短多核苷酸。各群體可具有一個、兩個、三個、四個或更多個標識符序列。第一標識符序列對於各群體可為獨特的。第二標識符序列可在具有不同長度之群體之間區分,其中第一群體及第二群體具有彼此相同的第二標識符序列且第三群體及第四群體具有彼此相同的標識符序列,其中第一群體及第二群體之第二標識符序列與第三群體及第四群體之第二標識符序列不同。第三標識符序列可在具有不同GC含量之群體之間區分,其中第一群體及第三群體具有彼此相同的第二標識符序列且第二群體及第四群體具有彼此相同的標識符序列,其中第一群體及第三群體之第三標識符序列與第二群體及第四群體之第三標識符序列不同。第四標識符序列可由所有群體共有。In a non-limiting example, there may be four populations of polynucleotides. Two populations can have higher GC content than the other two populations, and two populations can have polynucleotides that are longer in length than the other two populations. Length and GC content can be mixed between four populations, where the first population has long polynucleotides with high GC content, the second population has long polynucleotides with low GC content, and the third population has short polynucleotides with high GC content nucleotides, and the fourth population has short polynucleotides with low GC content. Each population can have one, two, three, four or more identifier sequences. The first identifier sequence may be unique to each population. The second identifier sequence is distinguishable between populations of different lengths, wherein the first and second populations have the same second identifier sequence as each other and the third and fourth populations have the same identifier sequence as each other, wherein The second identifier sequences of the first population and the second population are different from the second identifier sequences of the third population and the fourth population. The third identifier sequence is distinguishable between populations with different GC content, wherein the first and third populations have the same second identifier sequence as each other and the second and fourth populations have the same identifier sequence as each other, The third identifier sequences of the first group and the third group are different from the third identifier sequences of the second group and the fourth group. The fourth identifier sequence can be shared by all groups.
在拷貝之後,對給定標識符序列之序列具有特異性之探針之雜合及其雜合之量測值可指示多核苷酸之不同群體或差異組合根據區分其或由其共有之特徵,不同量之表面結合拷貝,亦即發生多少拷貝。舉例而言,各群體之量可個別地藉由量測對第一雜合序列之各序列具有特異性之探針之雜合來確定。長及短多核苷酸之拷貝量可藉由量測寡核苷酸與第二標識符序列之各序列之雜合來確定。高GC及低GC含量多核苷酸之拷貝量可藉由量測核苷酸與第三標識符序列之各序列之雜合來確定。且總體拷貝總量可藉由量測核苷酸與第四標識符序列之雜合來確定。在其他實例中,在一些或全部多核苷酸群體中可包括更多或更少數目之標識符序列,且在不同群體中以不同方式組合。在其他實例中,可有超過兩個可存在於給定標識符序列的序列,諸如其中比較給定特徵之若干實例(例如,低、中或高GC含量,或短、中及長多核苷酸長度等)。Following copying, the heterozygosity of probes specific for the sequence of a given identifier sequence, and the measure of their heterozygosity, can be indicative of different populations or differential combinations of polynucleotides according to the characteristics that distinguish them or share them, Different amounts of surface bound copies, ie how many copies occur. For example, the amount of each population can be individually determined by measuring hybridization of probes specific for each sequence of the first hybrid sequence. The copy amounts of the long and short polynucleotides can be determined by measuring the hybridization of the oligonucleotide to each sequence of the second identifier sequence. The copy amounts of high GC and low GC content polynucleotides can be determined by measuring the hybridization of the nucleotides to each sequence of the third identifier sequence. And the total copy amount can be determined by measuring the hybridization of nucleotides to the fourth identifier sequence. In other examples, a greater or lesser number of identifier sequences may be included in some or all polynucleotide populations, and combined in different ways in different populations. In other instances, there may be more than two sequences that may be present in a given identifier sequence, such as several instances in which a given feature is compared (eg, low, medium, or high GC content, or short, medium, and long polynucleotides length, etc.).
在聚類之後但在寡核苷酸與標識符序列雜合之前,如上文所描述,兩個或更多個群體之多核苷酸的拷貝及互補序列結合於表面。在評估寡核苷酸與標識符序列之雜合之前移除表面結合互補序列可為有利的。或者,在另一實例中,在量測寡核苷酸與表面結合互補序列上之標識符序列的互補序列的雜合之前,移除表面結合拷貝可為有利的。移除兩個或更多個群體之多核苷酸之表面結合拷貝或互補序列可藉由在此類拷貝及互補序列所延伸自之表面結合引子中包括可選擇性地裂解的殘基來實現,從而在聚類之後移除自其延伸之拷貝或互補序列。舉例而言,引子可包括去氧尿苷(deoxyuridine,dU)部分。隨後用酶調配物,諸如LMX1處理可使引子在dU殘基處裂解且釋放自其延伸之多核苷酸。在另一實例中,表面連接引子可包括8-側氧基鳥嘌呤(側氧基-G)殘基。隨後用酶調配物,諸如LMX2處理可使引子在側氧基-G殘基處裂解且釋放自其延伸之多核苷酸。After clustering but before hybridization of the oligonucleotides to the identifier sequence, copies of the polynucleotides of the two or more populations and complementary sequences are bound to the surface as described above. It may be advantageous to remove the surface-bound complementary sequence prior to assessing hybridization of the oligonucleotide to the identifier sequence. Alternatively, in another example, it may be advantageous to remove the surface-bound copy prior to measuring hybridization of the oligonucleotide to the complement of the identifier sequence on the surface-bound complement. Removing surface-bound copies or complementary sequences of polynucleotides of two or more populations can be accomplished by including selectively cleavable residues in the surface-bound primers from which such copies and complementary sequences extend, The copies or complementary sequences extending therefrom are thus removed after clustering. For example, a primer can include a deoxyuridine (dU) moiety. Subsequent treatment with an enzymatic formulation, such as LMX1, can cleave the primer at the dU residue and release the polynucleotide extending therefrom. In another example, the surface-attached primer can include an 8-oxoguanine (penox-G) residue. Subsequent treatment with an enzyme formulation, such as LMX2, can cleave the primer at the pendant oxy-G residue and release the polynucleotide extending therefrom.
此外,諸如聚類過程之拷貝過程之態樣可經修改或比較以判定此類態樣是否減少、消除、部分消除、惡化或以其他方式影響由多核苷酸群體之特徵產生之偏向。舉例而言,若區分多核苷酸與兩個或更多個不同群體之特徵產生、引起特徵或根據如本文中所揭示之方法判定為與該特徵相關聯,則可在不同條件下執行拷貝,諸如聚類過程,且可判定拷貝條件之此類差異對此類偏向之影響。製造多核苷酸群體之拷貝的過程之態樣(例如聚類過程之態樣)可為特徵,且此類特徵可在不同群體之間不同。在一實例中,來自在第一特徵(作為非限制性實例,諸如GC含量、長度及/或製備過程)上不同的兩個不同群體之多核苷酸可在兩種不同條件(第二特徵)中之每一者下拷貝。當在一組條件下拷貝時,表示第一特徵與拷貝之偏向有關的來自兩個群體之多核苷酸之拷貝之量的任何差異,可隨後與當在另一組條件下拷貝時來自兩個群體之多核苷酸之拷貝之量的任何差異進行比較。若此類差異彼此不同,則其將指示由於與特徵相關之偏向的拷貝條件之差異可能受條件影響(亦即,可能受第二特徵影響)。在另一實例中,區分特徵可為製造兩個或更多個多核苷酸群體之拷貝的方法之態樣,諸如聚類過程之態樣,無亦不同的其他特徵。In addition, aspects of copying processes, such as clustering processes, can be modified or compared to determine whether such aspects are reduced, eliminated, partially eliminated, exacerbated, or otherwise affected by biases resulting from the characteristics of a polynucleotide population. For example, copying can be performed under different conditions if a feature that distinguishes the polynucleotide from two or more different populations is generated, causes the feature, or is determined to be associated with the feature according to methods as disclosed herein, Processes such as clustering, and the effect of such differences in copying conditions on such biases can be determined. The aspect of the process of making copies of a polynucleotide population (eg, the aspect of the clustering process) can be a feature, and such features can vary from population to population. In one example, polynucleotides from two different populations that differ in a first characteristic (such as, by way of non-limiting example, GC content, length, and/or preparation process) can be used in two different conditions (a second characteristic) Copy of each of them. Any difference in the amount of copies of the polynucleotides from the two populations that represents the first characteristic associated with the bias of the copies when copied under one set of conditions can be subsequently compared to when copied under another set of conditions from the two Any differences in the number of copies of the polynucleotides of the populations are compared. If such differences were different from each other, it would indicate that differences in copy conditions due to a bias associated with a trait may be affected by the condition (ie, possibly by a second trait). In another example, a distinguishing characteristic may be an aspect of the method of making copies of two or more polynucleotide populations, such as an aspect of a clustering process, and no other characteristic that is different.
在一實例中,當在一組條件下拷貝時,兩個群體拷貝數量之間的差異中反映之特徵相關偏向可比當在另一組條件下拷貝時,兩個群體拷貝數量之間的差異中反映之特徵相關偏向更少或更多(由兩個群體之間拷貝之量的較小或較大差異表示)。可修改或測試進行拷貝之任何組分、情況、環境或其他態樣,以得到對關於特徵區分的兩個多核苷酸群體拷貝數量差異中所反映的偏向之影響。舉例而言,不同聚合酶、聚合反應中之添加劑(諸如聚乙二醇、鹽、核苷酸等)、基板、基板之聚合物塗層、流體槽特徵、溫度、時間安排或聚合循環數目、用於使多核苷酸之拷貝及其互補序列再雜合或線性化之組分(例如LMX1或LMX2,其在聚類之後,但在再合成表面連接多核苷酸以用於後續定序輪之前用於一些生物化學過程中以釋放表面連接多核苷酸的子集)或任何其他條件可經修改及比較。條件之多於一個實例可與多於一個其他實例相比。另外,可修改多個條件以判定例如特徵相關拷貝偏向上其間是否存在相互作用。此外,多個特徵可針對如上文所描述之其對偏向之個別及組合影響進行比較,且一或多個條件(單獨、以組合形式或兩者)可針對對任何一或多個特徵相關偏向之影響進行測試。In one example, the characteristic correlation bias reflected in the difference between the copy numbers of the two populations when copied under one set of conditions is comparable to the difference between the copy numbers of the two populations when copied under another set of conditions. The traits reflected were less or more biased (indicated by smaller or larger differences in the number of copies between the two populations). Any component, situation, environment, or other aspect of the copying can be modified or tested for effect on the bias reflected in the difference in copy number between the two polynucleotide populations for the distinction of characteristics. For example, different polymerases, additives in the polymerization reaction (such as polyethylene glycols, salts, nucleotides, etc.), substrates, polymer coatings of substrates, fluid cell characteristics, temperature, timing or number of polymerization cycles, Components for rehybridizing or linearizing copies of polynucleotides and their complements (eg, LMX1 or LMX2, after clustering, but before resynthesis of surface-linked polynucleotides for subsequent sequencing rounds) used in some biochemical processes to release a subset of surface-linked polynucleotides) or any other conditions can be modified and compared. More than one instance of the condition is comparable to more than one other instance. In addition, a number of conditions can be modified to determine whether there is an interaction therebetween, eg, on a feature-related copy bias. In addition, multiple features can be compared for their individual and combined effects on bias as described above, and one or more conditions (alone, in combination, or both) can be compared with respect to bias associated with any one or more features impact to be tested.
根據本發明之方法對於評估在用於下一代定序技術中之聚類或其他拷貝過程中的潛在偏向提供優於其他方法之優勢。舉例而言,如本文所揭示,可在無需對經拷貝多核苷酸進行定序之情況下評估偏向來源。可在無需耗費進行及分析多核苷酸定序的額外時間、費用及計算負擔之情況下鑑別潛在偏向來源及使偏向最小化、消除偏向或以其他方式影響偏差之可能調節。此外,本文中所揭示之實例提供用於評估諸如呈單獨或組合形式之多核苷酸特徵形式之多個可能偏向來源的高通量方法,以及其修改可使得消除或以其他方式修改拷貝中之偏向的多個變數,諸如在其下或根據其發生拷貝,例如聚類之條件或另外可影響在SBS過程中定序之前發生的多核苷酸拷貝之任何態樣的條件。Methods according to the present invention provide advantages over other methods for assessing potential bias in clustering or other copying processes for use in next-generation sequencing technologies. For example, as disclosed herein, the source of bias can be assessed without sequencing the copied polynucleotide. Potential sources of bias and possible adjustments to minimize, eliminate, or otherwise affect bias can be identified without the additional time, expense, and computational burden of performing and analyzing polynucleotide sequencing. Furthermore, the examples disclosed herein provide high-throughput methods for assessing multiple sources of potential bias, such as polynucleotide signatures, either individually or in combination, and modifications of which may result in the elimination or otherwise modification of A number of variables of bias, such as the conditions under which or according to which copies occur, eg, clustering or otherwise conditions that may affect any aspect of polynucleotide copying that occurs prior to sequencing in the SBS process.
關於評估可歸因於作為特徵的GC含量之偏向,多核苷酸群體可藉由平均相對GC含量界定特徵。舉例而言,已知某些微生物物種與例如人類相比具有相對較高或較低的GC含量平均百分比,人類基因體平均具有大約相等比例之GC及AT含量。已知一些微生物,諸如紅細菌屬(Rhodobacter group)之細菌具有較高GC含量,諸如高於60% GC含量。已知其他微生物,諸如仙人掌桿菌(Bacillus cereus)具有較低GC含量,諸如低於40% GC含量。在一實例中,GC含量可為特徵。多核苷酸群體可為由紅細菌製備之多核苷酸,其呈現相對於其他群體更高之GC含量作為特徵,另一群體由人類製備,其呈現相對於其他群體中等之GC含量作為特徵,或由仙人掌桿菌製備,其呈現相對於其他群體更低之GC含量作為特徵。此處相對地使用「更高(higher)」及「更低(lower)」。因此,使用人類作為群體,視另一群體(例如,分別仙人掌桿菌及紅細菌)之GC含量而定,其相對於此另一群體可具有更高GC或更低GC作為特徵。With regard to assessing the bias attributable to GC content as a feature, a polynucleotide population can be characterized by an average relative GC content. For example, certain microbial species are known to have relatively higher or lower average percentages of GC content compared to, for example, humans, who on average have approximately equal proportions of GC and AT content. Some microorganisms, such as bacteria of the Rhodobacter group, are known to have higher GC content, such as above 60% GC content. Other microorganisms, such as Bacillus cereus are known to have lower GC content, such as below 40% GC content. In one example, the GC content can be characteristic. A population of polynucleotides may be polynucleotides prepared from Rhodobacter that are characterized by higher GC content relative to other populations, another population prepared by humans and characterized by moderate GC content relative to other populations, or Prepared from Bacillus cacti, which is characterized by a lower GC content relative to other populations. "Higher" and "lower" are used relatively here. Thus, using humans as a population, depending on the GC content of another population (eg, Bacillus cacti and Rhodobacter, respectively), it may be characterized by a higher GC or a lower GC relative to this other population.
在其他實例中,多核苷酸可來自具有預定GC含量之合成或人工來源,該預定GC含量藉由例如直接確定樣品之多核苷酸序列或化學計量控制股中給定類型之核苷酸之相對量的併入,視其合成方法而定(例如使用用於序列合成之模板非依賴性方法)。多核苷酸群體可包括任何所欲或已知百分比之GC,意謂核鹼基之總數(G、C、A及T總計)中鳥嘌呤及胞嘧啶核鹼基之總合併數目。群體可由作為群體整體之特徵的GC含量定義,即使群體之個別多核苷酸可具有與群體整體之GC含量不同的GC含量。In other examples, polynucleotides can be derived from synthetic or artificial sources with predetermined GC content by, for example, directly determining the sequence of a polynucleotide in a sample or the relative relationship of nucleotides of a given type in a stoichiometric control strand The amount of incorporation depends on the method of its synthesis (eg, using template-independent methods for sequence synthesis). A polynucleotide population can include any desired or known percentage of GC, meaning the total combined number of guanine and cytosine nucleobases in the total number of nucleobases (G, C, A, and T total). A population can be defined by the GC content that is characteristic of the population as a whole, even though individual polynucleotides of the population can have a different GC content than the GC content of the population as a whole.
群體可具有約5% GC含量、約10% GC含量、約15% GC含量、約20% GC含量、約25% GC含量、約30% GC含量、約35% GC含量、約40% GC含量、約45% GC含量、約50% GC含量、約55% GC含量、約60% GC含量、約65% GC含量、約70% GC含量、約75% GC含量、約80% GC含量、約85% GC含量或約90% GC含量,或任何中間量之GC含量。所有其他可能的比較明確地包括為本發明之態樣。The population can have about 5% GC content, about 10% GC content, about 15% GC content, about 20% GC content, about 25% GC content, about 30% GC content, about 35% GC content, about 40% GC content , about 45% GC content, about 50% GC content, about 55% GC content, about 60% GC content, about 65% GC content, about 70% GC content, about 75% GC content, about 80% GC content, about 85% GC content or about 90% GC content, or any intermediate amount of GC content. All other possible comparisons are expressly included as aspects of the invention.
關於評估可歸因於多核苷酸長度之偏向,多核苷酸群體可藉由平均相對多核苷酸長度界定特徵。舉例而言,核酸分子可自樣品,諸如細胞或其他生物來源分離,且在樣品製備期間藉由各種方法中之任一者片段化。藉由調整片段化方法中使用之參數,諸如音波處理時間,可產生各種長度之多核苷酸。所要長度之多核苷酸可隨後自所得片段分離。群體可由作為群體整體之特徵的多核苷酸長度定義,即使群體之個別多核苷酸的長度可不同於如所確定的群體之多核苷酸長度。在另一實例中,作為群體之特徵的多核苷酸長度可藉由由設計之模板聚合預定長度之多核苷酸來預定。With regard to assessing bias attributable to polynucleotide length, a polynucleotide population can be characterized by an average relative polynucleotide length. For example, nucleic acid molecules can be isolated from samples, such as cells or other biological sources, and fragmented by any of a variety of methods during sample preparation. By adjusting parameters used in the fragmentation method, such as sonication time, polynucleotides of various lengths can be generated. Polynucleotides of the desired length can then be isolated from the resulting fragments. A population can be defined by the length of polynucleotides that are characteristic of the population as a whole, even though the lengths of individual polynucleotides of the population can be different from the polynucleotide lengths of the population as determined. In another example, a polynucleotide length characteristic of a population can be predetermined by polymerizing polynucleotides of predetermined length from a designed template.
多核苷酸群體可具有以下長度:約100個核苷酸、約150個核苷酸、約200個核苷酸、約250個核苷酸、約300個核苷酸、約350個核苷酸、約400個核苷酸、約450個核苷酸、約500個核苷酸、約550個核苷酸、約600個核苷酸、約650個核苷酸、約700個核苷酸、約750個核苷酸、約800個核苷酸、約850個核苷酸、約900個核苷酸、約950個核苷酸、約1,000個核苷酸、約1,050個核苷酸、約1,200個核苷酸、約1,250個核苷酸、約1,300個核苷酸、約1,350個核苷酸、約1,400個核苷酸、約1,450個核苷酸、約1,500個核苷酸、約1,550個核苷酸、約1,600個核苷酸、約1,650個核苷酸、約1,700個核苷酸、約1,750個核苷酸、約1,800個核苷酸、約1,850個核苷酸、約1,900個核苷酸、約1,950個核苷酸、約2000個核苷酸或更長。The population of polynucleotides can have the following lengths: about 100 nucleotides, about 150 nucleotides, about 200 nucleotides, about 250 nucleotides, about 300 nucleotides, about 350 nucleotides , about 400 nucleotides, about 450 nucleotides, about 500 nucleotides, about 550 nucleotides, about 600 nucleotides, about 650 nucleotides, about 700 nucleotides, about 750 nucleotides, about 800 nucleotides, about 850 nucleotides, about 900 nucleotides, about 950 nucleotides, about 1,000 nucleotides, about 1,050 nucleotides, about 1,200 nucleotides, about 1,250 nucleotides, about 1,300 nucleotides, about 1,350 nucleotides, about 1,400 nucleotides, about 1,450 nucleotides, about 1,500 nucleotides, about 1,550 nucleotides, about 1,600 nucleotides, about 1,650 nucleotides, about 1,700 nucleotides, about 1,750 nucleotides, about 1,800 nucleotides, about 1,850 nucleotides, about 1,900 nucleotides Nucleotides, about 1,950 nucleotides, about 2000 nucleotides or longer.
特徵亦可包括群體製備之其他態樣,諸如藉由不同庫製備方法或來自不同供應商之庫製備套組製備之核苷酸群體(DNA庫)。亦可比較聚類過程之態樣之影響以判定此類條件是否及如何影響與特徵相關之偏向或假設偏向。藉由一或多個特徵彼此區分之兩個或更多個多核苷酸群體中之每一者可經歷一種、兩種或更多種拷貝過程,諸如聚類過程,其中條件在兩種或更多種過程之間變化。拷貝偏向是否由在一個條件下及/或另一條件下的特徵差異引起抑或偏向的量或存在是否取決於其經歷何種拷貝條件而不同,可指示與特徵相關的偏向是否可藉由如此修改條件來改變。在一實例中,可修改多個條件,或可比較條件之若干不同實例。可修改之條件之實例包括用於固態拷貝,諸如聚類過程之反應溶液之組分(諸如,使用之聚合酶、pH、多核苷酸之濃度或任何組分、使用之線性化酶諸如LMX1或LMX2或其他,包括之效能添加劑諸如GP32或UvsX或其他核苷酸結合蛋白、聚乙二醇、磷酸肌酸或其他添加劑,基板表面之類型,或在其上發生固相PCR或聚類之表面之聚合物塗層之類型、存在或厚度)、拷貝輪之數目或持續時間、溫度等。在一些實例中,製備多核苷酸群體之方法及/或拷貝方法,諸如在由多核苷酸群體形成群集期間之任何態樣可根據本文所揭示之方法修改及評價以確定偏向可能性。Features may also include other aspects of population preparation, such as nucleotide populations (DNA libraries) prepared by different library preparation methods or library preparation kits from different suppliers. The effects of aspects of the clustering process can also be compared to determine whether and how such conditions affect biases or assumptions associated with features. Each of two or more populations of polynucleotides that are distinguished from each other by one or more characteristics can undergo one, two or more copying processes, such as clustering processes, where conditions are vary between processes. Whether copy bias results from differences in traits under one condition and/or another condition or whether the amount or presence of bias differs depending on which copy condition it experiences conditions to change. In one example, multiple conditions may be modified, or several different instances of the conditions may be compared. Examples of conditions that can be modified include components of the reaction solution for solid-state copying, such as clustering processes (such as the polymerase used, pH, concentration of polynucleotide or any component, linearase used such as LMX1 or LMX2 or others, including potency additives such as GP32 or UvsX or other nucleotide binding proteins, polyethylene glycol, phosphocreatine or other additives, the type of substrate surface, or the surface on which solid phase PCR or clustering occurs type, presence or thickness of polymer coating), number or duration of copy wheels, temperature, etc. In some examples, methods of making polynucleotide populations and/or copying methods, such as any aspect during cluster formation from polynucleotide populations, can be modified and evaluated according to the methods disclosed herein to determine bias potential.
可作為特徵在用於製造拷貝之方法(例如ExAmp、橋式或其他聚類過程)中使用之試劑中改變之參數的非限制性實例包括各種酶濃度(及比率)、添加劑(濃度及比率)、溶液pH、包括於用於拷貝之溶液中之多核苷酸濃度、包括於用於拷貝之溶液中之核苷酸濃度。Non-limiting examples of parameters that can be altered as features in reagents used in methods used to make copies (eg, ExAmp, bridge or other clustering processes) include various enzyme concentrations (and ratios), additives (concentrations and ratios) , solution pH, polynucleotide concentration included in the solution for copying, nucleotide concentration included in the solution for copying.
發生拷貝的溫度(例如,在一實例中,處於、高於或低於約20℃)、聚合或洗滌步驟之持續時間、試劑補充方法或持續時間、試劑至流體槽或所使用之其他基板中之流動速率等之非限制性實例可相應地經修改及研究。聚類時間可作為特徵變化(例如持續小於約30分鐘,或在約30分鐘至約一小時內,或在約一小時至約兩小時內,或在約兩小時至約三小時內,或在約三小時至約四小時內,或在約四小時至約五小時內,或在約五小時至約六小時內,或在約六小時至約七小時內,或在約七小時至約八小時內,或在約八小時至約九小時內,或在約九小時至約十小時內,或在約十小時至約十一小時內,或在約十一小時至約十二小時內,或在約十二小時至約二十四小時內,或在約二十四小時至約三十六小時內,或在約三十六小時至約四十八小時內,或在約四十八小時至約七十二小時內,或更長)。拷貝或聚類過程之各態樣之持續時間,諸如在溶液中培育試劑之持續時間可作為特徵變化(例如持續約10秒、或約20秒、或約30秒、或約40秒、或約50秒、或約60秒、或約70秒、或約80秒、或約90秒、或約100秒、或約110秒、或約120秒或更長)。The temperature at which copying occurs (eg, at, above or below about 20°C, in one example), the duration of the polymerization or wash steps, the method or duration of reagent replenishment, the introduction of reagents into fluid cells or other substrates used Non-limiting examples of flow rates, etc., can be modified and studied accordingly. Clustering time can vary as a characteristic (eg, lasts less than about 30 minutes, or within about 30 minutes to about one hour, or within about one hour to about two hours, or within about two hours to about three hours, or within within about three hours to about four hours, or within about four hours to about five hours, or within about five hours to about six hours, or within about six hours to about seven hours, or within about seven hours to about eight within hours, or within about eight hours to about nine hours, or within about nine hours to about ten hours, or within about ten hours to about eleven hours, or within about eleven hours to about twelve hours, or within about twelve hours to about twenty-four hours, or within about twenty-four hours to about thirty-six hours, or within about thirty-six hours to about forty-eight hours, or within about forty-eight hours to about seven within twelve hours, or longer). The duration of various aspects of the copying or clustering process, such as the duration of incubation of the reagent in solution, can vary as a characteristic (eg, for about 10 seconds, or about 20 seconds, or about 30 seconds, or about 40 seconds, or about 50 seconds, or about 60 seconds, or about 70 seconds, or about 80 seconds, or about 90 seconds, or about 100 seconds, or about 110 seconds, or about 120 seconds or more).
流體速度或試劑流動穿過流體槽之速率可作為特徵變化(例如,流動速率可為約10 μl/min、或約20 μl/min、或約30 μl/min、或約40 μl/min、或約50 μl/min、或約60 μl/min、或約70 μl/min、或約80 μl/min、或約90 μl/min、或約100 μl/min、或約110 μl/min、或約120 μl/min、或約130 μl/min、或約140 μl/min、或約150 μl/min或更高速率)。可作為特徵修改之其他態樣包括pH(例如高於或低於或處於約pH 7.5)、緩衝液類型(例如基於Tris或其他)及緩衝液或其他組分或聚類溶液組分之濃度(例如約100 nM,或少於或多於約100 nM)。The fluid velocity or the rate at which the reagent flows through the fluid cell can vary as a characteristic (eg, the flow rate can be about 10 μl/min, or about 20 μl/min, or about 30 μl/min, or about 40 μl/min, or about 50 μl/min, or about 60 μl/min, or about 70 μl/min, or about 80 μl/min, or about 90 μl/min, or about 100 μl/min, or about 110 μl/min, or about 120 μl/min, or about 130 μl/min, or about 140 μl/min, or about 150 μl/min or higher). Other aspects that can be modified as features include pH (eg above or below or at about pH 7.5), buffer type (eg Tris based or others) and concentration of buffer or other components or cluster solution components ( For example about 100 nM, or less or more than about 100 nM).
在一實例中,可在溶液中組合來自兩個或更多個群體之多核苷酸,且將溶液添加至諸如用於拷貝,包括例如藉由聚類過程拷貝之流體槽之基板。兩個或更多個群體之多核苷酸上的不同標識符序列准許鑑別表面連接拷貝為拷貝的群體。在一實例中,可控制來自群體之多核苷酸占所添加之多核苷酸總量的相對比例,且在若干不同溶液中變化。舉例而言,添加至流體槽或流體槽之泳道中之溶液中之多核苷酸的總數目可具有約相等比例之來自兩個多核苷酸群體中之每一者的多核苷酸。添加至流體槽或流體槽之泳道中之另一溶液中之多核苷酸的總數目可具有約25%來自一個多核苷酸群體之多核苷酸及約75%來自另一多核苷酸群體之多核苷酸,而添加至流體槽或流體槽之泳道中之又一溶液中之多核苷酸的總數目可具有約75%來自另一多核苷酸群體之多核苷酸及約25%來自一個多核苷酸群體之多核苷酸。來自一個群體及另一群體之多核苷酸之比例之間的任何其他分割可用於不同溶液中(例如約5%/95%、約10%/90%、約15%/85%、約20%/80%、約25%/75%、約30%/70%、約35%/65%、約40%/60%、約45%/55%、約50%/50%、約55%/45%、約60%/40%、約65%/35%、約70%/30%、約75%/25%、約80%/20%、約85%/15%、約90%/10%、約95%/5%,或任何中間相對比例)。In one example, polynucleotides from two or more populations can be combined in solution, and the solution added to a substrate such as a fluid cell for copying, including, for example, copying by a clustering process. Different identifier sequences on the polynucleotides of the two or more populations allow for the identification of populations whose surface-linked copies are copies. In one example, the relative proportion of polynucleotides from the population to the total amount of polynucleotides added can be controlled and varied in several different solutions. For example, the total number of polynucleotides in solution added to a fluid cell or lane of a fluid cell can have about equal proportions of polynucleotides from each of the two polynucleotide populations. The total number of polynucleotides in another solution added to the fluid cell or lane of the fluid cell can have about 25% polynucleotides from one polynucleotide population and about 75% polynucleotides from another polynucleotide population polynucleotides, and the total number of polynucleotides added to another solution in the fluid cell or lane of the fluid cell can have about 75% polynucleotides from another polynucleotide population and about 25% from one polynucleotide Polynucleotides of acid groups. Any other splits between the proportions of polynucleotides from one population and another can be used in different solutions (eg, about 5%/95%, about 10%/90%, about 15%/85%, about 20% /80%, about 25%/75%, about 30%/70%, about 35%/65%, about 40%/60%, about 45%/55%, about 50%/50%, about 55%/ 45%, about 60%/40%, about 65%/35%, about 70%/30%, about 75%/25%, about 80%/20%, about 85%/15%, about 90%/10 %, about 95%/5%, or any intermediate relative ratio).
庫製備library preparation
包括多核苷酸之庫可以任何適合方式製備以將寡核苷酸轉接子連接於目標多核苷酸。如本文所用,「庫(library)」為來自給定來源或樣品之多核苷酸群體。庫包括複數個目標多核苷酸。如本文所用,「目標多核苷酸(target polynucleotide)」為期望包括在諸如聚類過程之拷貝過程中的多核苷酸。目標多核苷酸可基本上為任何具有已知或未知序列之多核苷酸。其可為例如基因體DNA或cDNA之片段。目標多核苷酸可來源於已隨機片段化之初級多核苷酸樣品。目標多核苷酸可加工成適用於擴增的模板,其藉由將引子序列置放於各目標片段,諸如標識符序列、與表面連接引子互補之序列等末端。目標多核苷酸亦可藉由反轉錄成cDNA而自初級RNA樣品獲得。Libraries comprising polynucleotides can be prepared in any suitable manner to link oligonucleotide adaptors to target polynucleotides. As used herein, a "library" is a population of polynucleotides from a given source or sample. The library includes a plurality of target polynucleotides. As used herein, a "target polynucleotide" is a polynucleotide desired to be included in a copying process such as a clustering process. The target polynucleotide can be essentially any polynucleotide having a known or unknown sequence. It can be, for example, a fragment of genomic DNA or cDNA. The target polynucleotide can be derived from a sample of the primary polynucleotide that has been randomly fragmented. Target polynucleotides can be processed into templates suitable for amplification by placing primer sequences at the ends of each target fragment, such as identifier sequences, sequences complementary to surface-linked primers, and the like. Target polynucleotides can also be obtained from primary RNA samples by reverse transcription into cDNA.
如本文所用,術語「多核苷酸(polynucleotide)」及「寡核苷酸(oligonucleotide)」可互換地使用且係指包括兩個或更多個典型地經由磷酸二酯鍵彼此共價鍵結之核苷酸單體的分子。多核苷酸典型地含有比寡核苷酸更多的核苷酸。出於說明而非限制之目的,多核苷酸可視為含有15、20、30、40、50、100、200、300、400、500個或更多個核苷酸,而寡核苷酸可視為含有100、50、20、15個或更少個核苷酸。As used herein, the terms "polynucleotide" and "oligonucleotide" are used interchangeably and refer to a compound comprising two or more covalently bonded to each other, typically via phosphodiester bonds Molecules of nucleotide monomers. Polynucleotides typically contain more nucleotides than oligonucleotides. For purposes of illustration and not limitation, polynucleotides may be considered to contain 15, 20, 30, 40, 50, 100, 200, 300, 400, 500 or more nucleotides, and oligonucleotides may be considered Contains 100, 50, 20, 15 or fewer nucleotides.
多核苷酸及寡核苷酸可包括去氧核糖核酸(deoxyribonucleic acid,DNA)或核糖核酸(ribonucleic acid,RNA)。該等術語應理解為等效地包括由核苷酸類似物製成之DNA或RNA類似物且適用於單股(諸如有義或反義)及雙股多核苷酸。如本文所用,該術語亦涵蓋cDNA,其為例如藉由反轉錄酶作用由RNA模板產生之互補或拷貝DNA。Polynucleotides and oligonucleotides may include deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). These terms should be understood to include equivalently DNA or RNA analogs made from nucleotide analogs and apply to both single-stranded (such as sense or antisense) and double-stranded polynucleotides. As used herein, the term also encompasses cDNA, which is complementary or copy DNA produced from an RNA template, eg, by the action of reverse transcriptase.
初級多核苷酸分子可起源於雙股DNA(double-stranded DNA,dsDNA)形式(例如基因體DNA片段、PCR及擴增產物及其類似物),或可作為DNA或RNA起源於單股形式,且轉化為dsDNA形式。舉例而言,可使用此項技術中熟知之標準技術將mRNA分子拷貝為雙股cDNA。初級多核苷酸之精確序列一般不為本文中所呈現之揭示內容的材料,且可為已知或未知的。Primary polynucleotide molecules can originate in double-stranded DNA (dsDNA) form (eg, genomic DNA fragments, PCR and amplification products, and the like), or can originate in single-stranded form as DNA or RNA, and converted into dsDNA form. For example, an mRNA molecule can be copied into double-stranded cDNA using standard techniques well known in the art. The precise sequence of the primary polynucleotide is generally not material for the disclosure presented herein, and may or may not be known.
在一些實例中,初級目標多核苷酸為RNA分子。在此類實例之一態樣中,自特定樣品分離之RNA首先使用此項技術中已知之技術轉化成雙股DNA。雙股DNA可隨後用庫特異性標籤進行索引標記。包括庫特異性索引標籤之此類雙股DNA之不同製備物可以並行地由自不同來源或樣品分離之RNA產生。隨後,包括不同庫特異性索引標籤之雙股DNA之不同製備物可混合全體拷貝,且藉助於庫特異性索引標籤序列之存在關於其分離/來源自的群體確定各定序片段之身分。In some examples, the primary target polynucleotide is an RNA molecule. In one aspect of such an example, RNA isolated from a particular sample is first converted to double-stranded DNA using techniques known in the art. Double-stranded DNA can then be indexed with library-specific tags. Different preparations of such double-stranded DNA, including library-specific index tags, can be generated in parallel from RNA isolated from different sources or samples. Subsequently, different preparations of double-stranded DNA including different library-specific index tags can be mixed with all copies, and the identity of each sequenced fragment determined with respect to the population from which it was isolated/derived by virtue of the presence of library-specific index tag sequences.
在一些實例中,初級目標多核苷酸為DNA分子。舉例而言,初級多核苷酸可表示生物體之完整基因互補序列,且為基因體DNA分子,諸如人類DNA分子,其包括內含子及外顯子序列(編碼序列),以及非編碼調節序列,諸如啟動子及強化子序列。儘管可設想亦可使用特定子集之多核苷酸序列或基因體DNA,諸如特定染色體或其部分。在許多實例中,初級多核苷酸之序列為未知的。DNA目標多核苷酸可在片段化過程,諸如隨機片段化過程之前或之後且在轉接子寡核苷酸接合之前、期間或之後化學或酶處理。In some examples, the primary target polynucleotide is a DNA molecule. For example, a primary polynucleotide can represent the complete genetic complement of an organism, and is a genomic DNA molecule, such as a human DNA molecule, that includes intronic and exonic sequences (coding sequences), as well as non-coding regulatory sequences , such as promoter and enhancer sequences. Although it is envisaged that specific subsets of polynucleotide sequences or genomic DNA may also be used, such as specific chromosomes or portions thereof. In many instances, the sequence of the primary polynucleotide is unknown. DNA target polynucleotides can be chemically or enzymatically treated before or after a fragmentation process, such as a random fragmentation process, and before, during, or after adaptor oligonucleotide ligation.
在一個實例中,使初級目標多核苷酸片段化為適合於定序之適當長度。目標多核苷酸可以任何適合方式片段化。較佳地,使目標多核苷酸隨機片段化。隨機片段化係指藉由例如酶、化學或機械手段以無序方式片段化多核苷酸。可採用任何適合之片段化方法。為清楚起見,經由較大段多核苷酸的較小片段之特異性PCR擴增產生此類較小片段不等效於使較大段多核苷酸片段化,因為較大段多核苷酸保持完整,亦即未藉由PCR擴增片段化(但可對藉由任一技術產生之多核苷酸群體進行如本文所揭示之方法)。此外,隨機片段化經設計以無關於包括及/或圍繞斷裂處之核苷酸的序列身分或位置產生片段。In one example, the primary target polynucleotide is fragmented to an appropriate length for sequencing. The polynucleotide of interest can be fragmented in any suitable manner. Preferably, the target polynucleotides are randomly fragmented. Random fragmentation refers to fragmentation of polynucleotides in a disordered manner, eg, by enzymatic, chemical, or mechanical means. Any suitable fragmentation method can be used. For clarity, generating such smaller fragments via specific PCR amplification of smaller fragments of larger polynucleotides is not equivalent to fragmenting the larger polynucleotides because the larger polynucleotides remain Intact, that is, not fragmented by PCR amplification (although methods as disclosed herein can be performed on polynucleotide populations generated by either technique). In addition, random fragmentation is designed to generate fragments regardless of the sequence identity or position of the nucleotides including and/or surrounding the break.
在一些實例中,隨機片段化藉由機械手段,諸如噴霧或音波處理進行以產生長度為約50個鹼基對至約長度為1500個鹼基對,諸如長度為50至700個鹼基對或長度為50至500個鹼基對之片段。In some examples, random fragmentation is performed by mechanical means, such as spraying or sonication, to produce a length of about 50 base pairs to about 1500 base pairs in length, such as a length of 50 to 700 base pairs or Fragments of 50 to 500 base pairs in length.
藉由機械手段(例如噴霧、音波處理及Hydroshear)之多核苷酸分子之片段化可產生具有鈍端及3'懸垂端及5'懸垂端之非均質混合物的片段。片段末端可使用此項技術中已知之方法或套組(諸如Lucigen DNA終止子末端修復套組)修復以產生對於插入,例如插入選殖載體之鈍位點中而言最佳的末端。在一些實例中,核酸群體之片段末端係鈍端。片段末端可為鈍端及磷酸化的。磷酸酯部分可經由酶處理,例如使用多核苷酸激酶來引入。Fragmentation of polynucleotide molecules by mechanical means such as spraying, sonication, and Hydroshear can generate fragments with blunt ends and a heterogeneous mixture of 3' and 5' overhanging ends. Fragment ends can be repaired using methods or kits known in the art, such as the Lucigen DNA Terminator End Repair Kit, to generate ends that are optimal for insertion, eg, into blunt sites of a selection vector. In some examples, fragments of the nucleic acid population are blunt-ended. Fragment ends can be blunt and phosphorylated. Phosphate moieties can be introduced via enzymatic treatment, eg, using polynucleotide kinases.
在一些實例中,目標多核苷酸序列係用單一懸垂核苷酸,藉由例如某些類型之DNA聚合酶,諸如Taq聚合酶或Klenow exo minus聚合酶之活性製備,該等聚合酶具有將單一去氧核苷酸,例如去氧腺苷(A)添加至例如PCR產物之3'端的非模板依賴性末端轉移酶活性。此類酶可用於向目標多核苷酸雙螺旋之各股之鈍端3'端添加單一核苷酸『A』。因此,『A』可藉由與Taq或Klenow exo minus聚合酶反應而添加至目標多核苷酸雙螺旋之各端經修復雙螺旋股之3'端,而轉接子多核苷酸構築體可為具有在轉接子構築體之各雙螺旋區之3'端上存在的相容性『T』懸垂物之T-構築體。此末端修飾亦防止目標多核苷酸自接合,使得存在朝向形成組合的接合轉接子-目標多核苷酸的偏向。In some examples, the target polynucleotide sequence is prepared using a single overhanging nucleotide by, for example, the activity of certain types of DNA polymerases, such as Taq polymerase or Klenow exo minus polymerase, which have the ability to convert a single Deoxynucleotides, such as deoxyadenosine (A), are added to, for example, the 3' end of PCR products for template-independent terminal transferase activity. Such enzymes can be used to add a single nucleotide "A" to the blunt 3' end of each strand of a polynucleotide duplex of interest. Thus, "A" can be added to the 3' end of the repaired duplex strands at each end of the target polynucleotide duplex by reaction with Taq or Klenow exo minus polymerase, and the adaptor polynucleotide construct can be T-constructs with compatible "T" pendants present on the 3' end of each duplex region of the adaptor construct. This end modification also prevents the target polynucleotide from self-ligation, so that there is a bias towards forming a combined ligated adaptor-target polynucleotide.
在一些實例中,片段化經由標籤化完成。在此類方法中,轉位酶用於使雙股多核苷酸片段化且將通用引子序列連接於雙股多核苷酸之一股中。所得分子可經空位填充(gap-filled)且例如藉由PCR擴增,使用包括具有與所連接之通用引子序列互補的序列的3'端及含有轉接子之其他序列的5'端之引子來進行延伸。In some instances, fragmentation is done via tagging. In such methods, a translocase is used to fragment the double-stranded polynucleotide and ligate a universal primer sequence into one of the strands of the double-stranded polynucleotide. The resulting molecule can be gap-filled and amplified, for example, by PCR, using primers that include the 3' end with a sequence complementary to the sequence of the universal primer to which it is attached and the 5' end of the other sequence containing the adaptor to extend.
轉接子可以任何其他適合方式連接於目標多核苷酸。在一些實例中,轉接子可以單步過程引入。在一些實例中,轉接子可以多步過程,諸如雙步過程引入,其涉及將轉接子之一部分接合於具有通用引子序列之目標多核苷酸。第二步驟包括例如藉由PCR擴增,使用包括具有與所連接之通用引子序列互補的序列的3'端及含有轉接子之其他序列的5'端之引子來進行延伸。可進行額外延伸以提供額外序列至所得先前延伸之多核苷酸的5'端。The adaptor can be attached to the target polynucleotide in any other suitable manner. In some instances, adaptors can be introduced in a single-step process. In some examples, the adaptor can be introduced in a multi-step process, such as a two-step process, which involves ligating a portion of the adaptor to a target polynucleotide having a universal primer sequence. The second step involves extension, eg, by PCR amplification, using primers that include a 3' end having a sequence complementary to the sequence of the universal primer to which it is attached and a 5' end containing additional sequences of the adaptor. Additional extensions can be performed to provide additional sequence to the 5' end of the resulting previously extended polynucleotide.
在一些實例中,整個轉接子接合於片段化目標多核苷酸。較佳地,接合之轉接子包括接合於雙股目標多核苷酸之雙股區。較佳地,雙股區在無功能損失之情況下儘可能短。在此情形下,「功能(function)」係指在標準反應條件下雙股區形成穩定雙螺旋之能力。在一些實例中,標準反應條件係指酶催化之多核苷酸接合反應(例如在適合於酶之接合緩衝液中在4℃至25℃範圍內之溫度下培育)使得形成轉接子之兩股在轉接子接合於目標分子期間保持部分黏合的反應條件。接合方法利用諸如DNA接合酶之接合酶來實現或催化在此情況下轉接子雙螺旋寡核苷酸及目標多核苷酸雙螺旋之兩個多核苷酸股之末端的接合,使得形成共價鍵聯。轉接子雙螺旋寡核苷酸可含有5'-磷酸酯部分以促進與目標多核苷酸3'-OH接合。目標多核苷酸可含有5'-磷酸酯部分,其係自剪切過程殘餘,或使用酶處理步驟添加,且已經末端修復,且視情況藉由一或多個懸垂鹼基延伸,得到適合於接合之3'-OH。在此情形下,連接意謂先前未共價連接之多核苷酸股的共價鍵聯。在一態樣中,此類連接藉由在兩個多核苷酸股之間形成磷酸二酯鍵聯進行,但可使用其他共價鍵聯(例如非磷酸二酯主鏈鍵聯)。In some examples, the entire adaptor is ligated to the fragmented target polynucleotide. Preferably, the ligated adaptor comprises a double-stranded region ligated to the double-stranded target polynucleotide. Preferably, the double-stranded region is as short as possible without loss of function. In this context, "function" refers to the ability of the double-stranded region to form a stable double helix under standard reaction conditions. In some examples, standard reaction conditions refer to an enzyme-catalyzed polynucleotide ligation reaction (eg, incubation in a ligation buffer suitable for the enzyme at a temperature in the range of 4°C to 25°C) such that two strands of the adaptor are formed Reaction conditions that maintain partial adhesion during adaptor ligation to target molecules. Ligation methods utilize ligases such as DNA ligases to effect or catalyze, in this case, the ligation of the ends of the two polynucleotide strands of the adaptor duplex oligonucleotide and the target polynucleotide duplex, such that a covalent bond. The adaptor duplex oligonucleotide may contain a 5'-phosphate moiety to facilitate conjugation to the 3'-OH of the target polynucleotide. The polynucleotide of interest may contain a 5'-phosphate moiety, which is residual from the cleavage process, or added using an enzymatic treatment step, and which has been end repaired, and optionally extended by one or more overhanging bases, resulting in a suitable Joined 3'-OH. Linking in this context means covalent linkage of polynucleotide strands that were not previously covalently linked. In one aspect, such linkage is by forming a phosphodiester linkage between the two polynucleotide strands, although other covalent linkages (eg, non-phosphodiester backbone linkages) can be used.
任何適合的轉接子可經由任何適合的過程,諸如上文所述過程連接於目標多核苷酸。轉接子包括庫特異性索引標籤序列。在樣品固定化以進行定序之前,索引標籤序列可連接於來自各庫之目標多核苷酸。索引標籤自身不由目標多核苷酸之一部分形成,但變為用於擴增之模板之一部分。索引標籤可為作為模板製備步驟之一部分添加至目標之核苷酸的合成序列。因此,庫特異性索引標籤係連接於特定庫之目標分子中之每一者的核酸序列標籤,其存在指示或用於鑑別目標分子分離自之庫。Any suitable adaptor can be ligated to the target polynucleotide via any suitable process, such as those described above. The adaptor includes library-specific indexing tag sequences. Index tag sequences can be ligated to target polynucleotides from each library prior to sample immobilization for sequencing. The index tag itself is not formed from part of the target polynucleotide, but becomes part of the template used for amplification. The index tag can be a synthetic sequence of nucleotides added to the target as part of the template preparation step. Thus, a library-specific index tag is a nucleic acid sequence tag attached to each of the target molecules of a particular library, the presence of which indicates or is used to identify the library from which the target molecule was isolated.
較佳地,索引標籤序列長度為20個核苷酸或更少。舉例而言,索引標籤序列之長度可為1至10個核苷酸或4至6個核苷酸。四種核苷酸索引標籤產生在同一陣列上多工256個樣品之可能性,六種鹼基索引標籤使得能夠在同一陣列上處理4,096個樣品。Preferably, the index tag sequence is 20 nucleotides or less in length. For example, the index tag sequence can be 1 to 10 nucleotides or 4 to 6 nucleotides in length. The four nucleotide index tags give the possibility to multiplex 256 samples on the same array, and the six base index tags enable processing of 4,096 samples on the same array.
轉接子可含有超過一種索引標籤(或標識符序列)以使得多工可能性可增加。The adaptor may contain more than one index tag (or identifier sequence) so that the multiplexing potential can be increased.
轉接子可包括雙股區及包括兩個非互補單股之區域。轉接子之雙股區可具有任何適合數目的鹼基對。較佳地,雙股區為藉由使兩個部分互補的多核苷酸股黏合而形成的典型地包括5個或更多個連續鹼基對的短雙股區。轉接子之此「雙股區(double-stranded region)」係指其中兩股黏合之區域且不暗示任何特定結構構形。在一些實例中,雙股區包括20個或更少個連續鹼基對,諸如10個或更少個或5個或更少個連續鹼基對。An adaptor can include a double-stranded region and a region including two non-complementary single strands. The double-stranded region of the adaptor can have any suitable number of base pairs. Preferably, the double-stranded region is a short double-stranded region typically comprising 5 or more contiguous base pairs formed by bonding two partially complementary polynucleotide strands. This "double-stranded region" of an adapter refers to the region where the two strands are bonded and does not imply any particular structural configuration. In some examples, the double-stranded region includes 20 or less contiguous base pairs, such as 10 or less or 5 or less contiguous base pairs.
雙股區之穩定性可藉由包括展現比標準華生-克里克(Watson-Crick)鹼基對更強的鹼基配對的非天然核苷酸而增加,且因此其長度可能減少。轉接子之兩股在雙股區中可100%互補。The stability of the double-stranded region can be increased by including non-natural nucleotides that exhibit stronger base pairing than standard Watson-Crick base pairs, and thus may be reduced in length. The two strands of the adapter are 100% complementary in the double-strand region.
當轉接子連接於目標多核苷酸時,非互補單股區可形成待定序的多核苷酸之5'端及3'端。術語「非互補單股區(non-complementary single stranded region)」係指轉接子之區域,其中形成轉接子之兩個多核苷酸股之序列展現使得兩股不能夠在用於PCR反應之標準黏合條件下完全彼此黏合的非互補性程度。When the adaptor is ligated to the target polynucleotide, the non-complementary single-stranded regions can form the 5' and 3' ends of the polynucleotide to be sequenced. The term "non-complementary single stranded region" refers to a region of an adaptor in which the two polynucleotide strands forming the adaptor exhibit sequences such that the two strands cannot be used in a PCR reaction. The degree of non-complementarity that fully adhere to each other under standard adhesion conditions.
非互補單股區由形成雙股區之相同兩個多核苷酸股的不同部分提供。單股部分之長度下限典型地將藉由例如提供用於引子延伸、PCR及/或定序的引子之結合的適合序列之功能來確定。理論上,不存在不匹配區之長度的上限,除了一般而言,最小化轉接子之總長度係有利的,例如,以便促進在一或多個連接步驟之後分離未結合轉接子與轉接子-目標構築體。因此,通常較佳地,轉接子之非互補單股區長度為50個或更少個連續核苷酸,諸如長度為40個或更少個、30個或更少個、或25個或更少個連續核苷酸。Non-complementary single-stranded regions are provided by different portions of the same two polynucleotide strands that form the double-stranded region. The lower limit on the length of the single-stranded portion will typically be determined by, for example, the function of providing suitable sequences for binding of primers for primer extension, PCR and/or sequencing. In theory, there is no upper limit to the length of the mismatched region, except that in general it is advantageous to minimize the overall length of the adaptor, for example, in order to facilitate separation of unbound adaptor from the adaptor after one or more ligation steps Splicer-target construct. Thus, it is generally preferred that the non-complementary single-stranded region of the adaptor is 50 or less contiguous nucleotides in length, such as 40 or less, 30 or less, or 25 or less in length Fewer consecutive nucleotides.
庫特異性索引標籤序列可位於單股區、雙股區中,或跨越轉接子之單股區及雙股區。較佳地,索引標籤序列處於轉接子之單股區中。The library-specific index tag sequence can be located in the single-stranded region, the double-stranded region, or spanning the single-stranded and double-stranded regions of the adaptor. Preferably, the index tag sequence is in the single-stranded region of the adaptor.
除索引標籤序列之外,轉接子可包括任何其他適合的序列。舉例而言,轉接子可包括通用延伸引子序列,其典型地位於轉接子及所得用於定序之多核苷酸之5'端或3'端。通用延伸引子序列可雜合至結合於固體基板表面之互補引子。互補引子包括游離3'端,自該端,聚合酶或其他適合酶可添加核苷酸以使用雜合庫多核苷酸作為模板來延伸序列,使得庫多核苷酸之反向股偶接至固體表面。此類延伸可為定序操作或群集擴增之一部分。In addition to the index tag sequence, the adaptor may include any other suitable sequence. For example, an adaptor can include a universal extension primer sequence, which is typically located at the 5' or 3' end of the adaptor and the resulting polynucleotide used for sequencing. Universal extension primer sequences can hybridize to complementary primers bound to the surface of the solid substrate. Complementary primers include the free 3' end from which a polymerase or other suitable enzyme can add nucleotides to extend the sequence using the hybrid library polynucleotide as a template such that the reverse strand of the library polynucleotide is coupled to the solid surface. Such extension can be part of a sequencing operation or cluster amplification.
在一些實例中,轉接子包括一或多個通用定序引子序列。通用定序引子序列可結合於定序引子以允許對索引標籤序列、目標序列或索引標籤序列及目標序列定序。In some examples, the adaptor includes one or more universal sequencing primer sequences. A universal sequencing primer sequence can be combined with a sequencing primer to allow sequencing of an index tag sequence, a target sequence, or an index tag sequence and a target sequence.
轉接子之精確核苷酸序列一般不為材料且可由使用者選擇,使得所要序列元件最終包括於衍生自轉接子之模板的庫之通用序列中,以例如為特定組通用延伸引子及/或定序引子提供結合位點。The exact nucleotide sequence of the adaptor is generally not material and can be selected by the user such that the desired sequence element is ultimately included in the universal sequence of the library derived from the adaptor's template, e.g., for a particular set of universal extension primers and/or Or sequencing primers provide binding sites.
轉接子寡核苷酸可含有核酸外切酶抗性修飾,諸如硫代磷酸酯鍵。Adaptor oligonucleotides may contain exonuclease resistance modifications, such as phosphorothioate linkages.
較佳地,轉接子連接於目標多肽之兩端以產生具有核苷酸的第一轉接子-目標-第二轉接子序列之多核苷酸。第一及第二轉接子可相同或不同。若第一及第二轉接子不同,則第一及第二轉接子之至少一者包括庫特異性標識符序列。Preferably, adaptors are attached to both ends of the target polypeptide to generate a polynucleotide having a first adaptor-target-second adaptor sequence of nucleotides. The first and second adaptors can be the same or different. If the first and second adaptors are different, at least one of the first and second adaptors includes a library-specific identifier sequence.
「第一轉接子-目標-第二轉接子序列(first adapter-target-second adapter sequence)」或「轉接子-目標-轉接子(adapter-target-adapter)」序列係指轉接子相對於彼此及目標之位向且不一定意謂序列可不包括額外序列,諸如連接序列。"first adapter-target-second adapter sequence" or "adapter-target-adapter" sequence means an adapter The orientation of the subunits relative to each other and the target does not necessarily mean that the sequences may not include additional sequences, such as linker sequences.
可以類似方式製備其他庫,其各自包括至少一種不同於來自其他庫之索引標籤序列或索引標籤序列的組合之庫特異性索引標籤序列或索引標籤序列的組合。Other libraries, each comprising at least one library-specific indexing tag sequence or combination of indexing tag sequences that are different from indexing tag sequences or combinations of indexing tag sequences from other libraries, can be prepared in a similar manner.
如本文所用,「連接(attached)」或「結合(bound)」在轉接子相對於目標序列的情況下可互換使用。如上文所描述,任何適合之過程可用於將轉接子連接於目標多核苷酸。舉例而言,轉接子可經由用接合酶接合;經由接合轉接子之一部分及經由延伸,諸如PCR添加轉接子之其他或剩餘部分,其中引子含有轉接子之其他或剩餘部分之組合;經由轉位併入轉接子的一部分及經由延伸,諸如PCR添加轉接子之其他或剩餘部分,其中引子含有轉接子之其他或剩餘部分;或其類似過程連接於目標。較佳地,連接之轉接子寡核苷酸共價鍵結至目標多核苷酸。As used herein, "attached" or "bound" are used interchangeably in the context of an adaptor relative to a target sequence. As described above, any suitable procedure can be used to ligate the adaptor to the target polynucleotide. For example, an adaptor can be ligated via ligation with a ligase; by ligating a portion of the adaptor and adding the other or remainder of the adaptor by extension, such as PCR, wherein the primer contains a combination of the other or remainder of the adaptor Incorporation of a portion of the adaptor via translocation and addition of the other or remaining portion of the adaptor via extension, such as PCR, wherein the primer contains the other or remaining portion of the adaptor; or a similar process ligated to the target. Preferably, the ligated adaptor oligonucleotide is covalently bonded to the target polynucleotide.
在轉接子連接於目標多核苷酸之後,所得多核苷酸可經歷清除過程,以藉由移除未併入之轉接子之至少一部分來增強轉接子-目標-轉接子多核苷酸之純度。可使用任何適合之清除過程,諸如電泳、尺寸排阻層析或其類似過程。在一些實例中,固相反向固定化(solid phase reverse immobilization,SPRI)順磁性珠粒可用於將轉接子-目標-轉接子多核苷酸與未連接轉接子分離。雖然此類過程可增強所得轉接子-目標-轉接子多核苷酸之純度,但可能會留下一些未連接之轉接子寡核苷酸。After the adaptor is ligated to the target polynucleotide, the resulting polynucleotide can undergo a scavenging process to enhance the adaptor-target-adapter polynucleotide by removing at least a portion of the unincorporated adaptor purity. Any suitable clearance process can be used, such as electrophoresis, size exclusion chromatography, or the like. In some examples, solid phase reverse immobilization (SPRI) paramagnetic beads can be used to separate adapter-target-adapter polynucleotides from unligated adapters. While such a process may enhance the purity of the resulting adaptor-target-adapter polynucleotides, some unligated adaptor oligonucleotides may be left behind.
用於擴增固定化轉接子-目標-轉接子分子之方法包括但不限於橋式擴增及動力排除。擴增可使用一或多種固定化引子進行。一或多種固定化引子可在平坦表面上平鋪(be a lawn)。Methods for amplifying immobilized adaptor-target-adapter molecules include, but are not limited to, bridge amplification and kinetic exclusion. Amplification can be performed using one or more immobilized primers. One or more immobilized primers can be a lawn on a flat surface.
如本文所用,術語「固相擴增(solid-phase amplification)」係指在固體支撐物上進行或與固體支撐物結合進行,使得所有或一部分擴增產物在其形成時固定於固體支撐物上的任何核酸擴增反應。特定言之,該術語涵蓋作為類似於標準溶液相擴增,除了正向擴增引子及反向擴增引子中之一或兩者固定於固體支撐物上之反應的固相聚合酶連鎖反應(polymerase chain reaction)(固相PCR)及固相等溫擴增。固相PCR涵蓋諸如乳液之系統,其中一種引子錨定至珠粒且另一種呈自由溶液形式,且群落形成於固相凝膠基質中,其中一種引子錨定至表面且一種呈自由溶液形式。As used herein, the term "solid-phase amplification" refers to being performed on or in conjunction with a solid support such that all or a portion of the amplification product is immobilized on the solid support as it is formed of any nucleic acid amplification reaction. Specifically, the term encompasses solid-phase polymerase chain reactions that are reactions analogous to standard solution-phase amplification, except that one or both of the forward and reverse amplimers are immobilized on a solid support ( polymerase chain reaction) (solid phase PCR) and solid isothermal amplification. Solid phase PCR encompasses systems such as emulsions, where one primer is anchored to the beads and the other is in free solution, and colonies are formed in a solid phase gel matrix, where one primer is anchored to the surface and one is in free solution.
在一些實例中,固體支撐物包括圖案化表面。「圖案化表面(patterned surface)」係指固體支撐物之暴露層中或上之不同區域的排列。術語流體槽「支撐物(support)」或「基板(substrate)」係指可在其上添加表面化學物質之支撐物或基板。術語「圖案化基板(patterned substrate)」係指中間或上面界定有凹陷之支撐物。術語「非圖案化基板(non-patterned substrate)」係指實質上平坦之支撐物。基板在本文中亦可被稱作「支撐物」、「圖案化支撐物(patterned support)」或「非圖案化支撐物(non-patterned support)」。支撐物可為晶圓、面板、矩形片、模具或任何其他適合之組態。支撐物一般為剛性的且不溶於水性液體中。支撐物可對用於改質凹陷之化學物質呈惰性。舉例而言,支撐物對於用於形成聚合物塗層、諸如將引子連接於已沈積之聚合物塗層等之化學物質可為惰性的。適合的支撐物之實例包括環氧矽氧烷、玻璃及改質或官能化玻璃、多面體寡聚倍半矽氧烷(polyhedral oligomeric silsequioxane,POSS)及其衍生物、塑膠(包括丙烯酸聚合物、聚苯乙烯及苯乙烯及其他材料之共聚物、聚丙烯、聚乙烯、聚丁烯、聚胺基甲酸酯、聚四氟乙烯(諸如來自Chemours之TEFLON®)、環狀烯烴/環烯烴聚合物(cyclo-olefin polymer,COP)(諸如來自Zeon之ZEONOR®)、聚醯亞胺等)、耐綸、陶瓷/陶瓷氧化物、二氧化矽、熔融二氧化矽或基於二氧化矽之材料、矽酸鋁、矽及改質矽(例如摻硼之p+矽)、氮化矽(Si3 N4 )、氧化矽(SiO2 )、五氧化二鉭(TaO5 )或其他氧化鉭(TaOx )、氧化鉿(HaO2 )、碳、金屬、無機玻璃或其類似物。支撐物亦可為玻璃或矽或基於矽的聚合物,諸如POSS材料,視情況在表面具有氧化鉭或另一陶瓷氧化物之塗層。POSS材料可為Kejagoas等人,《微電子工程改造(Microelectronic Engineering)》86 (2009) 776-668中所揭示之材料,該文獻以全文引用之方式併入本文中。In some examples, the solid support includes a patterned surface. "Patterned surface" refers to the arrangement of different regions in or on the exposed layer of a solid support. The term fluid cell "support" or "substrate" refers to a support or substrate on which surface chemistry can be added. The term "patterned substrate" refers to a support having depressions defined in the middle or thereon. The term "non-patterned substrate" refers to a substantially flat support. The substrate may also be referred to herein as a "support,""patternedsupport," or "non-patterned support." The supports can be wafers, panels, rectangular sheets, molds, or any other suitable configuration. Supports are generally rigid and insoluble in aqueous liquids. The support can be inert to the chemicals used to modify the depressions. For example, the support can be inert to chemicals used to form the polymer coating, such as attaching primers to the deposited polymer coating, and the like. Examples of suitable supports include epoxysiloxanes, glasses and modified or functionalized glasses, polyhedral oligomeric silsequioxanes (POSS) and derivatives thereof, plastics including acrylic polymers, polyhedral Styrene and copolymers of styrene and other materials, polypropylene, polyethylene, polybutene, polyurethane, polytetrafluoroethylene (such as TEFLON® from Chemours), cyclic olefins/cyclic olefin polymers (cyclo-olefin polymer, COP) (such as ZEONOR® from Zeon), polyimide, etc.), nylon, ceramic/ceramic oxides, silica, fused silica or silica-based materials, silicon Aluminum oxide, silicon and modified silicon (such as boron-doped p+ silicon), silicon nitride (Si 3 N 4 ), silicon oxide (SiO 2 ), tantalum pentoxide (TaO 5 ) or other tantalum oxides (TaO x ) , hafnium oxide (HaO 2 ), carbon, metal, inorganic glass or the like. The support can also be glass or silicon or a silicon-based polymer, such as a POSS material, optionally with a coating of tantalum oxide or another ceramic oxide on the surface. The POSS material may be the material disclosed in Kejagoas et al., Microelectronic Engineering 86 (2009) 776-668, which is incorporated herein by reference in its entirety.
在一實例中,凹陷可為孔,使得圖案化基板在其表面中包括孔之陣列。孔可為微孔或奈米孔。各孔之尺寸可藉由其容積、孔開口面積、深度及/或直徑界定特徵。舉例而言,區域中之一或多者可為存在一或多個擴增引子之部分。部分可由其中不存在擴增引子的間隙區域隔開。在一些實例中,圖案可為x-y形式之呈列及行的特徵。在一些實例中,圖案可為重複排列之部分及/或間隙區域。在一些實例中,圖案可為無規排列之部分及/或間隙區域。In one example, the recesses can be holes, such that the patterned substrate includes an array of holes in its surface. The pores can be micropores or nanopores. The size of each hole can be characterized by its volume, hole open area, depth, and/or diameter. For example, one or more of the regions can be the portion where one or more amplification primers are present. Portions can be separated by gap regions in which no amplification primers are present. In some examples, the pattern may be a feature of columns and rows in an x-y format. In some examples, the pattern may be a repeating arrangement of portions and/or interstitial regions. In some examples, the pattern may be randomly arranged portions and/or interstitial regions.
在一些實例中,固體支撐物在表面中包括孔或凹陷之陣列。此可使用多種技術來製造,該等技術包括但不限於光刻、衝壓技術、模製技術及微蝕刻技術。所用技術可視陣列基板之組成及形狀而定。In some examples, the solid support includes an array of holes or depressions in the surface. This can be fabricated using a variety of techniques including, but not limited to, photolithography, stamping techniques, molding techniques, and micro-etching techniques. The technique used may depend on the composition and shape of the array substrate.
圖案化表面中之特徵可為具有圖案化之共價連接凝膠,諸如聚(N-(5-疊氮基乙醯胺基戊基)丙烯醯胺-共-丙烯醯胺)(poly(N-(5-azidoacetamidylpentyl)acrylamide-co-acrylamide),PAZAM)之玻璃、矽、塑膠或其他適合固體支撐物上之孔陣列中的孔(例如微孔或奈米孔)。該過程產生用於定序之凝膠墊,其可在多次循環下在定序操作中保持穩定。聚合物與孔之共價連接有助於在多次使用期間在整個結構化基板使用壽命中維持結構化特徵中之凝膠。然而,在許多實例中,凝膠不必共價連接於孔。舉例而言,在一些條件下,不共價連接於結構化基板之任何部分的不含矽烷之丙烯醯胺可用作凝膠材料。Features in the patterned surface can be covalently linked gels with patterning, such as poly(N-(5-azidoacetamidopentyl)acrylamide-co-acrylamide) (poly(N- -(5-azidoacetamidylpentyl)acrylamide-co-acrylamide), PAZAM) holes in glass, silicon, plastic or other suitable hole arrays on solid supports (eg micropores or nanopores). This process produces a gel pad for sequencing that is stable during sequencing operations over multiple cycles. The covalent attachment of the polymer to the pores helps maintain the gel in the structured features over the lifetime of the structured substrate during multiple uses. However, in many instances the gel need not be covalently attached to the pores. For example, under some conditions, a silane-free acrylamide that is not covalently attached to any portion of the structured substrate can be used as a gel material.
在一些實例中,結構化基板可藉由以下方式製造:以孔(例如微孔或奈米孔)圖案化固體支撐物材料;用凝膠材料(例如PAZAM、SFA或其化學改質之變體,諸如SFA之疊氮化形式(疊氮基-SFA))塗佈圖案化支撐物;及例如經由化學或機械拋光來拋光經凝膠塗佈之支撐物,從而在孔中保留凝膠,但自孔之間的結構化基板之表面上的間隙區域將實質上所有之凝膠移除或不活化。引子核酸可連接於凝膠材料。隨後可使目標核酸(例如片段化人類基因體)之溶液與經拋光之基板接觸,使得個別目標核酸可經由與連接於凝膠材料之引子的相互作用接種個別孔;然而,由於凝膠材料之不存在或不活化,目標核酸將不佔據間隙區域。目標核酸之擴增將受限於孔,因為在間隙區域中凝膠之不存在或不活化防止生長核酸群落向外遷移。過程可調且利用習知微米或奈米製造方法,從而可方便地進行製造。In some examples, structured substrates can be fabricated by patterning a solid support material with pores (eg, micropores or nanopores); with gel materials (eg, PAZAM, SFA, or chemically modified variants thereof) , such as the azide form of SFA (azido-SFA)) coating patterned supports; and polishing the gel-coated supports, such as via chemical or mechanical polishing, thereby retaining the gel in the pores, but Substantially all of the gel will be removed or inactivated from the interstitial regions on the surface of the structured substrate between the wells. The primer nucleic acid can be attached to the gel material. A solution of target nucleic acid (eg, a fragmented human genome) can then be contacted with the polished substrate so that individual target nucleic acids can seed individual wells via interaction with primers attached to the gel material; however, due to the Absent or not activated, the target nucleic acid will not occupy the interstitial region. Amplification of the target nucleic acid will be limited to the pores because the absence or inactivation of the gel in the gap region prevents the outward migration of the growing nucleic acid population. The process is tunable and can be easily fabricated using conventional micro or nano fabrication methods.
所揭示之主題作為一實例包括「固相(solid-phase)」擴增方法,其中僅一種擴增引子固定化(另一引子存在於例如自由溶液中),在其他實例中,固體支撐物可具有固定化之正向及反向引子。一些實例包括固定於固體支撐物上之「複數個(plurality)」相同正向引子及/或「複數個」相同反向引子,因為擴增過程可包括過量引子以維持擴增。除非上下文另外指示,否則本文中對正向及反向引子之提及因此應解釋為涵蓋「複數個」此類引子。The disclosed subject matter includes, as an example, "solid-phase" amplification methods in which only one amplification primer is immobilized (the other primer is present, eg, in free solution), in other examples, a solid support can be Has immobilized forward and reverse primers. Some examples include "plurality" of the same forward primer and/or "plurality" of the same reverse primer immobilized on a solid support, as the amplification process may include excess primers to maintain amplification. References herein to forward and reverse primers should therefore be construed to encompass "plurality" of such primers unless context dictates otherwise.
任何給定擴增反應包括對待擴增之模板具特異性的至少一種類型之正向引子及至少一種類型之反向引子。然而,在某些實例中,正向及反向引子可包括具有相同序列之模板特異性部分,且可具有完全相同的核苷酸序列及結構(包括任何非核苷酸修飾)。換言之,有可能使用僅一種類型之引子進行固相擴增,且此類單引子方法涵蓋於本發明之範疇內。其他實例可使用含有相同模板特異性序列但在一些其他結構特徵方面不同的正向及反向引子。舉例而言,一種類型之引子可含有不存在於另一類型的引子中之非核苷酸修飾。Any given amplification reaction includes at least one type of forward primer and at least one type of reverse primer specific for the template to be amplified. However, in certain examples, the forward and reverse primers can include template-specific portions having the same sequence, and can have the exact same nucleotide sequence and structure (including any non-nucleotide modifications). In other words, it is possible to use only one type of primer for solid-phase amplification, and such single-primer methods are encompassed within the scope of the present invention. Other examples may use forward and reverse primers that contain the same template-specific sequence but differ in some other structural feature. For example, one type of primer can contain non-nucleotide modifications that are not present in another type of primer.
術語「群集(cluster)」及「群落(colony)」在本文中可互換用以指固體支撐物上之離散位點,其包括複數個相同的固定化核酸股及複數個相同的固定化互補核酸股。術語「聚類陣列(clustered array)」係指由此類群集或群落形成的陣列。在此上下文中,術語「陣列(array)」不應被理解為需要群集的有序排列。The terms "cluster" and "colony" are used interchangeably herein to refer to discrete sites on a solid support comprising a plurality of identical immobilized nucleic acid strands and a plurality of identical immobilized complementary nucleic acids share. The term "clustered array" refers to an array formed from such clusters or colonies. In this context, the term "array" should not be understood as requiring an ordered arrangement of clusters.
術語「固相」或「表面(surface)」用於意指其中引子連接於平坦表面之平坦陣列,該表面例如玻璃、二氧化矽或塑膠顯微鏡載片或類似流體槽裝置;珠粒,其中一或兩種引子連接於珠粒且珠粒擴增;或在珠粒已擴增之後,表面上珠粒之陣列。The terms "solid phase" or "surface" are used to mean a flat array in which the primers are attached to a flat surface, such as a glass, silica or plastic microscope slide or similar fluid cell device; beads, one of which Either two primers are attached to the beads and the beads are amplified; or an array of beads on the surface after the beads have been amplified.
可使用熱循環過程或溫度維持恆定,且使用試劑之改變來執行延伸及變性循環之過程來製備聚類陣列。在一實例中,等溫過程可有利地包括使用低溫。Cluster arrays can be prepared using a process of thermal cycling or maintaining a constant temperature, and using changes in reagents to perform cycles of extension and denaturation. In one example, an isothermal process may advantageously include the use of low temperatures.
應瞭解,本文所述或此項技術中通常已知之擴增方法中之任一者可與通用或目標特異性引子一起使用以擴增固定化DNA片段。適於擴增之方法包括但不限於聚合酶連鎖反應(PCR)、股置換擴增(strand displacement amplification,SDA)、轉錄介導擴增(transcription mediated amplification,TMA)及基於核酸序列之擴增(nucleic acid sequence based amplification,NASBA)。以上擴增方法可用於擴增一或多種相關核酸。舉例而言,包括多工PCR、SDA、TMA、NASBA及其類似方法之PCR可用於擴增固定化DNA片段。在一些實例中,特異性針對相關多核苷酸之引子包括於擴增反應中。It will be appreciated that any of the amplification methods described herein or generally known in the art can be used with universal or target specific primers to amplify immobilized DNA fragments. Methods suitable for amplification include, but are not limited to, polymerase chain reaction (PCR), strand displacement amplification (SDA), transcription mediated amplification (TMA), and nucleic acid sequence-based amplification ( nucleic acid sequence based amplification, NASBA). The above amplification methods can be used to amplify one or more related nucleic acids. For example, PCR including multiplex PCR, SDA, TMA, NASBA, and similar methods can be used to amplify immobilized DNA fragments. In some examples, primers specific for the polynucleotide of interest are included in the amplification reaction.
其他適於擴增多核苷酸之方法可包括寡核苷酸延伸及接合、滾環擴增(rolling circle amplification,RCA)或寡核苷酸接合分析(oligonucleotide ligation assay,OLA)技術。應瞭解,此等擴增方法可經設計以擴增固定化DNA片段。舉例而言,在一些實例中,擴增方法可包括接合探針擴增或寡核苷酸接合分析(OLA)反應,其含有特異性針對相關核酸之引子。在一些實例中,擴增方法可包括引子延伸-接合反應,其含有特異性針對相關核酸之引子。作為可經特定設計以擴增相關核酸之引子延伸及接合引子之非限制性實例,擴增可包括用於GoldenGate分析(Illumina公司,San Diego, CA)之引子。Other methods suitable for amplifying polynucleotides may include oligonucleotide extension and ligation, rolling circle amplification (RCA) or oligonucleotide ligation assay (OLA) techniques. It will be appreciated that such amplification methods can be designed to amplify immobilized DNA fragments. For example, in some examples, amplification methods can include ligation probe amplification or oligonucleotide ligation assay (OLA) reactions that contain primers specific for the nucleic acid of interest. In some examples, amplification methods can include primer extension-ligation reactions containing primers specific for the nucleic acid of interest. As a non-limiting example of primer extension and ligation primers that can be specifically designed to amplify a nucleic acid of interest, amplification can include primers for GoldenGate assays (Illumina, Inc., San Diego, CA).
可用於本發明方法中之例示性等溫擴增方法包括但不限於多重置換擴增(Multiple Displacement Amplification,MDA)或等溫股置換核酸擴增。可用於本發明中之其他非基於PCR方法包括例如股置換擴增(SDA)或超支化股置換擴增。等溫擴增方法可與股置換Phi 29聚合酶或Bst DNA聚合酶大片段一起使用,5'->3' exo-用於基因體DNA之隨機引子擴增。使用此等聚合酶利用其高持續力及股置換活性。高持續力允許聚合酶產生長度為10至20 kb之片段。如上文所闡述,較小片段可在等溫條件下使用具有低持續力及股置換活性之聚合酶,諸如Klenow聚合酶產生。Exemplary isothermal amplification methods that can be used in the methods of the invention include, but are not limited to, Multiple Displacement Amplification (MDA) or isothermal strand displacement nucleic acid amplification. Other non-PCR-based methods that can be used in the present invention include, for example, strand displacement amplification (SDA) or hyperbranched strand displacement amplification. Isothermal amplification methods can be used with strand displacement Phi 29 polymerase or Bst DNA polymerase large fragments, 5'->3' exo- for random primer amplification of genomic DNA. Use of these polymerases takes advantage of their high persistence and strand displacement activity. The high sustaining force allows the polymerase to generate fragments of 10 to 20 kb in length. As described above, smaller fragments can be generated under isothermal conditions using polymerases with low sustaining force and strand displacement activity, such as Klenow polymerase.
DNA聚合酶可包括已藉由結構同源性分類成鑑別為A、B、C、D、X、Y及RT之家族的聚合酶。家族A中之DNA聚合酶包括例如T7 DNA聚合酶、真核粒線體DNA聚合酶γ、大腸桿菌(E. coli)DNA Pol I(包括Klenow片段)、水生棲熱菌(Thermus aquaticus)Pol I及嗜熱脂肪芽孢桿菌(Bacillus stearothermophilus)Pol I。家族B中之DNA聚合酶包括例如真核DNA聚合酶a、6及E;DNA聚合酶C;T4 DNA聚合酶、Phi29 DNA聚合酶、嗜熱球菌屬(Thermococcus sp.)90 N-7古菌(archaeon)聚合酶(亦稱為9°N™)及其變體,諸如美國專利申請公開案第2016/0032377 A1號中所揭示之實例,及RB69噬菌體DNA聚合酶。家族C包括例如大腸桿菌DNA聚合酶III α次單元。家族D包括例如來源於古細菌(Archaea)之廣古菌門(Euryarchaeota)子域的聚合酶。家族X中之DNA聚合酶包括例如真核聚合酶Pol β、Pol σ、Pol λ及Pol μ以及釀酒酵母(S. cerevisiae)Pol4。家族Y中之DNA聚合酶包括例如Pol η、Pol ι、Pol κ、大腸桿菌Pol IV(DINB)及大腸桿菌Pol V(UmuD'2C)。DNA聚合酶之反轉錄酶(reverse transcriptase,RT)家族包括例如反轉錄病毒反轉錄酶及真核端粒酶。實例RNA聚合酶包括但不限於病毒RNA聚合酶,諸如T7 RNA聚合酶;真核RNA聚合酶,諸如RNA聚合酶I、RNA聚合酶II、RNA聚合酶III、RNA聚合酶IV及RNA聚合酶V;及古細菌RNA聚合酶。其他聚合酶亦包括在如本文中所提及之聚合酶當中,亦包括任何其他功能性聚合酶,包括具有與僅作為列舉非限制性實例提供的上文所提及聚合酶中之任一者相比經修飾之序列的聚合酶。DNA polymerases can include polymerases that have been classified by structural homology into families identified as A, B, C, D, X, Y, and RT. DNA polymerases in family A include, for example, T7 DNA polymerase, eukaryotic mitochondrial DNA polymerase gamma, E. coli DNA Pol I (including Klenow fragments), Thermus aquaticus Pol I and Bacillus stearothermophilus Pol I. Family B DNA polymerase of the eukaryotic DNA polymerases include, for example, a, 6, and E; DNA polymerase C; (. Thermococcus sp) T4 DNA polymerase, the Phi29 DNA polymerase, thermophilic Lactococcus 9 0 N-7 ancient archaeon polymerase (also known as 9°N™) and variants thereof, such as the examples disclosed in US Patent Application Publication No. 2016/0032377 A1, and RB69 phage DNA polymerase. Family C includes, for example, the E. coli DNA polymerase III alpha subunit. Family D includes, for example, polymerases derived from the Euryarchaeota subdomain of Archaea. DNA polymerases in family X include, for example, the eukaryotic polymerases Pol β, Pol σ, Pol λ and Pol μ and S. cerevisiae Pol4. DNA polymerases in family Y include, for example, Pol n, Pol 1, Pol κ, Escherichia coli Pol IV (DINB), and Escherichia coli Pol V (UmuD'2C). The reverse transcriptase (RT) family of DNA polymerases includes, for example, retroviral reverse transcriptases and eukaryotic telomerases. Example RNA polymerases include, but are not limited to, viral RNA polymerases, such as T7 RNA polymerase; eukaryotic RNA polymerases, such as RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V ; and Archaeal RNA polymerase. Other polymerases are also included among the polymerases as mentioned herein, as are any other functional polymerases, including those with any of the above-mentioned polymerases provided by way of non-limiting example only Polymerases compared to modified sequences.
在一些實例中,等溫擴增可使用動力排除擴增(KEA),亦稱為排除擴增(ExAmp)進行。本發明之核酸庫可使用包括以下步驟的方法製造:使擴增試劑反應以產生複數個擴增位點,該等位點各自包括來自已接種位點之個別目標核酸之擴增子的實質上純系群體。在一些實例中,擴增反應一直進行至產生足夠填充各別擴增位點之容量數目之擴增子為止。以此方式填充已接種位點之容量防止目標核酸落於該位點,從而在該位點產生擴增子的純系群體。在一些實例中,即使擴增位點在第二目標核酸到達該位點之前仍未填滿容量,仍可達成表觀純系性。在一些條件下,第一目標核酸之擴增可繼續進行至如此程度,使得製造足夠數目之拷貝以有效勝過或完全蓋過轉移至該位點之第二目標核酸之拷貝的產生。舉例而言,在對直徑小於500 nm之圓形特徵使用橋式擴增過程的實例中,已確定,在第一目標核酸之14個指數擴增循環之後,同一位點處來自第二目標核酸之污染所產生的污染性擴增子之數目將不足以不利地影響在Illumina定序平台上的合成定序(sequencing-by-synthesis)分析。In some examples, isothermal amplification can be performed using kinetic exclusion amplification (KEA), also known as exclusion amplification (ExAmp). Nucleic acid libraries of the invention can be made using methods comprising the steps of reacting amplification reagents to generate a plurality of amplification sites, each of the sites comprising substantial amounts of amplicons from individual target nucleic acids that have been seeded pure group. In some examples, the amplification reaction is performed until a sufficient number of amplicons are produced to fill the capacity of the respective amplification sites. Filling the capacity of the vaccinated site in this way prevents the target nucleic acid from falling at that site, thereby producing a clonal population of amplicons at that site. In some instances, apparent homogeneity can be achieved even if the amplification site does not fill capacity before the second target nucleic acid reaches the site. Under some conditions, the amplification of the first target nucleic acid can proceed to such an extent that a sufficient number of copies are made to effectively outweigh or completely override the production of copies of the second target nucleic acid transferred to the site. For example, in the example using a bridge amplification process for circular features less than 500 nm in diameter, it has been determined that after 14 cycles of exponential amplification of the first target nucleic acid, the same locus from the second target nucleic acid The number of contaminating amplicons produced by the contamination would be insufficient to adversely affect sequencing-by-synthesis analysis on the Illumina sequencing platform.
在一些實例中,當某一過程以足夠快的速率發生,從而有效地排除另一事件或過程發生時,即可發生動力排除。舉例而言,製造核酸陣列,其中陣列之位點自溶液隨機接種目標核酸,且在擴增過程中產生的目標核酸之拷貝填滿接種位點中之每一者的容量。根據本發明之動力排除方法,接種及擴增過程可在擴增速率超過接種速率的條件下同時進行。因此,在已接種第一目標核酸之位點處相對較快的拷貝製造速率將有效地排除第二核酸接種於該位點上進行擴增。In some instances, kinetic exclusion occurs when a process occurs at a rate fast enough to effectively exclude another event or process from occurring. For example, nucleic acid arrays are fabricated in which the sites of the array are randomly seeded with target nucleic acid from solution, and copies of the target nucleic acid produced during the amplification process fill the capacity of each of the seeded sites. According to the kinetic exclusion method of the present invention, the inoculation and amplification processes can be performed simultaneously under conditions where the amplification rate exceeds the inoculation rate. Thus, a relatively fast copy production rate at the site where the first nucleic acid of interest has been seeded will effectively preclude seeding of the second nucleic acid at that site for amplification.
動力排除可利用相對較慢的擴增起始速率(例如緩慢的製造目標核酸之第一拷貝之速率)相對於相對較快的製造目標核酸(或目標核酸之第一拷貝)之後續拷貝的速率。在上一段之實例中,因為相對較慢的目標核酸接種速率(例如,相對較慢的擴散或運輸)相對於相對較快的發生擴增以在位點填充核酸種子之拷貝的速率,所以發生動力排除。在另一實例中,因為接種於位點之目標核酸之第一拷貝的形成延遲(例如延遲或緩慢活化)相對於相對較快的填充位點之後續拷貝的製造速率,所以可發生動力排除。在此實例中,可在個別位點接種若干不同的目標核酸(例如,在擴增之前,在各位點可存在若干目標核酸)。然而,任何給定目標核酸之第一拷貝形成可隨機活化,使得第一拷貝之平均形成速率相比於後續拷貝之產生速率而言相對較慢。在此情況下,雖然可在個別位點接種若干不同目標核酸,但動力排除將使得彼等目標核酸中僅一者得以擴增。更特定言之,一旦第一目標核酸經活化以擴增,其拷貝將迅速填滿位點之容量,從而防止在該位點製造第二目標核酸之拷貝。Kinetic exclusion can take advantage of a relatively slow initiation rate of amplification (eg, a slow rate of making a first copy of a target nucleic acid) relative to a relatively fast rate of making subsequent copies of a target nucleic acid (or first copy of a target nucleic acid) . In the example of the previous paragraph, because the relatively slow rate of target nucleic acid seeding (eg, relatively slow diffusion or transport) relative to the relatively fast rate at which amplification occurs to populate the locus with copies of the nucleic acid seed, occurs Power out. In another example, kinetic exclusion can occur because the formation of a first copy of a target nucleic acid seeded at a site is delayed (eg, delayed or slowly activated) relative to the relatively faster rate of production of subsequent copies that fill the site. In this example, several different target nucleic acids may be seeded at individual sites (eg, several target nucleic acids may be present at each site prior to amplification). However, the formation of the first copy of any given target nucleic acid can be activated randomly such that the average rate of formation of the first copy is relatively slow compared to the rate of production of subsequent copies. In this case, although several different target nucleic acids can be seeded at individual sites, kinetic exclusion will allow only one of those target nucleic acids to be amplified. More specifically, once the first nucleic acid of interest is activated for amplification, its copies will rapidly fill the capacity of the site, preventing the making of copies of the second nucleic acid of interest at that site.
擴增試劑可包括促進擴增子形成且在一些情況下增加擴增子形成速率的另外組分。實例為重組酶。重組酶可藉由允許重複侵入/延伸而促進擴增子形成。更特定言之,重組酶可促進使用目標核酸作為擴增子形成之模板的,藉由聚合酶的目標核酸之侵入及藉由聚合酶的引子之延伸。此過程可以連鎖反應形式重複,在連鎖反應中,由各輪侵入/延伸產生之擴增子充當後一輪中之模板。因為不需要變性循環(例如經由加熱或化學變性),所以該過程可比標準PCR更迅速地進行。因此,由重組酶促進之擴增可等溫進行。一般期望在由重組酶促進之擴增試劑中包括ATP或其他核苷酸(或在一些情況下,其不可水解的類似物)來促進擴增。重組酶與單股結合(single stranded binding,SSB)蛋白質之混合物尤其適用,因為SSB可進一步促進擴增。用於重組酶促進之擴增的例示性調配物包括由TwistDx(英國劍橋(Cambridge, UK))作為TwistAmp套組市售的調配物。Amplification reagents can include additional components that promote amplicon formation and in some cases increase the rate of amplicon formation. Examples are recombinases. Recombinases can facilitate amplicon formation by allowing repeated invasion/extension. More specifically, recombinases can facilitate invasion of target nucleic acid by a polymerase and extension of primers by a polymerase using the target nucleic acid as a template for amplicon formation. This process can be repeated in a chain reaction in which amplicons resulting from each round of invasion/extension serve as templates in subsequent rounds. Because no denaturation cycles (eg, via heat or chemical denaturation) are required, the process can proceed more rapidly than standard PCR. Thus, recombinase-promoted amplification can be performed isothermally. It is generally desirable to include ATP or other nucleotides (or in some cases, non-hydrolyzable analogs thereof) in amplification reagents facilitated by recombinases to facilitate amplification. Mixtures of recombinase and single stranded binding (SSB) proteins are particularly useful, since SSB further facilitates amplification. Exemplary formulations for recombinase-promoted amplification include formulations marketed by TwistDx (Cambridge, UK) as the TwistAmp kit.
可包括於擴增試劑中以促進擴增子形成且在一些情況下增加擴增子形成速率的組分之另一實例為解旋酶。解旋酶可藉由允許擴增子形成之連鎖反應而促進擴增子形成。因為不需要變性循環(例如經由加熱或化學變性),所以該過程可比標準PCR更迅速地進行。因此,解旋酶促進之擴增可等溫進行。解旋酶與單股結合(SSB)蛋白質之混合物尤其適用,因為SSB可進一步促進擴增。用於解旋酶促進之擴增的例示性調配物包括由Biohelix(Beverly, MA)作為IsoAmp套組市售的調配物。Another example of a component that can be included in amplification reagents to facilitate amplicon formation, and in some cases increase the rate of amplicon formation, is a helicase. Helicases can facilitate amplicon formation by allowing a chain reaction of amplicon formation. Because no denaturation cycles (eg, via heat or chemical denaturation) are required, the process can proceed more rapidly than standard PCR. Thus, helicase-promoted amplification can be performed isothermally. Mixtures of helicase and single-stranded bound (SSB) proteins are particularly useful, as SSB further facilitates amplification. Exemplary formulations for helicase-promoted amplification include formulations marketed by Biohelix (Beverly, MA) as the IsoAmp kit.
可包括於擴增試劑中以促進擴增子形成且在一些情況下增加擴增子形成速率的組分之又一實例為起始結合蛋白(origin binding protein)。Yet another example of a component that can be included in amplification reagents to promote amplicon formation, and in some cases increase the rate of amplicon formation, is an origin binding protein.
本文所闡述之方法之優勢在於,其提供複數種目標核酸之快速且有效的並行偵測。因此,本發明提供能夠使用此項技術中已知之技術,諸如上述實例製備及偵測核酸之整合系統。因此,本發明之整合系統可包括能夠將擴增試劑及/或定序試劑遞送至一或多個固定化DNA片段的流體組件,該系統包括諸如泵、閥、儲集器、流體管線、溫度控制器及其類似物之組件。流體槽可組態及/或用於整合系統中以偵測目標核酸。如關於流體槽所例示,整合系統之流體組件中之一或多者可用於擴增方法及偵測方法。整合系統之流體組件中之一或多者可用於本文所闡述之擴增方法且用於在定序方法,諸如以上例示之定序方法中遞送定序試劑。如本文所用,術語「流體槽(flow cell)」欲意謂具有可進行反應之腔室(亦即流動通道)、將一或多種試劑遞送至該腔室中之入口及自該腔室移除一或多種試劑之出口的容器。在一些實例中,腔室使得能夠偵測腔室中存在之反應或信號。舉例而言,腔室可在腔室中包括一或多個允許光學偵測陣列之透明表面、光學標記分子或其類似物。如本文所用,「流動通道(flow channel)」或「流動通道區域(flow channel region)」可為兩個結合組件之間界定的區域,其可選擇性地接受液體樣品。在一些實例中,流動通道可界定於圖案化支撐物與蓋之間,且因此可與界定於圖案化支撐物中之一或多個凹陷流體連通。在其他實例中,流動通道可界定於非圖案化支撐物與蓋之間。其他實例可包括用於分離反應物之皿、盤或孔,包括用於交換試劑及其他反應組分之自動化流體系統(automated fluidic)。舉例而言,可使用多孔盤,包括例如96或384孔盤。An advantage of the method described herein is that it provides rapid and efficient parallel detection of multiple target nucleic acids. Accordingly, the present invention provides integrated systems capable of producing and detecting nucleic acids using techniques known in the art, such as the examples above. Thus, the integrated systems of the present invention may include fluidic components capable of delivering amplification reagents and/or sequencing reagents to one or more immobilized DNA fragments, including, for example, pumps, valves, reservoirs, fluidic lines, temperature Components of controllers and the like. Fluid cells can be configured and/or used in integrated systems to detect target nucleic acids. As exemplified with respect to the fluid cell, one or more of the fluid components of the integrated system can be used for amplification methods and detection methods. One or more of the fluidic components of the integrated system can be used in the amplification methods described herein and in the delivery of sequencing reagents in sequencing methods, such as those exemplified above. As used herein, the term "flow cell" is intended to mean having a chamber (ie, a flow channel) in which a reaction can take place, inlets for delivering one or more reagents into the chamber, and removal from the chamber A container for the outlet of one or more reagents. In some examples, the chamber enables detection of a reaction or signal present in the chamber. For example, the chamber may include one or more transparent surfaces in the chamber that allow for optical detection of the array, optically labeled molecules, or the like. As used herein, a "flow channel" or "flow channel region" can be a region defined between two binding components that can selectively accept a liquid sample. In some examples, a flow channel can be defined between the patterned support and the lid, and thus can be in fluid communication with one or more recesses defined in the patterned support. In other examples, flow channels can be defined between the non-patterned support and the lid. Other examples may include dishes, trays, or wells for separating reactants, including automated fluidic systems for exchanging reagents and other reaction components. For example, multi-well plates can be used, including, for example, 96- or 384-well plates.
或者,整合系統可包括個別的流體系統來進行擴增方法及進行偵測方法。能夠產生擴增核酸且亦測定核酸序列之整合定序系統之實例包括但不限於MiSeqTM 平台(Illumina公司,San Diego, CA)。Alternatively, the integrated system may include separate fluidic systems for performing the amplification method and performing the detection method. Examples of integrated sequencing systems capable of generating amplified nucleic acids and also determining nucleic acid sequences include, but are not limited to, the MiSeq ™ platform (Illumina, Inc., San Diego, CA).
適合引子之非限制性實例包括P5及/或P7引子,其用於由Illumina公司出售之商業流體槽之表面上,用於在HISEQX™、MISEQ™、MISEQDX™、MINISEQ™、NEXTSEQ™、NEXTSEQDX™、NOVASEQ™、GENOME ANALYZER™、ISEQ™、成像cBot(cBot with imaging,icBot)及其他儀器平台上定序。且包括與如上文所揭示之第一或第二引子對應或互補之核苷酸序列的模板多核苷酸之部分根據如上文所提及之SBS平台中所用的此類引子序列可具有例如與P5引子(包括AATGATACGGCGACCACCGAGATCTACAC之核苷酸序列)、P7引子(包括CAAGCAGAAGACGGCATACGAGAT之核苷酸序列)或兩者對應或互補之序列,或其他。Non-limiting examples of suitable primers include the P5 and/or P7 primers used on the surface of commercial fluid cells sold by Illumina, Inc. for HISEQX™, MISEQ™, MISEQDX™, MINISEQ™, NEXTSEQ™, NEXTSEQDX™ , NOVASEQ™, GENOME ANALYZER™, ISEQ™, cBot with imaging (icBot) and other instrument platforms. And the portion of the template polynucleotide comprising the nucleotide sequence corresponding to or complementary to the first or second primer as disclosed above may have, for example, the same as P5 according to such primer sequence as used in the SBS platform as mentioned above. Primer (including the nucleotide sequence of AATGATACGGCGACCACCGAGATCTACAC), P7 primer (including the nucleotide sequence of CAAGCAGAAGACGGCATACGAGAT) or sequences corresponding to or complementary to both, or others.
作為非限制性實例,基板可包括用於前述SBS或其他平台中之任一者中的基板,該等平台諸如用於對與表面連接多核苷酸雜合之經標記寡核苷酸自動化聚類及成像的平台,其亦可(但不必)為經裝備以用於執行SBS過程之定序態樣本身的平台。此類基板可為流體槽。As a non-limiting example, a substrate can include a substrate for use in any of the aforementioned SBS or other platforms, such as for automated clustering of labeled oligonucleotides hybridized to surface-linked polynucleotides and imaging platforms, which can also (but need not) be platforms equipped to perform the sequenced samples themselves of the SBS process. Such substrates may be fluid cells.
如本文所用,術語「凹陷(depression)」係指具有由圖案化支撐物表面的一或多個間隙區域完全包圍之表面開口的圖案化支撐物中之離散凹面特徵。凹陷在表面中其開口處可具有多種形狀中之任一者,包括例如圓形、橢圓形、正方形、多邊形、星形(具有任何數目的頂點)等。與表面正交獲得之凹陷的橫截面可為曲線、正方形、多邊形、雙曲線、圓錐形、角形等。舉例而言,凹陷可為孔。亦如本文所用,「官能化凹陷(functionalized depression)」係指引子連接之離散凹面特徵,在一些實例中,該等引子藉由聚合物(諸如PAZAM或類似聚合物)連接於凹陷表面。As used herein, the term "depression" refers to discrete concave features in a patterned support having surface openings that are fully surrounded by one or more interstitial regions of the patterned support surface. The recesses may have any of a variety of shapes at their openings in the surface, including, for example, circles, ovals, squares, polygons, stars (with any number of vertices), and the like. The cross-section of the depression obtained normal to the surface can be curvilinear, square, polygonal, hyperbolic, conical, angular, and the like. For example, the recesses can be holes. As also used herein, a "functionalized depression" refers to discrete concave features linked by primers, which in some instances are attached to the surface of the depression by a polymer such as PAZAM or similar polymers.
應理解,本文中所提供之範圍包括所陳述範圍及所陳述範圍內之任何值或子範圍。作為一實例,約100 nm至約1,000 nm之範圍應解譯為不僅包括約100 nm至約1,000 nm之明確敍述的界限值,且亦包括個別值,諸如約708 nm、約945.5 nm等,及子範圍,諸如約425 nm至約825 nm、約550 nm至約940 nm等。另外,當「約(about)」及/或「實質上(substantially)」用以描述值時,其意謂涵蓋所陳述值的較小變化(至多+/-10%)。 實施例It is to be understood that ranges provided herein include the stated range and any value or sub-range within the stated range. As an example, the range of about 100 nm to about 1,000 nm should be interpreted to include not only the expressly recited limit values of about 100 nm to about 1,000 nm, but also individual values, such as about 708 nm, about 945.5 nm, etc., and subranges, such as about 425 nm to about 825 nm, about 550 nm to about 940 nm, and the like. Additionally, when "about" and/or "substantially" are used to describe a value, it is meant to encompass minor variations (up to +/- 10%) of the stated value. Example
以下實施例意欲說明本發明之特定實施例,但決不意欲限制其範疇。The following examples are intended to illustrate specific embodiments of the present invention, but are in no way intended to limit its scope.
實施例1. 評價不同多核苷酸尺寸(人類350 bp、450 bp及550 bp及細菌350 bp及550 bp庫)及不同GC/AT含量(細菌庫)之線性。Example 1. Evaluation of linearity of different polynucleotide sizes (human 350 bp, 450 bp and 550 bp and bacterial 350 bp and 550 bp pools) and different GC/AT contents (bacterial pools).
方法:不同比率之不同庫(群體長度、GC/AT含量、人類或細菌庫)用於在HiSeqTM
X流體槽上聚類。標繪相對於DNA庫之比例量的強度。應用線性擬合且使用JMP軟體計算R2
。在icBot平台上對標識符序列之經螢光標記之寡核苷酸探針的聚類及雜合成像。用與用螢光標記Alexa 647標記之經螢光標記之寡核苷酸探針互補的兩種標識符序列中之任一者標記多核苷酸(探針1:/5Alex647N/CT ACA CAT AGA GGC ACA CTC或探針2:/5Alex647N/CT ACA CGT ACT GAC ACA CTC,可購自IDT)。將具有以下來自一種或另一群體(或庫)之多核苷酸濃度的溶液裝載至8個流體槽(FC)泳道上:
增益(40)及成像曝光時間(探針1 600 ms及探針2 600 ms至900 ms)、曝光次數(3)及探針培育時間(6分鐘)用於對在聚類之後的表面連接拷貝成像。Gain (40) and imaging exposure times (probe 1 600 ms and probe 2 600 ms to 900 ms), number of exposures (3), and probe incubation time (6 min) were used for surface-attached copies after clustering imaging.
在此實施例中,螢光強度用作群集擴增水準之讀數。重要的係看在此分析中偵測到之螢光強度是否與群集擴增水準/庫輸入量相關。此等兩個因素之良好/線性相關性確定分析(及此分析之資料分析方法)之基礎。In this example, the fluorescence intensity was used as a readout for the level of cluster expansion. It is important to see if the fluorescence intensity detected in this assay correlates with the level of cluster expansion/library input. A good/linear correlation between these two factors determines the basis for the analysis (and the data analysis method for this analysis).
結果result
人類350 bp庫:探針1及探針2具有類似線性,其中R2 約0.99展示庫輸入量(群集擴增)與信號強度之間的線性關係。來自探針2之資料展示於圖3中。使用來自富GC(例如,紅細菌)、貧GC(例如,仙人掌桿菌)或更長(550 bp)群體之多核苷酸的比例發現類似的線性關係,指示起始材料之濃度在聚類之後轉變為信號強度。350 bp human libraries: Probe 1 and Probe 2 has a similar linear, wherein R 2 is about 0.99 input display library (clusters amplification) and the linear relationship between signal intensity. Data from probe 2 is shown in FIG. 3 . Similar linear relationships were found using the proportions of polynucleotides from GC-rich (eg, Rhodobacter), GC-poor (eg, Bacteroides cactus), or longer (550 bp) populations, indicating that the concentration of starting material shifted after clustering is the signal strength.
實施例2:確定可偵測到多少百分比的DNA擴增差異的分析靈敏度。Example 2: Analytical sensitivity to determine what percentage of DNA amplification differences can be detected.
方法:具有10%差異之DNA輸入在HiSeqTM X流體槽上聚類。在探針1或探針2雜合後量測信號強度。Methods: DNA inputs with 10% variance were clustered on a HiSeq™ X fluidic cell. Signal intensities were measured after probe 1 or probe 2 hybridization.
結果result
來自紅細菌350 bp聚類之13個流體槽的資料概述於圖4中。JMP分析展示,40%、50%及60%之紅細菌350 bp庫之DNA庫輸入可由此分析分開,具有統計學意義(95%)。使用具有不同插入物尺寸或GC含量之庫,例如人類350 bp、450 bp及550 bp、紅細菌550 bp及仙人掌桿菌350 bp及550 bp庫發現類似分析靈敏度。Data from the 13 fluid cells of the 350 bp cluster of Rhodobacter are summarized in Figure 4. The JMP analysis showed that 40%, 50% and 60% of the DNA pool input of the Rhodobacter 350 bp pool could be separated by this analysis with statistical significance (95%). Similar assay sensitivities were found using libraries with different insert sizes or GC content, such as human 350 bp, 450 bp and 550 bp, Rhodobacter 550 bp and Cactobacillus 350 bp and 550 bp libraries.
應瞭解,前述概念及本文中更詳細地論述之額外概念的所有組合(限制條件為此類概念並非相互不相容)考慮為本文中所揭示之發明主題的一部分。特定而言,在本發明結尾處出現之所主張主題的全部組合考慮為本文中所揭示之發明主題的一部分,且可用以達成本文中所描述之益處及優勢。It should be understood that all combinations of the foregoing concepts and additional concepts discussed in greater detail herein (provided that such concepts are not mutually incompatible) are considered part of the inventive subject matter disclosed herein. In particular, all combinations of the claimed subject matter appearing at the conclusion of this disclosure are considered part of the inventive subject matter disclosed herein, and can be utilized to achieve the benefits and advantages described herein.
無none
當參考附圖閱讀以下實施方式時,本發明之此等及其他特徵、態樣及優勢將變得更好理解,其中:These and other features, aspects and advantages of the present invention will become better understood when reading the following embodiments with reference to the accompanying drawings, wherein:
[圖1]展示根據如本文所揭示之方法之一個實例之態樣的流程圖。[FIG. 1] A flowchart showing an aspect according to one example of a method as disclosed herein.
[圖2]展示根據本發明之態樣的方法之一個實例之要素的說明。[FIG. 2] An illustration showing elements of one example of a method according to an aspect of the present invention.
[圖3]為展示在一個實例中,根據本發明之態樣,自與以不同比例之裝載DNA為起始物質之多核苷酸拷貝雜合的螢光標記之寡核苷酸偵測到之平均強度之差異的圖式。[FIG. 3] is shown in one example, according to aspects of the present invention, detected from fluorescently labeled oligonucleotides hybridized to copies of polynucleotides starting with DNA loaded at different ratios Schema of the difference in mean intensity.
[圖4]為比較在一個實例中,在聚類程序中在以總DNA之40%、50%或60%裝載的多核苷酸聚類之後螢光偵測強度的圖式。[FIG. 4] is a graph comparing the fluorescence detection intensities after clustering of polynucleotides loaded at 40%, 50% or 60% of the total DNA in a clustering procedure in one example.
[圖5]展示根據如本文所揭示之方法之一個實例之態樣的流程圖。[FIG. 5] A flowchart showing an aspect according to one example of a method as disclosed herein.
Claims (22)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202063031230P | 2020-05-28 | 2020-05-28 | |
| US63/031,230 | 2020-05-28 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| TW202146659A true TW202146659A (en) | 2021-12-16 |
Family
ID=76502835
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW110118675A TW202146659A (en) | 2020-05-28 | 2021-05-24 | Comparing copies of polynucleotides with different features |
Country Status (12)
| Country | Link |
|---|---|
| US (1) | US20210371908A1 (en) |
| EP (1) | EP4158059A1 (en) |
| JP (1) | JP2023528003A (en) |
| KR (1) | KR20230034941A (en) |
| CN (1) | CN115867672A (en) |
| AU (1) | AU2021280233A1 (en) |
| BR (1) | BR112022024008A2 (en) |
| CA (1) | CA3177270A1 (en) |
| IL (1) | IL297944A (en) |
| MX (1) | MX2022013817A (en) |
| TW (1) | TW202146659A (en) |
| WO (1) | WO2021242668A1 (en) |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060172314A1 (en) * | 2005-01-31 | 2006-08-03 | Song Min-Sun | Quantification of amplified nucleic acids |
| CA3209502A1 (en) * | 2008-09-16 | 2010-03-25 | Sequenom, Inc. | Processes and compositions for methylation-based enrichment of fetal nucleic acid from a maternal sample useful for non-invasive prenatal diagnoses |
| US8182994B2 (en) * | 2009-09-15 | 2012-05-22 | Illumina Cambridge Limited | Centroid markers for image analysis of high denisty clusters in complex polynucleotide sequencing |
| EP2820174B1 (en) * | 2012-02-27 | 2019-12-25 | The University of North Carolina at Chapel Hill | Methods and uses for molecular tags |
| JP6093436B2 (en) * | 2013-03-12 | 2017-03-08 | 株式会社日立製作所 | Two-dimensional cell array device and apparatus for gene quantification and sequence analysis |
| EP2971070B2 (en) | 2013-03-14 | 2021-03-03 | Illumina, Inc. | Modified polymerases for improved incorporation of nucleotide analogues |
| US10465232B1 (en) * | 2015-10-08 | 2019-11-05 | Trace Genomics, Inc. | Methods for quantifying efficiency of nucleic acid extraction and detection |
| WO2017192974A1 (en) * | 2016-05-06 | 2017-11-09 | Regents Of The University Of Minnesota | Analytical standards and methods of using same |
| GB2572877A (en) * | 2016-11-18 | 2019-10-16 | Twist Bioscience Corp | Polynucleotide libraries having controlled stoichiometry and synthesis thereof |
| US20210110885A1 (en) * | 2017-03-20 | 2021-04-15 | Celula China Med-Technology Co., Ltd. | Method of correcting amplification bias in amplicon sequencing |
-
2021
- 2021-05-24 TW TW110118675A patent/TW202146659A/en unknown
- 2021-05-24 CA CA3177270A patent/CA3177270A1/en active Pending
- 2021-05-24 WO PCT/US2021/033834 patent/WO2021242668A1/en not_active Ceased
- 2021-05-24 JP JP2022572710A patent/JP2023528003A/en active Pending
- 2021-05-24 MX MX2022013817A patent/MX2022013817A/en unknown
- 2021-05-24 BR BR112022024008A patent/BR112022024008A2/en not_active Application Discontinuation
- 2021-05-24 CN CN202180038450.8A patent/CN115867672A/en active Pending
- 2021-05-24 IL IL297944A patent/IL297944A/en unknown
- 2021-05-24 EP EP21733288.1A patent/EP4158059A1/en active Pending
- 2021-05-24 US US17/328,151 patent/US20210371908A1/en active Pending
- 2021-05-24 AU AU2021280233A patent/AU2021280233A1/en active Pending
- 2021-05-24 KR KR1020227041061A patent/KR20230034941A/en active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| CA3177270A1 (en) | 2021-12-02 |
| US20210371908A1 (en) | 2021-12-02 |
| EP4158059A1 (en) | 2023-04-05 |
| BR112022024008A2 (en) | 2022-12-27 |
| WO2021242668A1 (en) | 2021-12-02 |
| CN115867672A (en) | 2023-03-28 |
| MX2022013817A (en) | 2023-03-03 |
| IL297944A (en) | 2023-01-01 |
| KR20230034941A (en) | 2023-03-10 |
| AU2021280233A1 (en) | 2022-12-08 |
| JP2023528003A (en) | 2023-07-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP3872187B1 (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| US12247254B2 (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| US10975430B2 (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| TW202146659A (en) | Comparing copies of polynucleotides with different features | |
| US20210230689A1 (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40043984B (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40049316A (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40049316B (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40043984A (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40015459B (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40015459A (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40014781A (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40014781B (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries | |
| HK40029693A (en) | Compositions and methods for improving sample identification in indexed nucleic acid libraries |