US20180171360A1 - Methods for increasing cas9-mediated engineering efficiency - Google Patents
Methods for increasing cas9-mediated engineering efficiency Download PDFInfo
- Publication number
- US20180171360A1 US20180171360A1 US15/887,893 US201815887893A US2018171360A1 US 20180171360 A1 US20180171360 A1 US 20180171360A1 US 201815887893 A US201815887893 A US 201815887893A US 2018171360 A1 US2018171360 A1 US 2018171360A1
- Authority
- US
- United States
- Prior art keywords
- cas9
- nucleic acid
- target
- sgrna
- target nucleic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 151
- 230000001965 increasing effect Effects 0.000 title abstract description 16
- 230000001404 mediated effect Effects 0.000 title abstract description 15
- 101150038500 cas9 gene Proteins 0.000 title 1
- 108091033409 CRISPR Proteins 0.000 claims abstract description 421
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 249
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 224
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 224
- 102000040430 polynucleotide Human genes 0.000 claims description 196
- 108091033319 polynucleotide Proteins 0.000 claims description 196
- 239000002157 polynucleotide Substances 0.000 claims description 196
- 108020004414 DNA Proteins 0.000 claims description 166
- 108090000623 proteins and genes Proteins 0.000 claims description 163
- 210000004027 cell Anatomy 0.000 claims description 155
- 102000004169 proteins and genes Human genes 0.000 claims description 105
- 230000027455 binding Effects 0.000 claims description 67
- 241000193996 Streptococcus pyogenes Species 0.000 claims description 64
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 52
- 125000006850 spacer group Chemical group 0.000 claims description 51
- 238000003776 cleavage reaction Methods 0.000 claims description 45
- 230000007017 scission Effects 0.000 claims description 45
- 102000053602 DNA Human genes 0.000 claims description 32
- 238000001890 transfection Methods 0.000 claims description 24
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 19
- 230000001580 bacterial effect Effects 0.000 claims description 17
- 238000000338 in vitro Methods 0.000 claims description 12
- 238000011144 upstream manufacturing Methods 0.000 claims description 12
- 210000004962 mammalian cell Anatomy 0.000 claims description 11
- 241000194020 Streptococcus thermophilus Species 0.000 claims description 9
- 241000588650 Neisseria meningitidis Species 0.000 claims description 7
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 7
- 239000002502 liposome Substances 0.000 claims description 7
- 210000005253 yeast cell Anatomy 0.000 claims description 6
- 238000004520 electroporation Methods 0.000 claims description 5
- 150000002632 lipids Chemical class 0.000 claims description 5
- 241000191967 Staphylococcus aureus Species 0.000 claims description 4
- 239000002105 nanoparticle Substances 0.000 claims description 4
- 230000003612 virological effect Effects 0.000 claims description 3
- 101100166144 Staphylococcus aureus cas9 gene Proteins 0.000 claims description 2
- 238000010361 transduction Methods 0.000 claims description 2
- 230000026683 transduction Effects 0.000 claims description 2
- 101100166147 Streptococcus thermophilus cas9 gene Proteins 0.000 claims 1
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 abstract description 75
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 abstract description 20
- 238000010362 genome editing Methods 0.000 abstract description 15
- 235000018102 proteins Nutrition 0.000 description 83
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 71
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 71
- 230000035772 mutation Effects 0.000 description 70
- 101710163270 Nuclease Proteins 0.000 description 68
- 230000000694 effects Effects 0.000 description 67
- 230000014509 gene expression Effects 0.000 description 59
- 239000013598 vector Substances 0.000 description 57
- 108091079001 CRISPR RNA Proteins 0.000 description 55
- 108010073929 Vascular Endothelial Growth Factor A Proteins 0.000 description 54
- 102000009524 Vascular Endothelial Growth Factor A Human genes 0.000 description 53
- 125000003729 nucleotide group Chemical group 0.000 description 51
- 108010081734 Ribonucleoproteins Proteins 0.000 description 45
- 102000004389 Ribonucleoproteins Human genes 0.000 description 45
- 239000002773 nucleotide Substances 0.000 description 45
- 108090000765 processed proteins & peptides Proteins 0.000 description 45
- 102000004196 processed proteins & peptides Human genes 0.000 description 44
- 229920001184 polypeptide Polymers 0.000 description 43
- 150000001413 amino acids Chemical class 0.000 description 41
- 238000003752 polymerase chain reaction Methods 0.000 description 41
- 235000001014 amino acid Nutrition 0.000 description 40
- 108091034117 Oligonucleotide Proteins 0.000 description 38
- 241000196324 Embryophyta Species 0.000 description 37
- 239000013615 primer Substances 0.000 description 36
- 238000012163 sequencing technique Methods 0.000 description 35
- 230000008439 repair process Effects 0.000 description 33
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 32
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 31
- 230000004048 modification Effects 0.000 description 31
- 238000012986 modification Methods 0.000 description 31
- 230000008685 targeting Effects 0.000 description 31
- 230000001105 regulatory effect Effects 0.000 description 30
- 230000000295 complement effect Effects 0.000 description 28
- 238000005516 engineering process Methods 0.000 description 28
- 108091028043 Nucleic acid sequence Proteins 0.000 description 27
- 238000013518 transcription Methods 0.000 description 27
- 238000003780 insertion Methods 0.000 description 26
- 230000037431 insertion Effects 0.000 description 26
- 230000035897 transcription Effects 0.000 description 25
- 108091093088 Amplicon Proteins 0.000 description 22
- 238000006243 chemical reaction Methods 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 21
- 230000000875 corresponding effect Effects 0.000 description 19
- 230000006780 non-homologous end joining Effects 0.000 description 19
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 18
- 230000003197 catalytic effect Effects 0.000 description 18
- 241000894006 Bacteria Species 0.000 description 17
- 102000004533 Endonucleases Human genes 0.000 description 16
- 108010042407 Endonucleases Proteins 0.000 description 16
- 239000011324 bead Substances 0.000 description 16
- 230000037361 pathway Effects 0.000 description 16
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 15
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 15
- 231100000219 mutagenic Toxicity 0.000 description 14
- 230000003505 mutagenic effect Effects 0.000 description 14
- 238000002474 experimental method Methods 0.000 description 13
- 239000013604 expression vector Substances 0.000 description 13
- 230000006870 function Effects 0.000 description 13
- 229910052739 hydrogen Inorganic materials 0.000 description 13
- 241000894007 species Species 0.000 description 13
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 12
- 238000012217 deletion Methods 0.000 description 12
- 230000037430 deletion Effects 0.000 description 12
- 238000012350 deep sequencing Methods 0.000 description 11
- 239000001257 hydrogen Substances 0.000 description 11
- 239000000243 solution Substances 0.000 description 11
- 230000009261 transgenic effect Effects 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 10
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- 230000010354 integration Effects 0.000 description 9
- 230000007246 mechanism Effects 0.000 description 9
- 239000000203 mixture Substances 0.000 description 9
- 239000003155 DNA primer Substances 0.000 description 8
- 108020005004 Guide RNA Proteins 0.000 description 8
- 241000238631 Hexapoda Species 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 230000009437 off-target effect Effects 0.000 description 8
- 239000000047 product Substances 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 7
- UZOVYGYOLBIAJR-UHFFFAOYSA-N 4-isocyanato-4'-methyldiphenylmethane Chemical compound C1=CC(C)=CC=C1CC1=CC=C(N=C=O)C=C1 UZOVYGYOLBIAJR-UHFFFAOYSA-N 0.000 description 7
- 108020001019 DNA Primers Proteins 0.000 description 7
- 230000033616 DNA repair Effects 0.000 description 7
- 108091081021 Sense strand Proteins 0.000 description 7
- 239000012190 activator Substances 0.000 description 7
- 230000003044 adaptive effect Effects 0.000 description 7
- 230000008970 bacterial immunity Effects 0.000 description 7
- 239000003623 enhancer Substances 0.000 description 7
- 230000001939 inductive effect Effects 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 239000003550 marker Substances 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- 101710187573 Alcohol dehydrogenase 2 Proteins 0.000 description 6
- 101710133776 Alcohol dehydrogenase class-3 Proteins 0.000 description 6
- 238000010453 CRISPR/Cas method Methods 0.000 description 6
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 6
- -1 Cas9-like Proteins 0.000 description 6
- 230000007018 DNA scission Effects 0.000 description 6
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 6
- 241000699666 Mus <mouse, genus> Species 0.000 description 6
- 108020004511 Recombinant DNA Proteins 0.000 description 6
- 241000700605 Viruses Species 0.000 description 6
- 230000000692 anti-sense effect Effects 0.000 description 6
- 229910052791 calcium Inorganic materials 0.000 description 6
- 239000011575 calcium Substances 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 230000001419 dependent effect Effects 0.000 description 6
- 238000001415 gene therapy Methods 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 6
- 238000010348 incorporation Methods 0.000 description 6
- 230000003993 interaction Effects 0.000 description 6
- 230000009438 off-target cleavage Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 6
- 239000006228 supernatant Substances 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 230000009466 transformation Effects 0.000 description 6
- 238000010446 CRISPR interference Methods 0.000 description 5
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 5
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 5
- 102000029812 HNH nuclease Human genes 0.000 description 5
- 108060003760 HNH nuclease Proteins 0.000 description 5
- 108010078067 RNA Polymerase III Proteins 0.000 description 5
- 102000014450 RNA Polymerase III Human genes 0.000 description 5
- 230000006819 RNA synthesis Effects 0.000 description 5
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 5
- 238000000137 annealing Methods 0.000 description 5
- 239000006285 cell suspension Substances 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 230000003247 decreasing effect Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 229920002521 macromolecule Polymers 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 230000010076 replication Effects 0.000 description 5
- 230000005030 transcription termination Effects 0.000 description 5
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 4
- 101001000998 Homo sapiens Protein phosphatase 1 regulatory subunit 12C Proteins 0.000 description 4
- 102100035620 Protein phosphatase 1 regulatory subunit 12C Human genes 0.000 description 4
- 108020004459 Small interfering RNA Proteins 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 230000004721 adaptive immunity Effects 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 230000000903 blocking effect Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 238000005251 capillar electrophoresis Methods 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 230000010307 cell transformation Effects 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 108020001507 fusion proteins Proteins 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 230000030648 nucleus localization Effects 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 230000008488 polyadenylation Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000002741 site-directed mutagenesis Methods 0.000 description 4
- 239000004055 small Interfering RNA Substances 0.000 description 4
- 230000000153 supplemental effect Effects 0.000 description 4
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 3
- 241000203069 Archaea Species 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 238000007400 DNA extraction Methods 0.000 description 3
- 108091029865 Exogenous DNA Proteins 0.000 description 3
- 101150038242 GAL10 gene Proteins 0.000 description 3
- 102100024637 Galectin-10 Human genes 0.000 description 3
- 239000007995 HEPES buffer Substances 0.000 description 3
- 108091092195 Intron Proteins 0.000 description 3
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 229930182555 Penicillin Natural products 0.000 description 3
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 3
- 229920002873 Polyethylenimine Polymers 0.000 description 3
- 108010034634 Repressor Proteins Proteins 0.000 description 3
- 102000009661 Repressor Proteins Human genes 0.000 description 3
- 101001099217 Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8) Triosephosphate isomerase Proteins 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- 108010058532 UTP-hexose-1-phosphate uridylyltransferase Proteins 0.000 description 3
- 102000006321 UTP-hexose-1-phosphate uridylyltransferase Human genes 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 230000008436 biogenesis Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000021615 conjugation Effects 0.000 description 3
- 238000007405 data analysis Methods 0.000 description 3
- 230000002708 enhancing effect Effects 0.000 description 3
- 235000003869 genetically modified organism Nutrition 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 230000036039 immunity Effects 0.000 description 3
- 238000000126 in silico method Methods 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 229910001629 magnesium chloride Inorganic materials 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 238000012235 off-target genome editing Methods 0.000 description 3
- 229940049954 penicillin Drugs 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 238000010008 shearing Methods 0.000 description 3
- 229960005322 streptomycin Drugs 0.000 description 3
- 238000005382 thermal cycling Methods 0.000 description 3
- 238000009966 trimming Methods 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- 239000013603 viral vector Substances 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- 102100030379 Acyl-coenzyme A synthetase ACSM2A, mitochondrial Human genes 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- 101710187578 Alcohol dehydrogenase 1 Proteins 0.000 description 2
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 description 2
- 241000195940 Bryophyta Species 0.000 description 2
- 101150069031 CSN2 gene Proteins 0.000 description 2
- 241000218631 Coniferophyta Species 0.000 description 2
- 230000008836 DNA modification Effects 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical class NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 101100054737 Homo sapiens ACSM2A gene Proteins 0.000 description 2
- 101100220044 Homo sapiens CD34 gene Proteins 0.000 description 2
- 102100034343 Integrase Human genes 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 108060004795 Methyltransferase Proteins 0.000 description 2
- 241000699660 Mus musculus Species 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- 102000011755 Phosphoglycerate Kinase Human genes 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 244000062793 Sorghum vulgare Species 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 238000000692 Student's t-test Methods 0.000 description 2
- 108010020764 Transposases Proteins 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 2
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000006664 bond formation reaction Methods 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000000205 computational method Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 101150055601 cops2 gene Proteins 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 238000009792 diffusion process Methods 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 230000005782 double-strand break Effects 0.000 description 2
- 102000034287 fluorescent proteins Human genes 0.000 description 2
- 108091006047 fluorescent proteins Proteins 0.000 description 2
- 238000001476 gene delivery Methods 0.000 description 2
- 230000009368 gene silencing by RNA Effects 0.000 description 2
- 238000003205 genotyping method Methods 0.000 description 2
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 210000003917 human chromosome Anatomy 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 238000004255 ion exchange chromatography Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000001821 nucleic acid purification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- 230000001124 posttranscriptional effect Effects 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 235000004252 protein component Nutrition 0.000 description 2
- 108020001580 protein domains Proteins 0.000 description 2
- 102000005912 ran GTP Binding Protein Human genes 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000037425 regulation of transcription Effects 0.000 description 2
- 230000009712 regulation of translation Effects 0.000 description 2
- 230000003252 repetitive effect Effects 0.000 description 2
- 238000002271 resection Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 239000013605 shuttle vector Substances 0.000 description 2
- 238000001542 size-exclusion chromatography Methods 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 210000000130 stem cell Anatomy 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 238000011830 transgenic mouse model Methods 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 241000186045 Actinomyces naeslundii Species 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 235000001674 Agaricus brunnescens Nutrition 0.000 description 1
- 235000016626 Agrimonia eupatoria Nutrition 0.000 description 1
- 102100027211 Albumin Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 102100023635 Alpha-fetoprotein Human genes 0.000 description 1
- 108010006591 Apoenzymes Proteins 0.000 description 1
- 101100365680 Arabidopsis thaliana SGT1B gene Proteins 0.000 description 1
- 241000512259 Ascophyllum nodosum Species 0.000 description 1
- 101100136076 Aspergillus oryzae (strain ATCC 42149 / RIB 40) pel1 gene Proteins 0.000 description 1
- 235000000832 Ayote Nutrition 0.000 description 1
- 102000052609 BRCA2 Human genes 0.000 description 1
- 108700020462 BRCA2 Proteins 0.000 description 1
- 108091032955 Bacterial small RNA Proteins 0.000 description 1
- 241001474374 Blennius Species 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 241001536303 Botryococcus braunii Species 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 101150008921 Brca2 gene Proteins 0.000 description 1
- XDTMQSROBMDMFD-UHFFFAOYSA-N C1CCCCC1 Chemical compound C1CCCCC1 XDTMQSROBMDMFD-UHFFFAOYSA-N 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- 241000589875 Campylobacter jejuni Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 1
- 244000249214 Chlorella pyrenoidosa Species 0.000 description 1
- 235000007091 Chlorella pyrenoidosa Nutrition 0.000 description 1
- 101100417900 Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) rbr3A gene Proteins 0.000 description 1
- 241000243321 Cnidaria Species 0.000 description 1
- 108010060434 Co-Repressor Proteins Proteins 0.000 description 1
- 102000008169 Co-Repressor Proteins Human genes 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241001275954 Cortinarius caperatus Species 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 240000004244 Cucurbita moschata Species 0.000 description 1
- 235000009854 Cucurbita moschata Nutrition 0.000 description 1
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 description 1
- 241000272778 Cygnus atratus Species 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 1
- 230000008265 DNA repair mechanism Effects 0.000 description 1
- 108010046331 Deoxyribodipyrimidine photo-lyase Proteins 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 101100125027 Dictyostelium discoideum mhsp70 gene Proteins 0.000 description 1
- 241000255601 Drosophila melanogaster Species 0.000 description 1
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 241000258955 Echinodermata Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 102000004678 Exoribonucleases Human genes 0.000 description 1
- 108010002700 Exoribonucleases Proteins 0.000 description 1
- 241000588088 Francisella tularensis subsp. novicida U112 Species 0.000 description 1
- 108010058643 Fungal Proteins Proteins 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 102100028501 Galanin peptides Human genes 0.000 description 1
- 108010001515 Galectin 4 Proteins 0.000 description 1
- 102100039556 Galectin-4 Human genes 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 102100036669 Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Human genes 0.000 description 1
- 108010041921 Glycerolphosphate Dehydrogenase Proteins 0.000 description 1
- 239000004471 Glycine Chemical class 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 108050008753 HNH endonucleases Proteins 0.000 description 1
- 102000000310 HNH endonucleases Human genes 0.000 description 1
- 101150031823 HSP70 gene Proteins 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 1
- 102000011787 Histone Methyltransferases Human genes 0.000 description 1
- 108010036115 Histone Methyltransferases Proteins 0.000 description 1
- 102000003893 Histone acetyltransferases Human genes 0.000 description 1
- 108090000246 Histone acetyltransferases Proteins 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 1
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 101150102264 IE gene Proteins 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 101710203526 Integrase Proteins 0.000 description 1
- 108010061833 Integrases Proteins 0.000 description 1
- 108010015268 Integration Host Factors Proteins 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- SHZGCJCMOBCMKK-JFNONXLTSA-N L-rhamnopyranose Chemical compound C[C@@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O SHZGCJCMOBCMKK-JFNONXLTSA-N 0.000 description 1
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 1
- 241000672132 Lactobacillus buchneri NRRL B-30929 Species 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- 241000432054 Listeria innocua Clip11262 Species 0.000 description 1
- 108090000856 Lyases Proteins 0.000 description 1
- 102000004317 Lyases Human genes 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 241000195947 Lycopodium Species 0.000 description 1
- 241000218922 Magnoliophyta Species 0.000 description 1
- 101710141347 Major envelope glycoprotein Proteins 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- 241000196323 Marchantiophyta Species 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 201000009906 Meningitis Diseases 0.000 description 1
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101100219625 Mus musculus Casd1 gene Proteins 0.000 description 1
- 101100494762 Mus musculus Nedd9 gene Proteins 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 241001250129 Nannochloropsis gaditana Species 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 102000008763 Neurofilament Proteins Human genes 0.000 description 1
- 108010088373 Neurofilament Proteins Proteins 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 230000010718 Oxidation Activity Effects 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 238000009004 PCR Kit Methods 0.000 description 1
- 101150034686 PDC gene Proteins 0.000 description 1
- KJWZYMMLVHIVSU-IYCNHOCDSA-N PGK1 Chemical compound CCCCC[C@H](O)\C=C\[C@@H]1[C@@H](CCCCCCC(O)=O)C(=O)CC1=O KJWZYMMLVHIVSU-IYCNHOCDSA-N 0.000 description 1
- 241000432806 Pasteurella multocida subsp. multocida str. Pm70 Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 101000658568 Planomicrobium okeanokoites Type II restriction enzyme FokI Proteins 0.000 description 1
- 101710182846 Polyhedrin Proteins 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- 241000985694 Polypodiopsida Species 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 101710149951 Protein Tat Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108091093078 Pyrimidine dimer Proteins 0.000 description 1
- 241000205156 Pyrococcus furiosus Species 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 108010032060 RNA polymerase alpha subunit Proteins 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 108010068097 Rad51 Recombinase Proteins 0.000 description 1
- 102000002490 Rad51 Recombinase Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000589180 Rhizobium Species 0.000 description 1
- 102000003661 Ribonuclease III Human genes 0.000 description 1
- 108010057163 Ribonuclease III Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241000593524 Sargassum patens Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 1
- 241000191963 Staphylococcus epidermidis Species 0.000 description 1
- 241001521783 Streptococcus mutans UA159 Species 0.000 description 1
- 241001633172 Streptococcus thermophilus LMD-9 Species 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 241000255588 Tephritidae Species 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108091028113 Trans-activating crRNA Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 241000999858 Treponema denticola ATCC 35405 Species 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- 101710164125 Tubulin alpha-1 chain Proteins 0.000 description 1
- 108010075202 UDP-glucose 4-epimerase Proteins 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000006275 Ubiquitin-Protein Ligases Human genes 0.000 description 1
- 108010083111 Ubiquitin-Protein Ligases Proteins 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 239000005862 Whey Substances 0.000 description 1
- 102000007544 Whey Proteins Human genes 0.000 description 1
- 108010046377 Whey Proteins Proteins 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 230000006154 adenylylation Effects 0.000 description 1
- 230000010386 affect regulation Effects 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 108010026331 alpha-Fetoproteins Proteins 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 238000007845 assembly PCR Methods 0.000 description 1
- 210000003578 bacterial chromosome Anatomy 0.000 description 1
- 230000007940 bacterial gene expression Effects 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- 239000012148 binding buffer Substances 0.000 description 1
- 238000005842 biochemical reaction Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000001369 bisulfite sequencing Methods 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 101150055766 cat gene Proteins 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 108091092356 cellular DNA Proteins 0.000 description 1
- 230000007248 cellular mechanism Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 230000008876 conformational transition Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000009615 deamination Effects 0.000 description 1
- 238000006481 deamination reaction Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000006114 demyristoylation Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000027832 depurination Effects 0.000 description 1
- 230000001687 destabilization Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 101150052825 dnaK gene Proteins 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 230000002616 endonucleolytic effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 210000005075 mammary gland Anatomy 0.000 description 1
- 240000004308 marijuana Species 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 239000000693 micelle Substances 0.000 description 1
- 238000010208 microarray analysis Methods 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 230000000116 mitigating effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000000329 molecular dynamics simulation Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 239000011807 nanoball Substances 0.000 description 1
- 210000005044 neurofilament Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 101150110245 ompC gene Proteins 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 101150040383 pel2 gene Proteins 0.000 description 1
- 101150050446 pelB gene Proteins 0.000 description 1
- 239000000816 peptidomimetic Chemical class 0.000 description 1
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 1
- 150000008298 phosphoramidates Chemical class 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 235000012015 potatoes Nutrition 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 235000015136 pumpkin Nutrition 0.000 description 1
- 239000013635 pyrimidine dimer Substances 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000007841 sequencing by ligation Methods 0.000 description 1
- 108091029842 small nuclear ribonucleic acid Proteins 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000037426 transcriptional repression Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010015969 trypsin resistant core repressor Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/111—General methods applicable to biologically active non-coding nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases [RNase]; Deoxyribonucleases [DNase]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/30—Endoribonucleases active with either ribo- or deoxyribonucleic acids and producing 5'-phosphomonoesters (3.1.30)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPR]
Definitions
- the present invention relates to Type II CRISPR-Cas9 systems for use in increasing Cas9-mediated genome engineering efficiency by either decreasing the number of off-target nucleic acid double-stranded breaks, and/or enhancing homology-directed repair of a cleaved target nucleic acid.
- CRISPR Clustered regularly interspaced short palindromic repeats
- Cas9 Clustered regularly interspaced short palindromic repeats
- This system provides adaptive immunity against foreign DNA in bacteria (Barrangou, R., et al., “CRISPR provides acquired resistance against viruses in prokaryotes,” Science (2007) 315:1709-1712; Makarova, K. S., et al., “Evolution and classification of the CRISPR-Cas systems,” Nat Rev Microbiol (2011) 9:467-477; Garneau, J.
- RNA-guided Cas9 endonuclease specifically targets and cleaves DNA in a sequence-dependent manner
- Casiunas, G., et al. “Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria,” Proc Natl Acad Sci USA (2012) 109: E2579-E2586; Jinek, M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-821; Sternberg, S.
- site-specific nucleases such as Cas9 can introduce double-stranded breaks in DNA in unintended and/or incorrect locations, termed “off-target effects.” Accordingly, methods to reduce or eliminate off-target DNA breaks are highly desirable.
- DNA double-stranded breaks can be repaired by, for example, non-homologous end joining (NHEJ) or homology-directed repair (HDR).
- NHEJ non-homologous end joining
- HDR homology-directed repair
- Faithful repair by HDR is inefficient at site-directed breaks of the target nucleic acid because other cellular mechanisms may result in the incorporation of nucleic acids at the site of a double-stranded break or a single-stranded nick. It is apparent there is a clear need to develop novel strategies that mitigate or eliminate off-target genome editing events and increase the efficiency of inserting new material into the sites cut by site-directed nucleases such as Cas9.
- the disclosure provides for a method for reducing off-targeting nuclease cleavage comprising: contacting a first complex comprising a catalytically active Cas9 and a guide RNA with a target nucleic acid; contacting a second complex comprising a catalytically inactive Cas9 (dCas9) and a guide RNA with an off-target nucleic acid; and cleaving the target nucleic acid with the first complex, wherein the second complex prevents the first complex from cleaving the off-target nucleic acid.
- the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Streptococcus pyogenes, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from S. pyogenes.
- the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Streptococcus thermophilus, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from S. thermophilus. In some embodiments, the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Staphylococcus aureus, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from S. aureus.
- the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Neisseria meningitidis, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from N. meningitidis.
- the catalytically inactive Cas9 comprises a mutation in one or both of its nuclease domains. In some embodiments, the dCas9 is at least 80% catalytically inactive compared to a wild-type Cas9.
- the first complex is capable of binding to the off-target nucleic acid. In some embodiments, the binding and/or cleavage of the first complex to the off-target nucleic acid is reduced by at least 30%. In some embodiments, the binding of the first complex to the off-target nucleic acid is reduced by at least 70%.
- the cleaving comprises introducing a double-stranded break. In some embodiments, the cleaving comprises introducing a single-stranded break.
- the target nucleic acid is DNA. In some embodiments, the target nucleic acid is double-stranded DNA.
- the disclosure provides for a composition
- a composition comprising: two site-directed polypeptides to Cas9, wherein the two site-directed polypeptides comprise a mutation in one of their nuclease domains, wherein the two site-directed polypeptides are configured to bind and cleave the same strand of a double-stranded target nucleic acid.
- the two site-directed polypeptides comprise at least 10% amino acid identity to a nuclease domain of Cas9 from S. pyogenes.
- the mutation comprises a D10A mutation. In some embodiments, the mutation comprises an H840A mutation. In some embodiments, the target nucleic acid is DNA.
- the two site-directed polypeptides are bound to the sense strand of the double-stranded target nucleic acid. In some embodiments, the two site-directed polypeptides are bound to the anti-sense strand of the double-stranded target nucleic acid.
- the composition further comprises a donor polynucleotide. In some embodiments, the donor polynucleotide is single-stranded. In some embodiments, the donor polynucleotide is double-stranded. In some embodiments, the donor polynucleotide is partially single-stranded and partially double-stranded.
- a method for reducing binding and/or cleavage of an off-target nucleic acid by a complex comprising a catalytically active Cas9 protein and a guide polynucleotide comprises: (a) contacting a first complex with a selected target nucleic acid, wherein said first complex comprises: (i) a catalytically active Cas9 protein and (ii) a first guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to said selected target nucleic acid; and (b) contacting a second complex with an off-target nucleic acid, wherein said second complex comprises (i) a catalytically inactive Cas9 protein (dCas9 protein) that does not cleave the off-target nucleic acid and (ii) a second guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to said off-target nucleic acid
- the catalytically active Cas9 protein comprises at least 75% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In certain embodiments, the catalytically active Cas9 protein comprises at least 95% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In additional embodiments of the method, the catalytically active Cas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein.
- the dCas9 protein comprises at least one mutation in one or more endonuclease domains to render the dCas9 protein catalytically inactive.
- the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes.
- the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes.
- the dCas9 protein is a S.
- the one or more mutations is in a RuvC-1 domain, such as a D1OA mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein.
- the one or more mutations is in the HNH domain, such as a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein.
- the one or more mutations comprises a D10A mutation and a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutations in an orthologous Cas9 protein.
- the selected target nucleic acid is DNA, such as double-stranded DNA.
- the selected target nucleic acid is cleaved to provide a cleavage site and the method further comprises modifying the target nucleic acid, such as by inserting at least a portion of the donor polynucleotide at the cleavage site.
- the modifying comprises deleting one or more nucleotides at the cleavage cite.
- the method is performed in a cell, such as a eukaryotic cell, or in vitro.
- a method for modifying a target nucleic acid comprising: contacting two complexes to the same strand of the target nucleic acid, wherein each of the two complexes comprises a site-directed polypeptide and a nucleic acid-targeting nucleic acid, wherein the two site-directed polypeptides comprise a mutation in one of their nuclease domains; and modifying the target nucleic acid.
- the nucleic acid-targeting nucleic acid from one of the two complexes targets a different location in the target nucleic acid than the nucleic acid-targeting nucleic acid from the other of the two complexes.
- the two site-directed polypeptides comprise at least 75% amino acid identity to Cas9 from S. pyogenes. In some embodiments, the two site-directed polypeptides comprise at least 10% amino acid identity to a nuclease domain of Cas9 from S. pyogenes. In some embodiments, the mutation comprises a D10A mutation. In some embodiments, the mutation comprises an H840A mutation. In some embodiments, the target nucleic acid is DNA.
- the two site-directed polypeptides are bound to the sense strand of the double-stranded target nucleic acid. In some embodiments, the two site-directed polypeptides are bound to the anti-sense strand of the double-stranded target nucleic acid. In some embodiments, the modifying comprises cleaving the same strand of the target nucleic acid. In some embodiments, the cleaving comprises a single-stranded break. In some embodiments, the method further comprises inserting a donor polynucleotide into the target nucleic acid. In some embodiments, the donor polynucleotide is single-stranded. In some embodiments, the donor polynucleotide is double-stranded. In some embodiments, the donor polynucleotide is partially single-stranded and partially double-stranded.
- the invention is directed to a method for cleaving a single strand of a target nucleic acid comprising contacting first and second complexes at spaced-apart locations on the same strand of a nucleic acid molecule.
- the first complex comprises (i) a first Cas9 protein with a mutation in an endonuclease domain thereof to render the Cas9 protein a nickase; and (ii) a first guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to a first target nucleic acid.
- the second complex comprises (i) a second Cas9 protein with a mutation in an endonuclease domain thereof, to render the Cas9 protein a nickase; and (ii) a second guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to a second target nucleic; wherein the first and second Cas9 proteins cleave a single strand of said nucleic acid molecule at the spaced-apart locations on the same strand, to render a single-stranded break.
- a second Cas9 protein with a mutation in an endonuclease domain thereof to render the Cas9 protein a nickase
- a second guide polynucleotide such as sgRNA
- the first Cas9 protein and/or the second Cas9 protein comprises at least 75% amino acid identity to a Cas9 from S. pyogenes. In certain embodiments, the Cas9 protein comprises at least 95% amino acid identity to a Cas9 from S. pyogenes. In additional embodiments of the method, the first Cas9 protein and/or the second Cas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein with a mutation in an endonuclease domain thereof, to render the orthologous Cas9 protein a nickase.
- the one or more mutations is in a RuvC-1 domain, such as a D1OA mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein.
- the one or more mutations is in the HNH domain, such as a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein.
- the target nucleic acid is double-stranded DNA and the complexes bind to and cleave the anti-sense strand of the double-stranded DNA. In other embodiments, the target nucleic acid is double-stranded DNA and the complexes bind to and cleave the sense strand of the double-stranded DNA.
- the method further comprises modifying the target nucleic acid, such as by inserting at least a portion of the donor polynucleotide into the target nucleic acid at the single-stranded break.
- the donor polynucleotide is single-stranded.
- the inserting is done using homology-directed repair of the donor polynucleotide with the target nucleic acid.
- the method is performed in a cell, such as a eukaryotic cell, or in vitro.
- a method for directed homology-directed repair (HDR) in a target nucleic acid comprises: (a) contacting a first complex with a first target nucleic acid, wherein said first complex comprises: (i) a catalytically active Cas9 protein and (ii) a first guide polynucleotide, such as a sgRNA, that comprises a spacer adapted to bind to said first target nucleic acid, wherein said first complex cleaves the first target nucleic acid; and (b) contacting a second complex with a second target nucleic acid, wherein said second complex comprises: (i) a first catalytically inactive Cas9 protein (dCas9 protein) that comprises at least one mutation in one or more endonuclease domains to render the dCas9 protein catalytically inactive such that the dCas9 protein does not cleave the second target nucleic acid, and (ii) a second catalytically inactive Cas9 protein
- the second target nucleic acid is upstream of the first target nucleic acid. In other embodiments, the second target nucleic acid is downstream of the first target nucleic acid.
- the 5′ end of the polynucleotide donor is associated with the second complex. In other embodiments, the 3′ end of the polynucleotide donor is associated with the second complex.
- the method further comprises: (c) contacting a third complex with a third target nucleic acid, wherein the third target nucleic acid is positioned downstream of the first target nucleic acid when the first target nucleic acid is downstream of the second target nucleic acid, or wherein the third target nucleic acid is positioned upstream of the first target nucleic acid when the first target nucleic acid is upstream of the second target nucleic acid, wherein said third complex comprises: (i) a second dCas9 protein that comprises at least one mutation in one or more endonuclease domains to render the second dCas9 protein catalytically inactive such that the second dCas9 protein does not cleave the third target nucleic acid, and (ii) a third guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to said third target nucleic acid, and wherein the third complex comprises the other end of the polynucleot
- the 5′ end of the polynucleotide donor is associated with the second complex and the 3′ end of the polynucleotide donor is associated with the third complex. In other embodiments, the 3′ end of the polynucleotide donor is associated with the second complex and the 5′ end of the polynucleotide donor is associated with the third complex.
- the Cas9 protein comprises at least 75% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In certain embodiments, the Cas9 protein comprises at least 95% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In additional embodiments of the method, the Cas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein.
- the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes. In other embodiments, the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes. In additional embodiments, the dCas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein with at least one mutation in one or more endonuclease domains to render the orthologous Cas9 protein catalytically inactive. In certain embodiments, the one or more mutations is in a RuvC-1 domain, such as a D10A mutation, numbered relative to S.
- the one or more mutations is in the HNH domain, such as a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein.
- the one or more mutations comprises a D10A mutation and a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutations in an orthologous Cas9 protein.
- the selected target nucleic acid is DNA, such as double-stranded DNA.
- the method is performed in a cell, such as a eukaryotic cell, or in vitro.
- FIG. 1A and FIG. 1B present illustrative examples of Type II CRISPR-Cas9 associated RNAs.
- FIG. 1A shows a two-RNA component Type II CRISPR-Cas9 comprising a crRNA ( FIG. 1A, 101 ) and a tracrRNA ( FIG. 1A, 102 ), otherwise known as a dual-guide RNA.
- FIG. 1B illustrates the formation of base-pair hydrogen bonds between the crRNA and the tracrRNA to form secondary structure (see U.S. Published Patent Application No. 2014-0068797, published 6 Mar.
- FIG. 1B, 103 a spacer element
- FIG. 1B, 104 a first stem element comprising a lower stem element
- FIG. 1B, 104 a bulge element comprising unpaired nucleotides
- FIG. 1B, 106 an upper stem element
- FIG. 1B, 106 a nexus element
- FIG. 1B, 107 a second hairpin element comprising a second stem element
- FIG. 1B, 108 a third hairpin element comprising a third stem element
- the figures are not proportionally rendered nor are they to scale. The locations of indicators are approximate.
- FIG. 2 shows another example of a Type II CRISPR-Cas9 associated RNA.
- the figure illustrates a single-guide RNA (sgRNA) wherein the crRNA is covalently joined to the tracrRNA and forms a RNA polynucleotide secondary structure through base-pair hydrogen bonding (see, e.g., U.S. Published Patent Application No. 2014-0068797, published 6 Mar. 2014).
- the figure presents an overview of and nomenclature for secondary structural elements of a sgRNA of the S. pyogenes Cas9 including the following: a spacer element ( FIG. 2, 201 ); a first stem element comprising a lower stem element ( FIG.
- FIG. 2, 202 a bulge element comprising unpaired nucleotides ( FIG. 2, 205 ), and an upper stem element ( FIG. 2, 203 ); a loop element ( FIG. 2, 204 ) comprising unpaired nucleotides; (a first hairpin element comprises the first stem element and the loop element); a nexus element ( FIG. 2, 206 ); a second hairpin element comprising a second stem element ( FIG. 2, 207 ); and a third hairpin element comprising a third stem element ( FIG. 2, 208 ).
- the figure is not proportionally rendered nor is it to scale. The locations of indicators are approximate.
- FIG. 3A and FIG. 3B relate to structural information for a sgRNA/Cas protein complex and a Cas protein, respectively.
- FIG. 3A provides a model based on the crystal structure of S. pyogenes Cas9 (SpyCas9) in an active complex with sgRNA (Anders C., et al., “Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease,” Nature (2014) 513:569-573).
- FIG. 3A Helical domain
- FIG. 3A the Catalytic nuclease lobe
- FIG. 3A Catalytic nuclease lobe
- FIG. 3A sgRNA
- the relative location of the 3′ end of the sgRNA is indicated ( FIG. 3A , 3′ end sgRNA).
- the spacer RNA of the sgRNA is not visible because it is surrounded by the two protein lobes.
- the relative location of the 5′ end of the sgRNA FIG.
- FIG. 3A 5′ end sgRNA
- the spacer RNA of the sgRNA is located in the 5′ end region of the sgRNA.
- a cysteine residue ( FIG. 3A , WT SpyCas9 Cys) in wild type SpyCas9 is identified in the present disclosure as an available cross-linking site.
- the Catalytic nuclease lobe is shown as the lighter lobe wherein the relative positions of the RuvC ( FIG. 3A , RuvC; RNase H homologous domain) and HNH nuclease ( FIG. 3A , HNH; HNH nuclease homologous domain) domains are indicated.
- FIG. 3A CTD
- PAM protospacer adjacent motifs
- FIG. 3B presents a model of the domain arrangement of SpyCas9 relative to its primary sequence structure.
- three regions of the primary sequence correspond to the RuvC domain ( FIG. 3B , RuvC-I (amino acids 1-78), RuvC-II (amino acids 719-765), and RuvC-III (amino acids 926-1102)).
- One region corresponds to the Helical domain ( FIG.
- FIG. 3B Helical Domain (amino acids 79-718).
- One region corresponds to the HNH domain ( FIG. 3B , HNH (amino acids 766-925).
- One region corresponds to the CTD domain ( FIG. 3B , CTD (amino acids 1103-1368).
- FIG. 3B the regions of the primary sequence corresponding to the ⁇ -Helical lobe ( FIG. 3B , alpha-helical lobe) and the Nuclease domain lobe ( FIG. 3B , Nuclease domain lobe) are indicated with brackets.
- FIG. 4 depicts an exemplary embodiment of off-target binding and cleavage during genome engineering.
- a target nucleic acid FIG. 4 . 115
- a complex comprising a site-directed polypeptide (e.g., Cas9) ( FIG. 4, 105 ) and a nucleic acid-targeting nucleic acid (e.g., sgRNA or dual-guide RNA) ( FIG. 4, 110 ).
- the complex comprising the Cas9 binds to a target nucleic acid ( FIG. 4, 120 ).
- the complex binds to an off-target nucleic acid ( FIG. 4, 125 ).
- the Cas9 of the complex can cleave the target nucleic acid ( FIG. 4, 120 ) and the off-target nucleic acid, thereby generating off-target effects.
- FIG. 5 depicts an exemplary embodiment of a method of the disclosure for reducing off-target binding and cleavage events.
- a target nucleic acid ( FIG. 5, 215 ) is contacted with a complex comprising a site-directed polypeptide (e.g., an active Cas9) ( FIG. 5, 205 ) and a nucleic acid-targeting nucleic acid (e.g., sgRNA or dual-guide RNA) ( FIG. 5, 210 ).
- the complex binds to a target nucleic acid ( FIG. 5, 220 ).
- the complex comprising the Cas9 and sgRNA binds to an off-target nucleic acid ( FIG. 5, 225 ).
- Complexes comprising an engineered dCas9 protein ( FIG. 5, 235 ) and an engineered sgRNA ( FIG. 5, 236 ), can be introduced and contacted ( FIG. 5, 230 ) with the target nucleic acid.
- the dCas9 complexes can either displace or prevent the binding of complexes comprising active Cas9.
- the active Cas9 can cleave ( FIG. 5, 240 / 245 ) the target nucleic acid.
- the active Cas9 is prevented from cleaving the off-target nucleic acid because the dCas9 is preventing its binding and cleavage. In this way, off-target cleavage may be prevented.
- FIG. 6A , FIG. 6B , and FIG. 6C show the use of tandem Cas9 D10A nickases to excise a single-stranded region of DNA on the same strand of a target nucleic acid and insert a donor polynucleotide.
- FIG. 6A shows two D10A sgRNA/dCas9 complexes targeted to two spaced-apart positions on the sense strand of a target polynucleotide.
- FIG. 6B shows that a region on the sense strand between the targeted sites has been cleaved.
- FIG. 6C shows the insertion of the donor polynucleotide with overlapping flanking regions.
- FIG. 7A and FIG. 7B depict methods of increasing HDR using sgRNA/dCas9 and catalytically active sgRNA/Cas9 complexes.
- FIG. 7A shows a system using a single sgRNA/dCas9 complex tethered to a HDR polynucleotide donor adjacent to an active sgRNA/Cas9 complex to direct the donor to the site of the double-stranded break and to position the donor next to the cut site.
- FIG. 7A shows a system using a single sgRNA/dCas9 complex tethered to a HDR polynucleotide donor adjacent to an active sgRNA/Cas9 complex to direct the donor to the site of the double-stranded break and to position the donor next to the cut site.
- FIG. 7B shows a system using two spaced-apart sgRNA/dCas9 complexes and a catalytically active sgRNA/Cas9 complex positioned between the two catalytically inactive complexes, wherein the donor is positioned across the double-stranded break.
- FIG. 8 shows the effects of dCas9 nuclease blockers (dCas9-NBs) on VEGFA sgRNA/Cas9 on-target editing at the VEGFA locus.
- FIG. 9 shows the effects of dCas9-NBs on VEGFA sgRNA/Cas9 off-target editing at a known VEGFA off-target locus on human chromosome 15.
- FIG. 10 shows the various embodiments of the experimental conditions used to position homology donor nucleotides near a targeted site for increasing HDR efficiency, as described in Example 5C.
- FIG. 11 shows potential donor configurations using tandem Cas9D10A as described in the examples.
- FIG. 12 shows a comparison of repair types using either Cas9 or Cas9D10A at Targets 3 and 4 (human CD34 locus) as described in the examples.
- Neg denotes a negative control which is either Cas9 or Cas9D10A only, without sgRNA.
- Cas9 protein refers to Type II CRISPR-Cas9 proteins (as described, e.g., in Chylinski, K., (2013) “The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems,” RNA Biol. 2013 10(5):726-737), including, but not limited to Cas9, Cas9-like, proteins encoded by Cas9 orthologs, Cas9-like synthetic proteins, and variants and modifications thereof.
- Cas9 protein refers to Cas9 wild-type proteins derived from Type II CRISPR-Cas9 systems, modifications of Cas9 proteins, variants of Cas9 proteins, Cas9 orthologs, and combinations thereof. Cas9 proteins can be derived from any of various bacterial species which genomes encode such proteins. Cas proteins for use in the present methods are described further below.
- wild-type “naturally-occurring” and “unmodified” are used herein to mean the typical (or most common) form, appearance, phenotype, or strain existing in nature; for example, the typical form of cells, organisms, characteristics, polynucleotides, proteins, macromolecular complexes, genes, RNAs, DNAs, or genomes as they occur in and can be isolated from a source in nature.
- the wild-type form, appearance, phenotype, or strain serve as the original parent before an intentional modification.
- mutant, variant, engineered, recombinant, and modified forms are not wild-type forms.
- engineered As used herein, the terms “engineered,” “genetically engineered,” “recombinant,” “modified,” and “non-naturally occurring” are interchangeable and indicate intentional human manipulation.
- nucleic acid As used herein, the terms “nucleic acid,” “nucleotide sequence,” “oligonucleotide,” and “polynucleotide” are interchangeable. All refer to a polymeric form of nucleotides.
- the nucleotides may be deoxyribonucleotides (DNA) or ribonucleotides (RNA), or analogs thereof, and they may be of any length.
- Polynucleotides may perform any function and may have any secondary structure and three-dimensional structure. The terms encompass known analogs of natural nucleotides and nucleotides that are modified in the base, sugar and/or phosphate moieties.
- a polynucleotide may comprise one modified nucleotide or multiple modified nucleotides. Examples of modified nucleotides include methylated nucleotides and nucleotide analogs. Nucleotide structure may be modified before or after a polymer is assembled. Following polymerization, polynucleotides may be additionally modified via, for example, conjugation with a labeling component or target-binding component. A nucleotide sequence may incorporate non-nucleotide components.
- nucleic acids comprising modified backbone residues or linkages, that (i) are synthetic, naturally occurring, and non-naturally occurring, and (ii) have similar binding properties as a reference polynucleotide (e.g., DNA or RNA).
- reference polynucleotide e.g., DNA or RNA
- analogs include, but are not limited to, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs), and morpholino structures.
- Polynucleotide sequences are displayed herein in the conventional 5′ to 3′ orientation.
- complementarity refers to the ability of a nucleic acid sequence to form hydrogen bond(s) with another nucleic acid sequence (e.g., through traditional Watson-Crick base pairing).
- a percent complementarity indicates the percentage of residues in a nucleic acid molecule that can form hydrogen bonds with a second nucleic acid sequence.
- the two sequences are perfectly complementary, i.e., all of a first polynucleotide's contiguous residues hydrogen bond with the same number of contiguous residues in a second polynucleotide.
- sequence identity generally refers to the percent identity of bases or amino acids determined by comparing a first polynucleotide or polypeptide to a second polynucleotide or polypeptide using algorithms having various weighting parameters. Sequence identity between two polypeptides or two polynucleotides can be determined using sequence alignment by various methods and computer programs (e.g., BLAST, CS-BLAST, FASTA, HMMER, L-ALIGN, etc.), available through the worldwide web at sites including GENBANK (ncbi.nlm.nih.gov/genbank/) and EMBL-EBI (ebi.ac.uk.). Sequence identity between two polynucleotides or two polypeptide sequences is generally calculated using the standard default parameters of the various methods or computer programs.
- a “stem-loop structure” or “stem-loop element” refers to a polynucleotide having a secondary structure that includes a region of nucleotides that are known or predicted to form a double-stranded region (the “stem element”) that is linked on one side by a region of predominantly single-stranded nucleotides (the “loop element”).
- the term “hairpin” element is also used herein to refer to stem-loop structures. Such structures are well known in the art.
- the base pairing may be exact. However, as is known in the art, a stem element does not require exact base pairing. Thus, the stem element may include one or more base mismatches or non-paired bases.
- the term “recombination” refers to a process of exchange of genetic information between two polynucleotides.
- HDR DNA repair that takes place in cells, for example, during repair of double-stranded and single-stranded breaks in DNA.
- HDR requires nucleotide sequence homology and uses a “donor template” (donor template DNA, polynucleotide donor, or oligonucleotide (used interchangably herein) to repair the sequence where the double-stranded break occurred (e.g., DNA target sequence). This results in the transfer of genetic information from, for example, the donor template DNA to the DNA target sequence.
- donor template donor template DNA, polynucleotide donor, or oligonucleotide (used interchangably herein)
- HDR may result in alteration of the DNA target sequence (e.g., insertion, deletion, mutation) if the donor template DNA sequence or oligonucleotide sequence differs from the DNA target sequence and part or all of the donor template DNA polynucleotide or oligonucleotide is incorporated into the DNA target sequence.
- an entire donor template DNA polynucleotide, a portion of the donor template DNA polynucleotide, or a copy of the donor polynucleotide is integrated at the site of the DNA target sequence.
- non-homologous end joining refers to the repair of double-stranded breaks in DNA by direct ligation of one end of the break to the other end of the break without a requirement for a donor template DNA. NHEJ in the absence of a donor template DNA often results in a small number of nucleotides randomly inserted or deleted at the site of the double-stranded break.
- vectors can be linear or circular. Vectors can integrate into a target genome of a host cell or replicate independently in a host cell. Vectors can comprise, for example, an origin of replication, a multicloning site, and/or a selectable marker.
- An expression vector typically comprises an expression cassette.
- Vectors and plasmids include, but are not limited to, integrating vectors, prokaryotic plasmids, eukaryotic plasmids, plant synthetic chromosomes, episomes, viral vectors, cosmids, and artificial chromosomes.
- expression cassette is a polynucleotide construct, generated recombinantly or synthetically, comprising regulatory sequences operably linked to a selected polynucleotide to facilitate expression of the selected polynucleotide in a host cell.
- the regulatory sequences can facilitate transcription of the selected polynucleotide in a host cell, or transcription and translation of the selected polynucleotide in a host cell.
- An expression cassette can, for example, be integrated in the genome of a host cell or be present in an expression vector.
- expression cassette is a polynucleotide construct, generated recombinantly or synthetically, comprising regulatory sequences operably linked to a selected polynucleotide to facilitate expression of the selected polynucleotide in a host cell.
- the regulatory sequences can facilitate transcription of the selected polynucleotide in a host cell, or transcription and translation of the selected polynucleotide in a host cell.
- An expression cassette can, for example, be integrated in the genome of a host cell or be present in an expression vector.
- a “targeting vector” is a recombinant DNA construct typically comprising tailored DNA arms homologous to genomic DNA that flanks critical elements of a target gene or target sequence. When introduced into a cell, the targeting vector integrates into the cell genome via homologous recombination. Elements of the target gene can be modified in a number of ways including deletions and/or insertions. A defective target gene can be replaced by a functional target gene, or in the alternative a functional gene can be knocked out.
- a targeting vector comprises a selection cassette comprising a selectable marker that is introduced into the target gene. Targeting regions adjacent or sometimes within a target gene can be used to affect regulation of gene expression.
- regulatory sequences As used herein, the terms “regulatory sequences,” “regulatory elements,” and “control elements” are interchangeable and refer to polynucleotide sequences that are upstream (5′ non-coding sequences), within, or downstream (3′ non-translated sequences) of a polynucleotide target to be expressed. Regulatory sequences influence, for example, the timing of transcription, amount or level of transcription, RNA processing or stability, and/or translation of the related structural nucleotide sequence.
- Regulatory sequences may include activator binding sequences, enhancers, introns, polyadenylation recognition sequences, promoters, repressor binding sequences, stem-loop structures, translational initiation sequences, translation leader sequences, transcription termination sequences, translation termination sequences, primer binding sites, and the like.
- operably linked refers to polynucleotide sequences or amino acid sequences placed into a functional relationship with one another.
- a promoter or enhancer is operably linked to a coding sequence if it regulates, or contributes to the modulation of, the transcription of the coding sequence.
- Operably linked DNA sequences encoding regulatory sequences are typically contiguous to the coding sequence.
- enhancers can function when separated from a promoter by up to several kilobases or more. Accordingly, some polynucleotide elements may be operably linked but not contiguous.
- the term “expression” refers to transcription of a polynucleotide from a DNA template, resulting in, for example, an mRNA or other RNA transcript (e.g., non-coding, such as structural or scaffolding RNAs).
- the term further refers to the process through which transcribed mRNA is translated into peptides, polypeptides, or proteins.
- Transcripts and encoded polypeptides may be referred to collectively as “gene product.” Expression may include splicing the mRNA in a eukaryotic cell, if the polynucleotide is derived from genomic DNA.
- the term “modulate” refers to a change in the quantity, degree or amount of a function.
- the methods disclosed herein may modulate Cas9-mediated targeting efficiency by decreasing or eliminating off-target cleavage, thereby enhancing cleavage at the target site, or may enhance HDR and decrease the likelihood of NHEJ events.
- modulating targeting may denote increasing desired targeting events and/or inhibiting off-target cleavage.
- modulating HDR can denote increasing HDR and/or decreasing NHEJ.
- Modulation can be assayed by determining any characteristic directly or indirectly affected by the expression of the target gene. Such characteristics include, e.g., changes in targeting efficiency, RNA or protein levels, protein activity, product levels, associated gene expression, or activity level of reporter genes. Thus, “modulation” of gene expression includes both gene activation and gene repression.
- amino acid refers to natural and synthetic (unnatural) amino acids, including amino acid analogs, modified amino acids, peptidomimetics, glycine, and D or L optical isomers.
- polypeptide As used herein, the terms “peptide,” “polypeptide,” and “protein” are interchangeable and refer to polymers of amino acids.
- a polypeptide may be of any length. It may be branched or linear, it may be interrupted by non-amino acids, and it may comprise modified amino acids.
- the terms may be used to refer to an amino acid polymer that has been modified through, for example, acetylation, disulfide bond formation, glycosylation, lipidation, phosphorylation, cross-linking, and/or conjugation (e.g., with a labeling component or ligand).
- Polypeptide sequences are displayed herein in the conventional N-terminal to C-terminal orientation.
- Polypeptides and polynucleotides can be made using routine techniques in the field of molecular biology (see, e.g., standard texts discussed above). Further, essentially any polypeptide or polynucleotide can be custom ordered from commercial sources.
- binding includes a non-covalent interaction between macromolecules (e.g., between a protein and a polynucleotide, between a polynucleotide and a polynucleotide, and between a protein and a protein). Such non-covalent interaction is also referred to as “associating” or “interacting” (e.g., when a first macromolecule interacts with a second macromolecule, the first macromolecule binds to second macromolecule in a non-covalent manner). Some portions of a binding interaction may be sequence-specific; however, all components of a binding interaction do not need to be sequence-specific, such as a protein's contacts with phosphate residues in a DNA backbone. Binding interactions can be characterized by a dissociation constant (Kd). “Affinity” refers to the strength of binding. An increased binding affinity is correlated with a lower Kd. An example of non-covalent binding is hydrogen bond formation between base pairs.
- Kd dissociation constant
- isolated can refer to a nucleic acid or polypeptide that, by the hand of a human, exists apart from its native environment and is therefore not a product of nature. Isolated means substantially pure. An isolated nucleic acid or polypeptide can exist in a purified form and/or can exist in a non-native environment such as, for example, in a recombinant cell.
- a “host cell” generally refers to a biological cell.
- a cell can be the basic structural, functional and/or biological unit of a living organism.
- a cell can originate from any organism having one or more cells. Examples of host cells include, but are not limited to: a prokaryotic cell, eukaryotic cell, a bacterial cell, an archaeal cell, a cell of a single-cell eukaryotic organism, a protozoa cell, a cell from a plant (e.g.
- Botryococcus braunii Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens C.
- seaweeds e.g. kelp
- a fungal cell e.g., a yeast cell, a cell from a mushroom
- an animal cell e.g fruit fly, cnidarian, echinoderm, nematode, etc.
- a cell from a vertebrate animal e.g., fish, amphibian, reptile, bird, mammal
- a cell from a mammal e.g., a pig, a cow, a goat, a sheep, a rodent, a rat, a mouse, a non-human primate, a human, etc.
- a cell can be a stem cell or progenitor cell.
- transgenic organism refers to an organism comprising a recombinantly introduced polynucleotide.
- transgenic plant cell and “transgenic plant” are interchangeable and refer to a plant cell or a plant containing a recombinantly introduced polynucleotide. Included in the term transgenic plant is the progeny (any generation) of a transgenic plant or a seed such that the progeny or seed comprises a DNA sequence encoding a recombinantly introduced polynucleotide or a fragment thereof.
- the phrase “generating a transgenic plant cell or a plant” refers to using recombinant DNA methods and techniques to construct a vector for plant transformation to transform the plant cell or the plant and to generate the transgenic plant cell or the transgenic plant.
- a CRISPR Clustered Regularly Interspaced Short Palindromic Repeats
- CRISPR loci provide resistance to foreign invaders (e.g., virus, phage) in prokaryotes.
- foreign invaders e.g., virus, phage
- the CRISPR system can be thought to function as a type of immune system to help defend prokaryotes against foreign invaders.
- CRISPR locus function integration of new sequences into the locus, biogenesis of CRISPR RNA (crRNA), and silencing of foreign invader nucleic acid.
- a CRISPR locus includes a number of short repeating sequences referred to as “repeats.” Repeats can form hairpin structures and/or repeats can be unstructured single-stranded sequences. The repeats occur in clusters. Repeats frequently diverge between species. Repeats are regularly interspaced with unique intervening sequences, referred to as “spacers,” resulting in a repeat-spacer-repeat locus architecture. Spacers are identical to or have high homology with known foreign invader sequences.
- a spacer-repeat unit encodes a crisprRNA (crRNA).
- a crRNA refers to the mature form of the spacer-repeat unit.
- a crRNA comprises a “seed” sequence that is involved in targeting a target nucleic acid (e.g., possibly as a surveillance mechanism against foreign nucleic acid).
- a seed sequence is typically located towards the 5′ end of a crRNA (e.g. in the Cascade complex; for a description of the Cascade complex see, e.g., Jore, M. M. et al., “Structural basis for CRISPR RNA-guided DNA recognition by Cascade,” Nature Structural & Molecular Biology (2011) 18:529-536) or at the 3′ end of the spacer of a crRNA (e.g., in a Type II CRISPR-Cas9 system), directly adjacent to the first stem.
- a CRISPR locus comprises polynucleotide sequences encoding for CRISPR Associated Genes (Cas) genes.
- Cas genes are involved in the biogenesis and/or the interference stages of crRNA function.
- Cas genes display extreme sequence (e.g., primary sequence) divergence between species and homologues.
- Casl homologues can comprise less than 10% primary sequence identity between homologues.
- Some Cas genes comprise homologous secondary and/or tertiary structures. For example, despite extreme sequence divergence, many members of the Cas6 family of CRISPR proteins comprise a N-terminal ferredoxin-like fold.
- Cas genes are named according to the organism from which they are derived.
- Cas genes in Staphylococcus epidermidis can be referred to as Csm-type
- Cas genes in Streptococcus thermophilus can be referred to as Csn-type
- Cas genes in Pyrococcus furiosus can be referred to as Cmr-type.
- the integration stage of a CRISPR system refers to the ability of the CRISPR locus to integrate new spacers into the crRNA array upon being infected by a foreign invader. Acquisition of the foreign invader spacers can help confer immunity to subsequent attacks by the same foreign invader. Integration typically occurs at the leader end of the CRISPR locus. Cas proteins (e.g., Cas1 and Cas2) are involved in integration of new spacer sequences. Integration proceeds similarly for some types of CRISPR systems (e.g., Type I-III).
- Mature crRNAs are processed from a longer polycistronic CRISPR locus transcript (i.e., pre-crRNA array).
- a pre-crRNA array comprises a plurality of crRNAs.
- the repeats in the pre-crRNA array are recognized by Cas genes.
- Cas genes bind to the repeats and cleave the repeats. This action can liberate the plurality of crRNAs.
- crRNAs can be subjected to further events to produce the mature crRNA form such as trimming (e.g., with an exonuclease).
- a crRNA may comprise all, some, or none of the CRISPR repeat sequence.
- Interference refers to the stage in the CRISPR system that is functionally responsible for combating infection by a foreign invader.
- CRISPR interference follows a similar mechanism to RNA interference (RNAi: e.g., wherein a target RNA is targeted (e.g., hybridized) by a short interfering RNA (siRNA)), which results in target RNA degradation and/or destabilization.
- RNAi RNA interference
- CRISPR systems perform interference of a target nucleic acid by coupling crRNAs and Cas genes, thereby forming CRISPR ribonucleoproteins (crRNPs).
- crRNA of the crRNP guides the crRNP to foreign invader nucleic acid, (e.g., by recognizing the foreign invader nucleic acid through hybridization).
- Hybridized target foreign invader nucleic acid-crRNA units are subjected to cleavage by Cas proteins.
- Target nucleic acid interference typically requires a protospacer adjacent motif (PA
- CRISPR systems There are at least four types of CRISPR systems: Type I, Type II, Type III, and Type U. More than one CRISPR type system can be found in an organism. CRISPR systems can be complementary to each other, and/or can lend functional units in trans to facilitate CRISPR locus processing. Type II systems can be further subdivided into II-A (contains Csn2 locus) and II-B (contains Cas4 locus) and Type II-C (neither Csn2 nor Cas4, e.g. N. meningitides). Modifications of the components of CRISPR-Type II systems are extensively discussed in the present specification.
- crRNA biogenesis in a Type II CRISPR system comprises a trans-activating CRISPR RNA (tracrRNA).
- a tracrRNA is typically modified by endogenous RNaseIII.
- the tracrRNA hybridizes to a crRNA repeat in the pre-crRNA array.
- Endogenous RNaselll is recruited to cleave the pre-crRNA.
- Cleaved crRNAs are subjected to exoribonuclease trimming to produce the mature crRNA form (e.g., 5′ trimming).
- the tracrRNA typically remains hybridized to the crRNA.
- the tracrRNA and the crRNA associate with a site-directed polypeptide (e.g., Cas9).
- the crRNA of the crRNA-tracrRNA-Cas9 complex can guide the complex to a target nucleic acid to which the crRNA can hybridize. Hybridization of the crRNA to the target nucleic acid activates a wild-type, cognate Cas9 for target nucleic acid cleavage.
- Target nucleic acid in a Type II CRISPR system comprises a PAM.
- a PAM is essential to facilitate binding of a site-directed polypeptide (e.g., Cas9) to a target nucleic acid.
- Cas9 is an exemplary Type II CRISPR Cas protein.
- Cas9 is an endonuclease that can be programmed by the tracrRNA/crRNA to cleave, site-specifically, target DNA using two distinct endonuclease domains (HNH and RuvC/RNase H-like domains) (see U.S. Published Patent Application No. 2014-0068797, published 6 Mar. 2014; see also Jinek M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-821), one for each strand of the DNA's double helix.
- HNH and RuvC/RNase H-like domains two distinct endonuclease domains
- FIG. 3A presents a model of the domain arrangement of SpyCas9 ( S. pyogenes Cas9) relative to its primary sequence structure.
- SpyCas9 S. pyogenes Cas9
- FIG. 1A Two RNA components of a Type II CRISPR-Cas9 system are illustrated in FIG. 1A .
- each CRISPR-Cas9 system comprises a tracrRNA and a crRNA.
- this requirement can be bypassed by using an engineered sgRNA, described more fully below, containing a designed hairpin that mimics the tracrRNA-crRNA complex (Jinek et al., 2012).
- Cas9 orthologs are known in the art as well as their associated tracrRNA and crRNA components (see, e.g., “Supplementary Table S2. List of bacterial strains with identified Cas9 orthologs,” Fonfara, Ines, et al., “Phylogeny of Cas9 Determines Functional Exchangeability of Dual-RNA and Cas9 among Orthologous Type II CRISPR/Cas Systems,” Nucleic Acids Research (2014) 42:2577-2590, including all Supplemental Data; Chylinski K., et al., “Classification and evolution of type II CRISPR-Cas systems,” Nucleic Acids Research (2014) 42:6091-6105, including all Supplemental Data.); Esvelt, K.
- orthogonal Cas9 proteins for RNA-guided gene regulation and editing,” Nature Methods (2013) 10:1116-1121.
- a number of orthogonal Cas9 proteins have been identified including Cas9 proteins from Neisseria meningitidis, Streptococcus thermophilus and Staphylococcus aureus.
- a Cas9 protein refers to a Cas9 protein derived from any species, subspecies or strain of bacteria that encodes Cas9, as well as variants and orthologs of the particular Cas9 in question.
- the Cas9 proteins can either be directly isolated and purified from bacteria, or synthetically or recombinantly produced, or typically delivered using a construct encoding the protein, including without limitation, naked DNA, plasmid DNA, a viral vector and mRNA for Cas9 expression.
- Cas9 polypeptides e.g., SEQ ID NO: 1-256, SEQ ID NOS: 795-1346, therein
- SEQ ID NO: 8 sequence of Cas9 from S. pyogenes
- Modifications and variants of Cas9 proteins are also discussed (e.g., ⁇ 504-608, therein).
- Non-limiting examples of Cas9 proteins include Cas9 proteins from S.
- pyogenes (GI:15675041); Listeria innocua Clip 11262 (GI:16801805); Streptococcus mutans UA159 (GI:24379809); Streptococcus thermophilus LMD-9 ( S. thermophilus A, GI:11662823; S. thermophilus B, GI:116627542); Lactobacillus buchneri NRRL B-30929 (GI:331702228); Treponema denticola ATCC 35405 (GI:42525843); Francisella novicida U112 (GI:118497352); Campylobacter jejuni subsp.
- Jejuni NCTC 11168 (GI:218563121); Pasteurella multocida subsp. multocida str. Pm70 (GI:218767588); Neisseria meningitidis Zs491 (GI:15602992) and Actinomyces naeslundii (GI:489880078).
- aspects of the present invention can be practiced by one of ordinary skill in the art following the guidance of the specification to use Type II CRISPR-Cas9 proteins and Cas-protein encoding polynucleotides, including, but not limited to Cas9, Cas9-like, proteins encoded by Cas9 orthologs, Cas9-like synthetic proteins, and variants and modifications thereof.
- the cognate RNA components of these Cas proteins can be manipulated and modified for use in the practice of the present invention by one of ordinary skill in the art following the guidance of the present specification.
- dCas9 is meant a nuclease-deactivated Cas9, also termed “catalytically inactive”, “catalytically dead Cas9” or “dead Cas9.” Such molecules lack all or a portion of endonuclease activity and can therefore be used to regulate genes in an RNA-guided manner (Jinek M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-821).
- Targeting specificity is determined by complementary base-pairing of an sgRNA to the genomic locus and the protospacer adjacent motif (PAM).
- dCas9 can be used alone or in fusions to synthetically repress (CRISPRi) or activate (CRISPRa) gene expression.
- CRISPRi can work independently of host cellular machineries.
- only a dCas9 protein and a customized sgRNA designed with a complementary region to any gene of interest direct dCas9 to a chosen genomic location.
- dCas9 can be fused to a transcription factor, such as a repressor, and the fused Cas9-transcription factor can then work in concert with cellular machineries.
- the binding specificity is determined jointly by the complementary region on the sgRNA and a short DNA motif (protospacer adjacent motif or PAM) juxtaposed to the DNA complementary region, dependent on the species in question.
- PAM protospacer adjacent motif
- this sequence is NGG.
- dCas9 can be used by itself (whereby it represses transcription through steric hindrance). Taken together sgRNA and dCas9 provide a minimum system for gene-specific regulation in any organism.
- CRISPRa is carried out by dCas9-transcription factor (activator) fusions.
- Cas9 nickase is meant a Cas9 mutant that does not retain the ability to make double-stranded breaks in a target nucleic acid sequence, but maintains the ability to bind to and make a single-stranded break at a target site.
- Such a mutant will typically include a mutation in one, but not both of the Cas9 endonuclease domains (HNH and RuvC).
- HNH and RuvC Cas9 endonuclease domains
- an amino acid mutation at position D10A or H840A in Cas9, numbered relative to S. pyogenes can result in the inactivation of the nuclease catalytic activity and convert Cas9 to a nickase enzyme that makes single-stranded breaks at the target site.
- site-directed polypeptides such as meganucleases, TALE nucleases, Zinc-finger nucleases, MEGA-TALs and others known to one of skill in the art can be
- crRNA has a region of complementarity to a potential DNA target sequence ( FIG. 1A , the dark, 5′ region of the crRNA) and a second region that forms base-pair hydrogen bonds with the tracrRNA to form a secondary structure, typically to form at least a stem structure ( FIG. 1A , the light region extending to the 3′ end of the crRNA).
- the region of complementarity to the DNA target is the spacer.
- the tracrRNA and a crRNA interact through a number of base-pair hydrogen bonds to form secondary RNA structures, for example, as illustrated in FIG. 1B .
- tracrRNA/crRNA and Cas protein results in conformational change of the Cas protein that facilitates binding to DNA, endonuclease activities of the Cas protein, and crRNA-guided site-specific DNA cleavage by the endonuclease.
- the DNA target sequence is adjacent to a cognate protospacer adjacent motif (PAM).
- sgRNA typically refers to a single-guide RNA (i.e., a single, contiguous polynucleotide sequence) that essentially comprises a crRNA connected at its 3′ end to the 5′ end of a tracrRNA through a “loop” sequence (see, e.g., U.S. Published Patent Application No. 20140068797, published 6 Mar. 2014, incorporated herein by reference in its entirety).
- sgRNA interacts with a cognate Cas protein essentially as described for tracrRNA/crRNA polynucleotides, as discussed above. Similar to crRNA, sgRNA has a spacer, a region of complementarity to a potential DNA target sequence ( FIG.
- RNAs truncated single-guide RNAs (tru-sgRNAs) of approximately 17-18 nt. (See, e.g., Fu, Y. et. al., “Improving CRISPR-Cas nuclease specificity using truncated guide RNAs,” Nat Biotechnol. (2014) 32:279-284).
- the term also encompasses functional miniature sgRNAs with expendable features removed, but that retain an essential and conserved module termed the “nexus” located in the portion of sgRNA that corresponds to tracrRNA (not crRNA). See, e.g, U.S. Published Patent Application No. 20140315985, published 23 Oct. 2014, incorporated herein by reference in its entirety; Briner et al., “Guide RNA Functional Modules Direct Cas9 Activity and Orthogonality,” Molecular Cell (2014) 56:333-339.
- the nexus is located immediately downstream of (i.e., located in the 3′ direction from) the lower stem in Type II CRISPR-Cas9 systems.
- the nexus confers the binding of a sgRNA or a tracrRNA to its cognate Cas9 protein and confers an apoenzyme to haloenzyme conformational transition.
- a “spacer” or “spacer element” as used herein refers to the polynucleotide sequence that can specifically hybridize to a target nucleic acid sequence.
- the spacer element interacts with the target nucleic acid sequence through hydrogen bonding between complementary base pairs (i.e., paired bases).
- a spacer element binds to a selected DNA target sequence. Accordingly, the spacer element is a DNA target-binding sequence.
- the spacer element determines the location of Cas protein's site-specific binding and endonucleolytic cleavage.
- Spacer elements range from ⁇ 17- to ⁇ 84 nucleotides in length, depending on the Cas protein with which they are associated, and have an average length of 36 nucleotides (Marraffini, et al., “CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea,” Nature reviews Genetics (2010) 11:181-190).
- the spacer element typically comprises a “seed” sequence that is involved in targeting a target nucleic acid.
- the functional length for a spacer to direct specific cleavage is typically about 12-25 nucleotides. Variability of the functional length for a spacer element is known in the art (e.g., U.S. Published Patent Application No. 20140315985, published 23 Oct. 2014, incorporated herein by reference in its entirety).
- FIG. 3A provides a three-dimensional model based on the crystal structure of S. pyogenes Cas9 (SpyCas9) in an active complex with sgRNA.
- SpyCas9 S. pyogenes Cas9
- the relationship of the sgRNA to the Helical domain and the Catalytic domain is illustrated.
- the 3′ and 5′ ends of the sgRNA are indicated, as well as exposed portions of the sgRNA.
- the spacer RNA of the sgRNA is not visible because it is surrounded by the ⁇ -Helical lobe (Helical domain) and the Catalytic nuclease lobe (Catalytic domain).
- the spacer RNA of the sgRNA is located in the 5′ end region of the sgRNA.
- the RuvC and HNH nuclease domains when active, each cut a different DNA strand in target DNA.
- the C-terminal domain (CTD) is involved in recognition of proto
- Fonfara, et al. (“Phylogeny of Cas9 Determines Functional Exchangeability of Dual-RNA and Cas9 among Orthologous Type II CRISPR/Cas Systems,” Nucleic Acids Research (2014) 42:2577-2590, including all Supplemental Data, in particular Supplemental Figure S11) present the crRNA/tracrRNA sequences and secondary structures of eight Type II CRISPR-Cas9 systems.
- guide polynucleotide any polynucleotide that site-specifically guides Cas9 or dCas9 to a target, or off-target, nucleic acid.
- guide polynucleotides are known, such as but not limited to sgRNA (including miniature and truncated sgRNAs), dual-guide RNA, including but not limited to, crRNA/tracrRNA molecules, as described above, and the like.
- donor polynucleotide is meant a polynucleotide that can be directed to, and inserted into a target site of interest to modify the target nucleic acid. All or a portion of the donor polynucleotide can be inserted into the target nucleic acid.
- the donor polynucleotide is used for repair of the break in the target DNA sequence resulting in the transfer of genetic information (i.e., polynucleotide sequences) from the donor at the site or in close proximity of the break in the DNA. Accordingly, new genetic information (i.e., polynucleotide sequences) may be inserted or copied at a target DNA site.
- the donor polynucleotide can be double- or single-stranded DNA, RNA, a vector, plasmid, or the like.
- Non-symmetrical polynucleotide donors can also be used that are composed of two DNA oligonucleotides. They are partially complementary, and each can include a flanking region of homology.
- the donor can be used to insert or replace polynucleotide sequences in a target sequence, for example, to introduce a polynucleotide that encodes a protein or functional RNA (e.g., siRNA), to introduce a protein tag, to modify a regulatory sequence of a gene, or to introduce a regulatory sequence to a gene (e.g.
- a promoter e.g., an enhancer, an internal ribosome entry sequence, a start codon, a stop codon, a localization signal, or polyadenylation signal
- modify a nucleic acid sequence e.g., introduce a mutation
- Targeted DNA modifications using donor polynucleotides for large changes traditionally use plasmid-based donor templates that contain homology arms flanking the site of alteration. Each arm can vary in length, but is typically longer than about 100 bp, such as 100-1500 bp, e.g., 100 . . . 200 . . . 300 . . . 400 . . . 500 . . . 600 . . . 700 . . . 800 . . 900 . . . 1000 . . . 1500 bp or any integer between these values.
- these numbers can vary, depending on the size of the donor polynucleotide and the target polynucleotide.
- This method can be used to generate large modifications, including insertion of reporter genes such as fluorescent proteins or antibiotic resistance markers.
- reporter genes such as fluorescent proteins or antibiotic resistance markers.
- For transfection in cells, such as HEK cells approximately 100-1000 ng, e.g., 100 . . . 200 . . . 300 . . . 400 . . . 500 . . . 600 . . . 700 . . . 800 . . . 900 . . .
- a typical size donor plasmid e.g., approximately 5 kb
- a sgRNA/Cas9 vector can be used for one well in 24-well plate.
- Single-stranded and partially double-stranded oligonucleotides have been used in place of targeting plasmids for short modifications (e.g., less than 50 bp) within a defined locus without cloning.
- single-stranded oligonucleotides containing flanking sequences on each side that are homologous to the target region can be used, and can be oriented in either the sense or antisense direction relative to the target locus.
- the length of each arm can vary in length, but the length of at least one arm is typically longer than about 10 bases, such as from 10-150 bases, e.g., 10 . . . 20 . . .
- the length of at least one arm is 10 bases or more. In other embodiments, the length of at least one arm is 20 bases or more. In yet other embodiments, the length of at least one arm is 30 bases or more.
- the length of at least one arm is less than 100 bases. In further embodiments, the length of at least one arm is greater than 100 bases. In some embodiments, the length of at least one arm is zero bases.
- typically an oligonucleotide with around 100-150 bp total homology is used. The mutation is introduced in the middle, giving 50-75 bp homology arms for a donor designed to be symmetrical about the target site. In other cases, no homology arms are required, and the donor polynucleotide is inserted using non-homologous DNA repair mechanisms.
- the methods described herein are useful for increasing Cas9-mediated engineering efficiency by modulating off-target genome editing events, e.g., by decreasing the number of double-stranded breaks in DNA in unintended and/or incorrect locations.
- genome engineering systems such as those using zinc-finger nucleases (ZFNs), TALE-nucleases, and bacterially derived RNA-guided nucleases (e.g., the CRISPR-Cas9 system), have been used to target a protein to a specific genomic locus where it can induce a DNA double-stranded break.
- DNA double-stranded breaks can be repaired through either non homologous end joining (NHEJ) or homology-directed repair (HDR).
- NHEJ non homologous end joining
- HDR homology-directed repair
- Methods for increasing specificity and/or reducing off-target genomic events have included the use of shorter guide sequences with enhanced specificity (Fu, Y. et. al., “Improving CRISPR-Cas nuclease specificity using truncated guide RNAs,” Nat Biotechnol. (2014) 32:279-284) and/or engineering Cas9 mutants that can use two independent targeting events to induce a double-stranded break (Ran, F. A, et al., “Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity,” Cell (2013) 154:1380-1389; Tsai, S.
- an embodiment of the present invention provides methods to mitigate off-target genome editing events in a cell population or in an in vitro biochemical reaction. Mitigation of such events can be performed by an engineered CRISPR-Cas9 system as described herein.
- the methods include at least two basic components: (1) a complex of a catalytically active Cas9 protein and a sgRNA that targets the intended nucleic acid target (sgRNA/Cas9 complex); and (2) a complex of a catalytically inactive Cas9 protein, termedCas9” herein and a sgRNA that targets off-target loci (sgRNA/dCas9 complex).
- the first component can be any site-directed catalytically active DNA endonuclease, such as but not limited to zinc-finger nucleases (ZFNs), TALE-nucleases, and the like.
- ZFNs zinc-finger nucleases
- TALE-nucleases and the like.
- An off-target nucleic acid can differ from a target nucleic acid by, e.g., at least 1-5, such as 1, 2, 3, 4, 5 nucleotides, or up to 10 or more nucleotides or any number of nucleotides within the stated ranges.
- the percent complementarity between an off-target nucleic acid locus (or surrounding genomic region) and an “on-target” nucleic acid-targeting nucleic acid can be, for example about 5% to about 100%, or any percentage between this range, more preferably in the range of 90-100%.
- Cas9 proteins A number of catalytically active Cas9 proteins are known in the art and, as explained above, a Cas9 protein for use herein can be derived from any bacterial species, subspecies or strain that encodes the same. Although the subject invention is exemplified using S. pyogenes Cas9, orthologs from other bacterial species will find use herein. The specificity of these Cas9 orthologs is well known. Also useful are proteins encoded by Cas9-like synthetic proteins, and variants and modifications thereof. As explained above, the sequences for hundreds of Cas9 proteins are known and any of these proteins will find use with the present methods. The appropriate Cas9 protein to use with a particular target nucleic acid can be readily determined by one of skill in the art.
- dCas9 proteins are also known and, as described above, these proteins can be made catalytically inactive by mutating the RuvC1 and/or HNH domains to eliminate nuclease function. This is typically accomplished by introducing point mutations in both of the two catalytic residues (D10A and H840A, numbered relative to S. pyogenes Cas9) of the gene encoding Cas9. In doing so, dCas9 is rendered unable to cleave double-stranded DNA but retains the ability to target DNA. Moreover, as with the Cas9 proteins, the dCas9 proteins can be derived from any bacterial species, subspecies or strain that encodes the same.
- dCas9 orthologs are selected based on the particular protospacer adjacent motif (PAM) sequences present on the target nucleic acid.
- PAM protospacer adjacent motif
- S. pyogenes Cas9 targets NGG sequences.
- dCas9 orthologs can be used to target these sequences to block Cas9 cleavage thereof and prevent off-target breaks.
- sgRNA is used as an exemplary guide polynucleotide, however, it will be recognized by one of skill in the art that other guide polynucleotides that site-specifically guide Cas9 or dCas9 to a target, or off-target, nucleic acid can be used.
- the sgRNA component of the complexes is responsible for targeting a particular nucleic acid target.
- the spacer region of the sgRNA includes the region of complementarity to the targeted nucleic acid sequence.
- the spacer is the polynucleotide sequence that can specifically hybridize to a target nucleic acid sequence.
- the spacer element interacts with the target nucleic acid sequence through hydrogen bonding between complementary base pairs.
- a spacer element binds to a selected nucleic acid target sequence. Accordingly, the spacer element is the DNA target-binding sequence.
- binding specificity is determined by both sgRNA-DNA base pairing and the PAM sequence juxtaposed to the DNA complementary region.
- a sgRNA/dCas9 complex is targeted to genomic loci similarly targeted by catalytically intact sgRNA/Cas9 complexes, and can stably bind DNA and subsequently block activity of proteins targeted to those loci.
- dCas9 can robustly impair binding and/or activity of endogenous transcription factors in eukaryotic cells.
- a sgRNA, complexed with Cas9 is directed to a genomic locus of interest to induce double-stranded breaks.
- the binding specificity is determined by both sgRNA-DNA base pairing and the PAM sequence juxtaposed to the DNA complementary region.
- Computational and/or experimental methods e.g., sequencing, in silico DNA alignment methods can be used to ascertain off-target nuclease activity (e.g., to determine the off-target loci). Such methods are described in detail below.
- Independently acting dCas9 proteins can be designed to target these off-target loci. These engineered dCas9 proteins can be deployed as site-specific nuclease “blockers” to obstruct catalytically intact sgRNA/Cas9 binding and nuclease activity.
- sgRNA/Cas9 and sgRNA/dCas9 blockers may be introduced, for example into a cell or tissue, at differing concentrations.
- sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced ata ratio of 1:2, 1:3, 1:4, 1:5, 1:6, 1:7, 1:8, 1:9, 1:10, 10:1, 9:1, 8:1, 7:1, 6:1, 5:1, 4:1, 3:1, or 2:1.
- all of these components i.e., sgRNA, Cas9, dCas9, etc.
- sgRNA components will complex with Cas9 and dCas9 when provided to a cell.
- cell lines such as but not limited to HEK293 cells, are commercially available that constitutively express S. pyogenes Cas9 as well as S. pyogenes Cas9-GFP fusions. In this instance, cells can be transfected without catalytically active Cas9 as such is provided by the host cell.
- sgRNA/Cas9 and sgRNA/dCas9 complexes may be introduced at differing time points.
- sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced at least 1 minute apart, 5 minutes apart, 10 minutes apart, 30 minutes apart, 1 hour apart, 5 hours apart, or 15 hours apart or more.
- sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced at most 1 minute apart, 5 minutes apart, 10 minutes apart, 30 minutes apart, 1 hour apart, 5 hours apart, or 15 hours apart or more.
- sgRNA/Cas9 complexes can be introduced before the sgRNA/dCas9 complexes.
- sgRNA/Cas9 complexes can be introduced after the sgRNA/dCas9 complexes.
- sgRNA/Cas9 complexes and sgRNA/dCas9 complexes may be differentially regulated (i.e. differentially expressed or stabilized) via exogenously supplied agents (e.g. inducible DNA promoters or inducible Cas9 proteins).
- sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced into a cell by a variety of means including transfection, transduction, electroporation, micelles and liposome delivery, lipid nanoparticles, viral delivery, protein injection, and the like, described more fully below.
- sgRNA/dCas9 complexes may be directed to genomic loci that partially overlap.
- these complexes can be directed to loci that overlap by at least 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides.
- These complexes can be directed to loci that overlap by at most 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides.
- sgRNA/dCas9 complexes can be directed adjacent to sites of observed off-target nuclease activity and Cas9 binding.
- these complexes can be directed to sites that are adjacent to a site of observed off-target activity by at least 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides.
- Complexes can be directed to sites that are adjacent to a site of observed off-target activity by at most 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides.
- sgRNA/dCas9 complexes may be used to “tile” a given locus for maximum nuclease blocking activity. In some instances, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more sgRNA/dCas9 complexes are used. In some instances, at most 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more complexes are used.
- the complexes can cover a locus. Complexes can cover at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100% of a locus. The complexes can cover at most 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100% of a locus.
- the blockers can reduce off-targeting binding of the active complexes by at least 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100%.
- the blockers can reduce off-targeting binding of the active complexes by at most 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100%.
- a sgRNA/dCas9 complex can reduce binding of a sgRNA/Cas9 complex to an off-target nucleic acid by any mechanism.
- the sgRNA/dCas9 complex can compete with the catalytically active complex for binding the off-target nucleic acid.
- the sgRNA/dCas9 complex can bind to the off-target nucleic acid, thereby creating steric hindrance for the sgRNA/Cas9 complex that prevents binding of the sgRNA/Cas9 complex to the off-target nucleic acid.
- the sgRNA/dCas9 complex can displace the sgRNA/Cas9 complex from the off-target nucleic acid.
- the sgRNA/dCas9 complex can inhibit the sgRNA/Cas9 complex from binding the off-target nucleic acid.
- the sgRNA/dCas9 complex can block the sgRNA/Cas9 complex from binding the off-target nucleic acid.
- a sgRNA/dCas9 complex can reduce off-target nucleic acid binding, cleavage and/or modification by a sgRNA/Cas9 complex by 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100%, or any value within this range.
- a sgRNA/dCas9 complex can increase site-specific binding, and/or modification by 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100%, or any value within this range.
- Computational methods for determining off-target nuclease activity with any of the methods described herein can comprise scanning the genomic sequence of a subject.
- the genomic sequence can be segmented in silico into a plurality of nucleic acid sequences.
- the segmented nucleic acid sequences can be aligned with the nucleic acid-targeting nucleic acid sequence.
- a sequence search algorithm can determine one or more off-target nucleic acid sequences by identifying segmented genomic sequences with alignments comprising a defined number of base-pair mismatches with the nucleic acid-targeting nucleic acid.
- the number of base-pair mismatches between a genomic sequence and a nucleic acid-targeting nucleic acid selected by an algorithm can be user-defined, for example, the algorithm can be programmed to identify off-target sequences with mismatches of up to five base pairs between the genomic sequence and the nucleic acid-targeting nucleic acid.
- In silico binding algorithms can be used to calculate binding and/or cleavage efficiency of each predicted off-target nucleic acid sequence by a site-directed polypeptide using a weighting scheme. These data can be used to calculate off-target activity for a given nucleic acid-targeting nucleic acid and/or site-directed polypeptide.
- Off-target binding activity can be determined by experimental methods.
- the experimental methods can comprise sequencing a nucleic acid sample contacted by a complex comprising a site-directed polypeptide and a nucleic acid-targeting nucleic acid.
- the contacted nucleic acid sample can be fixed or crosslinked to stabilize the protein-RNA-DNA complex.
- the complex comprising the site-directed polypeptide, the nucleic acid (e.g., target nucleic acid, off-target nucleic acid), and/or the nucleic acid-targeting nucleic acid can be captured from the nucleic acid sample with an affinity tag and/or capture agents. Nucleic acid purification techniques can be used to separate the target nucleic acid from the complex.
- Nucleic acid purification techniques can include spin column separation, precipitation, and electrophoresis.
- the nucleic acid can be prepared for sequencing analysis by shearing and ligation of adaptors. Preparation for sequencing analysis can include the generation of sequencing libraries of the eluted target nucleic acid.
- Sequence determination methods can include but are not limited to pyrosequencing (for example, as commercialized by 454 Life Sciences, Inc., Branford, Conn.); sequencing by ligation (for example, as commercialized in the SOLiDTM technology, Life Technology, Inc., Carlsbad, Calif.); sequencing by synthesis using modified nucleotides (such as commercialized in TruSeqTM and HiSegTM technology by Illumina, Inc., San Diego, Calif., HeliScopeTM by Helicos Biosciences Corporation, Cambridge, Mass., and PacBio RS by Pacific Biosciences of California, Inc., Menlo Park, Calif.), sequencing by ion detection technologies (Ion Torrent, Inc., South San Francisco, Calif.); sequencing of DNA nanoballs (Complete Genomics, Inc., Mountain View, Calif.); nanopore-based sequencing technologies (for example, as developed by Oxford Nanopore Technologies, LTD, Oxford, UK), capillary sequencing (e.g, such as commercialized in MegaBACE by Molecular Dynamics, Inc., Sunnyval
- sequencing is performed by microarray analysis, such as in SNP genotyping by binding.
- Sequencing analysis can determine the identity and frequency of an off-target binding site for a given nucleic acid-targeting nucleic acid, by counting the number of times a particular binding site is read.
- the library of sequenced nucleic acids can include target nucleic acids and off-target nucleic acids.
- Off-target binding activity can be determined by additional experimental methods.
- the experimental methods can comprise inserting a donor oligonucleotide into a cleaved site (Tsai, S. Q. et al., “GUIDE-seq enables genome wide profiling of off-target cleavage by CRISPR-Cas nucleases” Nature Biotech. (2015) 33:187-197).
- the genomic DNA is then fragmented, adapters are appended, and PCR is performed with primers complementary to the donor oligonucleotide and adapter sequences.
- the amplified sequences are sequenced and then mapped back to a reference genome.
- Cas9 and/or dCas9 proteins may be modified or fused to additional protein domains.
- the fused additional protein domains may enhance the ability to block, impair, or inactivate active Cas9 complexes.
- fusion proteins including a Cas9 or dCas9 protein include, but are not limited to a nuclease, a transposase, a methylase, a transcription factor repressor or activator domain (e.g., such as KRAB and VP16), co-repressor and co-activator domains, DNA methyl transferases, histone acetyltransferases, histone deacetylases, and DNA cleavage domains (e.g., a cleavage domain from the endonuclease FokI).
- a non-native sequence can confer new functions to the fusion protein.
- functions include, but are not limited to the following: methyltransferase activity, demethylase activity, deamination activity, dismutase activity, alkylation activity, depurination activity, oxidation activity, pyrimidine dimer forming activity, integrase activity, transposase activity, recombinase activity, polymerase activity, ligase activity, helicase activity, photolyase activity, glycosylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin ligase activity, deubiquitinating activity, adenylation activity, deadenylation activity, sumoylating activity, desumoylating activity, ribosylation activity, deribosylation activity, myristoylation activity, remodelling activity, protease activity, oxidoreduct
- a donor polynucleotide is inserted into the target nucleic acid, when the target nucleic acid is cleaved.
- the methods can, for example, therefore be used to modify genomic DNA in a eukaryotic cell isolated from an organism. Further, the methods can also comprise contacting the nucleic acid target sequence in the genomic DNA with a donor polynucleotide wherein the modification comprises that at least a portion of the donor polynucleotide is integrated at the nucleic acid target sequence.
- Donor polynucleotide insertion can be performed by the homologous recombination machinery of the cell.
- the donor polynucleotide may comprise homology arms that are partially or fully complementary to the regions of the target nucleic acid outside of the break point.
- Donor polynucleotide insertion can also be performed by non-homologous DNA repair machinery of the cell, where no homology arms are required. A discussion of donor polynucleotides is presented more fully below.
- the donor polynucleotide can be tethered to the sgRNA/dCas9 complex to position it near the cleavage site targeted by the active sgRNA/Cas9 complex. See, FIG. 7A . In this way, homology directed repair, as described below, can be achieved at higher rates.
- FIG. 4 depicts an example of undesirable off-target binding and cleavage of a nuclease during genome engineering.
- a target nucleic acid 115 can be contacted with a complex comprising a site-directed polypeptide (e.g., Cas9) 105 and a nucleic acid-targeting nucleic acid (e.g., a sgRNA) 110 .
- the complex comprising the Cas9 105 and sgRNA 110 can bind to a target nucleic acid 120 .
- the complex comprising the Cas9 105 and sgRNA 110 can bind to an off-target nucleic acid 125 .
- the Cas9 of the complex can cleave 135 the target nucleic acid 120 and the off-target nucleic acid 125 , thereby generating off-target effects.
- FIG. 5 depicts an exemplary embodiment of reducing off-target binding and cleavage events using dCas9 blockers.
- a target nucleic acid 215 can be contacted with a complex comprising a site-directed polypeptide (e.g., Cas9) 205 and a nucleic acid-targeting nucleic acid (e.g., single-guide RNA) 210 .
- the complex comprising the Cas9 205 and sgRNA 210 can bind to a target nucleic acid 220 .
- the complex comprising the Cas9 205 and sgRNA 210 can bind to an off-target nucleic acid 225 .
- Complexes comprising an engineered dCas9 protein 235 and an engineered sgRNA 236 can be introduced and contacted 230 with the target nucleic acid.
- the dCas9 complexes can either displace or prevent the binding of complexes comprising active Cas9 205 .
- the active Cas9 205 can cleave 240 / 245 the target nucleic acid 220 .
- the active Cas9 205 may not cleave the off-target nucleic acid 225 because the dCas9 235 is preventing its binding and cleavage. In this way, off-target binding and cleavage may be prevented.
- the invention is directed to a method for increasing the efficiency of nucleic acid insertion by HDR or non-homologous repair mechanisms.
- multiple repair pathways can compete at site-directed DNA breaks. Such breaks can be repaired through, for example, non-homologous end-joining (NHEJ) or homology-directed repair (HDR).
- NHEJ non-homologous end-joining
- HDR homology-directed repair
- HDR relies on the presence of a donor polynucleotide, a piece of DNA that shares homology with sequences at or near a DNA break, that can be used to repair DNA breaks.
- a donor polynucleotide a piece of DNA that shares homology with sequences at or near a DNA break, that can be used to repair DNA breaks.
- the present invention provides for methods for using site-directed polypeptides (e.g., Cas9 nucleases) to create a substrate that will engage an alternative HDR pathway, similar to the single-strand annealing (SSA) branch of HDR, and will prevent competing DNA repair pathways, such as NHEJ, from repairing the break.
- site-directed polypeptides e.g., Cas9 nucleases
- Single-strand annealing is a process that is initiated when a break is introduced between two repetitive sequences oriented in the same direction.
- Four steps are generally necessary for the repair of breaks by SSA: (1) an end resection step which extends the repeated sequences and forms long 3′-ssDNA; (2) an annealing step in which the two repetitive sequences are annealed together forming a flap structure; (3) a second resection step in which the flap structures formed by the regions between the repeats are resected and; (4) ligation of the ends.
- HDR at DNA nicks occurs via a mechanism sometimes termed “alternative-HDR” that shares many of the same genetic dependencies of SSA such as inhibition by RAD51 and BRCA2.
- the inventors herein have developed an engineered CRISPR system by generating at least two single-stranded nicks on the same strand of a target double-stranded nucleic acid and providing a donor polynucleotide that can anneal to the non-nicked strand. This results in the accurate insertion of exogenous DNA with little background mutagenic end-joining.
- This method employs tandem Cas9 molecules that comprise one or more mutations that convert the catalytically active Cas9 molecules into nickases.
- the nickases are targeted to specific sites using sgRNAs designed to target two sites on the same strand in a double-stranded target nucleic acid, to generate two nicks (i.e., single-stranded breaks) on the targeted strand.
- any Cas9 molecule can be used, as described in detail above, so long as the Cas9 functions as a nickase. In some embodiments, this can be accomplished by introducing a point mutation in either of the two catalytic residues (D10A and H840A, numbered relative to S. pyogenes Cas9) of the gene encoding Cas9.
- An amino acid mutation at either position in Cas9 results in the inactivation of the nuclease catalytic activity and converts Cas9 to a nickase enzyme that makes single-stranded breaks at the target sites.
- the Cas9 double mutant with changes at amino acid positions D10A and H840A however, completely inactivates both the nuclease and nickase activities.
- Targeting specificity is determined by complementary base-pairing of a sgRNA to the genomic loci which include PAM sequences adjacent thereto.
- the nickases can comprise any mutation that enables the Cas9 to cleave only one strand of a double-stranded target nucleic acid.
- the Cas9 e.g., Cas9 from S. pyogenes
- the Cas9 can comprise a D10A mutation in one of its nuclease domains, or in a corresponding residue in an orthologous Cas9 to render the molecule a nickase.
- the Cas9 e.g., Cas9 from S. pyogenes
- Cas9 proteins are known and the Cas9 proteins can be derived from any bacterial species, subspecies or strain that encodes the same. Also useful are proteins encoded by Cas9 orthologs, Cas9-like synthetic proteins, and variants and modifications thereof.
- Cas9 orthologs are selected based on the particular protospacer adjacent motif (PAM) sequences present on the target nucleic acid.
- PAM protospacer adjacent motif
- S. pyogenes Cas9 targets NGG sequences.
- One of skill in the art can readily determine the particular Cas9 to mutate based on the particular specificity desired.
- both nickases used in the present methods should be paired such that nicks occur on the same strand.
- both nickases used can include a D10A mutation, or both can include a H840A mutation.
- One nickase can be a S. pyogenes Cas9 nickase and the other can be a nickase that targets a PAM with a different adjacent sequence than targeted by the S. pyogenes Cas9 nickase, such as a nickase designed from an orthologous Cas9 protein, so long as the same strand is nicked.
- the appropriate nickases for use in the present methods are therefore based on the nucleic acid target sequence and on a determination of PAM-adjacent sequences present at the desired cleavage sites. In this way, the method provides flexibility for single-stranded cleavage of the target nucleic acid.
- the nickases can cleave the sense strand of the double-stranded target nucleic acid or the anti-sense strand of the double-stranded target nucleic acid (e.g., DNA).
- the nickases can both cleave the same strand of the double-stranded target nucleic acid.
- the two nickases can be designed to cleave at a distance of at least 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100, 500, 1000, or 5000 or more bases away from each other.
- the two nickases can be designed to cleave at a distance of at most 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100, 500, 1000, or 5000 or more bases away from each other.
- the distance between the nicks will determine the length of the donor polynucleotide to be provided for insertion.
- a donor polynucleotide can be directed to, and inserted into a target site of interest to modify the target nucleic acid.
- Targeted DNA modifications using donor polynucleotides for large changes traditionally use plasmid-based donor templates that contain homology arms flanking the site of alteration. Each arm can vary in length, but is typically longer than about 200 bp for large insertion, the size of the arms depending on the size of the donor polynucleotide and the target polynucleotide, as explained in detail above.
- single-stranded oligonucleotides such as DNA oligonucleotides, partially double-stranded olignucleotides, nicked double-stranded donors, and the like, can be used in place of targeting plasmids.
- single-stranded oligonucleotides containing flanking sequences with homology in proximity to each nick can be used, and can be oriented in either the sense or antisense direction relative to the target locus.
- typically an oligonucleotide with around 100-150 bp total homology is used.
- Non-symmetrical polynucleotide donors can also be used that are composed of two DNA oligonucleotides. They are partially complementary, and each includes a flanking region of homology.
- the donor polynucleotide can have at least one arm with approximately 10 bases of homology to the target sequence.
- the donor polynucleotide can have at least one arm with less than 100 bases of homology to the target sequence.
- the donor can have more than 100 bases of homology to the target sequence.
- the donor can have homology arms of the same length.
- the donor can have homology arms of different lengths. In some cases, at least one of the homology arms is of zero length.
- a donor polynucleotide can be designed to anneal to the single-stranded gap that results from the nicks made by the two nickases.
- the donor polynucleotide can additionally comprise regions of homology with the sequences outside the breaks.
- the size of the regions of homology will be determined by the size of the target polynucleotide and can be at least 5, 10, 15, 20, 25, 30, 35 or more nucleotides in length, the size depending on the size of the donor polynucleotide and the target nucleic acid.
- the regions of homology can be at most 5, 10, 15, 20, 25, 30, 35 or more nucleotides in length.
- the donor polynucleotide can be single-stranded.
- the single-stranded donor polynucleotide can be inserted into the break created by the two tandem nickases.
- FIG. 6 depicts an exemplary embodiment of the present methods.
- two Cas9 D10A nickases are used in tandem to excise a single-stranded region of DNA on the same strand of a target double-stranded nucleic acid.
- two Cas9 nickases in this case S. pyogenes Cas9 nickases with D10A mutations in the HNH endonuclease domain
- S. pyogenes Cas9 nickases with D10A mutations in the HNH endonuclease domain are targeted to two spaced-apart positions on the sense strand of a target polynucleotide using two sgRNA/Cas9 nickase complexes.
- Targeting is accomplished using a spacer sequence present in the sgRNA that has been designed to specifically target a complementary region of in the target nucleic acid sequence. Binding specificity is determined by both sgRNA-DNA base pairing and the PAM, in this case, NGG, juxtaposed to the DNA complementary region (see, e.g., Mojica F. J. et al., “Short motif sequences determine the targets of the prokaryotic CRISPR defence system” Microbiology (2009) 155:733-740; Shah S. A. et al., “Protospacer recognition motifs: mixed identities and functional diversity” RNA Biology (2013) 10:891-899; Jinek M.
- the invention is directed to additional methods for increasing HDR.
- the current methodology for introducing a desired change into a gene includes transfecting, electroporating, or microinjecting a site-specific endonuclease and donor molecules into a cell or embryo and using passive diffusion to locate the donor molecules throughout the nucleus (Lin, S. et al. “Enhanced homology-directed human genome engineering by controlled timing of CRISPR/Cas9 delivery,” eLife (2014) Dec; doi: 10.7554/eLife.04766).
- this method of HDR typically has low efficiency.
- the methods described below position the donor molecule near the cut site to increase HDR efficiency.
- one or more sgRNA/dCas9 complexes are used, along with a catalytically active sgRNA/Cas9 complex.
- the one or more sgRNA/dCas9 complexes include a polynucleotide donor associated therewith to position the donor polynucleotide near a target site in order to increase HDR efficiency.
- the tethered dCas9 can position the donor molecule in an orientation that will increase the likelihood that the donor molecule will be incorporated into the target site through HDR, thereby introducing a desired change to the target sequence.
- the donor polynucleotide can be double- or single-stranded DNA, RNA, a vector, plasmid, or the like and can be used to transfer genetic information (i.e., polynucleotide sequences) from the donor at the site of the break in the target nucleic acid.
- the donor can be used to insert or replace polynucleotide sequences in a target sequence, for example, to introduce a polynucleotide that encodes a protein or functional RNA (e.g., siRNA), to introduce a protein tag, to modify a regulatory sequence of a gene, or to introduce a regulatory sequence to a gene (e.g.
- a promoter e.g., an enhancer, an internal ribosome entry sequence, a start codon, a stop codon, a localization signal, or polyadenylation signal
- modify a nucleic acid sequence e.g., introduce a mutation
- a single sgRNA/dCas9 complex can be used with the associated donor, as shown in FIG. 7A .
- two such complexes can be used to position the donor across the cut site as shown in FIG. 7B .
- the dCas9 and Cas9 molecules and guide polynucleotides used in the complexes can be any of those as described above.
- the complex can target nucleic acid either upstream or downstream of the nucleic acid targeted by the catalytically active sgRNA/Cas9 complex.
- a donor polynucleotide is associated with the sgRNA/dCas9 complex. In this way, the donor polynucleotide is brought into proximity with the cleaved target nucleic acid and HDR will insert at least a portion of the donor polynucleotide at the cleaved site.
- the second sgRNA/dCas9 complex is designed to target nucleic acid downstream of the catalytically active sgRNA/Cas9 complex when the first sgRNA/dCas9 targets nucleic acid upstream of the catalytically active sgRNA/Cas9 complex.
- the second sgRNA/dCas9 complex is designed to target nucleic acid upstream of the catalytically active sgRNA/Cas9 complex when the first sgRNA/dCas9 targets nucleic acid downstream of the catalytically active sgRNA/Cas9 complex.
- the target for the active sgRNA/Cas9 complex is in a position between the two inactive complexes.
- the 5′ end of the polynucleotide donor will be associated with one of the inactive sgRNA/dCas9 complexes and the 3′ end associated with the other of the inactive complexes such that the polynucleotide donor is positioned across the cleavage site for insertion using HDR.
- One of skill in the art can readily determine which end of the polynucleotide donor to associate with each complex based on the desired target.
- the donor is tethered to the complexes using methods well known in the art.
- the backbone of the sgRNA can be extended to include a region complementary to the donor molecule.
- the sgRNA in the sgRNA/dCas9 complex can include a number of extra nucleotides, e.g., 5-20, such as 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or even more, extra nucleotides at the 3′ end of the sgRNA that will bind in a complementary fashion to the 5′ or 3′ end of a single-stranded DNA donor polynucleotide.
- the donor polynucleotide will be positioned to interact with the sgRNA/Cas9-induced cut site and the cell's endogenous HDR machinery will incorporate the donor into the cleavage site.
- the sgRNA/dCas9 tethered donor polynucleotide is positioned upstream or downstream of the double-stranded break and is available at a higher local concentration for HDR.
- the various components can be provided to a cell or in vitro, for example, using expression cassettes encoding a Cas9, a dCas9, sgRNA; a donor polynucleotide, etc. These components can be present on a single cassette or multiple cassettes, in the same or different constructs.
- Expression cassettes typically comprise regulatory sequences that are involved in one or more of the following: regulation of transcription, post-transcriptional regulation, and regulation of translation.
- Expression cassettes can be introduced into a wide variety of organisms including bacterial cells, yeast cells, plant cells, and mammalian cells.
- Expression cassettes typically comprise functional regulatory sequences corresponding to the organism(s) into which they are being introduced.
- vectors including expression vectors, comprising polynucleotides coding for a Cas9, a dCas9, a sgRNA and/or a donor polynucleotide.
- Vectors useful for practicing the present invention include plasmids, viruses (including phage), and Integra table DNA fragments (i.e., fragments integratable into the host genome by homologous recombination).
- a vector replicates and functions independently of the host genome, or may, in some instances, integrate into the genome itself. Suitable replicating vectors will contain a replicon and control sequences derived from species compatible with the intended expression host cell.
- Transformed host cells are cells that have been transformed or transfected with the vectors constructed using recombinant DNA techniques
- Expression vectors for most host cells are commercially available. There are several commercial software products designed to facilitate selection of appropriate vectors and construction thereof, such as insect cell vectors for insect cell transformation and gene expression in insect cells, bacterial plasmids for bacterial transformation and gene expression in bacterial cells, yeast plasmids for cell transformation and gene expression in yeast and other fungi, mammalian vectors for mammalian cell transformation and gene expression in mammalian cells or mammals, viral vectors (including retroviral, lentiviral, and adenoviral vectors) for cell transformation and gene expression and methods to easily enable cloning of such polynucleotides.
- insect cell vectors for insect cell transformation and gene expression in insect cells bacterial plasmids for bacterial transformation and gene expression in bacterial cells
- yeast plasmids for cell transformation and gene expression in yeast and other fungi
- mammalian vectors for mammalian cell transformation and gene expression in mammalian cells or mammals
- viral vectors including retroviral, lentiviral
- SnapGeneTM (GSL Biotech LLC, Chicago, Ill.; snapgene.com/resources/plasmid_files/your_time_is_valuable/), for example, provides an extensive list of vectors, individual vector sequences, and vector maps, as well as commercial sources for many of the vectors.
- Expression cassettes typically comprise regulatory sequences that are involved in one or more of the following: regulation of transcription, post-transcriptional regulation, and regulation of translation.
- Expression cassettes can be introduced into a wide variety of organisms including bacterial cells, yeast cells, mammalian cells, and plant cells.
- Expression cassettes typically comprise functional regulatory sequences corresponding to the host cells or organism(s) into which they are being introduced.
- Expression vectors can also include polynucleotides encoding protein tags (e.g., poly-His tags, hemagglutinin tags, fluorescent protein tags, bioluminescent tags, nuclear localization tags). The coding sequences for such protein tags can be fused to the coding sequences or can be included in an expression cassette, for example, in a targeting vector.
- polynucleotides encoding one or more of the various components are operably linked to an inducible promoter, a repressible promoter, or a constitutive promoter.
- Vectors can be designed to simultaneously express a target-specific sgRNA using a U2 or U6 promoter, a Cas9 and/or dCas9, and if desired, a marker protein, for monitoring transfection efficiency and/or for further enriching/isolating transfected cells by flow cytometry.
- Vectors can be designed for expression of various components of the described methods in prokaryotic or eukaryotic cells.
- transcription can be in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
- Other RNA polymerase and promoter sequences can be used.
- Vectors can be introduced into and propagated in a prokaryote.
- Prokaryotic vectors are well known in the art.
- a prokaryotic vector comprises an origin of replication suitable for the target host cell (e.g., oriC derived from E. coli, pUC derived from pBR322, pSC101 derived from Salmonella ), 15A origin (derived from p15A) and bacterial artificial chromosomes).
- Vectors can include a selectable marker (e.g., genes encoding resistance for ampicillin, chloramphenicol, gentamicin, and kanamycin).
- ZeocinTM (Life Technologies, Grand Island, N.Y.) can be used as a selection in bacteria, fungi (including yeast), plants and mammalian cell lines. Accordingly, vectors can be designed that carry only one drug resistance gene for Zeocin for selection work in a number of organisms.
- Useful promoters are known for expression of proteins in prokaryotes, for example, T5, T7, Rhamnose (inducible), Arabinose (inducible), and PhoA (inducible). Further, T7 promoters are widely used in vectors that also encode the T7 RNA polymerase.
- Prokaryotic vectors can also include ribosome binding sites of varying strength, and secretion signals (e.g., mal, sec, tat, ompC, and pelB).
- vectors can comprise RNA polymerase promoters for the expression of sgRNAs.
- Prokaryotic RNA polymerase transcription termination sequences are also well known (e.g., transcription termination sequences from S. pyogenes ).
- Integrating vectors for stable transformation of prokaryotes are also known in the art (see, e.g., Heap, J. T., et al., “Integration of DNA into bacterial chromosomes from plasmids without a counter-selection marker,” Nucleic Acids Res. (2012) 40:e59).
- Expression of proteins in prokaryotes is typically carried out in Escherichia coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion proteins.
- RNA polymerase promoters suitable for expression of the various components are available in prokaryotes (see, e.g., Jiang, Y., et al., “Multigene editing in the Escherichia coli genome via the CRISPR-Cas9 system,” Environ Microbiol. (2015) 81:2506-2514); Estrem, S. T., et al., (1999) “Bacterial promoter architecture: subsite structure of UP elements and interactions with the carboxy-terminal domain of the RNA polymerase alpha subunit,” Genes Dev. 15; 13(16):2134-47).
- a vector is a yeast expression vector comprising one or more components of the above-described methods.
- yeast expression vectors for expression in Saccharomyces cerivisae include, but are not limited to, the following: pYepSec1, pMFa, pJRY88, pYES2, and picZ.
- Methods for gene expression in yeast cells are known in the art (see, e.g., Methods in Enzymology, Volume 194, “Guide to Yeast Genetics and Molecular and Cell Biology, Part A,” (2004) Christine Guthrie and Gerald R. Fink (eds.), Elsevier Academic Press, San Diego, Calif.).
- promoters typically include, but are not limited to, promoters of genes encoding the following yeast proteins: alcohol dehydrogenase 1 (ADH1) or alcohol dehydrogenase 2 (ADH2), phosphoglycerate kinase (PGK), triose phosphate isomerase (TPI), glyceraldehyde-3-phosphate dehydrogenase (GAPDH; also known as TDH3, or triose phosphate dehydrogenase), galactose-1-phosphate uridyl-transferase (GALT), UDP-galactose epimerase (GAL10), cytochrome ci (CYC1), acid phosphatase (PHOS) and glycerol-3-phosphate dehydr
- ADH1 alcohol dehydrogenase 1
- ADH2 alcohol dehydrogenase 2
- PGK phosphoglycerate kinase
- TPI triose phosphate isomerase
- Hybrid promoters such as the ADH2/GAPDH, CYC1/GAL10 and the ADH2/GAPDH promoter (which is induced at low cellular-glucose concentrations, e.g., about 0.1 percent to about 0.2 percent) also may be used.
- suitable promoters include the thiamine-repressed nmtl promoter and the constitutive cytomegalovirus promoter in pTL2M.
- Yeast RNA polymerase III promoters e.g., promoters from 5S, U6 or RPR1 genes
- polymerase III termination sequences are known in the art (see, e.g., www.yeastgenome.org; Harismendy, O., et al., (2003) “Genome-wide location of yeast RNA polymerase III transcription machinery,” The EMBO Journal. 22(18):4738-4747.)
- upstream activation sequences may be used to enhance polypeptide expression.
- upstream activation sequences for expression in yeast include the UASs of genes encoding these proteins: CYC1, ADH2, GAL1, GALT, GAL10, and ADH2.
- transcription termination sequences for expression in yeast include the termination sequences of the ⁇ -factor, CYC1, GAPDH, and PGK genes. One or multiple termination sequences can be used.
- Suitable promoters, terminators, and coding regions may be cloned into E. coli -yeast shuttle vectors and transformed into yeast cells. These vectors allow strain propagation in both yeast and E. coli strains. Typically, the vector contains a selectable marker and sequences enabling autonomous replication or chromosomal integration in each host. Examples of plasmids typically used in yeast are the shuttle vectors pRS423, pRS424, pRS425, and pRS426 (American Type Culture Collection, Manassas, Va.). These plasmids contain a yeast 2 micron origin of replication, an E. coli replication origin (e.g., pMB1), and a selectable marker.
- yeast 2 micron origin of replication e.g., pMB1
- a selectable marker e.g., pMB1
- the various components can also be expressed in insects or insect cells.
- Suitable expression control sequences for use in such cells are well known in the art.
- it is desirable that the expression control sequence comprises a constitutive promoter.
- suitable strong promoters include, but are not limited to, the following: the baculovirus promoters for the piO, polyhedrin (polh), p 6.9, capsid, UAS (contains a Gal4 binding site), Ac5, cathepsin-like genes, the B.
- baculovirus promoters for the ie1, ie2, ieO, etl, 39K (aka pp31), and gp64 genes. If it is desired to increase the amount of gene expression from a weak promoter, enhancer elements, such as the baculovirus enhancer element, hr5, may be used in conjunction with the promoter.
- RNA polymerase III promoters are known in the art, for example, the U6 promoter.
- conserveed features of RNA polymerase III promoters in insects are also known (see, e.g., Hernandez, G., (2007) “Insect small nuclear RNA gene promoters evolve rapidly yet retain conserved features involved in determining promoter activity and RNA polymerase specificity,” Nucleic Acids Res. 2007 January; 35(1):21-34).
- the various components are incorporated into mammalian vectors for use in mammalian cells.
- mammalian vectors suitable for use with the systems of the present invention are commercially available (e.g., from Life Technologies, Grand Island, N.Y.; NeoBiolab, Cambridge, Mass.; Promega, Madison, Wis.; DNA2.0, Menlo Park, Calif.; Addgene, Cambridge, Mass.).
- Vectors derived from mammalian viruses can also be used for expressing the various components of the present methods in mammalian cells. These include vectors derived from viruses such as adenovirus, papovirus, herpesvirus, polyomavirus, cytomegalovirus, lentivirus, retrovirus, vaccinia and Simian Virus 40 (SV40) (see, e.g., Kaufman, R. J., (2000) “Overview of vector design for mammalian gene expression,” Molecular Biotechnology, Volume 16, Issue 2, pp 151-160; Cooray S., et al., (2012) “Retrovirus and lentivirus vector design and methods of cell conditioning,” Methods Enzymol. 507:29-57).
- viruses such as adenovirus, papovirus, herpesvirus, polyomavirus, cytomegalovirus, lentivirus, retrovirus, vaccinia and Simian Virus 40 (SV40)
- SV40 Simian Virus 40
- Regulatory sequences operably linked to the components can include activator binding sequences, enhancers, introns, polyadenylation recognition sequences, promoters, repressor binding sequences, stem-loop structures, translational initiation sequences, translation leader sequences, transcription termination sequences, translation termination sequences, primer binding sites, and the like.
- Commonly used promoters are constitutive mammalian promoters CMV, EF1a, SV40, PGK1 (mouse or human), Ubc, CAG, CaMKIIa, and beta-Act. and others known in the art (Khan, K. H. (2013) “Gene Expression in Mammalian Cells and its Applications,” Advanced Pharmaceutical Bulletin 3(2), 257-263).
- mammalian RNA polymerase III promoters including H1 and U6, can be used.
- a recombinant mammalian expression vector is capable of preferentially directing expression of the nucleic acid in a particular cell type (e.g., using tissue-specific regulatory elements to express a polynucleotide).
- tissue-specific regulatory elements are known in the art and include, but are not limited to, the albumin promoter, lymphoid-specific promoters, neuron-specific promoters (e.g., the neurofilament promoter), pancreas-specific promoters, mammary gland-specific promoters (e.g., milk whey promoter), and in particular promoters of T cell receptors and immunoglobulins.
- Developmentally-regulated promoters are also encompassed, e.g., the murine hox promoters and the alpha-fetoprotein promoter.
- HEK 293 Human embryonic kidney
- CHO Chiinese hamster ovary
- These cell lines can be transfected by standard methods (e.g., using calcium phosphate or polyethyleneimine (PEI), or electroporation).
- PKI polyethyleneimine
- Other typical mammalian cell lines include, but are not limited to: HeLa, U2OS, 549, HT1080, CAD, P19, NIH 3T3, L929, N2a, Human embryonic kidney 293 cells, MCF-7, Y79, SO-Rb50, Hep G2, DUKX-X11, J558L, and Baby hamster kidney (BHK) cells.
- polynucleotides e.g., an expression vector
- methods of introducing polynucleotides are known in the art and are typically selected based on the kind of host cell.
- Such methods include, for example, viral or bacteriophage infection, transfection, conjugation, electroporation, calcium phosphate precipitation, polyethyleneimine-mediated transfection, DEAE-dextran mediated transfection, protoplast fusion, lipofection, liposome-mediated transfection, particle gun technology, direct microinjection, and nanoparticle-mediated delivery.
- one aspect of the present invention provides methods of increasing Cas9-mediated genome engineering efficiency by either decreasing the number of off-target nucleic acid double-stranded breaks, and/or enhancing HDR of a cleaved target nucleic acid, thus modifying genomes using HDR.
- the present invention also includes methods of modulating in vitro or in vivo transcription using the various components and complexes described herein.
- a sgRNA/dCas protein complex can repress gene expression by interfering with transcription when the sgRNA directs DNA target binding of the complex to the promoter region of the gene.
- complexes to reduce transcription also includes complexes wherein the dCas protein is fused to a known down regulator of a target gene (e.g., a repressor polypeptide).
- a target gene e.g., a repressor polypeptide
- expression of a gene is under the control of regulatory sequences to which a repressor polypeptide can bind.
- a guide polynucleotide can direct DNA target binding of a repressor protein complex to the DNA sequences encoding the regulatory sequences or adjacent the regulatory sequences such that binding of the repressor protein complex brings the repressor protein into operable contact with the regulatory sequences.
- dCas9 can be fused to an activator polypeptide to activate or increase expression of a gene under the control of regulatory sequences to which an activator polypeptide can bind.
- Another method of the present invention is the use of sgRNA/dCas9 complexes in methods to isolate or purify regions of genomic DNA (gDNA).
- a dCas protein is fused to an epitope (e.g., a FLAG® epitope, Sigma Aldrich, St. Louis, MO) and a sgRNA directs DNA target binding of a sgRNA/dCas9 protein-epitope complex to DNA sequences within the region of genomic DNA to be isolated or purified.
- An affinity agent is used to bind the epitope and the associated gDNA bound to the sgRNA/dCas9 protein-epitope complex.
- a gene-therapy method uses the introduction of nucleic acid sequences into an organism or cells of an organism (e.g., patient) to achieve expression of components of the present invention to provide modification of a target function.
- cells from an organism may be engineered, ex vivo, by (i) introduction of vectors comprising expression cassettes expressing the various components, (ii) direct introduction of sgRNA and/or donor polynucleotides and Cas9 and/or dCas9 proteins, or (iii) introduction of combinations of these components.
- the engineered cells are provided to an organism (e.g., patient) to be treated.
- components of the present invention are delivered using nanoscale delivery systems, such as nanoparticles.
- liposomes and other particulate delivery systems can be used.
- vectors including the components of the present methods can be packaged in liposomes prior to delivery to the subject or to cells derived therefrom, such as described in U.S. Pat. Nos. 5,580,859; 5,549,127; 5,264,618; 5,703,055, all incorporated herein by reference in their entireties.
- Lipid encapsulation is generally accomplished using liposomes which are able to stably bind or entrap and retain nucleic acid.
- the methods described herein can also be used to generate non-human genetically modified organisms.
- expression cassettes comprising polynucleotide sequences of the various components, as well as a targeting vector are introduced into zygote cells to site-specifically introduce a selected polynucleotide sequence at a DNA target sequence in the genome to generate a modification of the genomic DNA.
- the selected polynucleotide sequence is present in the targeting vector.
- Modifications of the genomic DNA typically include, insertion of a polynucleotide sequence, deletion of a polynucleotide sequence, or mutation of a polynucleotide sequence, for example, gene correction, gene replacement, gene tagging, transgene insertion, gene disruption, gene mutation, mutation of gene regulatory sequences, and so on.
- the organism is a mouse.
- Generating transgenic mice involves five basic steps (Cho A., et al., “Generation of Transgenic Mice,” Current protocols in cell biology, (2009); CHAPTER.Unit-19.11): (1) purifying a transgenic construct (e.g., expression cassettes comprising the various components of the various methods described herein, as well as a targeting vector); (2) harvesting donor zygotes; (3) microinjecting the transgenic construct into the mouse zygote; (4) implanting the microinjected zygotes into pseudo-pregnant recipient mice; and (5) performing genotyping and analysis of the modification of the genomic DNA established in founder mice.
- a transgenic construct e.g., expression cassettes comprising the various components of the various methods described herein, as well as a targeting vector
- the organism is a plant.
- the components described herein are used to effect efficient, cost-effective gene editing and manipulation in plant cells. It is generally preferable to insert a functional recombinant DNA in a plant genome at a non-specific location. However, in certain instances, it may be useful to use site-specific integration to introduce a recombinant DNA construct into the genome.
- Recombinant vectors for use in plant are known in the art.
- the vectors can include, for example, scaffold attachment regions (SARs), origins of replication, and/or selectable markers.
- Methods and compositions for transforming plants by introducing a recombinant DNA construct into a plant genome includes any of a number of methods known in the art.
- One method for constructing transformed plants is microprojectile bombardment.
- Agrobacterium -mediated transformation is another method for constructing transformed plants.
- other non- Agrobacterium species e.g., Rhizobium
- Other transformation methods include electroporation, liposomes, transformation using pollen or viruses, chemicals that increase free DNA uptake, or free DNA delivery by means of microproj ectile bombardment.
- DNA constructs of the present invention may be introduced into the genome of a plant host using conventional transformation techniques that are well known to those skilled in the art (see, e.g., “Methods to Transfer Foreign Genes to Plants,” Y Narusaka, et al., cdn.intechopen.com/pdfs-wm/30876.pdf).
- the following examples 1-4 illustrate the use of a catalytically inactive Cas9 (i.e. “dead” Cas9 or dCas9) to reduce off-target nuclease activity in eukaryotic cells. Additionally, this example shows how one can identify a specific spacer sequence (for incorporation into a sgRNA or crRNA) that is effective at blocking nuclease off-target activity in eukaryotic cells.
- sgRNA or single-guide RNA it is understood by one skilled in the art that other guide polynucleotide systems, such as a crRNA/tracrRNA dual-guide system, present an alternative means of guiding dCas9 to the targeted site.
- sgRNA components of dCas9 nuclease-blocker dCas9-NB, i.e. a Cas9 lacking catalytic activity
- ribonucleoprotein (RNP) complexes also termed “sgRNA/dCas9 complex” herein
- Cas9 nuclease RNP complexes also termed “sgRNA/Cas9 complex” herein
- in vitro transcription e.g., T7 Quick High Yield RNA Synthesis Kit, New England Biolabs, Ipswich, Mass.
- PCR Polymerase Chain Reaction
- the PCR reaction to assemble the sgRNA DNA template proceeded as follows: Three “internal” DNA primers (C, D, E-O, Table 2) were present at a concentration of 2 nM each. Two “outer” DNA primers (A, B, Table 2) corresponding to the T7 promoter and the 3′ end of the RNA sequence were present at 640 nM to drive the amplification reaction.
- PCR reactions were performed using Kapa HiFi HotstartTM PCR kit (Kapa Biosystems, Inc., Wilmington, Mass.) as per manufacturer's recommendation.
- PCR assembly reactions were carried out using the following thermal cycling conditions: 98° C. for 2 minutes, 35 cycles of 15 seconds at 98° C., 15 seconds at 62° C., 15 seconds at 72° C., and a final extension at 72° C. for 2 minutes.
- RNA synthesis Kit New England Biolabs, Ipswich, Mass.
- Transcription reactions were DNAse I-treated (New England Biolabs, Ipswich, Mass.).
- the quality of the transcribed RNA was checked by capillary electrophoresis on a Fragment Analyzer (Advanced Analytical Technologies, Inc., Ames, Iowa).
- the Cas9 and dCas9-NB sgRNA component sequences were as follows:
- Protein components of Cas9 and dCas9-NB RNPs were expressed from bacterial expression vectors in E. coli (BL21 (DE3)) and purified using affinity, ion exchange and size exclusion chromatography according to methods described in Jinek et al., 2012.
- the coding sequence for S. pyogenes Cas9 included the two nuclear localization sequences (NLS) at the C-terminus.
- the dCas9 variant of NLS-tagged Cas9, in which active site residues from both nuclease domains were mutated was prepared by introducing mutations into the coding sequence of S.
- This example illustrates the use of a MiSeq Sequencer (Illumina, San Diego, Calif.) for deep sequencing analysis to evaluate and compare the DNA cleavage (as inferred from non-homologous end joining, or NHEJ) of selected Cas9 nuclease off-target sequences in the presence and absence of dCas9-NBs.
- Cas9 was directed by a specific sgRNA to a sequence (GGGTGGGGGGAGTTTGCTCCTGG, SEQ ID NO:82) within the human gene Vascular Endothelial Growth Factor A (VEGFA).
- dCas9 was directed towards an off-target, sequence (GGATGGAGGGAGTTTGCTCCTGG, SEQ ID NO:83) known to be targeted by Cas9 RNP nuclease off-target to prevent off-target cleavage as well as a sequence (GGGGCCACTAGGGACAGGATTGG, SEQ ID NO:84) within the control locus, Adeno-Associated Virus Integration Site 1 (AAVSJ).
- sgRNA corresponding to approximately 1-9 ⁇ g or approximately 25-250 pmol
- reaction buffer 20 mM HEPES, pH 7.5, 100 mM KCL, 5 mM MgCl 2 , 5% glycerol
- Cas9/dCas9-NB RNP complexes were transfected into K562 cells (ATCC, Manassas, Va.), using the Nucleofector® 96-well Shuttle System (Lonza, Allendale, N.J.) and the following protocol: RNP complexes were dispensed in a 5 ⁇ L final volume into individual wells of a 96-well plate.
- K562 cells suspended in media were transferred from culture flask to a 50 mL conical, cells were then pelleted by centrifugation for 3 minutes at 200 ⁇ g, the culture medium aspirated and washed once with calcium and magnesium-free PBS.
- K562 cells were then pelleted by centrifugation for 3 minutes at 200 ⁇ g, the PBS aspirated and cell pellet was resuspended in 10 mL of calcium and magnesium-free PBS.
- K562 cells were counted using the Countess® II Automated Cell Counter (Life Technologies, Grand Island, N.Y.). 4.2 ⁇ 10 7 cells were transferred to a 50 ml tube and pelleted. The PBS was aspirated and the cells were resuspended in NucleofectorTM SF (Lonza, Allendale, N.J.) solution to a density of 1 ⁇ 10 7 cells/mL. 20 ⁇ L of the cell suspension were then added to individual wells containing 5 ⁇ L of RNP complexes and the entire volume was transferred to the wells of a 96-well NucleocuvetteTM Plate (Lonza, Allendale, N.J.).
- the plate was loaded onto the NucleofectorTM 96-well ShuttleTM (Lonza, Allendale, N.J.) and cells were nucleofected using the 96-FF-120 NucleofectorTM program (Lonza, Allendale, N.J.).
- Iscove's Modified Dulbecco's Media 80 ⁇ L Iscove's Modified Dulbecco's Media (IMDM, Life Technologies, Grand Island, N.Y.), supplemented with 10% FBS (Fisher Scientific, Pittsburgh, Pa.) and supplemented with penicillin and streptomycin (Life Technologies, Grand Island, N.Y.), was added to each well and 50 ⁇ L of the cell suspension was transferred to a 96-well cell culture plate containing 150 ⁇ L pre-warmed IMDM complete culture medium. The plate was then transferred to a tissue culture incubator and maintained at 37° C. in 5% CO 2 for approximately 48 hours.
- IMDM Iscove's Modified Dulbecco's Media
- FBS Fisher Scientific, Pittsburgh, Pa.
- penicillin and streptomycin Life Technologies, Grand Island, N.Y.
- Genomic DNA was isolated from K562 cells 48 hours after Cas9/dCas9-NB transfection using 50 ⁇ L QuickExtract DNA Extraction solution (Epicentre, Madison, Wis.) per well followed by incubation at 37° C. for 10 minutes. 50 ⁇ L water was added to the samples, and next they were incubated at 75° C. for 10 minutes and 95° C. for 5 minutes to stop the reaction. sgDNA was stored at ⁇ 20° C. until further processing.
- PCR reaction was diluted 1:100 in water.
- Target-specific primers are shown in Table 4:
- a second ‘barcoding’ PCR was set up using unique primers for each sample facilitating multiplex sequencing (oligonucleotides T and U in Table 2, where a unique 8 bp index sequence, denoted by “NNNNNNNN (SEQ ID NO:33)” allowed demultiplexing of each amplicon during sequence analysis).
- the second PCR was performed using Q5 Hot Start High-Fidelity 2 ⁇ Master Mix (New England Biolabs, Ipswich, Mass.) at 1 ⁇ concentration, primers at 0.5 ⁇ M each, 1 ⁇ L of 1:100 diluted first PCR, in a final volume of 10 ⁇ L and amplified 98° C. for 1 minute, 12 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 min. PCR reactions were pooled into a single microfuge tube for SPRIselect bead (Beckman Coulter, Pasadena, Calif.) based clean up of amplicons for sequencing.
- SPRIselect bead Beckman Coulter, Pasadena, Calif.
- the microfuge tube was spun in a microcentrifuge to collect the contents of the tube, and was then returned to the magnet, incubated until solution had cleared, and the supernatant containing the purified amplicons were dispensed into a clean microfuge tube.
- the purified amplicon library was quantified using the NanodropTM 2000 system (Thermo Fisher Scientific, Wilmington, Del.).
- the amplicon library was normalized to 4 nM concentration as calculated from optical absorbance at 260 nm (NanodropTM, Thermo Fisher Scientific, Wilmington, Del.) and size of the amplicons. Library was analyzed on MiSeq Sequencer with MiSeq Reagent Kit v2, 300 Cycles (Illumina, San Diego, Calif.), with two 151-cycle paired-end run plus two eight-cycle index reads.
- FIG. 8 shows the effects of dCas9-NBs on VEGFA sgRNA/Cas9 on-target editing at the VEGFA locus.
- FIG. 9 shows the effects of dCas9-NBs on VEGFA sgRNA/Cas9 off-target editing at a known VEGFA off-target locus on human chromosome 15.
- the deep sequencing analysis described in this example can be practiced by one of ordinary skill in the art with other Cas9/dCas9 RNP complexes (i.e. assembled with distinct sgRNAs and/or distinct ratios of Cas9, dCas9, and sgRNA).
- This example illustrates the method through which off-target Cas9 nuclease sites may be identified.
- the method presented here is adapted from Tsai et. al., “GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases.,” Nat Biotechnol., 2015 February; 33(2):187-97.
- a given locus in a genome of interest i.e. a human genome
- a 20 base pair target-site, followed by an NGG protospacer adjacent motif (PAM), is selected for nuclease targeting.
- PAM NGG protospacer adjacent motif
- Oligos are obtained (Integrated DNA Technologies, Coralville, Iowa) for generating a blunt, double-stranded oligodeoxynucleotide (dsODN) that will be utilized for the GUIDE-Seq method.
- the dsODN contains phosphothiorate linkages at the 5′ ends of both DNA strands.
- the dsODN is assembled by incubating the two oligos in annealing buffer (i.e. 10 mM Tris, pH 7.5-8.0, 50 mM NaCl, 1 mM EDTA) for 3 min at 95° C. and allowing the oligos to cool to RT.
- annealing buffer i.e. 10 mM Tris, pH 7.5-8.0, 50 mM NaCl, 1 mM EDTA
- Cells from a species of interest are procured from a commercial repository (i.e. ATCC, DSMZ). Cells are grown to an appropriate density for transfection. Cells are transfected with an sgRNAs/Cas9 protein complex and the DNA donor oligo via methods known to those skilled in the art (i.e. nucleofection or lipid transfection of DNA plasmid encoding RNP components as well as dsODN).
- gDNA is harvested 48hrs after cell transfection and purified using Agencourt DNAdvance (Beckman Coulter, Pasadena, Calif.). Purified gDNA is fragmented with methods known to those skilled in the art (i.e. mechanical shearing via sonication or enzymatic shearing with NEBfragmentase, (New England Biolabs, Ipswich, Mass.)) to an average length of 500 base pairs, then end-repaired, A-tailed and ligated to adapters. PCR with primers complementary to the dsODN tag and illumina sequencing adapter sequences (Illumina, San Diego, Calif.), respectively, are used for target-enrichment.
- Target-enriched library is sequenced using MiSeq Sequencer (Illumina, San Diego, Calif.). Reads are mapped back to the respective species' genome and read coverage is calculated using BedTools (bedtools.readthedocs.org/en/latest/). Integrative Genomics Viewer (IGV, broadinstitute.org/igv/) is used to map the starting (5′) and ending (3′) position of reads, and peaks are called using MACS2 (pypi.python.org/pypi/MACS2). The Sequencing data is used to confirm that a putative genomic locus is a candidate off-target sequence. Following the guidance of the present examples, the identification of novel off-target loci can be practiced by one of ordinary skill in the art.
- Tru-gRNAs Truncated Single-Guide RNAs
- dCas9-NBs may be assembled with truncated guides.
- the method presented here is adapted from Fu Y et. al., “Improving CRISPR-Cas nuclease specificity using truncated guide RNAs,” Nat Biotechnol. 2014 March; 32(3):279-84.
- Truncated single-guide RNAs (tru-sgRNAs) of 17-18nt have been shown to possess increased specificity relative to 20 nt sgRNAs.
- a dCas9-NB assembled with a tru-sgRNA may be targeted directly to a genomic motif and PAM of an off-target locus to reduce off-target editing while having minimal inhibition of on-target editing.
- a given off-target genomic locus i.e. spacer sequence
- a tru-sgRNA is designed to target said off-target location in the genome.
- the tru-sgRNA/dCas9 RNP may target a sequence contained entirely within the off-target motif, or it may target a sequence partially overlapping with the off-target motif.
- dCas9 is assembled with a short (i.e. 17 nt) tru-sgRNA and Cas9 is assembled with a sgRNA (i.e. 20 nt) to produce functional RNPs.
- RNA components are transcribed from DNA templates incorporating a T7 promoter at the 5′ end as described in the Experimental section herein.
- dCas9 (D10A, H840A) and Cas9 proteins are recombinantly expressed in E. coli.
- RNPs are assembled by incubating protein and RNA components together at 37° C. for 10 minutes.
- Cells from species of interest are procured from a commercial repository (i.e. ATCC, DSMZ). Cells are grown to a level of confluency that enables transfection. Tru-sgRNAs complexed with dCas9 are mixed with sgRNAs assembled with Cas9. Next, the mixture is transfected into a cell line of interest using methods known to those skilled in the art (i.e. nucleofection or lipid transfection) as described in Example 1 herein.
- gDNA is then harvested 48 hours later using Quick Extract (Epicentre, Madison, Wis.) per the manufacturer's instructions.
- Two rounds of PCR, as described in Example 1 herein, are used to amplify and barcode the genomic region targeted by the tru-sgRNA/dCas9-NB.
- Adapter oligos and dimers are removed by performing SPRlselect bead (Beckman Coulter, Pasadena, Calif.)-based clean up of the sequencing library. Sequencing library concentration is determined by the NanodropTM 2000 system (Thermo Scientific, Wilmington Del.).
- the library is analyzed on MiSeq Sequencer as follows:
- This system consists of a site-specific endonuclease (e.g, Cas9 complexed with a sgRNA) that targets a DNA target sequence of interest), and one or more catalytically inactive dCas9 molecules complexed with a sgRNA that targets DNA sequences adjacent to the cut site (See FIGS. 7A and 7B ).
- dCas9 molecules are also tethered to a HDR molecule (i.e. dsDNA, ssDNA, RNA, a plasmid, or the like).
- the tethered dCas9 is used to position the donor molecule in an orientation that will increase the likelihood that the donor molecule will be incorporated into the target site through HDR, thereby introducing a desired change to the target sequence.
- Oligonucleotides are ordered from manufacturers (e.g., Integrated DNA Technologies, Coralville, Iowa; or Eurofins, Germany). sgRNA transcription constructs are assembled by polymerase chain reaction (PCR).
- the primers for sgRNA transcription constructs consist of a primer containing a 5′ T7 promoter sequence, a primer containing a unique spacer sequence, primers containing the sgRNA TRCR backbone, and a reverse primer that may contain a complementary sequence to the homology donor for tethering the donor to the 3′ end of the sgRNA.
- T7 sgRNA transcription constructs are PCR-amplified.
- Two outer primers forward oligo contains T7 promoter oligo; reverse oligos contain 3′ end of sgRNA backbone or homology donor complementary sequence for tethering
- forward oligo contains T7 promoter oligo; reverse oligos contain 3′ end of sgRNA backbone or homology donor complementary sequence for tethering
- PCR reaction is performed using Q5 Hot Start High-Fidelity 2 ⁇ Master Mix (New England Biolabs, Ipswich, MA) following manufacturer's recommendations.
- PCR T7 sgRNA transcription construct assembly PCR is carried out using the following thermal cycling conditions: 98° C. for 2 minutes, 29 cycles of 98° C. for 20 seconds, 62° C. for 20 seconds, and 72° C. for 15 seconds, followed by a final extension of 72° C. for 2 minutes.
- DNA constructs are evaluated by capillary electrophoresis (Fragment Analyzer, Advanced Analytic
- RNA components are produced through in vitro transcription (T7 Quick High Yield RNA Synthesis Kit, New England Biolabs, Ipswich, MA) from a double-stranded DNA template.
- the RNA is then treated with DNase I (New England Biolabs, Ipswich, MA) to remove any double-stranded DNA and incubated at 37° C. for 10 minutes.
- 0.5 M EDTA is then added to the transcription reactions and incubated at 75° C. for 10 minutes to inactivate the DNase I.
- Homology donors are ordered as single-stranded DNA oligos of approximately 90 nucleotides in length.
- the homology donors are complementary to the coding sequence and are designed to be centered on the cut site with the PAM replaced with a EcoR1 restriction enzyme site and homology arms of approximately 42 nucleotides in length matching the target sequence.
- S. pyogenes catalytically active Cas9 and catalytically inactive dCas9 are C-terminally tagged with two nuclear localization sequences (NLS) and recombinantly expressed in E. coli. All sgRNA and tethered sgRNA are incubated for 2 minutes at 95° C., removed from the thermal cycler and allowed to equilibrate to room temperature.
- Cas9 Ribonucleoprotein (RNP) complexes are set up in triplicate with 2 ⁇ M Cas9 or 2 ⁇ M dCas9, 6 ⁇ M sgRNA or 6 ⁇ M tethered sgRNA and 2 ⁇ M donor oligo in binding buffer (20 mM HEPES, 100 mM KCl, 5 mM MgCl 2 , 1 mM DTT, and 5% glycerol at pH 7.4) in a volume of 6 ⁇ l.
- the RNPs are then allowed to bind at 37° C. for 10 minutes. After annealing, the Cas9 RNP and dCas9 RNP-donor tethers can be combined to a final volume of 12 ⁇ l.
- K562 cells (ATCC, Manassas, Va.) are cultured in suspension in IMDM medium supplemented with 10% FBS and 1% penicillin and streptomycin at 37° C. with 100% humidity. K562 cells are transfected using the Nucleofector® 96-well Shuttle System (Lonza, Allendale, N.J.). RNPs and dCas9 RNPs are arranged in a 96-well plate with 2 ⁇ l of Cas9 RNP or 4 ⁇ l of Cas9 and dCas9 RNP combined. K562 cells are transferred to a 50 ml conical centrifuge tube and centrifuged at 200 ⁇ G for 3 minutes.
- the media is aspirated and the cell pellet washed in calcium and magnesium-free PBS.
- the cells are centrifuged once more and then resuspended in Nucleofector SF buffer (Lonza, Allendale, N.J.) at a concentration of 1 ⁇ 10 7 cells/ml. 20 ⁇ l of this cell suspension is added to the RNP in the 96 well plate, mixed, and then the entire volume is transferred to a 96-well NucleocuvetteTM Plate (Lonza, Allendale, N.J.).
- the plate is then loaded into the NucleofectorTM 96-well ShuttleTM (Lonza, Allendale, N.J.) and cells are nucleofected using the 96-FF-120 NucleofectorTM program (Lonza, Allendale, N.J.).
- 80 ⁇ l of complete IMDM medium is added to each well of the 96-well NucleocuvetteTM Plate.
- the entire contents of the well are then transferred to a 96-well tissue culture plate containing 100 ⁇ l of complete IMDM medium.
- the cells are cultured at 37° C. with 100% humidity conditions for 48 hours.
- the K562 cells are centrifuged at 500 ⁇ G for 5 minutes and the medium is removed. The cells are washed 1 time in calcium and magnesium-free PBS. The cell pellets are then resuspended in 50 ⁇ l of QuickExtract DNA Extraction solutions (Epicentre, Madison, Wis.). The gDNA samples obtained are then incubated at 37° C. for 10 minutes, 65° C. for 6 minutes and 95° C. for 3 minutes to stop the reaction. gDNA samples are then diluted with 50 ⁇ l of water and stored at ⁇ 20° C.
- This gDNA is PCR-amplified using Q5 Hot Start High-Fidelity 2 ⁇ Master Mix (New England Biolabs, Ipswich, Mass.) at 1 ⁇ concentration, primers at 0.5 ⁇ M each, 3.75 ⁇ L of gDNA in a final volume of 10 L and amplified 98° C. for 1 minutes, 35 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 min. PCR reaction was diluted 1:100 in water.
- a second “barcoding” PCR is set up using unique primers for each sample, facilitating multiplex sequencing.
- the second PCR is performed using Q5 Hot Start High-Fidelity 2 ⁇ Master Mix (New England Biolabs, Ipswich, Mass.) at 1 ⁇ concentration, primers at 0.5 ⁇ M each, 1 ⁇ L of 1:100 diluted first PCR, in a final volume of 10 ⁇ L and amplified 98° C. for 1 minutes, 12 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 minutes.
- Q5 Hot Start High-Fidelity 2 ⁇ Master Mix New England Biolabs, Ipswich, Mass.
- PCR reactions are pooled into a single microfuge tube for SPRIselectTM bead (Beckman Coulter, Pasadena, Calif.) based clean up of amplicons for sequencing.
- the microfuge tube is then removed from the magnetic stand and 0.25 ⁇ volumes of Qiagen EB buffer (Qiagen, Venlo, Netherlands) was added to bead, mixed vigorously, and incubated for 2 minutes at RT.
- the microfuge tube is returned to the magnet, incubated until solution has cleared and supernatant containing the purified amplicons is dispensed into a clean microfuge tube.
- the purified amplicon library is quantified using the NanodropTM 2000 system (Thermo Fisher Scientific, Wilmington Del.) and library quality analyzed using the Fragment AnalyzerTM system (Advanced Analytical Technologies, Inc., Ames, Iowa) and the DNF-910 dsDNA Reagent KitTM (Advanced Analytical Technologies, Inc. Ames, Iowa).
- the amplicon library is normalized to 4 nmolar concentration as calculated from Nanodrop values and size of the amplicons.
- the library is analyzed on MiSeq Sequencer with MiSeq Reagent Kit v2TM, 300 Cycles (Illumina, San Diego, Calif.), with two 151-cycle paired-end run plus two eight-cycle index reads.
- the identity of products in the sequencing data is analyzed based upon the index barcode sequences adapted onto the amplicons in the second round of PCR.
- a computational script is used to process the MiSeq data by executing the following tasks:
- Aligned reads are compared to expected wild type target locus sequence. Reads not aligning to any part of the target locus are discarded.
- Reads matching wild-type target sequence are tallied.
- Reads with indels are categorized by indel type and tallied.
- This data is then analyzed to determine if sgRNA/dCas9 tethered donor polynucleotides increase HDR efficiency compared to passively diffused donor polynucleotides.
- Cas9D10A Cas9 nickase mutant where one nuclease domain is inactivated
- HDR preferentially homology-directed repair
- Cas9D10A is used with two specific, single-guide RNAs (sgRNAs) that deliver the nickase to two sites on the same strand 30-60 nucleotides apart. Spacer sequences were chosen from available sequences in human genomic DNA so that each of the two sgRNAs would target Cas9 to a location on either side of the desired region for modification.
- sgRNA components of Cas9D10A Ribonucleoprotein (RNP) complexes also termed “sgRNA/Cas9 nickase complexes” herein
- Cas9D10A Ribonucleoprotein (RNP) complexes also termed “sgRNA/Cas9 nickase complexes” herein
- catalytically active Cas9 nuclease RNP complexes also termed “sgRNA/Cas9 complexes” herein
- PCR Polymerase Chain Reaction
- the DNA primers were present at a concentration of 2 nM each.
- Two outer DNA primers corresponding to the T7 promoter (forward primer: Oligonucleotide A, Table 1), and the 3′ end of the RNA sequence (reverse primers: Oligonucleotides C, Table 1) were used at 640 nM to drive the amplification reaction.
- PCR reactions were performed using Kapa HiFiHotstart PCRTM kit (Kapa Biosystems, Inc., Wilmington, Mass.) as per manufacturer's recommendation.
- PCR assembly reactions were carried out using the following thermal cycling conditions: 98° C. for 2 minutes, 35 cycles of 15 seconds at 98° C., 15 seconds at 62° C., 15 seconds at 72° C., and a final extension at 72° C. for 2 min.
- the quantity of PCR product was determined using capillary electrophoresis on a Fragment Analyzer (Advanced Analytical Technologies, Inc., Ames, Iowa).
- RNA synthesis Kit New England Biolabs, Ipswich, Mass.
- Transcription reactions were DNAse I treated (New England Biolabs, Ipswich, Mass.).
- the quality of the transcribed RNA was checked by capillary electrophoresis on a Fragment Analyzer (Advanced Analytical Technologies, Inc., Ames, Iowa). Protein components of RNPs were expressed from bacterial expression vectors in E.
- Cas9 and/or Cas9D10A/Cas9H840A RNPs can be applied to the production of other Cas9 and/or Cas9D10A/Cas9H840A RNPs as described herein.
- the coding sequence for S. pyogenes Cas9 included the two nuclear localization sequences (NLS) at the C-terminus.
- Cas9D10A or Cas9H840A nickase variants of NLS-tagged Cas9, where an active site residue of either nuclease domain is mutated were prepared by introducing mutations into the coding sequence of S. pyogenes Cas9 by site directed mutagenesis (e.g. Q5 Site-directed Mutagenesis Kit, New England Biolabs, Ipswich, Mass.).
- Example DNA repair types can include mutagenic end-joining pathways such as non-homologous end joining (NHEJ) or insertion of material from a donor sequence by homology directed repair (HDR).
- NHEJ non-homologous end joining
- HDR homology directed repair
- Cas9 and Cas9D10A were directed to the human gene CD34 at five independent sites by specific sgRNAs.
- Cas9 and Cas9D10A RNPs To assemble Cas9 and Cas9D10A RNPs, 1.36 ⁇ l of sgRNA (corresponding to approximately 1-5 ⁇ g) were incubated for 2 minutes at 95° C. then allowed to equilibrate to room temperature for approximately 5 minutes. Subsequently, Cas9 and Cas9D10A were mixed with a corresponding sgRNA to form RNPs in reaction buffer (20 mM HEPES, pH 7.5, 100 mM KCL, 5 mM MgCl 2 , 5% glycerol). 20 pmols of Cas9 or Cas9D10A were combined with the target sgRNA and functional RNPs were assembled by incubating at 37° C. for 10 minutes. Finally, 20 pmols of Cas9 or Cas9D10A RNP was combined with 100 pmols of DNA donor oligonucleotide template for HDR prior to transfection into cells. Experiments were performed in triplicate
- Cas9/Cas9D10ARNP complexes were transfected into K562 cells (ATCC, Manassas, Va.), using the Nucleofector® 96-well Shuttle System (Lonza, Allendale, N.J.) and the following protocol: RNP and RNP plus donor complexes were dispensed in a 2-3 ⁇ L final volume into individual wells of a 96-well plate.
- K562 cells suspended in media were transferred from culture flask to a 50 mL conical, cells were then pelleted by centrifugation for 3 minutes at 200 ⁇ g, the culture medium aspirated and washed once with calcium and magnesium-free PBS.
- K562 cells were then pelleted by centrifugation for 3 minutes at 200 ⁇ g, the PBS aspirated and cell pellet was resuspended in 10 mL of calcium and magnesium-free PBS.
- K562 cells were counted using the Countess® II Automated Cell CounterTM (Life Technologies, Grand Island, N.Y.). 2.2 ⁇ 10 7 cells were transferred to a 50 ml tube and pelleted. The PBS was aspirated and the cells were resuspended in NucleofectorTM SF (Lonza, Allendale, N.J.) solution to a density of 1 ⁇ 10 7 cells/mL. 20 ⁇ L of the cell suspension are then added to individual wells containing 2-3 ⁇ L of RNP and RNP plus Donor complexes and the entire volume was transferred to the wells of a 96-well NucleocuvetteTM Plate (Lonza, Allendale, N.J.).
- the plate was loaded onto the NucleofectorTM 96-well ShuttleTM (Lonza, Allendale, N.J.) and cells were nucleofected using the 96-FF-120 NucleofectorTM program (Lonza, Allendale, N.J.).
- Iscove's Modified Dulbecco's Media 80 ⁇ L Iscove's Modified Dulbecco's Media (IMDM, Life Technologies, Grand Island, N.Y.), supplemented with 10% FBS (Fisher Scientific, Pittsburgh, Pa.) and supplemented with penicillin and streptomycin (Life Technologies, Grand Island, N.Y.), was added to each well and 50 ⁇ L of the cell suspension was transferred to a 96-well cell culture plate containing 150 ⁇ L pre-warmed IMDM complete culture medium. The plate was then transferred to a tissue culture incubator and maintained at 37° C. in 5% CO 2 for approximately 48 hours.
- IMDM Iscove's Modified Dulbecco's Media
- FBS Fisher Scientific, Pittsburgh, Pa.
- penicillin and streptomycin Life Technologies, Grand Island, N.Y.
- Genomic DNA was isolated from K562 cells 48 hours after Cas9/Cas9D10A transfection using 50 ⁇ L QuickExtract DNA Extraction solution (Epicentre, Madison, Wis.) per well followed by incubation at 37° C. for 10 minutes. 50 ⁇ L water was added to the samples, and next they were incubated at 75° C. for 10 minutes and 95° C. for 5 minutes to stop the reaction. gDNA was stored at ⁇ 80° C. until further processing.
- a first PCR was performed using Q5 Hot Start High-Fidelity 2 ⁇ Master MixTM (New England Biolabs, Ipswich, Mass.) at 1 ⁇ concentration, primers at 0.5 ⁇ M each, 3.75 ⁇ L of gDNA in a final volume of 10 L and amplified 98° C. for 1 minute, 35 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 minutes.
- PCR reaction was diluted 1:100 in water.
- Target-specific primers are shown in Table 8.
- a second ‘barcoding’ PCR was set up using unique primers for each sample facilitating multiplex sequencing. Primer pairs are shown in Table 9.
- the second PCR was performed using Q5 Hot Start High-Fidelity 2 ⁇ Master MixTM (New England Biolabs, Ipswich, Mass.) at 1 ⁇ concentration, primers at 0.5 ⁇ M each, 1 ⁇ L of 1:100 diluted first PCR, in a final volume of 10 ⁇ L and amplified 98° C. for 1 minute, 12 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 minutes. PCR reactions were pooled into a single microfuge tube for SPRIselectTM bead (Beckman Coulter, Pasadena, Calif.) based clean up of amplicons for sequencing.
- the microfuge tube was spun in a microcentrifuge to collect the contents of the tube, and was then returned to the magnet, incubated until the solution had cleared, and the supernatant containing the purified amplicons were dispensed into a clean microfuge tube.
- the purified amplicon library was quantified using the NanodropTM 2000 system (Thermo Scientific, Wilmington Del.).
- the amplicon library was normalized to 4 nmolar concentration as calculated from Nanodrop values and size of the amplicons.
- the library was analyzed on MiSeq Sequencer with MiSeq Reagent Kit v2TM, 300 Cycles (Illumina, San Diego), with two 151-cycle paired-end run plus two eight-cycle index reads.
- Indel structures were compared between sequence data that was generated from cells transfected with wild-type Cas9 RNP or Cas9 RNP+Donor, for each of the individual targets, and for Cas9D10A RNP and Cas9D10A RNP+Donor for each of the pairs of targets.
- the experimental data demonstrated that cells transfected with Cas9 RNP exhibited a number of classes of mutant edits.
- Cas9 RNP+Donor showed a similar spectrum of mutant edits and donor-dependent edits, whereas cells transfected with Cas9D10A RNP only, showed no evidence of editing but Cas9D10A RNP+Donor demonstrated similar levels of donor insertion to the Cas9 RNP+Donor, but with no measurable mutant edits that could not be attributed to incorporation of the donor sequence.
- FIG. 12 shows a comparison of repair types using either Cas9 or Cas9D10A at Targets 3 and 4 (human CD34 locus).
- Cas9 RNP complexed with sgRNA was used to target either CD34 Target 3 or CD34 Target 4.
- Cas9D10A RNPs complexed with sgRNA were used to target CD34 Target 3 and Target 4.
- Negative controls were Cas9 or Cas9D10A only, without sgRNA. The distribution of repair is shown by the bars.
- Cas9 RNP displayed only mutagenic repair.
- Cas9 RNP+Donor demonstrated mutagenic repair and HDR
- Cas9D10A RNP showed barely detectable mutagenic repair.
- Cas9D10A RNP+Donor demonstrated HDR edits with barely detectable mutagenic repair.
- the deep sequencing analysis described in this example can be practiced by one of ordinary skill in the art with other Cas9/Cas9D10A RNP complexes (i.e. assembled with distinct sgRNAs and distinct ratios of Cas9/Cas9D10A and donor oligonucleotide templates).
- This example illustrates the use of a Cas9 nickase mutant where one nuclease domain will be inactivated (either Cas9D10A or Cas9H840A) to engage preferentially HDR pathways and block mutagenic repair pathways at break sites in eukaryotic cells.
- spacer sequences for the two sgRNA sequences are chosen to vary the length of the deletion around the desired target site. Sequences are chosen such that the paired nickases are targeted to two sites on the same strand varying the distance between two sites in a range from 20 to 2000 nucleotides apart.
- Donor polynucleotides are designed with different lengths and positions relative to the locations of the spacer sequences and tested in combination with each pair of Cas9 nickase sgRNPs. Using the methods described in Examples 6 and 7, experiments are conducted to measure the frequency and type of DNA repair that takes place with each combination of paired nickases. Data are analyzed to identify the combination of nickase sgRNPs and donor polynucleotide that leads to the highest frequency of HDR with the lowest frequency of mutant editing.
- This example illustrates the use of a Cas9 nickase mutant where one nuclease domain will be inactivated (either Cas9D10A or Cas9H840A) to engage preferentially HDR pathways and block mutagenic repair pathways at break sites in eukaryotic cells.
- either paired Cas9D10A or paired Cas9H840A are used with two specific, sgRNAs that deliver the paired nickases to two sites on the same strand 20-2000 nucleotides apart.
- Donor oligonucleotides are designed to deliver specific nucleotide insertions or deletions at the desired site ( FIG. 11 ).
- Experiments are carried out varying spacing between nickases and varying donor sequence and length as described in Example 8 to identify the combination of reagents leading to the highest frequency of HDR and lowest frequency of mutagenic repair to introduce the intended modification at the desired site.
- This example illustrates the use of a Cas9 nickase mutant where one nuclease domain will be inactivated (either Cas9D10A or Cas9H840A) to engage exclusively HDR pathways and block mutagenic repair pathways at break sites in eukaryotic cells.
- either paired Cas9D10A or paired Cas9H840A can be used in tandem complexed with two specific sgRNAs that deliver the paired nickases to two sites on the same strand 20-2000 nucleotides apart.
- the donor oligonucleotides are provided in different orientations and/or lengths to deliver specific nucleotide insertions or deletions between two target Cas9-nickase sites in human primary cells for therapeutic advantage.
- This example illustrates the use of pairs of Cas9 nickase mutants to engage preferentially homology-directed repair pathways and block mutagenic repair pathways at break sites in eukaryotic cells.
- Cas9D10A and Cas9H840A are used in combination with two specific sgRNAs that deliver the paired nickases to two sites resulting in nicking of the same strand 20-2000 nucleotides apart.
- the sgRNAs paired with Cas9D10A must be chosen to target protospacer sequences and PAM sequences on one strand.
- sgRNAs paired with Cas9H840A must be chosen to target protospacer sequences and PAM sequences on the opposite strand to the Cas9D10A sgRNAs to ensure that the same strand is nicked twice.
- sgRNPs are assembled separately for each nickase mutant by combining the protein with the selected sgRNA.
- Donor oligonucleotides are designed to deliver specific nucleotide insertions or deletions at the desired site ( FIG. 11 ) and synthesized by an oligonucleotide manufacturer (e.g. Integrated DNA Technologies, Coralville, Iowa).
- Cas9D10a-sgRNPs are mixed with Cas9H840A-sgRNPs before transfection and the pair of nickases targeting the same strand transfected together with the donor oligonucleotide using methods described in above examples. Experiments are carried out varying spacing between nickases and varying donor sequence and length as described in Example 8 to identify the combination of reagents leading to the highest frequency of HDR and lowest frequency of mutagenic repair to introduce the intended modification at the desired site.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Mycology (AREA)
- Cell Biology (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
- This application claims the benefit under 35 U.S.C. § 119(e)(1) of U.S. Provisional Application Nos. 62/042,358, filed Aug. 27, 2014 and 62/047,495, filed Sep. 8, 2014, each of which applications is incorporated herein by reference in its entirety.
- The present invention relates to Type II CRISPR-Cas9 systems for use in increasing Cas9-mediated genome engineering efficiency by either decreasing the number of off-target nucleic acid double-stranded breaks, and/or enhancing homology-directed repair of a cleaved target nucleic acid.
- Clustered regularly interspaced short palindromic repeats (CRISPR) and associated Cas9 proteins constitute the CRISPR-Cas9 system. This system provides adaptive immunity against foreign DNA in bacteria (Barrangou, R., et al., “CRISPR provides acquired resistance against viruses in prokaryotes,” Science (2007) 315:1709-1712; Makarova, K. S., et al., “Evolution and classification of the CRISPR-Cas systems,” Nat Rev Microbiol (2011) 9:467-477; Garneau, J. E., et al., “The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA,” Nature (2010) 468:67-71; Sapranauskas, R., et al., “The Streptococcus thermophilus CRISPR/Cas system provides immunity in Escherichia coli,” Nucleic Acids Res (2011) 39: 9275-9282).
- The RNA-guided Cas9 endonuclease specifically targets and cleaves DNA in a sequence-dependent manner (Gasiunas, G., et al., “Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria,” Proc Natl Acad Sci USA (2012) 109: E2579-E2586; Jinek, M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-821; Sternberg, S. H., et al., “DNA interrogation by the CRISPR RNA-guided endonuclease Cas9,” Nature (2014) 507:62; Deltcheva, E., et al., “CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III,” Nature (2011) 471:602-607), and has been widely used for programmable genome editing in a variety of organisms and model systems (Cong, L., et al., “Multiplex genome engineering using CRISPR/Cas systems,” Science (2013) 339:819-823; Jiang, W., et al., “RNA-guided editing of bacterial genomes using CRISPR-Cas systems,” Nat. Biotechnol. (2013) 31: 233-239; Sander, J. D. & Joung, J. K., “CRISPR-Cas systems for editing, regulating and targeting genomes,” Nature Biotechnol. (2014) 32:347-355).
- Jinek, M., et al., (“A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-21) showed that in a subset of CRISPR-associated (Cas) systems, the mature crRNA that is base-paired to trans-activating crRNA (tracrRNA) forms a two-part RNA structure, also called “dual-guide,” that directs the CRISPR-associated protein Cas9 to introduce double-stranded breaks in target DNA. At sites complementary to the crRNA-guide (spacer) sequence, the Cas9 HNH nuclease domain cleaves the complementary strand and the Cas9 RuvC-like domain cleaves the non-complementary strand. Dual-crRNA/tracrRNA molecules were engineered into single-chain crRNA/tracrRNA molecules. These single-chain crRNA/tracrRNA directed target sequence-specific Cas9 double-strand DNA cleavage.
- However, site-specific nucleases such as Cas9 can introduce double-stranded breaks in DNA in unintended and/or incorrect locations, termed “off-target effects.” Accordingly, methods to reduce or eliminate off-target DNA breaks are highly desirable.
- Additionally, DNA double-stranded breaks can be repaired by, for example, non-homologous end joining (NHEJ) or homology-directed repair (HDR). Faithful repair by HDR is inefficient at site-directed breaks of the target nucleic acid because other cellular mechanisms may result in the incorporation of nucleic acids at the site of a double-stranded break or a single-stranded nick. It is apparent there is a clear need to develop novel strategies that mitigate or eliminate off-target genome editing events and increase the efficiency of inserting new material into the sites cut by site-directed nucleases such as Cas9.
- In one aspect, the disclosure provides for a method for reducing off-targeting nuclease cleavage comprising: contacting a first complex comprising a catalytically active Cas9 and a guide RNA with a target nucleic acid; contacting a second complex comprising a catalytically inactive Cas9 (dCas9) and a guide RNA with an off-target nucleic acid; and cleaving the target nucleic acid with the first complex, wherein the second complex prevents the first complex from cleaving the off-target nucleic acid. In some embodiments, the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Streptococcus pyogenes, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from S. pyogenes.
- In some embodiments, the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Streptococcus thermophilus, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from S. thermophilus. In some embodiments, the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Staphylococcus aureus, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from S. aureus. In some embodiments, the active Cas9 comprises at least 25% amino acid identity to the HNH and RuvC active site motifs of a Cas9 from Neisseria meningitidis, such as at least 50%, 75%, 95%, 99% and complete amino acid identity, or any percentage between 25% and 100%, to a Cas9 from N. meningitidis.
- In some embodiments, the catalytically inactive Cas9 comprises a mutation in one or both of its nuclease domains. In some embodiments, the dCas9 is at least 80% catalytically inactive compared to a wild-type Cas9.
- In some embodiments, the first complex is capable of binding to the off-target nucleic acid. In some embodiments, the binding and/or cleavage of the first complex to the off-target nucleic acid is reduced by at least 30%. In some embodiments, the binding of the first complex to the off-target nucleic acid is reduced by at least 70%.
- In some embodiments, the cleaving comprises introducing a double-stranded break. In some embodiments, the cleaving comprises introducing a single-stranded break. In some embodiments, the target nucleic acid is DNA. In some embodiments, the target nucleic acid is double-stranded DNA.
- In another aspect, the disclosure provides for a composition comprising: two site-directed polypeptides to Cas9, wherein the two site-directed polypeptides comprise a mutation in one of their nuclease domains, wherein the two site-directed polypeptides are configured to bind and cleave the same strand of a double-stranded target nucleic acid.
- In some embodiments, the two site-directed polypeptides comprise at least 10% amino acid identity to a nuclease domain of Cas9 from S. pyogenes.
- In some embodiments, the mutation comprises a D10A mutation. In some embodiments, the mutation comprises an H840A mutation. In some embodiments, the target nucleic acid is DNA.
- In some embodiments, the two site-directed polypeptides are bound to the sense strand of the double-stranded target nucleic acid. In some embodiments, the two site-directed polypeptides are bound to the anti-sense strand of the double-stranded target nucleic acid. In some embodiments, the composition further comprises a donor polynucleotide. In some embodiments, the donor polynucleotide is single-stranded. In some embodiments, the donor polynucleotide is double-stranded. In some embodiments, the donor polynucleotide is partially single-stranded and partially double-stranded.
- In another embodiment, a method for reducing binding and/or cleavage of an off-target nucleic acid by a complex comprising a catalytically active Cas9 protein and a guide polynucleotide, is provided. The method comprises: (a) contacting a first complex with a selected target nucleic acid, wherein said first complex comprises: (i) a catalytically active Cas9 protein and (ii) a first guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to said selected target nucleic acid; and (b) contacting a second complex with an off-target nucleic acid, wherein said second complex comprises (i) a catalytically inactive Cas9 protein (dCas9 protein) that does not cleave the off-target nucleic acid and (ii) a second guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to said off-target nucleic acid, thereby reducing binding and/or cleavage by said first complex of said off-target nucleic acid.
- In other embodiments, the catalytically active Cas9 protein comprises at least 75% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In certain embodiments, the catalytically active Cas9 protein comprises at least 95% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In additional embodiments of the method, the catalytically active Cas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein.
- In further embodiments, the dCas9 protein comprises at least one mutation in one or more endonuclease domains to render the dCas9 protein catalytically inactive. In some embodiments, the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes. In other embodiments, the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes. In additional embodiments, the dCas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein with at least one mutation in one or more endonuclease domains to render the orthologous Cas9 protein catalytically inactive. In certain embodiments, the one or more mutations is in a RuvC-1 domain, such as a D1OA mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein. In other embodiments, the one or more mutations is in the HNH domain, such as a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein. In additional embodiments, the one or more mutations comprises a D10A mutation and a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutations in an orthologous Cas9 protein.
- In additional embodiments, the selected target nucleic acid is DNA, such as double-stranded DNA.
- In further embodiments, the selected target nucleic acid is cleaved to provide a cleavage site and the method further comprises modifying the target nucleic acid, such as by inserting at least a portion of the donor polynucleotide at the cleavage site. In other embodiments, the modifying comprises deleting one or more nucleotides at the cleavage cite.
- In additional embodiments, the method is performed in a cell, such as a eukaryotic cell, or in vitro.
- In another embodiment, a method for modifying a target nucleic acid is provided comprising: contacting two complexes to the same strand of the target nucleic acid, wherein each of the two complexes comprises a site-directed polypeptide and a nucleic acid-targeting nucleic acid, wherein the two site-directed polypeptides comprise a mutation in one of their nuclease domains; and modifying the target nucleic acid. In some embodiments, the nucleic acid-targeting nucleic acid from one of the two complexes targets a different location in the target nucleic acid than the nucleic acid-targeting nucleic acid from the other of the two complexes.
- In some embodiments, the two site-directed polypeptides comprise at least 75% amino acid identity to Cas9 from S. pyogenes. In some embodiments, the two site-directed polypeptides comprise at least 10% amino acid identity to a nuclease domain of Cas9 from S. pyogenes. In some embodiments, the mutation comprises a D10A mutation. In some embodiments, the mutation comprises an H840A mutation. In some embodiments, the target nucleic acid is DNA.
- In some embodiments, the two site-directed polypeptides are bound to the sense strand of the double-stranded target nucleic acid. In some embodiments, the two site-directed polypeptides are bound to the anti-sense strand of the double-stranded target nucleic acid. In some embodiments, the modifying comprises cleaving the same strand of the target nucleic acid. In some embodiments, the cleaving comprises a single-stranded break. In some embodiments, the method further comprises inserting a donor polynucleotide into the target nucleic acid. In some embodiments, the donor polynucleotide is single-stranded. In some embodiments, the donor polynucleotide is double-stranded. In some embodiments, the donor polynucleotide is partially single-stranded and partially double-stranded.
- In another embodiment, the invention is directed to a method for cleaving a single strand of a target nucleic acid comprising contacting first and second complexes at spaced-apart locations on the same strand of a nucleic acid molecule. The first complex comprises (i) a first Cas9 protein with a mutation in an endonuclease domain thereof to render the Cas9 protein a nickase; and (ii) a first guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to a first target nucleic acid. The second complex comprises (i) a second Cas9 protein with a mutation in an endonuclease domain thereof, to render the Cas9 protein a nickase; and (ii) a second guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to a second target nucleic; wherein the first and second Cas9 proteins cleave a single strand of said nucleic acid molecule at the spaced-apart locations on the same strand, to render a single-stranded break.
- In some embodiments, the first Cas9 protein and/or the second Cas9 protein comprises at least 75% amino acid identity to a Cas9 from S. pyogenes. In certain embodiments, the Cas9 protein comprises at least 95% amino acid identity to a Cas9 from S. pyogenes. In additional embodiments of the method, the first Cas9 protein and/or the second Cas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein with a mutation in an endonuclease domain thereof, to render the orthologous Cas9 protein a nickase. In certain embodiments, the one or more mutations is in a RuvC-1 domain, such as a D1OA mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein. In other embodiments, the one or more mutations is in the HNH domain, such as a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein.
- In further embodiments, the target nucleic acid is double-stranded DNA and the complexes bind to and cleave the anti-sense strand of the double-stranded DNA. In other embodiments, the target nucleic acid is double-stranded DNA and the complexes bind to and cleave the sense strand of the double-stranded DNA.
- In additional embodiments, the method further comprises modifying the target nucleic acid, such as by inserting at least a portion of the donor polynucleotide into the target nucleic acid at the single-stranded break. In certain embodiments, the donor polynucleotide is single-stranded. In further embodiments, the inserting is done using homology-directed repair of the donor polynucleotide with the target nucleic acid.
- In additional embodiments, the method is performed in a cell, such as a eukaryotic cell, or in vitro.
- In yet further embodiments, a method for directed homology-directed repair (HDR) in a target nucleic acid is provided. The method comprises: (a) contacting a first complex with a first target nucleic acid, wherein said first complex comprises: (i) a catalytically active Cas9 protein and (ii) a first guide polynucleotide, such as a sgRNA, that comprises a spacer adapted to bind to said first target nucleic acid, wherein said first complex cleaves the first target nucleic acid; and (b) contacting a second complex with a second target nucleic acid, wherein said second complex comprises: (i) a first catalytically inactive Cas9 protein (dCas9 protein) that comprises at least one mutation in one or more endonuclease domains to render the dCas9 protein catalytically inactive such that the dCas9 protein does not cleave the second target nucleic acid, and (ii) a second guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to said second target nucleic acid, wherein the second complex comprises one end of a polynucleotide donor associated therewith and configured in proximity to the cleaved first target nucleic acid; wherein at least a portion of the polynucleotide donor is inserted into the first target nucleic acid via HDR.
- In certain embodiments, the second target nucleic acid is upstream of the first target nucleic acid. In other embodiments, the second target nucleic acid is downstream of the first target nucleic acid.
- In certain embodiments of the above method above, the 5′ end of the polynucleotide donor is associated with the second complex. In other embodiments, the 3′ end of the polynucleotide donor is associated with the second complex.
- In additional embodiments, the method further comprises: (c) contacting a third complex with a third target nucleic acid, wherein the third target nucleic acid is positioned downstream of the first target nucleic acid when the first target nucleic acid is downstream of the second target nucleic acid, or wherein the third target nucleic acid is positioned upstream of the first target nucleic acid when the first target nucleic acid is upstream of the second target nucleic acid, wherein said third complex comprises: (i) a second dCas9 protein that comprises at least one mutation in one or more endonuclease domains to render the second dCas9 protein catalytically inactive such that the second dCas9 protein does not cleave the third target nucleic acid, and (ii) a third guide polynucleotide, such as sgRNA, that comprises a spacer adapted to bind to said third target nucleic acid, and wherein the third complex comprises the other end of the polynucleotide donor associated with the second complex. In certain embodiments, the 5′ end of the polynucleotide donor is associated with the second complex and the 3′ end of the polynucleotide donor is associated with the third complex. In other embodiments, the 3′ end of the polynucleotide donor is associated with the second complex and the 5′ end of the polynucleotide donor is associated with the third complex.
- In other embodiments, the Cas9 protein comprises at least 75% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In certain embodiments, the Cas9 protein comprises at least 95% amino acid identity to a Cas9 from S. pyogenes, with the proviso that the Cas9 protein retains catalytic activity. In additional embodiments of the method, the Cas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein.
- In further embodiments, the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes. In other embodiments, the dCas9 protein comprises at least 75% amino acid identity to a Cas9 protein from S. pyogenes. In additional embodiments, the dCas9 protein is a S. pyogenes Cas9 protein or an orthologous Cas9 protein with at least one mutation in one or more endonuclease domains to render the orthologous Cas9 protein catalytically inactive. In certain embodiments, the one or more mutations is in a RuvC-1 domain, such as a D10A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein. In other embodiments, the one or more mutations is in the HNH domain, such as a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutation in an orthologous Cas9 protein. In additional embodiments, the one or more mutations comprises a D10A mutation and a H840A mutation, numbered relative to S. pyogenes Cas9, or the corresponding mutations in an orthologous Cas9 protein.
- In additional embodiments, the selected target nucleic acid is DNA, such as double-stranded DNA.
- In further embodiments, the method is performed in a cell, such as a eukaryotic cell, or in vitro.
- These aspects and other embodiments of the methods for increasing Cas9-mediated engineering efficiency and/or HDR repair will readily occur to those of ordinary skill in the art in view of the disclosure herein.
- All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference.
-
FIG. 1A andFIG. 1B present illustrative examples of Type II CRISPR-Cas9 associated RNAs.FIG. 1A shows a two-RNA component Type II CRISPR-Cas9 comprising a crRNA (FIG. 1A, 101 ) and a tracrRNA (FIG. 1A, 102 ), otherwise known as a dual-guide RNA.FIG. 1B illustrates the formation of base-pair hydrogen bonds between the crRNA and the tracrRNA to form secondary structure (see U.S. Published Patent Application No. 2014-0068797, published 6 Mar. 2014; see also Jinek M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-821). The figures present an overview of and nomenclature for secondary structural elements of the crRNA and tracrRNA of the S. pyogenes Cas9 including the following: a spacer element (FIG. 1B, 103 ); a first stem element comprising a lower stem element (FIG. 1B, 104 ), a bulge element comprising unpaired nucleotides (FIG. 1B, 105 ), and an upper stem element (FIG. 1B, 106 ); a nexus element (FIG. 1B, 107 ); a second hairpin element comprising a second stem element (FIG. 1B, 108 ); and a third hairpin element comprising a third stem element (FIG. 1B, 109 ). The figures are not proportionally rendered nor are they to scale. The locations of indicators are approximate. -
FIG. 2 shows another example of a Type II CRISPR-Cas9 associated RNA. The figure illustrates a single-guide RNA (sgRNA) wherein the crRNA is covalently joined to the tracrRNA and forms a RNA polynucleotide secondary structure through base-pair hydrogen bonding (see, e.g., U.S. Published Patent Application No. 2014-0068797, published 6 Mar. 2014). The figure presents an overview of and nomenclature for secondary structural elements of a sgRNA of the S. pyogenes Cas9 including the following: a spacer element (FIG. 2, 201 ); a first stem element comprising a lower stem element (FIG. 2, 202 ), a bulge element comprising unpaired nucleotides (FIG. 2, 205 ), and an upper stem element (FIG. 2, 203 ); a loop element (FIG. 2, 204 ) comprising unpaired nucleotides; (a first hairpin element comprises the first stem element and the loop element); a nexus element (FIG. 2, 206 ); a second hairpin element comprising a second stem element (FIG. 2, 207 ); and a third hairpin element comprising a third stem element (FIG. 2, 208 ). (See, e.g.,FIGS. 1 and 3 of Briner, A. E., et al., “Guide RNA Functional Modules Direct Cas9 Activity and Orthogonality,” Molecular Cell (2014) 56:333-339.) The figure is not proportionally rendered nor is it to scale. The locations of indicators are approximate. -
FIG. 3A andFIG. 3B relate to structural information for a sgRNA/Cas protein complex and a Cas protein, respectively.FIG. 3A provides a model based on the crystal structure of S. pyogenes Cas9 (SpyCas9) in an active complex with sgRNA (Anders C., et al., “Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease,” Nature (2014) 513:569-573). Structural studies of the SpyCas9 showed that the protein exhibits a bi-lobed architecture comprising the Catalytic nuclease lobe and the a-Helical lobe of the enzyme (See Jinek M., et al., “Structures of Cas9 endonucleases reveal RNA-mediated conformational activation,” Science (2014) 343:1247997; Anders C., et al., “Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease,” Nature (2014) 513:569-573). InFIG. 3A , the α-Helical lobe (FIG. 3A , Helical domain) is shown as the darker lobe; the Catalytic nuclease lobe (FIG. 3A , Catalytic nuclease lobe) is shown in a light grey and the sgRNA backbone is shown in black (FIG. 3A , sgRNA). The relative location of the 3′ end of the sgRNA is indicated (FIG. 3A , 3′ end sgRNA). The spacer RNA of the sgRNA is not visible because it is surrounded by the two protein lobes. The relative location of the 5′ end of the sgRNA (FIG. 3A , 5′ end sgRNA) is indicated and the spacer RNA of the sgRNA is located in the 5′ end region of the sgRNA. A cysteine residue (FIG. 3A , WT SpyCas9 Cys) in wild type SpyCas9 is identified in the present disclosure as an available cross-linking site. InFIG. 3A , the Catalytic nuclease lobe is shown as the lighter lobe wherein the relative positions of the RuvC (FIG. 3A , RuvC; RNase H homologous domain) and HNH nuclease (FIG. 3A , HNH; HNH nuclease homologous domain) domains are indicated. The RuvC and HNH nuclease domains, when active, each cut a different DNA strand in target DNA. The C-terminal domain (FIG. 3A , CTD) is involved in recognition of protospacer adjacent motifs (PAM) in target DNA.FIG. 3B presents a model of the domain arrangement of SpyCas9 relative to its primary sequence structure. InFIG. 3B , three regions of the primary sequence correspond to the RuvC domain (FIG. 3B , RuvC-I (amino acids 1-78), RuvC-II (amino acids 719-765), and RuvC-III (amino acids 926-1102)). One region corresponds to the Helical domain (FIG. 3B , Helical Domain (amino acids 79-718). One region corresponds to the HNH domain (FIG. 3B , HNH (amino acids 766-925). One region corresponds to the CTD domain (FIG. 3B , CTD (amino acids 1103-1368). InFIG. 3B , the regions of the primary sequence corresponding to the α-Helical lobe (FIG. 3B , alpha-helical lobe) and the Nuclease domain lobe (FIG. 3B , Nuclease domain lobe) are indicated with brackets. -
FIG. 4 depicts an exemplary embodiment of off-target binding and cleavage during genome engineering. In this embodiment, a target nucleic acid (FIG. 4 . 115) is contacted with a complex comprising a site-directed polypeptide (e.g., Cas9) (FIG. 4, 105 ) and a nucleic acid-targeting nucleic acid (e.g., sgRNA or dual-guide RNA) (FIG. 4, 110 ). The complex comprising the Cas9 binds to a target nucleic acid (FIG. 4, 120 ). In some instances, the complex binds to an off-target nucleic acid (FIG. 4, 125 ). In a cleavage step (FIG. 4, 130 ), the Cas9 of the complex can cleave the target nucleic acid (FIG. 4, 120 ) and the off-target nucleic acid, thereby generating off-target effects. -
FIG. 5 depicts an exemplary embodiment of a method of the disclosure for reducing off-target binding and cleavage events. A target nucleic acid (FIG. 5, 215 ) is contacted with a complex comprising a site-directed polypeptide (e.g., an active Cas9) (FIG. 5, 205 ) and a nucleic acid-targeting nucleic acid (e.g., sgRNA or dual-guide RNA) (FIG. 5, 210 ). The complex binds to a target nucleic acid (FIG. 5, 220 ). In some instances, the complex comprising the Cas9 and sgRNA binds to an off-target nucleic acid (FIG. 5, 225 ). Complexes comprising an engineered dCas9 protein (FIG. 5, 235 ) and an engineered sgRNA (FIG. 5, 236 ), can be introduced and contacted (FIG. 5, 230 ) with the target nucleic acid. The dCas9 complexes can either displace or prevent the binding of complexes comprising active Cas9. The active Cas9 can cleave (FIG. 5, 240 /245) the target nucleic acid. The active Cas9 is prevented from cleaving the off-target nucleic acid because the dCas9 is preventing its binding and cleavage. In this way, off-target cleavage may be prevented. -
FIG. 6A ,FIG. 6B , andFIG. 6C show the use of tandem Cas9 D10A nickases to excise a single-stranded region of DNA on the same strand of a target nucleic acid and insert a donor polynucleotide.FIG. 6A shows two D10A sgRNA/dCas9 complexes targeted to two spaced-apart positions on the sense strand of a target polynucleotide.FIG. 6B shows that a region on the sense strand between the targeted sites has been cleaved.FIG. 6C shows the insertion of the donor polynucleotide with overlapping flanking regions. -
FIG. 7A andFIG. 7B depict methods of increasing HDR using sgRNA/dCas9 and catalytically active sgRNA/Cas9 complexes.FIG. 7A shows a system using a single sgRNA/dCas9 complex tethered to a HDR polynucleotide donor adjacent to an active sgRNA/Cas9 complex to direct the donor to the site of the double-stranded break and to position the donor next to the cut site.FIG. 7B shows a system using two spaced-apart sgRNA/dCas9 complexes and a catalytically active sgRNA/Cas9 complex positioned between the two catalytically inactive complexes, wherein the donor is positioned across the double-stranded break. -
FIG. 8 shows the effects of dCas9 nuclease blockers (dCas9-NBs) on VEGFA sgRNA/Cas9 on-target editing at the VEGFA locus. -
FIG. 9 shows the effects of dCas9-NBs on VEGFA sgRNA/Cas9 off-target editing at a known VEGFA off-target locus onhuman chromosome 15. -
FIG. 10 shows the various embodiments of the experimental conditions used to position homology donor nucleotides near a targeted site for increasing HDR efficiency, as described in Example 5C. -
FIG. 11 shows potential donor configurations using tandem Cas9D10A as described in the examples. -
FIG. 12 shows a comparison of repair types using either Cas9 or Cas9D10A atTargets 3 and 4 (human CD34 locus) as described in the examples. Neg denotes a negative control which is either Cas9 or Cas9D10A only, without sgRNA. The distribution of repair is denoted by the bars in the figure. Solid bars=unedited; hatched bars=mutagenic repair; stippled bars=HDR. - It is to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting. As used in this specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a sgRNA/dCas9 complex” includes one or more such complexes, reference to “a sgRNA/Cas9 complex” includes one or more such complexes, reference to “a mutation” includes one or more mutations, and the like. It is also to be understood that when reference is made to an embodiment using a sgRNA to target Cas9 or dCas9 to a target site, one skilled in the art can use an alternative embodiment of the invention based on the use of a dual-guide RNA (e.g. crRNA/tracrRNA) in place of the sgRNA.
- Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although other methods and materials similar, or equivalent, to those described herein can be used in the practice of the present invention, preferred materials and methods are described herein.
- In view of the teachings of the present specification, one of ordinary skill in the art can apply conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics, and recombinant polynucleotides, as taught, for example, by the following standard texts: Antibodies: A Laboratory Manual, Second edition, E. A. Greenfield, 2014, Cold Spring Harbor Laboratory Press, ISBN 978-1-936113-81-1; Culture of Animal Cells: A Manual of Basic Technique and Specialized Applications, 6th Edition, R. I. Freshney, 2010, Wiley-Blackwell, ISBN 978-0-470-52812-9; Transgenic Animal Technology, Third Edition: A Laboratory Handbook, 2014, C. A. Pinkert, Elsevier, ISBN 978-0124104907; The Laboratory Mouse, Second Edition, 2012, H. Hedrich, Academic Press, ISBN 978-0123820082; Manipulating the Mouse Embryo: A Laboratory Manual, 2013, R. Behringer, et al., Cold Spring Harbor Laboratory Press, ISBN 978-1936113019; PCR 2: A Practical Approach, 1995, M. J. McPherson, et al., IRL Press, ISBN 978-0199634248; Methods in Molecular Biology (Series), J. M. Walker, ISSN 1064-3745, Humana Press; RNA: A Laboratory Manual, 2010, D. C. Rio, et al., Cold Spring Harbor Laboratory Press, ISBN 978-0879698911; Methods in Enzymology (Series), Academic Press; Molecular Cloning: A Laboratory Manual (Fourth Edition), 2012, M. R. Green, et al., Cold Spring Harbor Laboratory Press, ISBN 978-1605500560; Bioconjugate Techniques, Third Edition, 2013, G. T. Hermanson, Academic Press, ISBN 978-0123822390; Methods in Plant Biochemistry and Molecular Biology, 1997, W. V. Dashek, CRC Press, ISBN 978-0849394805; Plant Cell Culture Protocols (Methods in Molecular Biology), 2012, V. M. Loyola-Vargas, et al., Humana Press, ISBN 978-1617798177; Plant Transformation Technologies, 2011, C. N. Stewart, et al., Wiley-Blackwell, ISBN 978-0813821955; Recombinant Proteins from Plants (Methods in Biotechnology), 2010, C. Cunningham, et al., Humana Press, ISBN 978-1617370212; Plant Genomics: Methods and Protocols (Methods in Molecular Biology), 2009, D. J. Somers, et al., Humana Press, ISBN 978-1588299970; Plant Biotechnology: Methods in Tissue Culture and Gene Transfer, 2008, R. Keshavachandran, et al., Orient Blackswan, ISBN 978-8173716164.
- The term “Cas9 protein” as used herein refers to Type II CRISPR-Cas9 proteins (as described, e.g., in Chylinski, K., (2013) “The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems,” RNA Biol. 2013 10(5):726-737), including, but not limited to Cas9, Cas9-like, proteins encoded by Cas9 orthologs, Cas9-like synthetic proteins, and variants and modifications thereof. The term “Cas9 protein” as used herein refers to Cas9 wild-type proteins derived from Type II CRISPR-Cas9 systems, modifications of Cas9 proteins, variants of Cas9 proteins, Cas9 orthologs, and combinations thereof. Cas9 proteins can be derived from any of various bacterial species which genomes encode such proteins. Cas proteins for use in the present methods are described further below.
- The terms “wild-type,” “naturally-occurring” and “unmodified” are used herein to mean the typical (or most common) form, appearance, phenotype, or strain existing in nature; for example, the typical form of cells, organisms, characteristics, polynucleotides, proteins, macromolecular complexes, genes, RNAs, DNAs, or genomes as they occur in and can be isolated from a source in nature. The wild-type form, appearance, phenotype, or strain serve as the original parent before an intentional modification. Thus, mutant, variant, engineered, recombinant, and modified forms are not wild-type forms.
- As used herein, the terms “engineered,” “genetically engineered,” “recombinant,” “modified,” and “non-naturally occurring” are interchangeable and indicate intentional human manipulation.
- As used herein, the terms “nucleic acid,” “nucleotide sequence,” “oligonucleotide,” and “polynucleotide” are interchangeable. All refer to a polymeric form of nucleotides. The nucleotides may be deoxyribonucleotides (DNA) or ribonucleotides (RNA), or analogs thereof, and they may be of any length. Polynucleotides may perform any function and may have any secondary structure and three-dimensional structure. The terms encompass known analogs of natural nucleotides and nucleotides that are modified in the base, sugar and/or phosphate moieties. Analogs of a particular nucleotide have the same base-pairing specificity (e.g., an analog of A base pairs with T). A polynucleotide may comprise one modified nucleotide or multiple modified nucleotides. Examples of modified nucleotides include methylated nucleotides and nucleotide analogs. Nucleotide structure may be modified before or after a polymer is assembled. Following polymerization, polynucleotides may be additionally modified via, for example, conjugation with a labeling component or target-binding component. A nucleotide sequence may incorporate non-nucleotide components. The terms also encompass nucleic acids comprising modified backbone residues or linkages, that (i) are synthetic, naturally occurring, and non-naturally occurring, and (ii) have similar binding properties as a reference polynucleotide (e.g., DNA or RNA). Examples of such analogs include, but are not limited to, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs), and morpholino structures.
- Polynucleotide sequences are displayed herein in the conventional 5′ to 3′ orientation.
- As used herein, the term “complementarity” refers to the ability of a nucleic acid sequence to form hydrogen bond(s) with another nucleic acid sequence (e.g., through traditional Watson-Crick base pairing). A percent complementarity indicates the percentage of residues in a nucleic acid molecule that can form hydrogen bonds with a second nucleic acid sequence. When two polynucleotide sequences have 100% complementarity, the two sequences are perfectly complementary, i.e., all of a first polynucleotide's contiguous residues hydrogen bond with the same number of contiguous residues in a second polynucleotide.
- As used herein, the term “sequence identity” generally refers to the percent identity of bases or amino acids determined by comparing a first polynucleotide or polypeptide to a second polynucleotide or polypeptide using algorithms having various weighting parameters. Sequence identity between two polypeptides or two polynucleotides can be determined using sequence alignment by various methods and computer programs (e.g., BLAST, CS-BLAST, FASTA, HMMER, L-ALIGN, etc.), available through the worldwide web at sites including GENBANK (ncbi.nlm.nih.gov/genbank/) and EMBL-EBI (ebi.ac.uk.). Sequence identity between two polynucleotides or two polypeptide sequences is generally calculated using the standard default parameters of the various methods or computer programs.
- As used herein a “stem-loop structure” or “stem-loop element” refers to a polynucleotide having a secondary structure that includes a region of nucleotides that are known or predicted to form a double-stranded region (the “stem element”) that is linked on one side by a region of predominantly single-stranded nucleotides (the “loop element”). The term “hairpin” element is also used herein to refer to stem-loop structures. Such structures are well known in the art. The base pairing may be exact. However, as is known in the art, a stem element does not require exact base pairing. Thus, the stem element may include one or more base mismatches or non-paired bases.
- As used herein, the term “recombination” refers to a process of exchange of genetic information between two polynucleotides.
- As used herein, the term “homology-directed repair” or “HDR” refers to DNA repair that takes place in cells, for example, during repair of double-stranded and single-stranded breaks in DNA. HDR requires nucleotide sequence homology and uses a “donor template” (donor template DNA, polynucleotide donor, or oligonucleotide (used interchangably herein) to repair the sequence where the double-stranded break occurred (e.g., DNA target sequence). This results in the transfer of genetic information from, for example, the donor template DNA to the DNA target sequence. HDR may result in alteration of the DNA target sequence (e.g., insertion, deletion, mutation) if the donor template DNA sequence or oligonucleotide sequence differs from the DNA target sequence and part or all of the donor template DNA polynucleotide or oligonucleotide is incorporated into the DNA target sequence. In some embodiments, an entire donor template DNA polynucleotide, a portion of the donor template DNA polynucleotide, or a copy of the donor polynucleotide is integrated at the site of the DNA target sequence.
- As used herein the term “non-homologous end joining” or “NHEJ” refers to the repair of double-stranded breaks in DNA by direct ligation of one end of the break to the other end of the break without a requirement for a donor template DNA. NHEJ in the absence of a donor template DNA often results in a small number of nucleotides randomly inserted or deleted at the site of the double-stranded break.
- Alternative mechanisms of DNA insertion that do not require sequence homology between the donor and the target sequence can also be used for nucleic acid insertion. These mechanisms involve various components of the cellular DNA repair machinery and it is to be understood that the scope of the invention is not bound by the use of any particular mechanism for insertion of nucleic acid after target nucleic acid is cut or nicked by a site-specific polynucleotide.
- The terms “vector” and “plasmid” are used interchangeably and as used herein refer to a polynucleotide vehicle to introduce genetic material into a cell. Vectors can be linear or circular. Vectors can integrate into a target genome of a host cell or replicate independently in a host cell. Vectors can comprise, for example, an origin of replication, a multicloning site, and/or a selectable marker. An expression vector typically comprises an expression cassette. Vectors and plasmids include, but are not limited to, integrating vectors, prokaryotic plasmids, eukaryotic plasmids, plant synthetic chromosomes, episomes, viral vectors, cosmids, and artificial chromosomes. As used herein the term “expression cassette” is a polynucleotide construct, generated recombinantly or synthetically, comprising regulatory sequences operably linked to a selected polynucleotide to facilitate expression of the selected polynucleotide in a host cell. For example, the regulatory sequences can facilitate transcription of the selected polynucleotide in a host cell, or transcription and translation of the selected polynucleotide in a host cell. An expression cassette can, for example, be integrated in the genome of a host cell or be present in an expression vector.
- As used herein the term “expression cassette” is a polynucleotide construct, generated recombinantly or synthetically, comprising regulatory sequences operably linked to a selected polynucleotide to facilitate expression of the selected polynucleotide in a host cell. For example, the regulatory sequences can facilitate transcription of the selected polynucleotide in a host cell, or transcription and translation of the selected polynucleotide in a host cell. An expression cassette can, for example, be integrated in the genome of a host cell or be present in an expression vector.
- As used herein a “targeting vector” is a recombinant DNA construct typically comprising tailored DNA arms homologous to genomic DNA that flanks critical elements of a target gene or target sequence. When introduced into a cell, the targeting vector integrates into the cell genome via homologous recombination. Elements of the target gene can be modified in a number of ways including deletions and/or insertions. A defective target gene can be replaced by a functional target gene, or in the alternative a functional gene can be knocked out. Optionally a targeting vector comprises a selection cassette comprising a selectable marker that is introduced into the target gene. Targeting regions adjacent or sometimes within a target gene can be used to affect regulation of gene expression.
- As used herein, the terms “regulatory sequences,” “regulatory elements,” and “control elements” are interchangeable and refer to polynucleotide sequences that are upstream (5′ non-coding sequences), within, or downstream (3′ non-translated sequences) of a polynucleotide target to be expressed. Regulatory sequences influence, for example, the timing of transcription, amount or level of transcription, RNA processing or stability, and/or translation of the related structural nucleotide sequence. Regulatory sequences may include activator binding sequences, enhancers, introns, polyadenylation recognition sequences, promoters, repressor binding sequences, stem-loop structures, translational initiation sequences, translation leader sequences, transcription termination sequences, translation termination sequences, primer binding sites, and the like.
- As used herein the term “operably linked” refers to polynucleotide sequences or amino acid sequences placed into a functional relationship with one another. For instance, a promoter or enhancer is operably linked to a coding sequence if it regulates, or contributes to the modulation of, the transcription of the coding sequence. Operably linked DNA sequences encoding regulatory sequences are typically contiguous to the coding sequence. However, enhancers can function when separated from a promoter by up to several kilobases or more. Accordingly, some polynucleotide elements may be operably linked but not contiguous.
- As used herein, the term “expression” refers to transcription of a polynucleotide from a DNA template, resulting in, for example, an mRNA or other RNA transcript (e.g., non-coding, such as structural or scaffolding RNAs). The term further refers to the process through which transcribed mRNA is translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be referred to collectively as “gene product.” Expression may include splicing the mRNA in a eukaryotic cell, if the polynucleotide is derived from genomic DNA.
- As used herein the term “modulate” refers to a change in the quantity, degree or amount of a function. For example, the methods disclosed herein may modulate Cas9-mediated targeting efficiency by decreasing or eliminating off-target cleavage, thereby enhancing cleavage at the target site, or may enhance HDR and decrease the likelihood of NHEJ events. Accordingly, the term “modulating targeting” may denote increasing desired targeting events and/or inhibiting off-target cleavage. Similarly, “modulating HDR” can denote increasing HDR and/or decreasing NHEJ.
- Modulation can be assayed by determining any characteristic directly or indirectly affected by the expression of the target gene. Such characteristics include, e.g., changes in targeting efficiency, RNA or protein levels, protein activity, product levels, associated gene expression, or activity level of reporter genes. Thus, “modulation” of gene expression includes both gene activation and gene repression.
- As used herein, the term “amino acid” refers to natural and synthetic (unnatural) amino acids, including amino acid analogs, modified amino acids, peptidomimetics, glycine, and D or L optical isomers.
- As used herein, the terms “peptide,” “polypeptide,” and “protein” are interchangeable and refer to polymers of amino acids. A polypeptide may be of any length. It may be branched or linear, it may be interrupted by non-amino acids, and it may comprise modified amino acids. The terms may be used to refer to an amino acid polymer that has been modified through, for example, acetylation, disulfide bond formation, glycosylation, lipidation, phosphorylation, cross-linking, and/or conjugation (e.g., with a labeling component or ligand). Polypeptide sequences are displayed herein in the conventional N-terminal to C-terminal orientation.
- Polypeptides and polynucleotides can be made using routine techniques in the field of molecular biology (see, e.g., standard texts discussed above). Further, essentially any polypeptide or polynucleotide can be custom ordered from commercial sources.
- The term “binding” as used herein includes a non-covalent interaction between macromolecules (e.g., between a protein and a polynucleotide, between a polynucleotide and a polynucleotide, and between a protein and a protein). Such non-covalent interaction is also referred to as “associating” or “interacting” (e.g., when a first macromolecule interacts with a second macromolecule, the first macromolecule binds to second macromolecule in a non-covalent manner). Some portions of a binding interaction may be sequence-specific; however, all components of a binding interaction do not need to be sequence-specific, such as a protein's contacts with phosphate residues in a DNA backbone. Binding interactions can be characterized by a dissociation constant (Kd). “Affinity” refers to the strength of binding. An increased binding affinity is correlated with a lower Kd. An example of non-covalent binding is hydrogen bond formation between base pairs.
- As used herein, the term “isolated” can refer to a nucleic acid or polypeptide that, by the hand of a human, exists apart from its native environment and is therefore not a product of nature. Isolated means substantially pure. An isolated nucleic acid or polypeptide can exist in a purified form and/or can exist in a non-native environment such as, for example, in a recombinant cell.
- As used herein, a “host cell” generally refers to a biological cell. A cell can be the basic structural, functional and/or biological unit of a living organism. A cell can originate from any organism having one or more cells. Examples of host cells include, but are not limited to: a prokaryotic cell, eukaryotic cell, a bacterial cell, an archaeal cell, a cell of a single-cell eukaryotic organism, a protozoa cell, a cell from a plant (e.g. cells from plant crops, fruits, vegetables, grains, soy bean, corn, maize, wheat, seeds, tomatoes, rice, cassava, sugarcane, sunflower, sorghum, millet, alfalfa, oil-producing Brassica (for example, but not limited to, oilseed rape/canola), pumpkin, hay, potatoes, cotton, cannabis, tobacco, flowering plants, conifers, gymnosperms, ferns, clubmosses, hornworts, liverworts, mosses), an algal cell, (e.g., Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens C. Agardh, and the like), seaweeds (e.g. kelp), a fungal cell (e.g., a yeast cell, a cell from a mushroom), an animal cell, a cell from an invertebrate animal (e.g fruit fly, cnidarian, echinoderm, nematode, etc.), a cell from a vertebrate animal (e.g., fish, amphibian, reptile, bird, mammal), a cell from a mammal (e.g., a pig, a cow, a goat, a sheep, a rodent, a rat, a mouse, a non-human primate, a human, etc.). Further, a cell can be a stem cell or progenitor cell.
- As used herein, the term “transgenic organism” refers to an organism comprising a recombinantly introduced polynucleotide.
- As used herein, the terms “transgenic plant cell” and “transgenic plant” are interchangeable and refer to a plant cell or a plant containing a recombinantly introduced polynucleotide. Included in the term transgenic plant is the progeny (any generation) of a transgenic plant or a seed such that the progeny or seed comprises a DNA sequence encoding a recombinantly introduced polynucleotide or a fragment thereof.
- As used herein, the phrase “generating a transgenic plant cell or a plant” refers to using recombinant DNA methods and techniques to construct a vector for plant transformation to transform the plant cell or the plant and to generate the transgenic plant cell or the transgenic plant.
- A CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a genomic locus found in the genomes of many prokaryotes (e.g., bacteria and archaea). CRISPR loci provide resistance to foreign invaders (e.g., virus, phage) in prokaryotes. In this way, the CRISPR system can be thought to function as a type of immune system to help defend prokaryotes against foreign invaders. There are three stages of CRISPR locus function: integration of new sequences into the locus, biogenesis of CRISPR RNA (crRNA), and silencing of foreign invader nucleic acid.
- A CRISPR locus includes a number of short repeating sequences referred to as “repeats.” Repeats can form hairpin structures and/or repeats can be unstructured single-stranded sequences. The repeats occur in clusters. Repeats frequently diverge between species. Repeats are regularly interspaced with unique intervening sequences, referred to as “spacers,” resulting in a repeat-spacer-repeat locus architecture. Spacers are identical to or have high homology with known foreign invader sequences. A spacer-repeat unit encodes a crisprRNA (crRNA). A crRNA refers to the mature form of the spacer-repeat unit. A crRNA comprises a “seed” sequence that is involved in targeting a target nucleic acid (e.g., possibly as a surveillance mechanism against foreign nucleic acid). A seed sequence is typically located towards the 5′ end of a crRNA (e.g. in the Cascade complex; for a description of the Cascade complex see, e.g., Jore, M. M. et al., “Structural basis for CRISPR RNA-guided DNA recognition by Cascade,” Nature Structural & Molecular Biology (2011) 18:529-536) or at the 3′ end of the spacer of a crRNA (e.g., in a Type II CRISPR-Cas9 system), directly adjacent to the first stem.
- A CRISPR locus comprises polynucleotide sequences encoding for CRISPR Associated Genes (Cas) genes. Cas genes are involved in the biogenesis and/or the interference stages of crRNA function. Cas genes display extreme sequence (e.g., primary sequence) divergence between species and homologues. For example, Casl homologues can comprise less than 10% primary sequence identity between homologues. Some Cas genes comprise homologous secondary and/or tertiary structures. For example, despite extreme sequence divergence, many members of the Cas6 family of CRISPR proteins comprise a N-terminal ferredoxin-like fold. Cas genes are named according to the organism from which they are derived. For example, Cas genes in Staphylococcus epidermidis can be referred to as Csm-type, Cas genes in Streptococcus thermophilus can be referred to as Csn-type, and Cas genes in Pyrococcus furiosus can be referred to as Cmr-type.
- The integration stage of a CRISPR system refers to the ability of the CRISPR locus to integrate new spacers into the crRNA array upon being infected by a foreign invader. Acquisition of the foreign invader spacers can help confer immunity to subsequent attacks by the same foreign invader. Integration typically occurs at the leader end of the CRISPR locus. Cas proteins (e.g., Cas1 and Cas2) are involved in integration of new spacer sequences. Integration proceeds similarly for some types of CRISPR systems (e.g., Type I-III).
- Mature crRNAs are processed from a longer polycistronic CRISPR locus transcript (i.e., pre-crRNA array). A pre-crRNA array comprises a plurality of crRNAs. The repeats in the pre-crRNA array are recognized by Cas genes. Cas genes bind to the repeats and cleave the repeats. This action can liberate the plurality of crRNAs. crRNAs can be subjected to further events to produce the mature crRNA form such as trimming (e.g., with an exonuclease). A crRNA may comprise all, some, or none of the CRISPR repeat sequence.
- Interference refers to the stage in the CRISPR system that is functionally responsible for combating infection by a foreign invader. CRISPR interference follows a similar mechanism to RNA interference (RNAi: e.g., wherein a target RNA is targeted (e.g., hybridized) by a short interfering RNA (siRNA)), which results in target RNA degradation and/or destabilization. CRISPR systems perform interference of a target nucleic acid by coupling crRNAs and Cas genes, thereby forming CRISPR ribonucleoproteins (crRNPs). crRNA of the crRNP guides the crRNP to foreign invader nucleic acid, (e.g., by recognizing the foreign invader nucleic acid through hybridization). Hybridized target foreign invader nucleic acid-crRNA units are subjected to cleavage by Cas proteins. Target nucleic acid interference typically requires a protospacer adjacent motif (PAM) in a target nucleic acid.
- There are at least four types of CRISPR systems: Type I, Type II, Type III, and Type U. More than one CRISPR type system can be found in an organism. CRISPR systems can be complementary to each other, and/or can lend functional units in trans to facilitate CRISPR locus processing. Type II systems can be further subdivided into II-A (contains Csn2 locus) and II-B (contains Cas4 locus) and Type II-C (neither Csn2 nor Cas4, e.g. N. meningitides). Modifications of the components of CRISPR-Type II systems are extensively discussed in the present specification.
- crRNA biogenesis in a Type II CRISPR system comprises a trans-activating CRISPR RNA (tracrRNA). A tracrRNA is typically modified by endogenous RNaseIII. The tracrRNA hybridizes to a crRNA repeat in the pre-crRNA array. Endogenous RNaselll is recruited to cleave the pre-crRNA. Cleaved crRNAs are subjected to exoribonuclease trimming to produce the mature crRNA form (e.g., 5′ trimming). The tracrRNA typically remains hybridized to the crRNA. The tracrRNA and the crRNA associate with a site-directed polypeptide (e.g., Cas9). The crRNA of the crRNA-tracrRNA-Cas9 complex can guide the complex to a target nucleic acid to which the crRNA can hybridize. Hybridization of the crRNA to the target nucleic acid activates a wild-type, cognate Cas9 for target nucleic acid cleavage. Target nucleic acid in a Type II CRISPR system comprises a PAM. In some embodiments, a PAM is essential to facilitate binding of a site-directed polypeptide (e.g., Cas9) to a target nucleic acid.
- Cas9 is an exemplary Type II CRISPR Cas protein. Cas9 is an endonuclease that can be programmed by the tracrRNA/crRNA to cleave, site-specifically, target DNA using two distinct endonuclease domains (HNH and RuvC/RNase H-like domains) (see U.S. Published Patent Application No. 2014-0068797, published 6 Mar. 2014; see also Jinek M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-821), one for each strand of the DNA's double helix. RuvC and HNH together produce double-stranded breaks (DSBs), and separately can produce single-stranded breaks.
FIG. 3A presents a model of the domain arrangement of SpyCas9 (S. pyogenes Cas9) relative to its primary sequence structure. Two RNA components of a Type II CRISPR-Cas9 system are illustrated inFIG. 1A . Typically each CRISPR-Cas9 system comprises a tracrRNA and a crRNA. However, this requirement can be bypassed by using an engineered sgRNA, described more fully below, containing a designed hairpin that mimics the tracrRNA-crRNA complex (Jinek et al., 2012). Base-pairing between the sgRNA and target DNA causes double-stranded breaks (DSBs) due to the endonuclease activity of Cas9. Binding specificity is determined by both sgRNA-DNA base pairing and a short DNA motif (protospacer adjacent motif [PAM] sequence: NGG) juxtaposed to the DNA complementary region (Marraffini L A, Sontheimer E J. “CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea,” Nat Rev Genet., 2010; 11:181-190). Thus, the CRISPR system only requires a minimal set of two molecules—the Cas9 protein and the sgRNA. - A large number of Cas9 orthologs are known in the art as well as their associated tracrRNA and crRNA components (see, e.g., “Supplementary Table S2. List of bacterial strains with identified Cas9 orthologs,” Fonfara, Ines, et al., “Phylogeny of Cas9 Determines Functional Exchangeability of Dual-RNA and Cas9 among Orthologous Type II CRISPR/Cas Systems,” Nucleic Acids Research (2014) 42:2577-2590, including all Supplemental Data; Chylinski K., et al., “Classification and evolution of type II CRISPR-Cas systems,” Nucleic Acids Research (2014) 42:6091-6105, including all Supplemental Data.); Esvelt, K. M., et al., “Orthogonal Cas9 proteins for RNA-guided gene regulation and editing,” Nature Methods (2013) 10:1116-1121). A number of orthogonal Cas9 proteins have been identified including Cas9 proteins from Neisseria meningitidis, Streptococcus thermophilus and Staphylococcus aureus.
- As used herein, “a Cas9 protein” refers to a Cas9 protein derived from any species, subspecies or strain of bacteria that encodes Cas9, as well as variants and orthologs of the particular Cas9 in question. The Cas9 proteins can either be directly isolated and purified from bacteria, or synthetically or recombinantly produced, or typically delivered using a construct encoding the protein, including without limitation, naked DNA, plasmid DNA, a viral vector and mRNA for Cas9 expression.
- Variants and modifications of Cas9 proteins are known in the art. U.S. Published Patent Application 20140273226, published
Sep 18, 2014, incorporated herein by reference in its entirety, discusses the S. pyogenes Cas9 gene, Cas9 protein, and variants of the Cas9 protein including host-specific codon optimized Cas9 coding sequences (e.g., ¶¶0129-0137 therein) and Cas9 fusion proteins (e.g., ¶¶233-240 therein). U.S. Published Patent Application 20140315985, published Oct. 23, 2014, incorporated herein in its entirety, teaches a large number of exemplary wild-type Cas9 polypeptides (e.g., SEQ ID NO: 1-256, SEQ ID NOS: 795-1346, therein) including the sequence of Cas9 from S. pyogenes (SEQ ID NO: 8, therein). Modifications and variants of Cas9 proteins are also discussed (e.g., ¶¶504-608, therein). Non-limiting examples of Cas9 proteins include Cas9 proteins from S. pyogenes (GI:15675041); Listeria innocua Clip 11262 (GI:16801805); Streptococcus mutans UA159 (GI:24379809); Streptococcus thermophilus LMD-9 (S. thermophilus A, GI:11662823; S. thermophilus B, GI:116627542); Lactobacillus buchneri NRRL B-30929 (GI:331702228); Treponema denticola ATCC 35405 (GI:42525843); Francisella novicida U112 (GI:118497352); Campylobacter jejuni subsp. Jejuni NCTC 11168 (GI:218563121); Pasteurella multocida subsp. multocida str. Pm70 (GI:218767588); Neisseria meningitidis Zs491 (GI:15602992) and Actinomyces naeslundii (GI:489880078). - Aspects of the present invention can be practiced by one of ordinary skill in the art following the guidance of the specification to use Type II CRISPR-Cas9 proteins and Cas-protein encoding polynucleotides, including, but not limited to Cas9, Cas9-like, proteins encoded by Cas9 orthologs, Cas9-like synthetic proteins, and variants and modifications thereof. The cognate RNA components of these Cas proteins can be manipulated and modified for use in the practice of the present invention by one of ordinary skill in the art following the guidance of the present specification.
- By “dCas9” is meant a nuclease-deactivated Cas9, also termed “catalytically inactive”, “catalytically dead Cas9” or “dead Cas9.” Such molecules lack all or a portion of endonuclease activity and can therefore be used to regulate genes in an RNA-guided manner (Jinek M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science (2012) 337:816-821). This is accomplished by introducing mutations that inactivate Cas9 nuclease function and is typically accomplished by mutating both of the two catalytic residues (D10A in the RuvC-1 domain, and H840A in the HNH domain, numbered relative to S. pyogenes Cas9) of the gene encoding Cas9. It is understood that mutation of other catalytic residues to reduce activity of either or both of the nuclease domains can also be carried out by one skilled in the art. In doing so, dCas9 is unable to cleave dsDNA but retains the ability to target DNA. The Cas9 double mutant with changes at amino acid positions D10A and H840A completely inactivates both the nuclease and nickase activities. Targeting specificity is determined by complementary base-pairing of an sgRNA to the genomic locus and the protospacer adjacent motif (PAM).
- dCas9 can be used alone or in fusions to synthetically repress (CRISPRi) or activate (CRISPRa) gene expression. CRISPRi can work independently of host cellular machineries. In some embodiments, only a dCas9 protein and a customized sgRNA designed with a complementary region to any gene of interest direct dCas9 to a chosen genomic location. In other embodiments, dCas9 can be fused to a transcription factor, such as a repressor, and the fused Cas9-transcription factor can then work in concert with cellular machineries. The binding specificity is determined jointly by the complementary region on the sgRNA and a short DNA motif (protospacer adjacent motif or PAM) juxtaposed to the DNA complementary region, dependent on the species in question. (see, e.g., Anders C., et al., “Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease,” Nature (2014) 513:569-573). In the case of S. pyogenes, this sequence is NGG. To achieve transcriptional repression, dCas9 can be used by itself (whereby it represses transcription through steric hindrance). Taken together sgRNA and dCas9 provide a minimum system for gene-specific regulation in any organism. (Qi, L. S., et al., “Repurposing CRISPR as an RNA-Guided Platform for Sequence-Specific Control of Gene Expression” Cell (2013) 152:1173-1183). CRISPRa is carried out by dCas9-transcription factor (activator) fusions.
- By a “Cas9 nickase” is meant a Cas9 mutant that does not retain the ability to make double-stranded breaks in a target nucleic acid sequence, but maintains the ability to bind to and make a single-stranded break at a target site. Such a mutant will typically include a mutation in one, but not both of the Cas9 endonuclease domains (HNH and RuvC). Thus, an amino acid mutation at position D10A or H840A in Cas9, numbered relative to S. pyogenes, can result in the inactivation of the nuclease catalytic activity and convert Cas9 to a nickase enzyme that makes single-stranded breaks at the target site. It is to be understood that other site-directed polypeptides such as meganucleases, TALE nucleases, Zinc-finger nucleases, MEGA-TALs and others known to one of skill in the art can be used in alternative embodiments.
- crRNA has a region of complementarity to a potential DNA target sequence (
FIG. 1A , the dark, 5′ region of the crRNA) and a second region that forms base-pair hydrogen bonds with the tracrRNA to form a secondary structure, typically to form at least a stem structure (FIG. 1A , the light region extending to the 3′ end of the crRNA). The region of complementarity to the DNA target is the spacer. The tracrRNA and a crRNA interact through a number of base-pair hydrogen bonds to form secondary RNA structures, for example, as illustrated inFIG. 1B . Complex formation between tracrRNA/crRNA and Cas protein results in conformational change of the Cas protein that facilitates binding to DNA, endonuclease activities of the Cas protein, and crRNA-guided site-specific DNA cleavage by the endonuclease. For a Cas protein/tracrRNA/crRNA complex to cleave a DNA target sequence, the DNA target sequence is adjacent to a cognate protospacer adjacent motif (PAM). - The term “sgRNA” typically refers to a single-guide RNA (i.e., a single, contiguous polynucleotide sequence) that essentially comprises a crRNA connected at its 3′ end to the 5′ end of a tracrRNA through a “loop” sequence (see, e.g., U.S. Published Patent Application No. 20140068797, published 6 Mar. 2014, incorporated herein by reference in its entirety). sgRNA interacts with a cognate Cas protein essentially as described for tracrRNA/crRNA polynucleotides, as discussed above. Similar to crRNA, sgRNA has a spacer, a region of complementarity to a potential DNA target sequence (
FIG. 2, 201 ), adjacent a second region that forms base-pair hydrogen bonds that form a secondary structure, typically a stem structure (FIG. 2, 202, 203, 204, 205 ). The term includes truncated single-guide RNAs (tru-sgRNAs) of approximately 17-18 nt. (See, e.g., Fu, Y. et. al., “Improving CRISPR-Cas nuclease specificity using truncated guide RNAs,” Nat Biotechnol. (2014) 32:279-284). The term also encompasses functional miniature sgRNAs with expendable features removed, but that retain an essential and conserved module termed the “nexus” located in the portion of sgRNA that corresponds to tracrRNA (not crRNA). See, e.g, U.S. Published Patent Application No. 20140315985, published 23 Oct. 2014, incorporated herein by reference in its entirety; Briner et al., “Guide RNA Functional Modules Direct Cas9 Activity and Orthogonality,” Molecular Cell (2014) 56:333-339. The nexus is located immediately downstream of (i.e., located in the 3′ direction from) the lower stem in Type II CRISPR-Cas9 systems. An example of the relative location of the nexus is illustrated in the sgRNA shown inFIG. 2 . The nexus confers the binding of a sgRNA or a tracrRNA to its cognate Cas9 protein and confers an apoenzyme to haloenzyme conformational transition. - With reference to a crRNA or sgRNA, a “spacer” or “spacer element” as used herein refers to the polynucleotide sequence that can specifically hybridize to a target nucleic acid sequence. The spacer element interacts with the target nucleic acid sequence through hydrogen bonding between complementary base pairs (i.e., paired bases). A spacer element binds to a selected DNA target sequence. Accordingly, the spacer element is a DNA target-binding sequence. The spacer element determines the location of Cas protein's site-specific binding and endonucleolytic cleavage. Spacer elements range from ˜17- to ˜84 nucleotides in length, depending on the Cas protein with which they are associated, and have an average length of 36 nucleotides (Marraffini, et al., “CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea,” Nature reviews Genetics (2010) 11:181-190). In a Type II CRISPR-Cas9 system the spacer element typically comprises a “seed” sequence that is involved in targeting a target nucleic acid. For example, for SpyCas9, the functional length for a spacer to direct specific cleavage is typically about 12-25 nucleotides. Variability of the functional length for a spacer element is known in the art (e.g., U.S. Published Patent Application No. 20140315985, published 23 Oct. 2014, incorporated herein by reference in its entirety).
-
FIG. 3A provides a three-dimensional model based on the crystal structure of S. pyogenes Cas9 (SpyCas9) in an active complex with sgRNA. The relationship of the sgRNA to the Helical domain and the Catalytic domain is illustrated. The 3′ and 5′ ends of the sgRNA are indicated, as well as exposed portions of the sgRNA. The spacer RNA of the sgRNA is not visible because it is surrounded by the α-Helical lobe (Helical domain) and the Catalytic nuclease lobe (Catalytic domain). The spacer RNA of the sgRNA is located in the 5′ end region of the sgRNA. The RuvC and HNH nuclease domains, when active, each cut a different DNA strand in target DNA. The C-terminal domain (CTD) is involved in recognition of protospacer adjacent motifs (PAMs) in target DNA. - U.S. Published Patent Application No. 20140315985, published 23 Oct. 2014, incorporated herein by reference in its entirety; and Briner et al., “Guide RNA Functional Modules Direct Cas9 Activity and Orthogonality,” Molecular Cell (2014) 56:333-339, disclose consensus sequences and secondary structures of predicted sgRNAs for several sgRNA/Cas9 families. The general arrangement of secondary structures in the predicted sgRNAs up to and including the nexus are presented in
FIG. 2 herein which presents an overview of and nomenclature for elements of the sgRNA of the S. pyogenes Cas9. Relative toFIG. 2 , there is variation in the number and arrangement of stem structures located 3′ of the nexus in the sgRNAs of U.S. Published Patent Application No. 2014-0315985 and Briner, et al. Ran et al., “In vivo genome editing using Staphylococcus aureus Cas9,” Nature (2015) 520:186-191, including all extended data) present the crRNA/tracrRNA sequences and secondary structures of eight Type II CRISPR-Cas9 systems (see Extended DataFIG. 1 of Ran, et al.). Further, Fonfara, et al., (“Phylogeny of Cas9 Determines Functional Exchangeability of Dual-RNA and Cas9 among Orthologous Type II CRISPR/Cas Systems,” Nucleic Acids Research (2014) 42:2577-2590, including all Supplemental Data, in particular Supplemental Figure S11) present the crRNA/tracrRNA sequences and secondary structures of eight Type II CRISPR-Cas9 systems. - By “guide polynucleotide” is meant any polynucleotide that site-specifically guides Cas9 or dCas9 to a target, or off-target, nucleic acid. Many such guide polynucleotides are known, such as but not limited to sgRNA (including miniature and truncated sgRNAs), dual-guide RNA, including but not limited to, crRNA/tracrRNA molecules, as described above, and the like.
- By “donor polynucleotide” is meant a polynucleotide that can be directed to, and inserted into a target site of interest to modify the target nucleic acid. All or a portion of the donor polynucleotide can be inserted into the target nucleic acid. The donor polynucleotide is used for repair of the break in the target DNA sequence resulting in the transfer of genetic information (i.e., polynucleotide sequences) from the donor at the site or in close proximity of the break in the DNA. Accordingly, new genetic information (i.e., polynucleotide sequences) may be inserted or copied at a target DNA site. The donor polynucleotide can be double- or single-stranded DNA, RNA, a vector, plasmid, or the like. Non-symmetrical polynucleotide donors can also be used that are composed of two DNA oligonucleotides. They are partially complementary, and each can include a flanking region of homology. The donor can be used to insert or replace polynucleotide sequences in a target sequence, for example, to introduce a polynucleotide that encodes a protein or functional RNA (e.g., siRNA), to introduce a protein tag, to modify a regulatory sequence of a gene, or to introduce a regulatory sequence to a gene (e.g. a promoter, an enhancer, an internal ribosome entry sequence, a start codon, a stop codon, a localization signal, or polyadenylation signal), to modify a nucleic acid sequence (e.g., introduce a mutation), and the like.
- Targeted DNA modifications using donor polynucleotides for large changes (e.g., more than 100 bp insertions or deletions) traditionally use plasmid-based donor templates that contain homology arms flanking the site of alteration. Each arm can vary in length, but is typically longer than about 100 bp, such as 100-1500 bp, e.g., 100 . . . 200 . . . 300 . . . 400 . . . 500 . . . 600 . . . 700 . . . 800 . . . 900 . . . 1000 . . . 1500 bp or any integer between these values. However, these numbers can vary, depending on the size of the donor polynucleotide and the target polynucleotide. This method can be used to generate large modifications, including insertion of reporter genes such as fluorescent proteins or antibiotic resistance markers. For transfection in cells, such as HEK cells, approximately 100-1000 ng, e.g., 100 . . . 200 . . . 300 . . . 400 . . . 500 . . . 600 . . . 700 . . . 800 . . . 900 . . . 1000 ng or any integer between these values, of a typical size donor plasmid (e.g., approximately 5 kb) containing a sgRNA/Cas9 vector, can be used for one well in 24-well plate. (See, e.g., Yang et al., “One Step Generation of Mice Carrying Reporter and Conditional Alleles by CRISPR/Cas-Mediated Genome Engineering” Cell (2013) 154:1370-1379).
- Single-stranded and partially double-stranded oligonucleotides, such as DNA oligonucleotides, have been used in place of targeting plasmids for short modifications (e.g., less than 50 bp) within a defined locus without cloning. To achieve high HDR efficiencies, single-stranded oligonucleotides containing flanking sequences on each side that are homologous to the target region can be used, and can be oriented in either the sense or antisense direction relative to the target locus. The length of each arm can vary in length, but the length of at least one arm is typically longer than about 10 bases, such as from 10-150 bases, e.g., 10 . . . 20 . . . 30 . . . 40 . . . 50 . . . 60 . . . 70 . . . 80 . . . 90 . . . 100 . . . 110 . . . 120 . . . 130 . . . 140 . . . 150, or any integer within these ranges. However, these numbers can vary, depending on the size of the donor polynucleotide and the target polynucleotide. In a preferred embodiment, the length of at least one arm is 10 bases or more. In other embodiments, the length of at least one arm is 20 bases or more. In yet other embodiments, the length of at least one arm is 30 bases or more. In some embodiments, the length of at least one arm is less than 100 bases. In further embodiments, the length of at least one arm is greater than 100 bases. In some embodiments, the length of at least one arm is zero bases. For single-stranded DNA oligonucleotide design, typically an oligonucleotide with around 100-150 bp total homology is used. The mutation is introduced in the middle, giving 50-75 bp homology arms for a donor designed to be symmetrical about the target site. In other cases, no homology arms are required, and the donor polynucleotide is inserted using non-homologous DNA repair mechanisms.
- In one embodiment, the methods described herein are useful for increasing Cas9-mediated engineering efficiency by modulating off-target genome editing events, e.g., by decreasing the number of double-stranded breaks in DNA in unintended and/or incorrect locations. In particular, genome engineering systems, such as those using zinc-finger nucleases (ZFNs), TALE-nucleases, and bacterially derived RNA-guided nucleases (e.g., the CRISPR-Cas9 system), have been used to target a protein to a specific genomic locus where it can induce a DNA double-stranded break. DNA double-stranded breaks can be repaired through either non homologous end joining (NHEJ) or homology-directed repair (HDR). NHEJ can result in imperfect repair and the addition or deletion of several bases, whereas HDR can be utilized to insert rationally designed exogenous DNA sequences. These methods can sometimes result in off-target nuclease activity as described above.
- Methods for increasing specificity and/or reducing off-target genomic events have included the use of shorter guide sequences with enhanced specificity (Fu, Y. et. al., “Improving CRISPR-Cas nuclease specificity using truncated guide RNAs,” Nat Biotechnol. (2014) 32:279-284) and/or engineering Cas9 mutants that can use two independent targeting events to induce a double-stranded break (Ran, F. A, et al., “Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity,” Cell (2013) 154:1380-1389; Tsai, S. Q., et al., “Dimeric CRISPR RNA-guided FokI nucleases for highly specific genome editing,” Nature Biotech. (2014) 32:569-576). However, these strategies may reduce the efficiency of on-target genome editing, constrain targeting capabilities, or still result in “off-target” nuclease activity.
- Accordingly, an embodiment of the present invention provides methods to mitigate off-target genome editing events in a cell population or in an in vitro biochemical reaction. Mitigation of such events can be performed by an engineered CRISPR-Cas9 system as described herein. The methods include at least two basic components: (1) a complex of a catalytically active Cas9 protein and a sgRNA that targets the intended nucleic acid target (sgRNA/Cas9 complex); and (2) a complex of a catalytically inactive Cas9 protein, termed “dCas9” herein and a sgRNA that targets off-target loci (sgRNA/dCas9 complex). In some embodiments, rather than a sgRNA/Cas9 complex, the first component can be any site-directed catalytically active DNA endonuclease, such as but not limited to zinc-finger nucleases (ZFNs), TALE-nucleases, and the like.
- An off-target nucleic acid can differ from a target nucleic acid by, e.g., at least 1-5, such as 1, 2, 3, 4, 5 nucleotides, or up to 10 or more nucleotides or any number of nucleotides within the stated ranges.
- The percent complementarity between an off-target nucleic acid locus (or surrounding genomic region) and an “on-target” nucleic acid-targeting nucleic acid can be, for example about 5% to about 100%, or any percentage between this range, more preferably in the range of 90-100%.
- A number of catalytically active Cas9 proteins are known in the art and, as explained above, a Cas9 protein for use herein can be derived from any bacterial species, subspecies or strain that encodes the same. Although the subject invention is exemplified using S. pyogenes Cas9, orthologs from other bacterial species will find use herein. The specificity of these Cas9 orthologs is well known. Also useful are proteins encoded by Cas9-like synthetic proteins, and variants and modifications thereof. As explained above, the sequences for hundreds of Cas9 proteins are known and any of these proteins will find use with the present methods. The appropriate Cas9 protein to use with a particular target nucleic acid can be readily determined by one of skill in the art.
- dCas9 proteins are also known and, as described above, these proteins can be made catalytically inactive by mutating the RuvC1 and/or HNH domains to eliminate nuclease function. This is typically accomplished by introducing point mutations in both of the two catalytic residues (D10A and H840A, numbered relative to S. pyogenes Cas9) of the gene encoding Cas9. In doing so, dCas9 is rendered unable to cleave double-stranded DNA but retains the ability to target DNA. Moreover, as with the Cas9 proteins, the dCas9 proteins can be derived from any bacterial species, subspecies or strain that encodes the same. Also useful are proteins encoded by Cas9 orthologs, Cas9-like synthetic proteins, and variants and modifications thereof. In one embodiment, dCas9 orthologs are selected based on the particular protospacer adjacent motif (PAM) sequences present on the target nucleic acid. For example, S. pyogenes Cas9 targets NGG sequences. However, if other PAM sequences are present, dCas9 orthologs can be used to target these sequences to block Cas9 cleavage thereof and prevent off-target breaks.
- In the following embodiments, sgRNA is used as an exemplary guide polynucleotide, however, it will be recognized by one of skill in the art that other guide polynucleotides that site-specifically guide Cas9 or dCas9 to a target, or off-target, nucleic acid can be used. The sgRNA component of the complexes is responsible for targeting a particular nucleic acid target. In particular, the spacer region of the sgRNA includes the region of complementarity to the targeted nucleic acid sequence. Thus, the spacer is the polynucleotide sequence that can specifically hybridize to a target nucleic acid sequence. The spacer element interacts with the target nucleic acid sequence through hydrogen bonding between complementary base pairs. A spacer element binds to a selected nucleic acid target sequence. Accordingly, the spacer element is the DNA target-binding sequence.
- Thus, binding specificity is determined by both sgRNA-DNA base pairing and the PAM sequence juxtaposed to the DNA complementary region.
- Thus, in an aspect of the present invention, a sgRNA/dCas9 complex is targeted to genomic loci similarly targeted by catalytically intact sgRNA/Cas9 complexes, and can stably bind DNA and subsequently block activity of proteins targeted to those loci. In this way, dCas9 can robustly impair binding and/or activity of endogenous transcription factors in eukaryotic cells.
- In an exemplary embodiment, a sgRNA, complexed with Cas9 (sgRNA/Cas9 complex) is directed to a genomic locus of interest to induce double-stranded breaks. The binding specificity is determined by both sgRNA-DNA base pairing and the PAM sequence juxtaposed to the DNA complementary region. Computational and/or experimental methods (e.g., sequencing, in silico DNA alignment methods can be used to ascertain off-target nuclease activity (e.g., to determine the off-target loci). Such methods are described in detail below. Independently acting dCas9 proteins can be designed to target these off-target loci. These engineered dCas9 proteins can be deployed as site-specific nuclease “blockers” to obstruct catalytically intact sgRNA/Cas9 binding and nuclease activity.
- sgRNA/Cas9 and sgRNA/dCas9 blockers may be introduced, for example into a cell or tissue, at differing concentrations. For example, sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced ata ratio of 1:2, 1:3, 1:4, 1:5, 1:6, 1:7, 1:8, 1:9, 1:10, 10:1, 9:1, 8:1, 7:1, 6:1, 5:1, 4:1, 3:1, or 2:1. Additionally, all of these components, i.e., sgRNA, Cas9, dCas9, etc. may be provided separately, e.g., as separately in vitro assembled complexes, using separate DNA or RNA constructs, or together, in a single construct, or in any combination. Typically, the sgRNA components will complex with Cas9 and dCas9 when provided to a cell. Additionally, cell lines such as but not limited to HEK293 cells, are commercially available that constitutively express S. pyogenes Cas9 as well as S. pyogenes Cas9-GFP fusions. In this instance, cells can be transfected without catalytically active Cas9 as such is provided by the host cell.
- sgRNA/Cas9 and sgRNA/dCas9 complexes may be introduced at differing time points. For example, sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced at least 1 minute apart, 5 minutes apart, 10 minutes apart, 30 minutes apart, 1 hour apart, 5 hours apart, or 15 hours apart or more. sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced at most 1 minute apart, 5 minutes apart, 10 minutes apart, 30 minutes apart, 1 hour apart, 5 hours apart, or 15 hours apart or more. sgRNA/Cas9 complexes can be introduced before the sgRNA/dCas9 complexes. sgRNA/Cas9 complexes can be introduced after the sgRNA/dCas9 complexes. sgRNA/Cas9 complexes and sgRNA/dCas9 complexes may be differentially regulated (i.e. differentially expressed or stabilized) via exogenously supplied agents (e.g. inducible DNA promoters or inducible Cas9 proteins).
- sgRNA/Cas9 and sgRNA/dCas9 complexes can be introduced into a cell by a variety of means including transfection, transduction, electroporation, micelles and liposome delivery, lipid nanoparticles, viral delivery, protein injection, and the like, described more fully below.
- sgRNA/dCas9 complexes may be directed to genomic loci that partially overlap. For example, these complexes can be directed to loci that overlap by at least 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides. These complexes can be directed to loci that overlap by at most 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides.
- sgRNA/dCas9 complexes can be directed adjacent to sites of observed off-target nuclease activity and Cas9 binding. For example, these complexes can be directed to sites that are adjacent to a site of observed off-target activity by at least 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides. Complexes can be directed to sites that are adjacent to a site of observed off-target activity by at most 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, or 35 or more nucleotides.
- Multiple sgRNA/dCas9 complexes may be used to “tile” a given locus for maximum nuclease blocking activity. In some instances, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more sgRNA/dCas9 complexes are used. In some instances, at most 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more complexes are used. The complexes can cover a locus. Complexes can cover at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100% of a locus. The complexes can cover at most 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100% of a locus.
- The blockers can reduce off-targeting binding of the active complexes by at least 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100%. The blockers can reduce off-targeting binding of the active complexes by at most 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100%.
- Without wishing to be bound by a particular theory, a sgRNA/dCas9 complex can reduce binding of a sgRNA/Cas9 complex to an off-target nucleic acid by any mechanism. For example, the sgRNA/dCas9 complex can compete with the catalytically active complex for binding the off-target nucleic acid. The sgRNA/dCas9 complex can bind to the off-target nucleic acid, thereby creating steric hindrance for the sgRNA/Cas9 complex that prevents binding of the sgRNA/Cas9 complex to the off-target nucleic acid. The sgRNA/dCas9 complex can displace the sgRNA/Cas9 complex from the off-target nucleic acid. The sgRNA/dCas9 complex can inhibit the sgRNA/Cas9 complex from binding the off-target nucleic acid. The sgRNA/dCas9 complex can block the sgRNA/Cas9 complex from binding the off-target nucleic acid.
- A sgRNA/dCas9 complex can reduce off-target nucleic acid binding, cleavage and/or modification by a sgRNA/Cas9 complex by 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100%, or any value within this range. Conversely, a sgRNA/dCas9 complex can increase site-specific binding, and/or modification by 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100%, or any value within this range.
- Computational methods for determining off-target nuclease activity with any of the methods described herein can comprise scanning the genomic sequence of a subject. The genomic sequence can be segmented in silico into a plurality of nucleic acid sequences. The segmented nucleic acid sequences can be aligned with the nucleic acid-targeting nucleic acid sequence. A sequence search algorithm can determine one or more off-target nucleic acid sequences by identifying segmented genomic sequences with alignments comprising a defined number of base-pair mismatches with the nucleic acid-targeting nucleic acid. The number of base-pair mismatches between a genomic sequence and a nucleic acid-targeting nucleic acid selected by an algorithm can be user-defined, for example, the algorithm can be programmed to identify off-target sequences with mismatches of up to five base pairs between the genomic sequence and the nucleic acid-targeting nucleic acid. In silico binding algorithms can be used to calculate binding and/or cleavage efficiency of each predicted off-target nucleic acid sequence by a site-directed polypeptide using a weighting scheme. These data can be used to calculate off-target activity for a given nucleic acid-targeting nucleic acid and/or site-directed polypeptide.
- Off-target binding activity can be determined by experimental methods. In one non-limiting example, the experimental methods can comprise sequencing a nucleic acid sample contacted by a complex comprising a site-directed polypeptide and a nucleic acid-targeting nucleic acid. The contacted nucleic acid sample can be fixed or crosslinked to stabilize the protein-RNA-DNA complex. The complex comprising the site-directed polypeptide, the nucleic acid (e.g., target nucleic acid, off-target nucleic acid), and/or the nucleic acid-targeting nucleic acid can be captured from the nucleic acid sample with an affinity tag and/or capture agents. Nucleic acid purification techniques can be used to separate the target nucleic acid from the complex. Nucleic acid purification techniques can include spin column separation, precipitation, and electrophoresis. The nucleic acid can be prepared for sequencing analysis by shearing and ligation of adaptors. Preparation for sequencing analysis can include the generation of sequencing libraries of the eluted target nucleic acid.
- Sequence determination methods can include but are not limited to pyrosequencing (for example, as commercialized by 454 Life Sciences, Inc., Branford, Conn.); sequencing by ligation (for example, as commercialized in the SOLiDTM technology, Life Technology, Inc., Carlsbad, Calif.); sequencing by synthesis using modified nucleotides (such as commercialized in TruSeq™ and HiSeg™ technology by Illumina, Inc., San Diego, Calif., HeliScopeTM by Helicos Biosciences Corporation, Cambridge, Mass., and PacBio RS by Pacific Biosciences of California, Inc., Menlo Park, Calif.), sequencing by ion detection technologies (Ion Torrent, Inc., South San Francisco, Calif.); sequencing of DNA nanoballs (Complete Genomics, Inc., Mountain View, Calif.); nanopore-based sequencing technologies (for example, as developed by Oxford Nanopore Technologies, LTD, Oxford, UK), capillary sequencing (e.g, such as commercialized in MegaBACE by Molecular Dynamics, Inc., Sunnyvale, Calif.), electronic sequencing, single molecule sequencing (e.g., such as commercialized in SMRT™ technology by Pacific Biosciences, Menlo Park, Calif.), droplet microfluidic sequencing, sequencing by hybridization (such as commercialized by Affymetrix, Santa Clara, Calif.), bisulfite sequencing, and other known highly parallelized sequencing methods.
- In some aspects, sequencing is performed by microarray analysis, such as in SNP genotyping by binding. Sequencing analysis can determine the identity and frequency of an off-target binding site for a given nucleic acid-targeting nucleic acid, by counting the number of times a particular binding site is read. The library of sequenced nucleic acids can include target nucleic acids and off-target nucleic acids.
- Off-target binding activity can be determined by additional experimental methods. The experimental methods can comprise inserting a donor oligonucleotide into a cleaved site (Tsai, S. Q. et al., “GUIDE-seq enables genome wide profiling of off-target cleavage by CRISPR-Cas nucleases” Nature Biotech. (2015) 33:187-197). The genomic DNA is then fragmented, adapters are appended, and PCR is performed with primers complementary to the donor oligonucleotide and adapter sequences. The amplified sequences are sequenced and then mapped back to a reference genome. Other experimental methods rely on exploiting double-stranded break induced translocations of genomic DNA to experimentally induce (via the creation of double-stranded breaks) genomic “bait” sites (Frock, R. L. et. al. “Genome-wide detection of DNA double-stranded breaks induced by engineered nucleases” Nature Biotech. (2015) 33:179-186). Genomic DNA is subsequently fragmented, adapters are appended, and PCR is performed with primers complementary to the known “bait” site and adapter sequence. The amplified sequences are sequenced and then mapped back to a reference genome
- In some embodiments, Cas9 and/or dCas9 proteins may be modified or fused to additional protein domains. The fused additional protein domains may enhance the ability to block, impair, or inactivate active Cas9 complexes. Examples of fusion proteins including a Cas9 or dCas9 protein include, but are not limited to a nuclease, a transposase, a methylase, a transcription factor repressor or activator domain (e.g., such as KRAB and VP16), co-repressor and co-activator domains, DNA methyl transferases, histone acetyltransferases, histone deacetylases, and DNA cleavage domains (e.g., a cleavage domain from the endonuclease FokI). In some embodiments, a non-native sequence can confer new functions to the fusion protein. Such functions include, but are not limited to the following: methyltransferase activity, demethylase activity, deamination activity, dismutase activity, alkylation activity, depurination activity, oxidation activity, pyrimidine dimer forming activity, integrase activity, transposase activity, recombinase activity, polymerase activity, ligase activity, helicase activity, photolyase activity, glycosylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin ligase activity, deubiquitinating activity, adenylation activity, deadenylation activity, sumoylating activity, desumoylating activity, ribosylation activity, deribosylation activity, myristoylation activity, remodelling activity, protease activity, oxidoreductase activity, transferase activity, hydrolase activity, lyase activity, isomerase activity, synthase activity, synthetase activity, demyristoylation activity, and any combinations thereof.
- In some instances, a donor polynucleotide is inserted into the target nucleic acid, when the target nucleic acid is cleaved. The methods can, for example, therefore be used to modify genomic DNA in a eukaryotic cell isolated from an organism. Further, the methods can also comprise contacting the nucleic acid target sequence in the genomic DNA with a donor polynucleotide wherein the modification comprises that at least a portion of the donor polynucleotide is integrated at the nucleic acid target sequence.
- Donor polynucleotide insertion can be performed by the homologous recombination machinery of the cell. The donor polynucleotide may comprise homology arms that are partially or fully complementary to the regions of the target nucleic acid outside of the break point. Donor polynucleotide insertion can also be performed by non-homologous DNA repair machinery of the cell, where no homology arms are required. A discussion of donor polynucleotides is presented more fully below.
- In an embodiment, the donor polynucleotide can be tethered to the sgRNA/dCas9 complex to position it near the cleavage site targeted by the active sgRNA/Cas9 complex. See,
FIG. 7A . In this way, homology directed repair, as described below, can be achieved at higher rates. - One particular embodiment of the methods described herein is illustrated in
FIGS. 4 and 5 .FIG. 4 depicts an example of undesirable off-target binding and cleavage of a nuclease during genome engineering. A targetnucleic acid 115 can be contacted with a complex comprising a site-directed polypeptide (e.g., Cas9) 105 and a nucleic acid-targeting nucleic acid (e.g., a sgRNA) 110. The complex comprising theCas9 105 andsgRNA 110 can bind to a targetnucleic acid 120. In some instances, the complex comprising theCas9 105 andsgRNA 110 can bind to an off-targetnucleic acid 125. In acleavage step 130, the Cas9 of the complex can cleave 135 the targetnucleic acid 120 and the off-targetnucleic acid 125, thereby generating off-target effects. -
FIG. 5 depicts an exemplary embodiment of reducing off-target binding and cleavage events using dCas9 blockers. A targetnucleic acid 215 can be contacted with a complex comprising a site-directed polypeptide (e.g., Cas9) 205 and a nucleic acid-targeting nucleic acid (e.g., single-guide RNA) 210. The complex comprising theCas9 205 andsgRNA 210 can bind to a targetnucleic acid 220. In some instances, the complex comprising theCas9 205 andsgRNA 210 can bind to an off-targetnucleic acid 225. Complexes comprising an engineereddCas9 protein 235 and an engineeredsgRNA 236 can be introduced and contacted 230 with the target nucleic acid. The dCas9 complexes can either displace or prevent the binding of complexes comprisingactive Cas9 205. Theactive Cas9 205 can cleave 240/245 the targetnucleic acid 220. Theactive Cas9 205 may not cleave the off-targetnucleic acid 225 because thedCas9 235 is preventing its binding and cleavage. In this way, off-target binding and cleavage may be prevented. - In another embodiment, the invention is directed to a method for increasing the efficiency of nucleic acid insertion by HDR or non-homologous repair mechanisms. As explained above, multiple repair pathways can compete at site-directed DNA breaks. Such breaks can be repaired through, for example, non-homologous end-joining (NHEJ) or homology-directed repair (HDR). NHEJ can result in imperfect repair and the addition or deletion of one or more bases, whereas HDR can be utilized to insert rationally designed exogenous DNA sequences. Repair of a double-strand break (DSB) in the presence of a donor polynucleotide results in a portion of breaks faithfully repaired by HDR and a portion of breaks where another less reliable repair pathway, such as NHEJ, is engaged, resulting in mixed repair outcomes. Alternative repair pathways for insertion of DNA using non-homologous mechanisms can also result in the insertion of donor DNA at the break site.
- HDR relies on the presence of a donor polynucleotide, a piece of DNA that shares homology with sequences at or near a DNA break, that can be used to repair DNA breaks. Without wishing to be bound by any particular theory or mechanism, in some embodiments, the present invention provides for methods for using site-directed polypeptides (e.g., Cas9 nucleases) to create a substrate that will engage an alternative HDR pathway, similar to the single-strand annealing (SSA) branch of HDR, and will prevent competing DNA repair pathways, such as NHEJ, from repairing the break.
- Single-strand annealing (SSA) is a process that is initiated when a break is introduced between two repetitive sequences oriented in the same direction. Four steps are generally necessary for the repair of breaks by SSA: (1) an end resection step which extends the repeated sequences and forms long 3′-ssDNA; (2) an annealing step in which the two repetitive sequences are annealed together forming a flap structure; (3) a second resection step in which the flap structures formed by the regions between the repeats are resected and; (4) ligation of the ends. HDR at DNA nicks occurs via a mechanism sometimes termed “alternative-HDR” that shares many of the same genetic dependencies of SSA such as inhibition by RAD51 and BRCA2.
- The inventors herein have developed an engineered CRISPR system by generating at least two single-stranded nicks on the same strand of a target double-stranded nucleic acid and providing a donor polynucleotide that can anneal to the non-nicked strand. This results in the accurate insertion of exogenous DNA with little background mutagenic end-joining.
- This method employs tandem Cas9 molecules that comprise one or more mutations that convert the catalytically active Cas9 molecules into nickases. The nickases are targeted to specific sites using sgRNAs designed to target two sites on the same strand in a double-stranded target nucleic acid, to generate two nicks (i.e., single-stranded breaks) on the targeted strand.
- Any Cas9 molecule can be used, as described in detail above, so long as the Cas9 functions as a nickase. In some embodiments, this can be accomplished by introducing a point mutation in either of the two catalytic residues (D10A and H840A, numbered relative to S. pyogenes Cas9) of the gene encoding Cas9. An amino acid mutation at either position in Cas9 results in the inactivation of the nuclease catalytic activity and converts Cas9 to a nickase enzyme that makes single-stranded breaks at the target sites. The Cas9 double mutant with changes at amino acid positions D10A and H840A, however, completely inactivates both the nuclease and nickase activities. Targeting specificity is determined by complementary base-pairing of a sgRNA to the genomic loci which include PAM sequences adjacent thereto.
- The nickases can comprise any mutation that enables the Cas9 to cleave only one strand of a double-stranded target nucleic acid. For example, as explained above, the Cas9 (e.g., Cas9 from S. pyogenes) can comprise a D10A mutation in one of its nuclease domains, or in a corresponding residue in an orthologous Cas9 to render the molecule a nickase. The Cas9 (e.g., Cas9 from S. pyogenes) can comprise a H840A mutation in one of its nuclease domains, or a corresponding residue in an orthologous Cas9 to render the molecule a nickase.
- Accordingly, any Cas9 molecule that has nickase activity and only makes single-stranded breaks can be used. As explained above, Cas9 proteins are known and the Cas9 proteins can be derived from any bacterial species, subspecies or strain that encodes the same. Also useful are proteins encoded by Cas9 orthologs, Cas9-like synthetic proteins, and variants and modifications thereof. In one embodiment, Cas9 orthologs are selected based on the particular protospacer adjacent motif (PAM) sequences present on the target nucleic acid. For example, S. pyogenes Cas9 targets NGG sequences. One of skill in the art can readily determine the particular Cas9 to mutate based on the particular specificity desired.
- Moreover, the nickases used in the present methods should be paired such that nicks occur on the same strand. For example, both nickases used can include a D10A mutation, or both can include a H840A mutation. One nickase can be a S. pyogenes Cas9 nickase and the other can be a nickase that targets a PAM with a different adjacent sequence than targeted by the S. pyogenes Cas9 nickase, such as a nickase designed from an orthologous Cas9 protein, so long as the same strand is nicked. The appropriate nickases for use in the present methods are therefore based on the nucleic acid target sequence and on a determination of PAM-adjacent sequences present at the desired cleavage sites. In this way, the method provides flexibility for single-stranded cleavage of the target nucleic acid.
- The nickases can cleave the sense strand of the double-stranded target nucleic acid or the anti-sense strand of the double-stranded target nucleic acid (e.g., DNA). The nickases can both cleave the same strand of the double-stranded target nucleic acid.
- The two nickases can be designed to cleave at a distance of at least 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100, 500, 1000, or 5000 or more bases away from each other. The two nickases can be designed to cleave at a distance of at most 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100, 500, 1000, or 5000 or more bases away from each other. The distance between the nicks will determine the length of the donor polynucleotide to be provided for insertion.
- As explained above, once the target nucleic acid is nicked, a donor polynucleotide can be directed to, and inserted into a target site of interest to modify the target nucleic acid. Targeted DNA modifications using donor polynucleotides for large changes (e.g., more than 100 bp insertions or deletions) traditionally use plasmid-based donor templates that contain homology arms flanking the site of alteration. Each arm can vary in length, but is typically longer than about 200 bp for large insertion, the size of the arms depending on the size of the donor polynucleotide and the target polynucleotide, as explained in detail above.
- For shorter modifications (e.g., less than 50 bp), single-stranded oligonucleotides such as DNA oligonucleotides, partially double-stranded olignucleotides, nicked double-stranded donors, and the like, can be used in place of targeting plasmids. In this embodiment, for example, single-stranded oligonucleotides containing flanking sequences with homology in proximity to each nick, can be used, and can be oriented in either the sense or antisense direction relative to the target locus. For single-stranded DNA oligonucleotide design, typically an oligonucleotide with around 100-150 bp total homology is used. The mutation is introduced in the middle, giving approximately 50-75 bp homology arms. However, these numbers can vary, depending on the size of the donor polynucleotide and the target polynucleotide. Non-symmetrical polynucleotide donors can also be used that are composed of two DNA oligonucleotides. They are partially complementary, and each includes a flanking region of homology. For some modifications, the donor polynucleotide can have at least one arm with approximately 10 bases of homology to the target sequence. For some modifications, the donor polynucleotide can have at least one arm with less than 100 bases of homology to the target sequence. For other modifications, the donor can have more than 100 bases of homology to the target sequence. In some cases, the donor can have homology arms of the same length. In other cases, the donor can have homology arms of different lengths. In some cases, at least one of the homology arms is of zero length.
- Thus, a donor polynucleotide can be designed to anneal to the single-stranded gap that results from the nicks made by the two nickases. As explained above, the donor polynucleotide can additionally comprise regions of homology with the sequences outside the breaks. The size of the regions of homology will be determined by the size of the target polynucleotide and can be at least 5, 10, 15, 20, 25, 30, 35 or more nucleotides in length, the size depending on the size of the donor polynucleotide and the target nucleic acid. The regions of homology can be at most 5, 10, 15, 20, 25, 30, 35 or more nucleotides in length. The donor polynucleotide can be single-stranded. The single-stranded donor polynucleotide can be inserted into the break created by the two tandem nickases.
-
FIG. 6 depicts an exemplary embodiment of the present methods. Here, two Cas9 D10A nickases are used in tandem to excise a single-stranded region of DNA on the same strand of a target double-stranded nucleic acid. As shown inFIG. 6A , two Cas9 nickases (in this case S. pyogenes Cas9 nickases with D10A mutations in the HNH endonuclease domain) are targeted to two spaced-apart positions on the sense strand of a target polynucleotide using two sgRNA/Cas9 nickase complexes. Targeting is accomplished using a spacer sequence present in the sgRNA that has been designed to specifically target a complementary region of in the target nucleic acid sequence. Binding specificity is determined by both sgRNA-DNA base pairing and the PAM, in this case, NGG, juxtaposed to the DNA complementary region (see, e.g., Mojica F. J. et al., “Short motif sequences determine the targets of the prokaryotic CRISPR defence system” Microbiology (2009) 155:733-740; Shah S. A. et al., “Protospacer recognition motifs: mixed identities and functional diversity” RNA Biology (2013) 10:891-899; Jinek M. et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity” Science (2012) 337:816-821). The targeted single strand is then cleaved (FIG. 6B ) and the donor, with overlapping flanking regions, inserted (FIG. 6C ). - In another embodiment, the invention is directed to additional methods for increasing HDR. The current methodology for introducing a desired change into a gene includes transfecting, electroporating, or microinjecting a site-specific endonuclease and donor molecules into a cell or embryo and using passive diffusion to locate the donor molecules throughout the nucleus (Lin, S. et al. “Enhanced homology-directed human genome engineering by controlled timing of CRISPR/Cas9 delivery,” eLife (2014) Dec; doi: 10.7554/eLife.04766). However, this method of HDR typically has low efficiency. Unlike passive diffusion, the methods described below position the donor molecule near the cut site to increase HDR efficiency.
- In these methods, one or more sgRNA/dCas9 complexes are used, along with a catalytically active sgRNA/Cas9 complex. The one or more sgRNA/dCas9 complexes include a polynucleotide donor associated therewith to position the donor polynucleotide near a target site in order to increase HDR efficiency. Thus, the tethered dCas9 can position the donor molecule in an orientation that will increase the likelihood that the donor molecule will be incorporated into the target site through HDR, thereby introducing a desired change to the target sequence.
- As explained above, the donor polynucleotide can be double- or single-stranded DNA, RNA, a vector, plasmid, or the like and can be used to transfer genetic information (i.e., polynucleotide sequences) from the donor at the site of the break in the target nucleic acid. The donor can be used to insert or replace polynucleotide sequences in a target sequence, for example, to introduce a polynucleotide that encodes a protein or functional RNA (e.g., siRNA), to introduce a protein tag, to modify a regulatory sequence of a gene, or to introduce a regulatory sequence to a gene (e.g. a promoter, an enhancer, an internal ribosome entry sequence, a start codon, a stop codon, a localization signal, or polyadenylation signal), to modify a nucleic acid sequence (e.g., introduce a mutation), and the like.
- A single sgRNA/dCas9 complex can be used with the associated donor, as shown in
FIG. 7A . Alternatively, two such complexes can be used to position the donor across the cut site as shown inFIG. 7B . The dCas9 and Cas9 molecules and guide polynucleotides used in the complexes can be any of those as described above. - When one sgRNA/dCas9 complex is used, the complex can target nucleic acid either upstream or downstream of the nucleic acid targeted by the catalytically active sgRNA/Cas9 complex. A donor polynucleotide is associated with the sgRNA/dCas9 complex. In this way, the donor polynucleotide is brought into proximity with the cleaved target nucleic acid and HDR will insert at least a portion of the donor polynucleotide at the cleaved site.
- When two sgRNA/dCas9 complexes are used, the second sgRNA/dCas9 complex is designed to target nucleic acid downstream of the catalytically active sgRNA/Cas9 complex when the first sgRNA/dCas9 targets nucleic acid upstream of the catalytically active sgRNA/Cas9 complex. Alternatively, the second sgRNA/dCas9 complex is designed to target nucleic acid upstream of the catalytically active sgRNA/Cas9 complex when the first sgRNA/dCas9 targets nucleic acid downstream of the catalytically active sgRNA/Cas9 complex. Thus, the target for the active sgRNA/Cas9 complex is in a position between the two inactive complexes. Additionally, the 5′ end of the polynucleotide donor will be associated with one of the inactive sgRNA/dCas9 complexes and the 3′ end associated with the other of the inactive complexes such that the polynucleotide donor is positioned across the cleavage site for insertion using HDR. One of skill in the art can readily determine which end of the polynucleotide donor to associate with each complex based on the desired target.
- The donor is tethered to the complexes using methods well known in the art. To do so, the backbone of the sgRNA can be extended to include a region complementary to the donor molecule. For example, the sgRNA in the sgRNA/dCas9 complex can include a number of extra nucleotides, e.g., 5-20, such as 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or even more, extra nucleotides at the 3′ end of the sgRNA that will bind in a complementary fashion to the 5′ or 3′ end of a single-stranded DNA donor polynucleotide. In this manner, the donor polynucleotide will be positioned to interact with the sgRNA/Cas9-induced cut site and the cell's endogenous HDR machinery will incorporate the donor into the cleavage site. The sgRNA/dCas9 tethered donor polynucleotide is positioned upstream or downstream of the double-stranded break and is available at a higher local concentration for HDR.
- In all of the embodiments of the above-described methods, the various components can be provided to a cell or in vitro, for example, using expression cassettes encoding a Cas9, a dCas9, sgRNA; a donor polynucleotide, etc. These components can be present on a single cassette or multiple cassettes, in the same or different constructs. Expression cassettes typically comprise regulatory sequences that are involved in one or more of the following: regulation of transcription, post-transcriptional regulation, and regulation of translation. Expression cassettes can be introduced into a wide variety of organisms including bacterial cells, yeast cells, plant cells, and mammalian cells. Expression cassettes typically comprise functional regulatory sequences corresponding to the organism(s) into which they are being introduced.
- In one aspect, all or a portion of the various components of the methods are provided in vectors, including expression vectors, comprising polynucleotides coding for a Cas9, a dCas9, a sgRNA and/or a donor polynucleotide. Vectors useful for practicing the present invention include plasmids, viruses (including phage), and Integra table DNA fragments (i.e., fragments integratable into the host genome by homologous recombination). A vector replicates and functions independently of the host genome, or may, in some instances, integrate into the genome itself. Suitable replicating vectors will contain a replicon and control sequences derived from species compatible with the intended expression host cell. Transformed host cells are cells that have been transformed or transfected with the vectors constructed using recombinant DNA techniques
- General methods for construction of expression vectors are known in the art. Expression vectors for most host cells are commercially available. There are several commercial software products designed to facilitate selection of appropriate vectors and construction thereof, such as insect cell vectors for insect cell transformation and gene expression in insect cells, bacterial plasmids for bacterial transformation and gene expression in bacterial cells, yeast plasmids for cell transformation and gene expression in yeast and other fungi, mammalian vectors for mammalian cell transformation and gene expression in mammalian cells or mammals, viral vectors (including retroviral, lentiviral, and adenoviral vectors) for cell transformation and gene expression and methods to easily enable cloning of such polynucleotides. SnapGene™ (GSL Biotech LLC, Chicago, Ill.; snapgene.com/resources/plasmid_files/your_time_is_valuable/), for example, provides an extensive list of vectors, individual vector sequences, and vector maps, as well as commercial sources for many of the vectors.
- Expression cassettes typically comprise regulatory sequences that are involved in one or more of the following: regulation of transcription, post-transcriptional regulation, and regulation of translation. Expression cassettes can be introduced into a wide variety of organisms including bacterial cells, yeast cells, mammalian cells, and plant cells. Expression cassettes typically comprise functional regulatory sequences corresponding to the host cells or organism(s) into which they are being introduced. Expression vectors can also include polynucleotides encoding protein tags (e.g., poly-His tags, hemagglutinin tags, fluorescent protein tags, bioluminescent tags, nuclear localization tags). The coding sequences for such protein tags can be fused to the coding sequences or can be included in an expression cassette, for example, in a targeting vector.
- In some embodiments, polynucleotides encoding one or more of the various components are operably linked to an inducible promoter, a repressible promoter, or a constitutive promoter.
- Several expression vectors have been designed for expressing guide polynucleotides. See, e.g., Shen, B. et al. “Efficient genome modification by CRISPR-Cas9 nickase with minimal off-target effects” (2014) March 2. doi: 10.1038/nmeth.2857. 10.1038/nmeth.2857. Additionally, vectors and expression systems are commercially available, such as from New England Biolabs (Ipswich, Mass.) and Clontech Laboratories (Mountain View, Calif.). Vectors can be designed to simultaneously express a target-specific sgRNA using a U2 or U6 promoter, a Cas9 and/or dCas9, and if desired, a marker protein, for monitoring transfection efficiency and/or for further enriching/isolating transfected cells by flow cytometry.
- Vectors can be designed for expression of various components of the described methods in prokaryotic or eukaryotic cells. Alternatively, transcription can be in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. Other RNA polymerase and promoter sequences can be used.
- Vectors can be introduced into and propagated in a prokaryote. Prokaryotic vectors are well known in the art. Typically a prokaryotic vector comprises an origin of replication suitable for the target host cell (e.g., oriC derived from E. coli, pUC derived from pBR322, pSC101 derived from Salmonella), 15A origin (derived from p15A) and bacterial artificial chromosomes). Vectors can include a selectable marker (e.g., genes encoding resistance for ampicillin, chloramphenicol, gentamicin, and kanamycin). Zeocin™ (Life Technologies, Grand Island, N.Y.) can be used as a selection in bacteria, fungi (including yeast), plants and mammalian cell lines. Accordingly, vectors can be designed that carry only one drug resistance gene for Zeocin for selection work in a number of organisms. Useful promoters are known for expression of proteins in prokaryotes, for example, T5, T7, Rhamnose (inducible), Arabinose (inducible), and PhoA (inducible). Further, T7 promoters are widely used in vectors that also encode the T7 RNA polymerase. Prokaryotic vectors can also include ribosome binding sites of varying strength, and secretion signals (e.g., mal, sec, tat, ompC, and pelB). In addition, vectors can comprise RNA polymerase promoters for the expression of sgRNAs. Prokaryotic RNA polymerase transcription termination sequences are also well known (e.g., transcription termination sequences from S. pyogenes).
- Integrating vectors for stable transformation of prokaryotes are also known in the art (see, e.g., Heap, J. T., et al., “Integration of DNA into bacterial chromosomes from plasmids without a counter-selection marker,” Nucleic Acids Res. (2012) 40:e59).
- Expression of proteins in prokaryotes is typically carried out in Escherichia coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion proteins.
- A wide variety of RNA polymerase promoters suitable for expression of the various components are available in prokaryotes (see, e.g., Jiang, Y., et al., “Multigene editing in the Escherichia coli genome via the CRISPR-Cas9 system,” Environ Microbiol. (2015) 81:2506-2514); Estrem, S. T., et al., (1999) “Bacterial promoter architecture: subsite structure of UP elements and interactions with the carboxy-terminal domain of the RNA polymerase alpha subunit,” Genes Dev. 15; 13(16):2134-47).
- In some embodiments, a vector is a yeast expression vector comprising one or more components of the above-described methods. Examples of vectors for expression in Saccharomyces cerivisae include, but are not limited to, the following: pYepSec1, pMFa, pJRY88, pYES2, and picZ. Methods for gene expression in yeast cells are known in the art (see, e.g., Methods in Enzymology, Volume 194, “Guide to Yeast Genetics and Molecular and Cell Biology, Part A,” (2004) Christine Guthrie and Gerald R. Fink (eds.), Elsevier Academic Press, San Diego, Calif.). Typically, expression of protein-encoding genes in yeast requires a promoter operably linked to a coding region of interest plus a transcriptional terminator. Various yeast promoters can be used to construct expression cassettes for expression of genes in yeast. Examples of promoters include, but are not limited to, promoters of genes encoding the following yeast proteins: alcohol dehydrogenase 1 (ADH1) or alcohol dehydrogenase 2 (ADH2), phosphoglycerate kinase (PGK), triose phosphate isomerase (TPI), glyceraldehyde-3-phosphate dehydrogenase (GAPDH; also known as TDH3, or triose phosphate dehydrogenase), galactose-1-phosphate uridyl-transferase (GALT), UDP-galactose epimerase (GAL10), cytochrome ci (CYC1), acid phosphatase (PHOS) and glycerol-3-phosphate dehydrogenase gene (GPD1). Hybrid promoters, such as the ADH2/GAPDH, CYC1/GAL10 and the ADH2/GAPDH promoter (which is induced at low cellular-glucose concentrations, e.g., about 0.1 percent to about 0.2 percent) also may be used. In S. pombe, suitable promoters include the thiamine-repressed nmtl promoter and the constitutive cytomegalovirus promoter in pTL2M.
- Yeast RNA polymerase III promoters (e.g., promoters from 5S, U6 or RPR1 genes) as well as polymerase III termination sequences are known in the art (see, e.g., www.yeastgenome.org; Harismendy, O., et al., (2003) “Genome-wide location of yeast RNA polymerase III transcription machinery,” The EMBO Journal. 22(18):4738-4747.)
- In addition to a promoter, several upstream activation sequences (UASs), also called enhancers, may be used to enhance polypeptide expression. Exemplary upstream activation sequences for expression in yeast include the UASs of genes encoding these proteins: CYC1, ADH2, GAL1, GALT, GAL10, and ADH2. Exemplary transcription termination sequences for expression in yeast include the termination sequences of the α-factor, CYC1, GAPDH, and PGK genes. One or multiple termination sequences can be used.
- Suitable promoters, terminators, and coding regions may be cloned into E. coli-yeast shuttle vectors and transformed into yeast cells. These vectors allow strain propagation in both yeast and E. coli strains. Typically, the vector contains a selectable marker and sequences enabling autonomous replication or chromosomal integration in each host. Examples of plasmids typically used in yeast are the shuttle vectors pRS423, pRS424, pRS425, and pRS426 (American Type Culture Collection, Manassas, Va.). These plasmids contain a
yeast 2 micron origin of replication, an E. coli replication origin (e.g., pMB1), and a selectable marker. - The various components can also be expressed in insects or insect cells. Suitable expression control sequences for use in such cells are well known in the art. In some embodiments, it is desirable that the expression control sequence comprises a constitutive promoter. Examples of suitable strong promoters include, but are not limited to, the following: the baculovirus promoters for the piO, polyhedrin (polh), p 6.9, capsid, UAS (contains a Gal4 binding site), Ac5, cathepsin-like genes, the B. mori actin gene promoter; Drosophila melanogaster hsp70, actin, α-1-tubulin or ubiquitin gene promoters, RSV or MMTV promoters, copia promoter, gypsy promoter, and the cytomegalovirus IE gene promoter. Examples of weak promoters that can be used include, but are not limited to, the following: the baculovirus promoters for the ie1, ie2, ieO, etl, 39K (aka pp31), and gp64 genes. If it is desired to increase the amount of gene expression from a weak promoter, enhancer elements, such as the baculovirus enhancer element, hr5, may be used in conjunction with the promoter.
- For the expression of some of the components of the present invention in insects, RNA polymerase III promoters are known in the art, for example, the U6 promoter. Conserved features of RNA polymerase III promoters in insects are also known (see, e.g., Hernandez, G., (2007) “Insect small nuclear RNA gene promoters evolve rapidly yet retain conserved features involved in determining promoter activity and RNA polymerase specificity,” Nucleic Acids Res. 2007 January; 35(1):21-34).
- In another aspect, the various components are incorporated into mammalian vectors for use in mammalian cells. A large number of mammalian vectors suitable for use with the systems of the present invention are commercially available (e.g., from Life Technologies, Grand Island, N.Y.; NeoBiolab, Cambridge, Mass.; Promega, Madison, Wis.; DNA2.0, Menlo Park, Calif.; Addgene, Cambridge, Mass.).
- Vectors derived from mammalian viruses can also be used for expressing the various components of the present methods in mammalian cells. These include vectors derived from viruses such as adenovirus, papovirus, herpesvirus, polyomavirus, cytomegalovirus, lentivirus, retrovirus, vaccinia and Simian Virus 40 (SV40) (see, e.g., Kaufman, R. J., (2000) “Overview of vector design for mammalian gene expression,” Molecular Biotechnology,
Volume 16,Issue 2, pp 151-160; Cooray S., et al., (2012) “Retrovirus and lentivirus vector design and methods of cell conditioning,” Methods Enzymol. 507:29-57). Regulatory sequences operably linked to the components can include activator binding sequences, enhancers, introns, polyadenylation recognition sequences, promoters, repressor binding sequences, stem-loop structures, translational initiation sequences, translation leader sequences, transcription termination sequences, translation termination sequences, primer binding sites, and the like. Commonly used promoters are constitutive mammalian promoters CMV, EF1a, SV40, PGK1 (mouse or human), Ubc, CAG, CaMKIIa, and beta-Act. and others known in the art (Khan, K. H. (2013) “Gene Expression in Mammalian Cells and its Applications,” Advanced Pharmaceutical Bulletin 3(2), 257-263). Further, mammalian RNA polymerase III promoters, including H1 and U6, can be used. - In some embodiments, a recombinant mammalian expression vector is capable of preferentially directing expression of the nucleic acid in a particular cell type (e.g., using tissue-specific regulatory elements to express a polynucleotide). Tissue-specific regulatory elements are known in the art and include, but are not limited to, the albumin promoter, lymphoid-specific promoters, neuron-specific promoters (e.g., the neurofilament promoter), pancreas-specific promoters, mammary gland-specific promoters (e.g., milk whey promoter), and in particular promoters of T cell receptors and immunoglobulins. Developmentally-regulated promoters are also encompassed, e.g., the murine hox promoters and the alpha-fetoprotein promoter.
- Numerous mammalian cell lines have been utilized for expression of gene products including HEK 293 (Human embryonic kidney) and CHO (Chinese hamster ovary). These cell lines can be transfected by standard methods (e.g., using calcium phosphate or polyethyleneimine (PEI), or electroporation). Other typical mammalian cell lines include, but are not limited to: HeLa, U2OS, 549, HT1080, CAD, P19, NIH 3T3, L929, N2a, Human embryonic kidney 293 cells, MCF-7, Y79, SO-Rb50, Hep G2, DUKX-X11, J558L, and Baby hamster kidney (BHK) cells.
- Methods of introducing polynucleotides (e.g., an expression vector) into host cells are known in the art and are typically selected based on the kind of host cell. Such methods include, for example, viral or bacteriophage infection, transfection, conjugation, electroporation, calcium phosphate precipitation, polyethyleneimine-mediated transfection, DEAE-dextran mediated transfection, protoplast fusion, lipofection, liposome-mediated transfection, particle gun technology, direct microinjection, and nanoparticle-mediated delivery.
- As explained above, one aspect of the present invention provides methods of increasing Cas9-mediated genome engineering efficiency by either decreasing the number of off-target nucleic acid double-stranded breaks, and/or enhancing HDR of a cleaved target nucleic acid, thus modifying genomes using HDR. The present invention also includes methods of modulating in vitro or in vivo transcription using the various components and complexes described herein. In one embodiment, a sgRNA/dCas protein complex can repress gene expression by interfering with transcription when the sgRNA directs DNA target binding of the complex to the promoter region of the gene. Use of the complexes to reduce transcription also includes complexes wherein the dCas protein is fused to a known down regulator of a target gene (e.g., a repressor polypeptide). For example, expression of a gene is under the control of regulatory sequences to which a repressor polypeptide can bind. A guide polynucleotide can direct DNA target binding of a repressor protein complex to the DNA sequences encoding the regulatory sequences or adjacent the regulatory sequences such that binding of the repressor protein complex brings the repressor protein into operable contact with the regulatory sequences. Similarly, dCas9 can be fused to an activator polypeptide to activate or increase expression of a gene under the control of regulatory sequences to which an activator polypeptide can bind.
- Another method of the present invention is the use of sgRNA/dCas9 complexes in methods to isolate or purify regions of genomic DNA (gDNA). In an embodiment of the method, a dCas protein is fused to an epitope (e.g., a FLAG® epitope, Sigma Aldrich, St. Louis, MO) and a sgRNA directs DNA target binding of a sgRNA/dCas9 protein-epitope complex to DNA sequences within the region of genomic DNA to be isolated or purified. An affinity agent is used to bind the epitope and the associated gDNA bound to the sgRNA/dCas9 protein-epitope complex.
- The present invention also encompasses gene-therapy methods for preventing or treating diseases, disorders, and conditions using the various methods described herein. In one embodiment, a gene-therapy method uses the introduction of nucleic acid sequences into an organism or cells of an organism (e.g., patient) to achieve expression of components of the present invention to provide modification of a target function. For example, cells from an organism may be engineered, ex vivo, by (i) introduction of vectors comprising expression cassettes expressing the various components, (ii) direct introduction of sgRNA and/or donor polynucleotides and Cas9 and/or dCas9 proteins, or (iii) introduction of combinations of these components. The engineered cells are provided to an organism (e.g., patient) to be treated.
- Examples of gene-therapy and delivery techniques for therapy are known in the art (see, e.g., Kay, M. A., (2011) “State-of-the-art gene-based therapies: the road ahead,”
Nature Reviews Genetics 12, 316-328; Wang, D., et al., (2014) “State-of-the-art human gene therapy: part I. Gene delivery technologies,” Discov Med. 18(97):67-77; Wang, D., et al., (2014) “State-of-the-art human gene therapy: part II. Gene therapy strategies and clinical applications,” Discov Med. 18(98):151-61; “The Clinibook: Clinical Gene Transfer State of the Art,” Odile Cohen-Haguenauer (Editor), EDP Sciences (Oct. 31, 2012), ISBN-10: 2842541715). - In some aspects, components of the present invention are delivered using nanoscale delivery systems, such as nanoparticles. Additionally, liposomes and other particulate delivery systems can be used. For example, vectors including the components of the present methods can be packaged in liposomes prior to delivery to the subject or to cells derived therefrom, such as described in U.S. Pat. Nos. 5,580,859; 5,549,127; 5,264,618; 5,703,055, all incorporated herein by reference in their entireties. Lipid encapsulation is generally accomplished using liposomes which are able to stably bind or entrap and retain nucleic acid.
- The methods described herein can also be used to generate non-human genetically modified organisms. Generally, in these methods expression cassettes comprising polynucleotide sequences of the various components, as well as a targeting vector are introduced into zygote cells to site-specifically introduce a selected polynucleotide sequence at a DNA target sequence in the genome to generate a modification of the genomic DNA. The selected polynucleotide sequence is present in the targeting vector. Modifications of the genomic DNA typically include, insertion of a polynucleotide sequence, deletion of a polynucleotide sequence, or mutation of a polynucleotide sequence, for example, gene correction, gene replacement, gene tagging, transgene insertion, gene disruption, gene mutation, mutation of gene regulatory sequences, and so on. In one embodiment of methods to generate non-human genetically modified organisms, the organism is a mouse. Generating transgenic mice involves five basic steps (Cho A., et al., “Generation of Transgenic Mice,” Current protocols in cell biology, (2009); CHAPTER.Unit-19.11): (1) purifying a transgenic construct (e.g., expression cassettes comprising the various components of the various methods described herein, as well as a targeting vector); (2) harvesting donor zygotes; (3) microinjecting the transgenic construct into the mouse zygote; (4) implanting the microinjected zygotes into pseudo-pregnant recipient mice; and (5) performing genotyping and analysis of the modification of the genomic DNA established in founder mice.
- In another embodiment of methods to generate non-human genetically modified organisms, the organism is a plant. Thus, the components described herein are used to effect efficient, cost-effective gene editing and manipulation in plant cells. It is generally preferable to insert a functional recombinant DNA in a plant genome at a non-specific location. However, in certain instances, it may be useful to use site-specific integration to introduce a recombinant DNA construct into the genome. Recombinant vectors for use in plant are known in the art. The vectors can include, for example, scaffold attachment regions (SARs), origins of replication, and/or selectable markers.
- Methods and compositions for transforming plants by introducing a recombinant DNA construct into a plant genome includes any of a number of methods known in the art. One method for constructing transformed plants is microprojectile bombardment. Agrobacterium-mediated transformation is another method for constructing transformed plants. Alternatively, other non-Agrobacterium species (e.g., Rhizobium) and other prokaryotic cells that are able to infect plant cells and introduce heterologous nucleotide sequences into the infected plant cell's genome can be used. Other transformation methods include electroporation, liposomes, transformation using pollen or viruses, chemicals that increase free DNA uptake, or free DNA delivery by means of microproj ectile bombardment. DNA constructs of the present invention may be introduced into the genome of a plant host using conventional transformation techniques that are well known to those skilled in the art (see, e.g., “Methods to Transfer Foreign Genes to Plants,” Y Narusaka, et al., cdn.intechopen.com/pdfs-wm/30876.pdf).
- While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. From the above description and the following Examples, one skilled in the art can ascertain essential characteristics of this invention, and without departing from the spirit and scope thereof, can make changes, substitutions, variations, and modifications of the invention to adapt it to various usages and conditions. Such changes, substitutions, variations, and modifications are also intended to fall within the scope of the present disclosure.
- Aspects of the present invention are further illustrated in the following Examples. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, concentrations, percent changes, etc.) but some experimental errors and deviations should be accounted for. Unless indicated otherwise, temperature is in degrees Centigrade and pressure is at or near atmospheric. It should be understood that these Examples, while indicating some embodiments of the invention, are given by way of illustration only.
- The following examples are not intended to limit the scope of what the inventors regard as various aspects of the present invention.
- The following examples 1-4 illustrate the use of a catalytically inactive Cas9 (i.e. “dead” Cas9 or dCas9) to reduce off-target nuclease activity in eukaryotic cells. Additionally, this example shows how one can identify a specific spacer sequence (for incorporation into a sgRNA or crRNA) that is effective at blocking nuclease off-target activity in eukaryotic cells. Where the term sgRNA or single-guide RNA is used, it is understood by one skilled in the art that other guide polynucleotide systems, such as a crRNA/tracrRNA dual-guide system, present an alternative means of guiding dCas9 to the targeted site.
- sgRNA components of dCas9 nuclease-blocker (dCas9-NB, i.e. a Cas9 lacking catalytic activity) ribonucleoprotein (RNP) complexes (also termed “sgRNA/dCas9 complex” herein) and catalytically active Cas9 nuclease RNP complexes (also termed “sgRNA/Cas9 complex” herein) were produced by in vitro transcription (e.g., T7 Quick High Yield RNA Synthesis Kit, New England Biolabs, Ipswich, Mass.) from double-stranded DNA templates incorporating a T7 promoter at the 5′ end of the DNA sequence. Polymerase Chain Reaction (PCR) using 5′ overlapping primers was used to assemble the double-stranded DNA templates for transcription of sgRNA components. The sgRNA components, templates and primers used are identified in Table 1. The sequences of the oligonucleotide primers used in the assembly are presented in Table 2.
-
TABLE 1 Overlapping Primers for Generation of dCas9-NB and Cas9 Nuclease sgRNA Component Templates Component Target for DNA binding Primers Cas9 sgRNA VEGFA A, B, C, D, E dCas9 sgRNA AAVS1 A, B, C, D, F dCas9 sgRNA VEGFA off-target A2 A, B, C, D, G dCas9 sgRNA VEGFA off-target A3 A, B, C, D, H dCas9 sgRNA VEGFA off-target A4 A, B, C, D, I dCas9 sgRNA VEGFA off-target B1 A, B, C, D, J dCas9 sgRNA VEGFA off-target B2 A, B, C, D, K dCas9 sgRNA VEGFA off-target C1 A, B, C, D, L dCas9 sgRNA VEGFA off-target C3 A, B, C, D, M dCas9 sgRNA VEGFA off-target D2 A, B, C, D, N dCas9 sgRNA VEGFA off-target D3 A, B, C, D, O *DNA primer sequences are shown in Table 2 -
TABLE 2 DNA Primer Sequences Used A AAAAAAAGCACCGACTCGGTGCC SEQ ID NO: 1 B AGTAATAATACGACTCACTATAG SEQ ID NO: 2 C GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT SEQ ID NO: 3 ATCAAC D AAAAAAAGCACCGACTCGGTGCCACTTTTTCAAGTTGATAACGG SEQ ID NO: 4 ACTAGC E TAATACGACTCACTATAGGGTGGGGGGAGTTTGCTCCGTTTTAGA SEQ ID NO: 5 GCTAGAAATAGC F TAATACGACTCACTATAGGGGCCACTAGGGACAGGATGTTTTAG SEQ ID NO: 6 AGCTAGAAATAGC G TAATACGACTCACTATAGTGGAGGGAGTTTGCTCCTGGTTTTAGA SEQ ID NO: 7 GCTAGAAATAGC H TAATACGACTCACTATAGGACGGATTTGTGGGATGGAGTTTTAGA SEQ ID NO: 8 GCTAGAAATAGC I TAATACGACTCACTATAGCAGGACATTCTGACACCCCGTTTTAGA SEQ ID NO: 9 GCTAGAAATAGC J TAATACGACTCACTATAGGAGGCTCCCATCACGGGGGGTTTTAG SEQ ID NO: 10 AGCTAGAAATAGC K TAATACGACTCACTATAGTGGGGATCACAGGTTCCCCGTTTTAGA SEQ ID NO: 11 GCTAGAAATAGC L TAATACGACTCACTATAGAGAGCTCTTCTGACTACAGGTTTTAGA SEQ ID NO: 12 GCTAGAAATAGC M TAATACGACTCACTATAGGACCAAATGAGACCAGTCCGTTTTAG SEQ ID NO: 13 AGCTAGAAATAGC N TAATACGACTCACTATAGCCCATTATGATAGGGAGGGGTTTTAGA SEQ ID NO: 14 GCTAGAAATAGC O TAATACGACTCACTATAGCTCCTGGGGATGGAAGGGCGTTTTAG SEQ ID NO: 15 AGCTAGAAATAGC P CACTCTTTCCCTACACGACGCTCTTCCGATCTCCAGATGGCACAT SEQ ID NO: 16 TGTCAGA Q GGAGTTCAGACGTGTGCTCTTCCGATCTCCTAGTGACTGCCGTCT SEQ ID NO: 17 GC R GGAGTTCAGACGTGTGCTCTTCCGATCTacctggccATCATCCTTCTA SEQ ID NO: 18 S CACTCTTTCCCTACACGACGCTCTTCCGATCTCAGCAGACCCACT SEQ ID NO: 19 GAGTCAA T CAAGCAGAAGACGGCATACGAGATNNNNNNNNGTGACTGGAGT SEQ ID NO: 20 TCAGACGTGTGCTC U AATGATACGGCGACCACCGAGATCTACACNNNNNNNNACACTCT SEQ ID NO: 21 TTCCCTACACGACG - The PCR reaction to assemble the sgRNA DNA template proceeded as follows: Three “internal” DNA primers (C, D, E-O, Table 2) were present at a concentration of 2 nM each. Two “outer” DNA primers (A, B, Table 2) corresponding to the T7 promoter and the 3′ end of the RNA sequence were present at 640 nM to drive the amplification reaction. PCR reactions were performed using Kapa HiFi Hotstart™ PCR kit (Kapa Biosystems, Inc., Wilmington, Mass.) as per manufacturer's recommendation. PCR assembly reactions were carried out using the following thermal cycling conditions: 98° C. for 2 minutes, 35 cycles of 15 seconds at 98° C., 15 seconds at 62° C., 15 seconds at 72° C., and a final extension at 72° C. for 2 minutes.
- Between approximately 0.25-0.5 μg of the DNA template for the sgRNA components were transcribed using T7 High Yield RNA synthesis Kit (New England Biolabs, Ipswich, Mass.) for approximately 16 hours at 37° C. Transcription reactions were DNAse I-treated (New England Biolabs, Ipswich, Mass.). The quality of the transcribed RNA was checked by capillary electrophoresis on a Fragment Analyzer (Advanced Analytical Technologies, Inc., Ames, Iowa). The Cas9 and dCas9-NB sgRNA component sequences were as follows:
-
TABLE 3 Cas9 and dCas9-NB sgRNA Component Sequences DNA target RNA Sequence (5′ to 3′) VEGFA GGGUGGGGGGAGUUUGCUCCGUUUUAGAGCUAGAAAUAGCAAGU UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA GUCGGUGCUUUUUUU (SEQ ID NO: 22) AAVS1 GGGGCCACUAGGGACAGGAUGUUUUAGAGCUAGAAAUAGCAAGU UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA GCUUUUUUU (SEQ ID NO: 23) VEGFA GUCGGUGUGGAGGGAGUUUGCUCCUGGUUUUAGAGCUAGAAAUA off- GCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGG target CACCGAGUCGGUGCUUUUUUU (SEQ ID NO: 24) A2 VEGFA GGACGGAUUUGUGGGAUGGAGUUUUAGAGCUAGAAAUAGCAAGU off- UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA target GUCGGUGCUUUUUUU (SEQ ID NO: 25) A3 VEGFA GCAGGACAUUCUGACACCCCGUUUUAGAGCUAGAAAUAGCAAGU off- UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA target GUCGGUGCUUUUUUUU (SEQ ID NO: 26) A4 VEGFA GGAGGCUCCCAUCACGGGGGGUUUUAGAGCUAGAAAUAGCAAGU off- UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA target GUCGGUGCUUUUUUU (SEQ ID NO: 27) B1 VEGFA GUGGGGAUCACAGGUUCCCCGUUUUAGAGCUAGAAAUAGCAAGU off- UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA target GGUCGGUCUUUUUUU (SEQ ID NO: 28) B2 VEGFA GAGAGCUCUUCUGACUACAGGUUUUAGAGCUAGAAAUAGCAAGU off- UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA target GUCGGUGCUUUUUUU (SEQ ID NO: 29) C1 VEGFA GGACCAAAUGAGACCAGUCCGUUUUAGAGCUAGAAAUAGCAAGU off- UAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA target GGUCGGUCUUUUUUU (SEQ ID NO: 30) C3 VEGFA GCCCAUUAUGAUAGGGAGGGGUUUUAGAGCUAGAAAUAGCAAGU off- AUAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGA target GUCGGUGCUUUUUUU (SEQ ID NO: 31) D2 VEGFA GCUCCUGGGGAUUGGAAGGGCGUUUUAGAGCUAGAAAUAGCAAG off- UUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCG target AGUCGGUGCUUUUUUU (SEQ ID NO: 32) D3 - Protein components of Cas9 and dCas9-NB RNPs were expressed from bacterial expression vectors in E. coli (BL21 (DE3)) and purified using affinity, ion exchange and size exclusion chromatography according to methods described in Jinek et al., 2012. The coding sequence for S. pyogenes Cas9 included the two nuclear localization sequences (NLS) at the C-terminus. The dCas9 variant of NLS-tagged Cas9, in which active site residues from both nuclease domains were mutated (Jinek, et al., 2012), was prepared by introducing mutations into the coding sequence of S. pyogenes Cas9 by site directed mutagenesis (Q5 Site-directed Mutagenesis Kit, New England Biolabs, Ipswich, Mass.). This method for production of Cas9 and/or dCas9-NB RNPs can be applied to the production of other Cas9 and/or dCas9-NB RNPs as described herein.
- This example illustrates the use of a MiSeq Sequencer (Illumina, San Diego, Calif.) for deep sequencing analysis to evaluate and compare the DNA cleavage (as inferred from non-homologous end joining, or NHEJ) of selected Cas9 nuclease off-target sequences in the presence and absence of dCas9-NBs. In this example, Cas9 was directed by a specific sgRNA to a sequence (GGGTGGGGGGAGTTTGCTCCTGG, SEQ ID NO:82) within the human gene Vascular Endothelial Growth Factor A (VEGFA). dCas9 was directed towards an off-target, sequence (GGATGGAGGGAGTTTGCTCCTGG, SEQ ID NO:83) known to be targeted by Cas9 RNP nuclease off-target to prevent off-target cleavage as well as a sequence (GGGGCCACTAGGGACAGGATTGG, SEQ ID NO:84) within the control locus, Adeno-Associated Virus Integration Site 1 (AAVSJ).
- A. Transfection of Cas9/dCas9-NB RNPs:
- To assemble Cas9 and dCas9 RNPs, 1.3 μl of sgRNA (corresponding to approximately 1-9 μg or approximately 25-250 pmol) were incubated for 2 minutes at 95° C. then allowed to equilibrate to room temperature for about 5 minutes. Subsequently, Cas9 and dCas9 were mixed with a corresponding sgRNA to form RNPs in reaction buffer (20 mM HEPES, pH 7.5, 100 mM KCL, 5 mM MgCl2, 5% glycerol). 20 pmols Cas9 were combined with the target sgRNA, and 0 or 20 pmols of dCas9 were combined with off-target directed sgRNAs, and functional RNPs were assembled by incubating at 37° C. for 10 min. Finally, 20 pmols Cas9 RNP was combined with 0 (i.e. just the dCas9-NB sgRNA component) or 20 pmols dCas9 RNP immediately prior to transfection into cells. Experiments were performed in triplicate.
- Cas9/dCas9-NB RNP complexes were transfected into K562 cells (ATCC, Manassas, Va.), using the Nucleofector® 96-well Shuttle System (Lonza, Allendale, N.J.) and the following protocol: RNP complexes were dispensed in a 5 μL final volume into individual wells of a 96-well plate. K562 cells suspended in media were transferred from culture flask to a 50 mL conical, cells were then pelleted by centrifugation for 3 minutes at 200×g, the culture medium aspirated and washed once with calcium and magnesium-free PBS. K562 cells were then pelleted by centrifugation for 3 minutes at 200×g, the PBS aspirated and cell pellet was resuspended in 10 mL of calcium and magnesium-free PBS.
- K562 cells were counted using the Countess® II Automated Cell Counter (Life Technologies, Grand Island, N.Y.). 4.2×107 cells were transferred to a 50 ml tube and pelleted. The PBS was aspirated and the cells were resuspended in Nucleofector™ SF (Lonza, Allendale, N.J.) solution to a density of 1×107 cells/mL. 20 μL of the cell suspension were then added to individual wells containing 5 μL of RNP complexes and the entire volume was transferred to the wells of a 96-well Nucleocuvette™ Plate (Lonza, Allendale, N.J.). The plate was loaded onto the Nucleofector™ 96-well Shuttle™ (Lonza, Allendale, N.J.) and cells were nucleofected using the 96-FF-120 Nucleofector™ program (Lonza, Allendale, N.J.). Post-nucleofection, 80 μL Iscove's Modified Dulbecco's Media (IMDM, Life Technologies, Grand Island, N.Y.), supplemented with 10% FBS (Fisher Scientific, Pittsburgh, Pa.) and supplemented with penicillin and streptomycin (Life Technologies, Grand Island, N.Y.), was added to each well and 50 μL of the cell suspension was transferred to a 96-well cell culture plate containing 150 μL pre-warmed IMDM complete culture medium. The plate was then transferred to a tissue culture incubator and maintained at 37° C. in 5% CO2 for approximately 48 hours.
- Genomic DNA (gDNA) was isolated from K562 cells 48 hours after Cas9/dCas9-NB transfection using 50 μL QuickExtract DNA Extraction solution (Epicentre, Madison, Wis.) per well followed by incubation at 37° C. for 10 minutes. 50 μL water was added to the samples, and next they were incubated at 75° C. for 10 minutes and 95° C. for 5 minutes to stop the reaction. sgDNA was stored at −20° C. until further processing.
- B. Sequencing Library Preparation:
- Using previously isolated sgDNA, a first PCR was performed using Q5 Hot Start High-
Fidelity 2× Master Mix (New England Biolabs, Ipswich, Mass.) at lx concentration, primers at 0.5 μM each, 3.75 μL of sgDNA in a final volume of 10 μL and amplified 98° C. for 1 minutes, 35 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 min. PCR reaction was diluted 1:100 in water. Target-specific primers are shown in Table 4: -
TABLE 4 Target-specific Primers Used for Sequencing Target Primers VEGFA on-target P, Q VEGFA off-target 1 R, S *DNA primer sequences are shown in Table 2 - A second ‘barcoding’ PCR was set up using unique primers for each sample facilitating multiplex sequencing (oligonucleotides T and U in Table 2, where a unique 8 bp index sequence, denoted by “NNNNNNNN (SEQ ID NO:33)” allowed demultiplexing of each amplicon during sequence analysis).
- The second PCR was performed using Q5 Hot Start High-
Fidelity 2× Master Mix (New England Biolabs, Ipswich, Mass.) at 1× concentration, primers at 0.5 μM each, 1 μL of 1:100 diluted first PCR, in a final volume of 10 μL and amplified 98° C. for 1 minute, 12 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 min. PCR reactions were pooled into a single microfuge tube for SPRIselect bead (Beckman Coulter, Pasadena, Calif.) based clean up of amplicons for sequencing. - To pooled amplicons, 0.9× volumes of SPRIselect beads were added, mixed and incubated at room temperature (RT) for 10 minutes. The microfuge tube was placed on a magnetic tube stand (Beckman Coulter, Pasadena, Calif.) until solution had cleared. Supernatant was removed and discarded, and the residual beads were washed with 1 volume of 85% ethanol, and incubated at RT for 30 s. After incubation, ethanol was aspirated and beads were air dried at RT for 10 min. The microfuge tube was then removed from the magnetic stand and 0.25× volumes of water (Qiagen, Venlo, Limburg) was added to the beads, mixed vigorously, and incubated for 2 min. at RT. The microfuge tube was spun in a microcentrifuge to collect the contents of the tube, and was then returned to the magnet, incubated until solution had cleared, and the supernatant containing the purified amplicons were dispensed into a clean microfuge tube. The purified amplicon library was quantified using the Nanodrop™ 2000 system (Thermo Fisher Scientific, Wilmington, Del.).
- The amplicon library was normalized to 4 nM concentration as calculated from optical absorbance at 260 nm (Nanodrop™, Thermo Fisher Scientific, Wilmington, Del.) and size of the amplicons. Library was analyzed on MiSeq Sequencer with MiSeq Reagent Kit v2, 300 Cycles (Illumina, San Diego, Calif.), with two 151-cycle paired-end run plus two eight-cycle index reads.
- C. Deep Sequencing Data Analysis:
- The identity of products in the sequencing data was analyzed based upon the index barcode sequences adapted onto the amplicons in the second round of PCR. A computational script was used to process the MiSeq data by executing the following tasks:
-
- 1. Reads were aligned to the human genome (build GRCh38/38) using Bowtie (bowtie-bio.sourceforge.net/index.shtml) software.
- 2. Aligned reads were compared to wild type loci; reads not aligning to any part of the loci were discarded.
- 3. Reads matching wild-type sequence were tallied. Reads with indels (surrounding 5 bp from the Cas9 cut site) were categorized by indel type and tallied.
- 4. Total indel reads were divided by the sum of wild-type reads and indel reads to give percent-mutated reads.
-
FIG. 8 shows the effects of dCas9-NBs on VEGFA sgRNA/Cas9 on-target editing at the VEGFA locus. As can be seen from the data in the figure, the addition of a dCas9-NB targeted to the VEGFA on-target locus inhibits on-target editing, while dCas9-NBs targeted to distinct regions do not have a significant effect (n=3, error bars show standard deviation, *p<0.05, student's t-test (two-tailed) comparing 3 vs. 4). -
FIG. 9 shows the effects of dCas9-NBs on VEGFA sgRNA/Cas9 off-target editing at a known VEGFA off-target locus onhuman chromosome 15. As can be seen from the data in the figure, the addition of a dCas9-NB to the off-target locus either by the VEGFA on-target sgRNA or an sgRNA targeted specifically to thechromosome 15 off-target locus, impairs cleavage, while dCas9-NBs targeted to distinct regions do not have a significant effect (n=3, error bars show standard deviation, *p<0.05, student's t-test (two-tailed) comparing 3 vs. 4, 5 vs. 6, 7 vs. 8). - A description of the samples used in these experiments and
FIGS. 8 and 9 are shown in Table 5: -
TABLE 5 Sample Descriptions for FIGS. 8 and 9 Sample Description 1 AAVS1 sgRNA, 0 pmol dCas9 2 AAVS1 sgRNA, 20 pmol dCas9 3 VEGFA sgRNA, 0 pmol dCas9 4 VEGFA sgRNA, 20 pmol dCas9 5 VEGFA off-target sgRNA A2, 0 pmol dCas9 6 VEGFA off-target sgRNA A2, 20 pmol dCas9 7 VEGFA off-target sgRNA A3, 0 pmol dCas9 8 VEGFA off-target sgRNA A3, 20 pmol dCas9 9 VEGFA off-target sgRNA A4, 0 pmol dCas9 10 VEGFA off-target sgRNA A4, 20 pmol dCas9 11 VEGFA off-target sgRNA B1, 0 pmol dCas9 12 VEGFA off-target sgRNA B1, 20 pmol dCas9 13 VEGFA off-target sgRNA B2, 0 pmol dCas9 14 VEGFA off-target sgRNA B2, 20 pmol dCas9 15 VEGFA off-target sgRNA C1, 0 pmol dCas9 16 VEGFA off-target sgRNA C1, 20 pmol dCas9 17 VEGFA off-target sgRNA C3, 0 pmol dCas9 18 VEGFA off-target sgRNA C3, 20 pmol dCas9 19 VEGFA off-target sgRNA D2, 0 pmol dCas9 20 VEGFA off-target sgRNA D2, 20 pmol dCas9 21 VEGFA off-target sgRNA D3, 0 pmol dCas9 22 VEGFA off-target sgRNA D3, 20 pmol dCas9 - Following the guidance of the present specification and examples, the deep sequencing analysis described in this example can be practiced by one of ordinary skill in the art with other Cas9/dCas9 RNP complexes (i.e. assembled with distinct sgRNAs and/or distinct ratios of Cas9, dCas9, and sgRNA).
- This example illustrates the method through which off-target Cas9 nuclease sites may be identified. The method presented here is adapted from Tsai et. al., “GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases.,” Nat Biotechnol., 2015 February; 33(2):187-97.
- A. Identify a Target-Site of Interest:
- A given locus in a genome of interest (i.e. a human genome) is screened using bioinformatics approaches known to those skilled in the art to identify Cas9 target-sites. A 20 base pair target-site, followed by an NGG protospacer adjacent motif (PAM), is selected for nuclease targeting.
- B. Assemble GUIDE-Seq Components:
- Oligos are obtained (Integrated DNA Technologies, Coralville, Iowa) for generating a blunt, double-stranded oligodeoxynucleotide (dsODN) that will be utilized for the GUIDE-Seq method. The dsODN contains phosphothiorate linkages at the 5′ ends of both DNA strands. The dsODN is assembled by incubating the two oligos in annealing buffer (i.e. 10 mM Tris, pH 7.5-8.0, 50 mM NaCl, 1 mM EDTA) for 3 min at 95° C. and allowing the oligos to cool to RT.
- C. Transfection of GUIDE-Seq Components:
- Cells from a species of interest (i.e., human cells) are procured from a commercial repository (i.e. ATCC, DSMZ). Cells are grown to an appropriate density for transfection. Cells are transfected with an sgRNAs/Cas9 protein complex and the DNA donor oligo via methods known to those skilled in the art (i.e. nucleofection or lipid transfection of DNA plasmid encoding RNP components as well as dsODN).
- D. Sequencing Library Preparation and Analysis:
- gDNA is harvested 48hrs after cell transfection and purified using Agencourt DNAdvance (Beckman Coulter, Pasadena, Calif.). Purified gDNA is fragmented with methods known to those skilled in the art (i.e. mechanical shearing via sonication or enzymatic shearing with NEBfragmentase, (New England Biolabs, Ipswich, Mass.)) to an average length of 500 base pairs, then end-repaired, A-tailed and ligated to adapters. PCR with primers complementary to the dsODN tag and illumina sequencing adapter sequences (Illumina, San Diego, Calif.), respectively, are used for target-enrichment. Target-enriched library is sequenced using MiSeq Sequencer (Illumina, San Diego, Calif.). Reads are mapped back to the respective species' genome and read coverage is calculated using BedTools (bedtools.readthedocs.org/en/latest/). Integrative Genomics Viewer (IGV, broadinstitute.org/igv/) is used to map the starting (5′) and ending (3′) position of reads, and peaks are called using MACS2 (pypi.python.org/pypi/MACS2). The Sequencing data is used to confirm that a putative genomic locus is a candidate off-target sequence. Following the guidance of the present examples, the identification of novel off-target loci can be practiced by one of ordinary skill in the art.
- This example illustrates methods where dCas9-NBs may be assembled with truncated guides. The method presented here is adapted from Fu Y et. al., “Improving CRISPR-Cas nuclease specificity using truncated guide RNAs,” Nat Biotechnol. 2014 March; 32(3):279-84. Truncated single-guide RNAs (tru-sgRNAs) of 17-18nt have been shown to possess increased specificity relative to 20 nt sgRNAs. Thus, a dCas9-NB assembled with a tru-sgRNA may be targeted directly to a genomic motif and PAM of an off-target locus to reduce off-target editing while having minimal inhibition of on-target editing.
- A. Design of tru-sgRNA to Enable dCas9 Mediated Off-Target Nuclease Blocking:
- Using methods described in Example 3 herein, a given off-target genomic locus (i.e. spacer sequence) is identified. Next, a tru-sgRNA is designed to target said off-target location in the genome. The tru-sgRNA/dCas9 RNP may target a sequence contained entirely within the off-target motif, or it may target a sequence partially overlapping with the off-target motif.
- B. Production of dCas9 Nuclease Blocker Components:
- dCas9 is assembled with a short (i.e. 17 nt) tru-sgRNA and Cas9 is assembled with a sgRNA (i.e. 20 nt) to produce functional RNPs. RNA components are transcribed from DNA templates incorporating a T7 promoter at the 5′ end as described in the Experimental section herein. dCas9 (D10A, H840A) and Cas9 proteins are recombinantly expressed in E. coli. RNPs are assembled by incubating protein and RNA components together at 37° C. for 10 minutes.
- C. Transfection of tru-sgRNA Containing dCas9-NB and sgRNA Containing Cas9 RNP:
- Cells from species of interest are procured from a commercial repository (i.e. ATCC, DSMZ). Cells are grown to a level of confluency that enables transfection. Tru-sgRNAs complexed with dCas9 are mixed with sgRNAs assembled with Cas9. Next, the mixture is transfected into a cell line of interest using methods known to those skilled in the art (i.e. nucleofection or lipid transfection) as described in Example 1 herein.
- D. Sequencing Library Preparation:
- gDNA is then harvested 48 hours later using Quick Extract (Epicentre, Madison, Wis.) per the manufacturer's instructions. Two rounds of PCR, as described in Example 1 herein, are used to amplify and barcode the genomic region targeted by the tru-sgRNA/dCas9-NB. Adapter oligos and dimers are removed by performing SPRlselect bead (Beckman Coulter, Pasadena, Calif.)-based clean up of the sequencing library. Sequencing library concentration is determined by the Nanodrop™ 2000 system (Thermo Scientific, Wilmington Del.).
- E. Deep Sequencing Analysis:
- The library is analyzed on MiSeq Sequencer as follows:
-
- 1. Reads are aligned to the human genome (build GRCh38/38) using Bowtie (bowtie-bio.sourceforge.net/index.shtml) software.
- 2. Aligned reads are compared to wild type loci; reads not aligning to any part of the loci are discarded.
- 3. Reads matching wild-type sequence are tallied. Reads with indels are categorized by indel type and tallied.
- 4. Total indel reads are divided by the sum of wild-type reads and indel reads are percent-mutated reads.
- This system consists of a site-specific endonuclease (e.g, Cas9 complexed with a sgRNA) that targets a DNA target sequence of interest), and one or more catalytically inactive dCas9 molecules complexed with a sgRNA that targets DNA sequences adjacent to the cut site (See
FIGS. 7A and 7B ). These dCas9 molecules are also tethered to a HDR molecule (i.e. dsDNA, ssDNA, RNA, a plasmid, or the like). The tethered dCas9 is used to position the donor molecule in an orientation that will increase the likelihood that the donor molecule will be incorporated into the target site through HDR, thereby introducing a desired change to the target sequence. - A. DNA and RNA Constructs:
- Oligonucleotides are ordered from manufacturers (e.g., Integrated DNA Technologies, Coralville, Iowa; or Eurofins, Luxembourg). sgRNA transcription constructs are assembled by polymerase chain reaction (PCR).
- The primers for sgRNA transcription constructs consist of a primer containing a 5′ T7 promoter sequence, a primer containing a unique spacer sequence, primers containing the sgRNA TRCR backbone, and a reverse primer that may contain a complementary sequence to the homology donor for tethering the donor to the 3′ end of the sgRNA.
- T7 sgRNA transcription constructs are PCR-amplified. Two outer primers (forward oligo contains T7 promoter oligo; reverse oligos contain 3′ end of sgRNA backbone or homology donor complementary sequence for tethering) are present in PCR reaction at 640 nM. Unique spacer and sgRNA backbone oligos are present at 2 nM. PCR reactions are performed using Q5 Hot Start High-
Fidelity 2× Master Mix (New England Biolabs, Ipswich, MA) following manufacturer's recommendations. PCR T7 sgRNA transcription construct assembly PCR is carried out using the following thermal cycling conditions: 98° C. for 2 minutes, 29 cycles of 98° C. for 20 seconds, 62° C. for 20 seconds, and 72° C. for 15 seconds, followed by a final extension of 72° C. for 2 minutes. DNA constructs are evaluated by capillary electrophoresis (Fragment Analyzer, Advanced Analytical Technologies, Ames, Iowa). - RNA components are produced through in vitro transcription (T7 Quick High Yield RNA Synthesis Kit, New England Biolabs, Ipswich, MA) from a double-stranded DNA template. The RNA is then treated with DNase I (New England Biolabs, Ipswich, MA) to remove any double-stranded DNA and incubated at 37° C. for 10 minutes. 0.5 M EDTA is then added to the transcription reactions and incubated at 75° C. for 10 minutes to inactivate the DNase I.
- Homology donors are ordered as single-stranded DNA oligos of approximately 90 nucleotides in length. The homology donors are complementary to the coding sequence and are designed to be centered on the cut site with the PAM replaced with a EcoR1 restriction enzyme site and homology arms of approximately 42 nucleotides in length matching the target sequence.
- B. sgRNA/Cas9 and sgRNA/dCas9 Complex Generation:
- S. pyogenes catalytically active Cas9 and catalytically inactive dCas9 are C-terminally tagged with two nuclear localization sequences (NLS) and recombinantly expressed in E. coli. All sgRNA and tethered sgRNA are incubated for 2 minutes at 95° C., removed from the thermal cycler and allowed to equilibrate to room temperature. Cas9 Ribonucleoprotein (RNP) complexes (also termed “sgRNA/Cas9 complex” and “sgRNA/dCas9 complex” herein) are set up in triplicate with 2 μM Cas9 or 2 μM dCas9, 6 μM sgRNA or 6 μM tethered sgRNA and 2 μM donor oligo in binding buffer (20 mM HEPES, 100 mM KCl, 5 mM MgCl2, 1 mM DTT, and 5% glycerol at pH 7.4) in a volume of 6 μl. The RNPs are then allowed to bind at 37° C. for 10 minutes. After annealing, the Cas9 RNP and dCas9 RNP-donor tethers can be combined to a final volume of 12 μl.
- C. Experimental Conditions:
- Experimental conditions for the use of various embodiments of the invention are described below and illustrated in
FIG. 10 . -
- 1) No transfection control—cells are not electroporated.
- 2) Cas9+standard sgRNA—cells are transfected with sgRNA and catalytically active Cas9 targeting the locus of interest.
- 3) Cas9+standard sgRNA for target locus adjacent site—cells are transfected with sgRNA and catalytically active Cas9 targeting the locus adjacent site to determine if spacer and PAM at that site provide good cleavage activity at that site
- 4) Donor only control—cells are transfected with the donor polynucleotide to determine if there is incorporation through HDR at the site
- 5) Tethered sgRNA/dCas9 and donor transfection—cells are transfected with donor polynucleotide and with tethered sgRNA/dCas9 complexes for the site adjacent to the target locus.
- 6) Tethered sgRNA/Cas9 at the target locus—cells are transfected with catalytically active tethered sgRNA/Cas9 complex with donor polynucleotide to the target locus.
- 7) Standard sgRNA/Cas9 and donor transfection—cells are transfected with catalytically active sgRNA/Cas9 complex and donor polynucleotide. This will determine the HDR incorporation rates for a standard HDR experiment.
- 8) Standard sgRNA/Cas9 and tethered sgRNA/dCas9 (one site) and donor transfection—cells are transfected with catalytically active sgRNA/Cas9 complex targeting the target locus and with tethered sgRNA/dCas9 targeting the target adjacent locus and donor polynucleotide.
- 9) Standard sgRNA/Cas9 and tethered sgRNA/dCas9 (two sites) and donor transfection—cells are transfected with catalytically active sgRNA/Cas9 complex targeting the target locus and with tethered sgRNA/dCas9 targeting upstream and downstream of the target adjacent locus and donor polynucleotide. The two tethers on the two sgRNA/dCas9 complexes stretch the donor polynucleotide across the double-stranded break and make that region available for HDR.
- D. Cell Culture and Transfections:
- K562 cells (ATCC, Manassas, Va.) are cultured in suspension in IMDM medium supplemented with 10% FBS and 1% penicillin and streptomycin at 37° C. with 100% humidity. K562 cells are transfected using the Nucleofector® 96-well Shuttle System (Lonza, Allendale, N.J.). RNPs and dCas9 RNPs are arranged in a 96-well plate with 2 μl of Cas9 RNP or 4 μl of Cas9 and dCas9 RNP combined. K562 cells are transferred to a 50 ml conical centrifuge tube and centrifuged at 200×G for 3 minutes. The media is aspirated and the cell pellet washed in calcium and magnesium-free PBS. The cells are centrifuged once more and then resuspended in Nucleofector SF buffer (Lonza, Allendale, N.J.) at a concentration of 1×107 cells/ml. 20 μl of this cell suspension is added to the RNP in the 96 well plate, mixed, and then the entire volume is transferred to a 96-well Nucleocuvette™ Plate (Lonza, Allendale, N.J.). The plate is then loaded into the Nucleofector™ 96-well Shuttle™ (Lonza, Allendale, N.J.) and cells are nucleofected using the 96-FF-120 Nucleofector™ program (Lonza, Allendale, N.J.). Immediately following nucleofection, 80 μl of complete IMDM medium is added to each well of the 96-well Nucleocuvette™ Plate. The entire contents of the well are then transferred to a 96-well tissue culture plate containing 100 μl of complete IMDM medium. The cells are cultured at 37° C. with 100% humidity conditions for 48 hours.
- After 48 hours the K562 cells are centrifuged at 500×G for 5 minutes and the medium is removed. The cells are washed 1 time in calcium and magnesium-free PBS. The cell pellets are then resuspended in 50 μl of QuickExtract DNA Extraction solutions (Epicentre, Madison, Wis.). The gDNA samples obtained are then incubated at 37° C. for 10 minutes, 65° C. for 6 minutes and 95° C. for 3 minutes to stop the reaction. gDNA samples are then diluted with 50 μl of water and stored at −20° C.
- This gDNA is PCR-amplified using Q5 Hot Start High-
Fidelity 2× Master Mix (New England Biolabs, Ipswich, Mass.) at 1× concentration, primers at 0.5 μM each, 3.75 μL of gDNA in a final volume of 10 L and amplified 98° C. for 1 minutes, 35 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 min. PCR reaction was diluted 1:100 in water. - A second “barcoding” PCR is set up using unique primers for each sample, facilitating multiplex sequencing. The second PCR is performed using Q5 Hot Start High-
Fidelity 2× Master Mix (New England Biolabs, Ipswich, Mass.) at 1× concentration, primers at 0.5 μM each, 1 μL of 1:100 diluted first PCR, in a final volume of 10 μL and amplified 98° C. for 1 minutes, 12 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 minutes. - E. SPRIselectclean-up:
- PCR reactions are pooled into a single microfuge tube for SPRIselect™ bead (Beckman Coulter, Pasadena, Calif.) based clean up of amplicons for sequencing.
- To pooled amplicons, 0.9× volumes of SPRIselect™ beads are added, and mixed and incubated at room temperature (RT) for 10 minutes. The microfuge tube is placed on a magnetic tube stand (Beckman Coulter, Pasadena, Calif.) until the solution has cleared. Supernatant is removed and discarded, and the residual beads are washed with 1 volume of 85% ethanol, and incubated at RT for 30 s. After incubation, ethanol is aspirated and beads are air dried at RT for 10 min. The microfuge tube is then removed from the magnetic stand and 0.25× volumes of Qiagen EB buffer (Qiagen, Venlo, Netherlands) was added to bead, mixed vigorously, and incubated for 2 minutes at RT. The microfuge tube is returned to the magnet, incubated until solution has cleared and supernatant containing the purified amplicons is dispensed into a clean microfuge tube. The purified amplicon library is quantified using the Nanodrop™ 2000 system (Thermo Fisher Scientific, Wilmington Del.) and library quality analyzed using the Fragment Analyzer™ system (Advanced Analytical Technologies, Inc., Ames, Iowa) and the DNF-910 dsDNA Reagent Kit™ (Advanced Analytical Technologies, Inc. Ames, Iowa).
- F. Deep Sequencing Set-Up:
- The amplicon library is normalized to 4 nmolar concentration as calculated from Nanodrop values and size of the amplicons. The library is analyzed on MiSeq Sequencer with MiSeq Reagent Kit v2™, 300 Cycles (Illumina, San Diego, Calif.), with two 151-cycle paired-end run plus two eight-cycle index reads.
- G. Deep Sequencing Data Analysis:
- The identity of products in the sequencing data is analyzed based upon the index barcode sequences adapted onto the amplicons in the second round of PCR. A computational script is used to process the MiSeq data by executing the following tasks:
- 1. Reads are aligned to the human genome (build GRCh38/38) using Bowtie (bowtie-bio.sourceforge.net/index.shtml) software.
- 2. Aligned reads are compared to expected wild type target locus sequence. Reads not aligning to any part of the target locus are discarded.
- 3. Reads matching wild-type target sequence are tallied. Reads with indels are categorized by indel type and tallied.
- 4. Total indel reads are divided by the sum of wild-type reads and indel reads give percent-mutated reads.
- This data is then analyzed to determine if sgRNA/dCas9 tethered donor polynucleotides increase HDR efficiency compared to passively diffused donor polynucleotides.
- This example illustrates the use of a Cas9 nickase mutant where one nuclease domain is inactivated (Cas9D10A) to engage preferentially homology-directed repair (HDR) pathways and block mutagenic repair pathways at break sites in eukaryotic cells. In this example Cas9D10A is used with two specific, single-guide RNAs (sgRNAs) that deliver the nickase to two sites on the same strand 30-60 nucleotides apart. Spacer sequences were chosen from available sequences in human genomic DNA so that each of the two sgRNAs would target Cas9 to a location on either side of the desired region for modification.
- Production of Cas9D10A Nickase and Cas9 Nuclease Components:
- sgRNA components of Cas9D10A Ribonucleoprotein (RNP) complexes (also termed “sgRNA/Cas9 nickase complexes” herein) and catalytically active Cas9 nuclease RNP complexes (also termed “sgRNA/Cas9 complexes” herein) were produced by in vitro transcription (e.g., T7 Quick High Yield RNA Synthesis Kit, New England Biolabs, Ipswich, Mass.) from double-stranded DNA templates incorporating a T7 promoter at the 5′ end of the DNA sequence. Polymerase Chain Reaction (PCR) using 5 overlapping primers assembled the double-stranded DNA templates for the sgRNA components. The oligonucleotides used in the assembly are presented in Table 6.
-
TABLE 6 Overlapping Primers for Generating Cas9D10A and Cas9 Nuclease sgRNA Component Templates Target for Component DNA binding Primers Cas9 and CD34 Target 1 A, B, C, D, E Cas9D10A sgRNA Cas9 and CD34 Target 2 A, B, C, D, F Cas9D10A sgRNA Cas9 and CD34 Target 3 A, B, C, D, G Cas9D10A sgRNA Cas9 and CD34 Target 4 A, B, C, D, H, Cas9D10A sgRNA Cas9 and CD34 Target 5 A, B, C, D, I Cas9D10A sgRNA A GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCT AGTCCGTTATCAAC (SEQ ID NO: 3) B AAAAAAAGCACCGACTCGGTGCCACTTTTTCAAGTT GATAACGGACTAGC (SEQ ID NO: 4) C AAAAAAAGCACCGACTCGGTGCC (SEQ ID NO: 1) D AGTAATAATACGACTCACTATAG (SEQ ID NO: 2) E TAATACGACTCACTATAGGAACACTGTGCTGATTAC AGGTTTTAGAGCTAGAAATAGC (SEQ ID NO: 34) F TAATACGACTCACTATAGGTTTGTGTTTCCATAAAC TGGTTTTAGAGCTAGAAATAGC (SEQ ID NO: 35) G TAATACGACTCACTATAGGCTACTAACTTGAGCTCC CCGTTTTAGAGCTAGAAATAGC (SEQ ID NO: 36) H TAATACGACTCACTATAGTCCCAAAGGCGGAGGGCG TTGTTTTAGAGCTAGAAATAGC (SEQ ID NO: 37) I TAATACGACTCACTATAGAGGCTGGGTTGCCGCCGT CGGTTTTAGAGCTAGAAATAGC (SEQ ID NO: 38) - The DNA primers were present at a concentration of 2 nM each. Two outer DNA primers corresponding to the T7 promoter (forward primer: Oligonucleotide A, Table 1), and the 3′ end of the RNA sequence (reverse primers: Oligonucleotides C, Table 1) were used at 640 nM to drive the amplification reaction. PCR reactions were performed using Kapa HiFiHotstart PCR™ kit (Kapa Biosystems, Inc., Wilmington, Mass.) as per manufacturer's recommendation. PCR assembly reactions were carried out using the following thermal cycling conditions: 98° C. for 2 minutes, 35 cycles of 15 seconds at 98° C., 15 seconds at 62° C., 15 seconds at 72° C., and a final extension at 72° C. for 2 min. Following the PCR reaction, the quantity of PCR product was determined using capillary electrophoresis on a Fragment Analyzer (Advanced Analytical Technologies, Inc., Ames, Iowa).
- Between 0.25-0.5 μg of the DNA template for the sgRNA components were transcribed using T7 High Yield RNA synthesis Kit (New England Biolabs, Ipswich, Mass.) for approximately 16 hours at 37° C. Transcription reactions were DNAse I treated (New England Biolabs, Ipswich, Mass.). The quality of the transcribed RNA was checked by capillary electrophoresis on a Fragment Analyzer (Advanced Analytical Technologies, Inc., Ames, Iowa). Protein components of RNPs were expressed from bacterial expression vectors in E. coli (BL21 (DE3)) and purified using affinity, ion exchange and size exclusion chromatography according to methods described in Jinek, M., et al., “A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity,” Science 337 (2012) 816-821. This method for production of Cas9 and/or Cas9D10A/Cas9H840A RNPs can be applied to the production of other Cas9 and/or Cas9D10A/Cas9H840A RNPs as described herein. The coding sequence for S. pyogenes Cas9 included the two nuclear localization sequences (NLS) at the C-terminus. Cas9D10A or Cas9H840A nickase variants of NLS-tagged Cas9, where an active site residue of either nuclease domain is mutated (Jinek, et al., 2012) were prepared by introducing mutations into the coding sequence of S. pyogenes Cas9 by site directed mutagenesis (e.g. Q5 Site-directed Mutagenesis Kit, New England Biolabs, Ipswich, Mass.).
- This example illustrates the use of a MiSeq Sequencer (Illumina, San Diego, Calif.) for deep sequencing analysis to quantify total editing events initiated by DNA cleavage by Cas9 or Cas9D10A and compare DNA repair types. Example DNA repair types can include mutagenic end-joining pathways such as non-homologous end joining (NHEJ) or insertion of material from a donor sequence by homology directed repair (HDR). In this example, Cas9 and Cas9D10A were directed to the human gene CD34 at five independent sites by specific sgRNAs.
- A. Transfection of Cas9/Cas9D10A RNPs:
- To assemble Cas9 and Cas9D10A RNPs, 1.36 μl of sgRNA (corresponding to approximately 1-5 μg) were incubated for 2 minutes at 95° C. then allowed to equilibrate to room temperature for approximately 5 minutes. Subsequently, Cas9 and Cas9D10A were mixed with a corresponding sgRNA to form RNPs in reaction buffer (20 mM HEPES, pH 7.5, 100 mM KCL, 5 mM MgCl2, 5% glycerol). 20 pmols of Cas9 or Cas9D10A were combined with the target sgRNA and functional RNPs were assembled by incubating at 37° C. for 10 minutes. Finally, 20 pmols of Cas9 or Cas9D10A RNP was combined with 100 pmols of DNA donor oligonucleotide template for HDR prior to transfection into cells. Experiments were performed in triplicate.
-
TABLE 7 DNA Oligonucleotide Donor Templates CD34 TCAGTTTATGGAAACACAAACTCTTCTGTCCAGTCACAGA Target 1 gaattcCTGTAATCAGCACAGTGTTCACCACCCCAGCCAA TCGTTCAACT (SEQ ID NO: 39) CD34 CCAGAAACGACAGTCAAATTCACATCTACCTCTGTGATAA Target 2 gaattcCAGTTTATGGAAACACAAACTCTTCTGTCCAGTC ACAGACCTCT (SEQ ID NO: 40) CD34 ACCCAGCCTCCCTCCTAACGCCCTCCGCCTTTGGGACCAA Target 3 gaattcGGGGAGCTCAAGTTAGTAGCAGCCAAGGAGAGGC GCTGCCTTGC (SEQ ID NO: 41) CD34 CCACCTTTTTTGGCCTCGACGGCGGCAACCCAGCCTCCCT Target 4 gaattcAACGCCCTCCGCCTTTGGGACCAACCAGGGGAGC TCAAGTTAGT (SEQ ID NO: 42) CD34 CGAGGCATCTGGAGCCCGAACAAACCTCCACCTTTTTTGG Target 5 gaattcCGACGGCGGCAACCCAGCCTCCCTCCTAACGCCC TCCGCCTTTG (SEQ ID NO: 43) CD34 CACATCTACCTCTGTGATAAgCTCAGTTTATGGAAttCAC Targets AAAACTCTTCTGTCCGTCACAGAgCTCTGTAATCAGCACA 1 + 2 GTGTTCACCA Cas9D10A (SEQ ID NO: 44) CD34 CCTCGACGGCGGCAACCCAGCCTCCCTgCTAACGCCCTCC Targets GaaTTcTGGGACCAAgCAGGGGAGCTCAAGTTAGTAGCAG 3 + 4 CCAAGGAGAG Cas9D10A (SEQ ID NO: 45) CD34 CCGAACAAACCTCCACCTTTTTTGGCgTCGACGGCGGCAA Targets CCgAattCCTCCCTCgTAACGCCCTCCGCCTTTGGGACCA 4 + 5 ACCAGGGGAG Cas9D10A (SEQ ID NO: 46) CD34 CCACCTTTTTTGGgCTCGACGGCGGCAACCCAGCCTCCCT Targets CCgAAttCGCCCTCCGCCTTTGGGACCAAgCAGGGGAGCT 3 + 5 CAAGTTAGTA Cas9D10A (SEQ ID NO: 47) - Cas9/Cas9D10ARNP complexes were transfected into K562 cells (ATCC, Manassas, Va.), using the Nucleofector® 96-well Shuttle System (Lonza, Allendale, N.J.) and the following protocol: RNP and RNP plus donor complexes were dispensed in a 2-3 μL final volume into individual wells of a 96-well plate. K562 cells suspended in media were transferred from culture flask to a 50 mL conical, cells were then pelleted by centrifugation for 3 minutes at 200×g, the culture medium aspirated and washed once with calcium and magnesium-free PBS. K562 cells were then pelleted by centrifugation for 3 minutes at 200×g, the PBS aspirated and cell pellet was resuspended in 10 mL of calcium and magnesium-free PBS.
- K562 cells were counted using the Countess® II Automated Cell Counter™ (Life Technologies, Grand Island, N.Y.). 2.2×107 cells were transferred to a 50 ml tube and pelleted. The PBS was aspirated and the cells were resuspended in Nucleofector™ SF (Lonza, Allendale, N.J.) solution to a density of 1×107 cells/mL. 20 μL of the cell suspension are then added to individual wells containing 2-3 μL of RNP and RNP plus Donor complexes and the entire volume was transferred to the wells of a 96-well Nucleocuvette™ Plate (Lonza, Allendale, N.J.). The plate was loaded onto the Nucleofector™ 96-well Shuttle™ (Lonza, Allendale, N.J.) and cells were nucleofected using the 96-FF-120 Nucleofector™ program (Lonza, Allendale, N.J.). Post-nucleofection, 80 μL Iscove's Modified Dulbecco's Media (IMDM, Life Technologies, Grand Island, N.Y.), supplemented with 10% FBS (Fisher Scientific, Pittsburgh, Pa.) and supplemented with penicillin and streptomycin (Life Technologies, Grand Island, N.Y.), was added to each well and 50 μL of the cell suspension was transferred to a 96-well cell culture plate containing 150 μL pre-warmed IMDM complete culture medium. The plate was then transferred to a tissue culture incubator and maintained at 37° C. in 5% CO2 for approximately 48 hours.
- Genomic DNA (gDNA) was isolated from K562 cells 48 hours after Cas9/Cas9D10A transfection using 50 μL QuickExtract DNA Extraction solution (Epicentre, Madison, Wis.) per well followed by incubation at 37° C. for 10 minutes. 50 μL water was added to the samples, and next they were incubated at 75° C. for 10 minutes and 95° C. for 5 minutes to stop the reaction. gDNA was stored at −80° C. until further processing.
- B. Sequencing Library Preparation:
- Using previously isolated gDNA, a first PCR was performed using Q5 Hot Start High-
Fidelity 2× Master Mix™ (New England Biolabs, Ipswich, Mass.) at 1× concentration, primers at 0.5 μM each, 3.75 μL of gDNA in a final volume of 10 L and amplified 98° C. for 1 minute, 35 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 minutes. PCR reaction was diluted 1:100 in water. Target-specific primers are shown in Table 8. -
TABLE 8 Target-specific Primers Used CD34 GGAGTTCAGACGTGTGCTCTTCCGATCTTGCAA Target GGCTAGTGCTAGTGG 1_F (SEQ ID NO: 48) CD34 CACTCTTTCCCTACACGACGCTCTTCCGATCTA Target CATGCACACCCATGTTTTG 1_R (SEQ ID NO: 49) CD34 GGAGTTCAGACGTGTGCTCTTCCGATCTAACAT Target TTCCAGGTGACAGGC 2_F (SEQ ID NO: 50) CD34 CACTCTTTCCCTACACGACGCTCTTCCGATCTA Target CATGCACACCCATGTTTTG 2_R (SEQ ID NO: 51) CD34 GGAGTTCAGACGTGTGCTCTTCCGATCTGTGGG Target GGATTCTTGCTTTTT 3_F (SEQ ID NO: 52) CD34 CACTCTTTCCCTACACGACGCTCTTCCGATCTC Target TCCAGAAAGCTGAACGAGG 3_R (SEQ ID NO: 53) CD34 GGAGTTCAGACGTGTGCTCTTCCGATCTTTTCC Target TCTCTTCTCCCCTCC 4_F (SEQ ID NO: 54) CD34 CACTCTTTCCCTACACGACGCTCTTCCGATCTC Target TGCCACAAAGGGGTTAAAA 4_R (SEQ ID NO: 55) CD34 GGAGTTCAGACGTGTGCTCTTCCGATCTTTTCC Target TCTCTTCTCCCCTCC 5_F (SEQ ID NO: 56) CD34 CACTCTTTCCCTACACGACGCTCTTCCGATCTC Target TGCCACAAAGGGGTTAAAA 5_R (SEQ ID NO: 57) CD34 GGAGTTCAGACGTGTGCTCTTCCGATCTTGCAA Targets GGCTAGTGCTAGTGG 1 + 2 (SEQ ID NO: 58) Cas9D10A_F CD34 CACTCTTTCCCTACACGACGCTCTTCCGATCTC Targets ACATGCACACCCATGTTTT 1 + 2 (SEQ ID NO: 59) Cas9D10A_R CD34 GGAGTTCAGACGTGTGCTCTTCCGATCTTCTCT Targets TCTCCCCTCCCTTTT 3-5 (SEQ ID NO: 60) Cas9D10A_F CD34 CACTCTTTCCCTACACGACGCTCTTCCGATCTG Targets CCACAAAGGGGTTAAAAGTT 3-5 (SEQ ID NO: 61) Cas9D10A_R - A second ‘barcoding’ PCR was set up using unique primers for each sample facilitating multiplex sequencing. Primer pairs are shown in Table 9.
-
TABLE 9 Barcoding Primers ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACTGAACCTT FORi5_BC9 ACACTCTTTCCCTACACGACG (SEQ ID NO: 62) ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACTGCTAAGT FORi5_BC10 ACACTCTTTCCCTACACGACG (SEQ ID NO: 63) ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACTAAGTTCC FORi5_BC11 ACACTCTTTCCCTACACGACG (SEQ ID NO: 64) ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACATAGAGGC FORi5_BC12 ACACTCTTTCCCTACACGACG (SEQ ID NO: 65) ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACGGCTCTGA FORi5_BC13 ACACTCTTTCCCTACACGACG (SEQ ID NO: 66) ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACAGGCGAAG FORi5_BC14 ACACTCTTTCCCTACACGACG (SEQ ID No: 67) ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACTAATCTTA FORi5_BC15 ACACTCTTTCCCTACACGACG (SEQ ID NO: 68) ILMN_AMP_ AATGATACGGCGACCACCGAGATCTACACCAGGACGT FORi5_BC16 ACACTCTTTCCCTACACGACG (SEQ ID NO: 69) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATACATCGGTGAC REVi7_BC49 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 70) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATGCCTAAGTGAC REVi7_BC50 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 71) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATTCAAGTGTGAC REVi7_BC51 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 72) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATCTGATCGTGAC REVi7_BC52 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 73) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATGTAGCCGTGAC REVi7_BC53 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 74) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATTTGACTGTGAC REVi7_BC54 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 75) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATGGAACTGTGAC REVi7_BC55 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 76) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATTGACATGTGAC REVi7_BC56 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 77) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATGGACGGGTGAC REVi7_BC57 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 78) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATCCACTCGTGAC REVi7_BC58 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 79) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATCTTTTGGTGAC REVi7-BC59 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 80) ILMN_AMP_ CAAGCAGAAGACGGCATACGAGATATTGAGTGGTGAC REVi7_BC60 TGGAGTTCAGACGTGTGCTC (SEQ ID NO: 81) - The second PCR was performed using Q5 Hot Start High-
Fidelity 2× Master Mix™ (New England Biolabs, Ipswich, Mass.) at 1× concentration, primers at 0.5 μM each, 1 μL of 1:100 diluted first PCR, in a final volume of 10 μL and amplified 98° C. for 1 minute, 12 cycles of 10 s at 98° C., 20 s at 60° C., 30 s at 72° C., and a final extension at 72° C. for 2 minutes. PCR reactions were pooled into a single microfuge tube for SPRIselect™ bead (Beckman Coulter, Pasadena, Calif.) based clean up of amplicons for sequencing. - To pooled amplicons, 0.9× volumes of SPRIselect™ beads were added, mixed and incubated at room temperature (RT) for 10 minutes. The microfuge tube was placed on a magnetic tube stand (Beckman Coulter, Pasadena, Calif.) until the solution had cleared. Supernatant was removed and discarded, and the residual beads were washed with 1 volume of 85% ethanol, and incubated at RT for 30 s. After incubation, ethanol was aspirated and beads were air dried at RT for 10 min. The microfuge tube was then removed from the magnetic stand and 0.25× volumes of water (Qiagen, Venlo, Limburg) was added to the bead, mixed vigorously, and incubated for 2 minutes at RT. The microfuge tube was spun in a microcentrifuge to collect the contents of the tube, and was then returned to the magnet, incubated until the solution had cleared, and the supernatant containing the purified amplicons were dispensed into a clean microfuge tube. The purified amplicon library was quantified using the Nanodrop™ 2000 system (Thermo Scientific, Wilmington Del.).
- The amplicon library was normalized to 4 nmolar concentration as calculated from Nanodrop values and size of the amplicons. The library was analyzed on MiSeq Sequencer with MiSeq Reagent Kit v2™, 300 Cycles (Illumina, San Diego), with two 151-cycle paired-end run plus two eight-cycle index reads.
- C. Deep Sequencing Data Analysis:
- The identity of products in the sequencing data was analyzed based upon the index barcode sequences adapted onto the amplicons in the second round of PCR. A computational script was used to process the MiSeq data by executing the following tasks:
-
- 1. Reads were aligned to the human genome (build GRCh38/38) using Bowtie (bowtie-bio.sourceforge.net/index.shtml) software.
- 2. Aligned reads were compared to wild type loci. Reads not aligning to any part of the loci were discarded.
- 3. Reads matching wild-type sequence were tallied. Reads with indels (surrounding the Cas9 cut site) were categorized by indel type and tallied.
- 4. Total indel reads were divided by the sum of wild-type reads and indel reads gave percent-mutated reads.
- Indel structures were compared between sequence data that was generated from cells transfected with wild-type Cas9 RNP or Cas9 RNP+Donor, for each of the individual targets, and for Cas9D10A RNP and Cas9D10A RNP+Donor for each of the pairs of targets. The experimental data demonstrated that cells transfected with Cas9 RNP exhibited a number of classes of mutant edits. Cas9 RNP+Donor showed a similar spectrum of mutant edits and donor-dependent edits, whereas cells transfected with Cas9D10A RNP only, showed no evidence of editing but Cas9D10A RNP+Donor demonstrated similar levels of donor insertion to the Cas9 RNP+Donor, but with no measurable mutant edits that could not be attributed to incorporation of the donor sequence.
-
FIG. 12 shows a comparison of repair types using either Cas9 or Cas9D10A atTargets 3 and 4 (human CD34 locus). Cas9 RNP complexed with sgRNA was used to target eitherCD34 Target 3 orCD34 Target 4. Cas9D10A RNPs complexed with sgRNA were used to targetCD34 Target 3 andTarget 4. Negative controls were Cas9 or Cas9D10A only, without sgRNA. The distribution of repair is shown by the bars. As can be seen, Cas9 RNP displayed only mutagenic repair. Cas9 RNP+Donor demonstrated mutagenic repair and HDR, whereas Cas9D10A RNP showed barely detectable mutagenic repair. Cas9D10A RNP+Donor demonstrated HDR edits with barely detectable mutagenic repair. - Table 10 contains an average of three replicates (excluding negative controls n=2) and standard deviation (STD) of each class.
-
TABLE 10 Data Used in FIG. 12 % % Muta- Uned- Uned- genic % ited MUT HDR Sample Nuclease ited Repair HDR STD STD STD Target 3 Cas9 7.7 92.3 0 0.57 0.57 0 Target 4Cas9 58.3 41.7 0 2.39 2.3 0 Target Cas9 2 59.7 38.3 0 2.31 2.31 3 + Donor Target Cas9 22.3 54.3 23.3 2.08 2.51 1.52 4 + Donor Target Cas9 100 0 0 ND ND ND 3 neg Target Cas9 100 0 0 ND ND ND 4 neg Target Cas9D10A 99.3 0.7 0 0.05 0.05 0 3 + 4 Target Cas9D10A 81.8 0.5 17.6 1.52 0 1.52 3 + 4 Donor Target Cas9D10A 99.4 0.6 0 ND ND ND 3 + 4 neg - Following the guidance of the present specification and examples, the deep sequencing analysis described in this example can be practiced by one of ordinary skill in the art with other Cas9/Cas9D10A RNP complexes (i.e. assembled with distinct sgRNAs and distinct ratios of Cas9/Cas9D10A and donor oligonucleotide templates).
- This example illustrates the use of a Cas9 nickase mutant where one nuclease domain will be inactivated (either Cas9D10A or Cas9H840A) to engage preferentially HDR pathways and block mutagenic repair pathways at break sites in eukaryotic cells. In this example, spacer sequences for the two sgRNA sequences are chosen to vary the length of the deletion around the desired target site. Sequences are chosen such that the paired nickases are targeted to two sites on the same strand varying the distance between two sites in a range from 20 to 2000 nucleotides apart. Donor polynucleotides are designed with different lengths and positions relative to the locations of the spacer sequences and tested in combination with each pair of Cas9 nickase sgRNPs. Using the methods described in Examples 6 and 7, experiments are conducted to measure the frequency and type of DNA repair that takes place with each combination of paired nickases. Data are analyzed to identify the combination of nickase sgRNPs and donor polynucleotide that leads to the highest frequency of HDR with the lowest frequency of mutant editing.
- This example illustrates the use of a Cas9 nickase mutant where one nuclease domain will be inactivated (either Cas9D10A or Cas9H840A) to engage preferentially HDR pathways and block mutagenic repair pathways at break sites in eukaryotic cells. In this example, either paired Cas9D10A or paired Cas9H840A are used with two specific, sgRNAs that deliver the paired nickases to two sites on the same strand 20-2000 nucleotides apart. Donor oligonucleotides are designed to deliver specific nucleotide insertions or deletions at the desired site (
FIG. 11 ). Experiments are carried out varying spacing between nickases and varying donor sequence and length as described in Example 8 to identify the combination of reagents leading to the highest frequency of HDR and lowest frequency of mutagenic repair to introduce the intended modification at the desired site. - This example illustrates the use of a Cas9 nickase mutant where one nuclease domain will be inactivated (either Cas9D10A or Cas9H840A) to engage exclusively HDR pathways and block mutagenic repair pathways at break sites in eukaryotic cells. In this example, either paired Cas9D10A or paired Cas9H840A can be used in tandem complexed with two specific sgRNAs that deliver the paired nickases to two sites on the same strand 20-2000 nucleotides apart. The donor oligonucleotides are provided in different orientations and/or lengths to deliver specific nucleotide insertions or deletions between two target Cas9-nickase sites in human primary cells for therapeutic advantage.
- This example illustrates the use of pairs of Cas9 nickase mutants to engage preferentially homology-directed repair pathways and block mutagenic repair pathways at break sites in eukaryotic cells. In this example, Cas9D10A and Cas9H840A are used in combination with two specific sgRNAs that deliver the paired nickases to two sites resulting in nicking of the same strand 20-2000 nucleotides apart. The sgRNAs paired with Cas9D10A must be chosen to target protospacer sequences and PAM sequences on one strand. The sgRNAs paired with Cas9H840A must be chosen to target protospacer sequences and PAM sequences on the opposite strand to the Cas9D10A sgRNAs to ensure that the same strand is nicked twice. sgRNPs are assembled separately for each nickase mutant by combining the protein with the selected sgRNA. Donor oligonucleotides are designed to deliver specific nucleotide insertions or deletions at the desired site (
FIG. 11 ) and synthesized by an oligonucleotide manufacturer (e.g. Integrated DNA Technologies, Coralville, Iowa). Cas9D10a-sgRNPs are mixed with Cas9H840A-sgRNPs before transfection and the pair of nickases targeting the same strand transfected together with the donor oligonucleotide using methods described in above examples. Experiments are carried out varying spacing between nickases and varying donor sequence and length as described in Example 8 to identify the combination of reagents leading to the highest frequency of HDR and lowest frequency of mutagenic repair to introduce the intended modification at the desired site. - Although preferred embodiments of the subject methods have been described in some detail, it is understood that obvious variations can be made without departing from the spirit and the scope of the methods as defined by the appended claims.
Claims (21)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/887,893 US20180171360A1 (en) | 2014-08-27 | 2018-02-02 | Methods for increasing cas9-mediated engineering efficiency |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201462042358P | 2014-08-27 | 2014-08-27 | |
| US201462047495P | 2014-09-08 | 2014-09-08 | |
| US14/836,753 US9970030B2 (en) | 2014-08-27 | 2015-08-26 | Methods for increasing CAS9-mediated engineering efficiency |
| US15/887,893 US20180171360A1 (en) | 2014-08-27 | 2018-02-02 | Methods for increasing cas9-mediated engineering efficiency |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/836,753 Continuation US9970030B2 (en) | 2014-08-27 | 2015-08-26 | Methods for increasing CAS9-mediated engineering efficiency |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20180171360A1 true US20180171360A1 (en) | 2018-06-21 |
Family
ID=54140655
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/836,753 Active US9970030B2 (en) | 2014-08-27 | 2015-08-26 | Methods for increasing CAS9-mediated engineering efficiency |
| US15/887,893 Abandoned US20180171360A1 (en) | 2014-08-27 | 2018-02-02 | Methods for increasing cas9-mediated engineering efficiency |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/836,753 Active US9970030B2 (en) | 2014-08-27 | 2015-08-26 | Methods for increasing CAS9-mediated engineering efficiency |
Country Status (11)
| Country | Link |
|---|---|
| US (2) | US9970030B2 (en) |
| EP (1) | EP3186376B1 (en) |
| AU (1) | AU2015308910B2 (en) |
| CA (1) | CA2959070C (en) |
| DK (1) | DK3186376T3 (en) |
| ES (1) | ES2730378T3 (en) |
| GB (1) | GB2544001A (en) |
| IL (1) | IL250700B (en) |
| NZ (1) | NZ728437A (en) |
| SG (1) | SG11201701245QA (en) |
| WO (1) | WO2016033246A1 (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2022060707A1 (en) * | 2020-09-15 | 2022-03-24 | Rutgers, The State University Of New Jersey | Systems for gene editing and methods of use thereof |
| WO2024086805A3 (en) * | 2022-10-21 | 2024-06-06 | Watchmaker Genomics, Inc. | Methods and compositions for sequencing library normalization |
Families Citing this family (69)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2853829C (en) | 2011-07-22 | 2023-09-26 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
| RS58255B1 (en) | 2013-04-16 | 2019-03-29 | Regeneron Pharma | Targeted modification of rat genome |
| US20150044192A1 (en) | 2013-08-09 | 2015-02-12 | President And Fellows Of Harvard College | Methods for identifying a target site of a cas9 nuclease |
| US9359599B2 (en) | 2013-08-22 | 2016-06-07 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
| US9228207B2 (en) | 2013-09-06 | 2016-01-05 | President And Fellows Of Harvard College | Switchable gRNAs comprising aptamers |
| US9388430B2 (en) | 2013-09-06 | 2016-07-12 | President And Fellows Of Harvard College | Cas9-recombinase fusion proteins and uses thereof |
| US9526784B2 (en) | 2013-09-06 | 2016-12-27 | President And Fellows Of Harvard College | Delivery system for functional nucleases |
| KR102170502B1 (en) | 2013-12-11 | 2020-10-28 | 리제너론 파마슈티칼스 인코포레이티드 | Methods and compositions for the targeted modification of a genome |
| US20150165054A1 (en) | 2013-12-12 | 2015-06-18 | President And Fellows Of Harvard College | Methods for correcting caspase-9 point mutations |
| EP3177718B1 (en) | 2014-07-30 | 2022-03-16 | President and Fellows of Harvard College | Cas9 proteins including ligand-dependent inteins |
| EP3221457B1 (en) | 2014-11-21 | 2019-03-20 | Regeneron Pharmaceuticals, Inc. | Methods and compositions for targeted genetic modification using paired guide rnas |
| AU2015355546B2 (en) | 2014-12-03 | 2021-10-14 | Agilent Technologies, Inc. | Guide RNA with chemical modifications |
| EP3230445B1 (en) | 2014-12-12 | 2024-01-24 | Tod M. Woolf | Compositions and methods for editing nucleic acids in cells utilizing oligonucleotides |
| CN114231527A (en) | 2015-04-06 | 2022-03-25 | 里兰斯坦福初级大学理事会 | Chemically modified guide RNAs for CRISPR/CAS-mediated gene regulation |
| WO2017070633A2 (en) | 2015-10-23 | 2017-04-27 | President And Fellows Of Harvard College | Evolved cas9 proteins for gene editing |
| US11905521B2 (en) * | 2015-11-17 | 2024-02-20 | The Chinese University Of Hong Kong | Methods and systems for targeted gene manipulation |
| DK3420080T3 (en) * | 2016-02-22 | 2019-11-25 | Caribou Biosciences Inc | PROCEDURE FOR MODULATING DNA REPAIR RESULTS |
| US20200291369A1 (en) * | 2016-03-11 | 2020-09-17 | Erasmus University Medical Center Rotterdam | Improved CRISPR-Cas9 Genome Editing Tool |
| EP3219799A1 (en) | 2016-03-17 | 2017-09-20 | IMBA-Institut für Molekulare Biotechnologie GmbH | Conditional crispr sgrna expression |
| EP4495235A3 (en) * | 2016-04-13 | 2025-05-21 | Editas Medicine, Inc. | Grna fusion molecules, gene editing systems, and methods of use thereof |
| EP4023228A1 (en) * | 2016-05-06 | 2022-07-06 | Tod M. Woolf | Genome editing oligonucleotide without programmable nucleases |
| NZ747857A (en) | 2016-05-20 | 2023-01-27 | Regeneron Pharma | Methods for breaking immunological tolerance using multiple guide rnas |
| DK3272867T3 (en) * | 2016-06-02 | 2019-12-02 | Sigma Aldrich Co Llc | USE OF PROGRAMMABLE DNA BINDING PROTEINS FOR IMPROVING TARGETED RE-MODIFICATION |
| US10767175B2 (en) | 2016-06-08 | 2020-09-08 | Agilent Technologies, Inc. | High specificity genome editing using chemically modified guide RNAs |
| WO2017214632A2 (en) * | 2016-06-10 | 2017-12-14 | University Of Wyoming | Recombinant insect vectors and methods of use |
| JP6855037B2 (en) * | 2016-07-19 | 2021-04-07 | 国立大学法人大阪大学 | Genome editing method |
| WO2018027078A1 (en) | 2016-08-03 | 2018-02-08 | President And Fellows Of Harard College | Adenosine nucleobase editors and uses thereof |
| JP7201153B2 (en) | 2016-08-09 | 2023-01-10 | プレジデント アンド フェローズ オブ ハーバード カレッジ | Programmable CAS9-recombinase fusion protein and uses thereof |
| CN109890202A (en) * | 2016-08-18 | 2019-06-14 | 凯诺比生长公司 | Increase and decrease plant and the method for cannboid synthesis |
| US11542509B2 (en) | 2016-08-24 | 2023-01-03 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
| EP3526320A1 (en) | 2016-10-14 | 2019-08-21 | President and Fellows of Harvard College | Aav delivery of nucleobase editors |
| WO2018119359A1 (en) | 2016-12-23 | 2018-06-28 | President And Fellows Of Harvard College | Editing of ccr5 receptor gene to protect against hiv infection |
| US12110545B2 (en) | 2017-01-06 | 2024-10-08 | Editas Medicine, Inc. | Methods of assessing nuclease cleavage |
| KR20190123328A (en) | 2017-03-09 | 2019-10-31 | 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 | Cancer vaccine |
| EP3592853A1 (en) | 2017-03-09 | 2020-01-15 | President and Fellows of Harvard College | Suppression of pain by gene editing |
| JP2020510439A (en) | 2017-03-10 | 2020-04-09 | プレジデント アンド フェローズ オブ ハーバード カレッジ | Base-editing factor from cytosine to guanine |
| WO2018170015A1 (en) * | 2017-03-14 | 2018-09-20 | The Regents Of The University Of California | Engineering crispr cas9 immune stealth |
| KR102687373B1 (en) | 2017-03-23 | 2024-07-23 | 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 | Nucleobase editing agent comprising a nucleic acid programmable DNA binding protein |
| KR102758434B1 (en) * | 2017-03-30 | 2025-01-21 | 고쿠리츠 다이가쿠 호진 교토 다이가쿠 | Method for inducing exon skipping by genome editing |
| CN110691848B (en) * | 2017-04-10 | 2024-04-02 | 马克斯·普朗克科学促进学会 | Compounds for improving genome editing efficiency |
| US11560566B2 (en) | 2017-05-12 | 2023-01-24 | President And Fellows Of Harvard College | Aptazyme-embedded guide RNAs for use with CRISPR-Cas9 in genome editing and transcriptional activation |
| EP3658573A1 (en) | 2017-07-28 | 2020-06-03 | President and Fellows of Harvard College | Methods and compositions for evolving base editors using phage-assisted continuous evolution (pace) |
| US11319532B2 (en) | 2017-08-30 | 2022-05-03 | President And Fellows Of Harvard College | High efficiency base editors comprising Gam |
| JP2020534812A (en) * | 2017-09-08 | 2020-12-03 | ライフ テクノロジーズ コーポレイション | Methods for improved homologous recombination and compositions thereof |
| EP3697906A1 (en) | 2017-10-16 | 2020-08-26 | The Broad Institute, Inc. | Uses of adenosine base editors |
| CN111448313A (en) * | 2017-11-16 | 2020-07-24 | 阿斯利康(瑞典)有限公司 | Compositions and methods for improving the effectiveness of Cas9-based knock-in strategies |
| WO2019099982A1 (en) * | 2017-11-17 | 2019-05-23 | The Johns Hopkins University | Compositions and methods for efficient genome editing |
| US12406749B2 (en) | 2017-12-15 | 2025-09-02 | The Broad Institute, Inc. | Systems and methods for predicting repair outcomes in genetic engineering |
| GB2589246A (en) | 2018-05-16 | 2021-05-26 | Synthego Corp | Methods and systems for guide RNA design and use |
| EP3797160A1 (en) | 2018-05-23 | 2021-03-31 | The Broad Institute Inc. | Base editors and uses thereof |
| NZ775055A (en) | 2018-10-16 | 2024-11-29 | Blueallele Corp | Methods for targeted insertion of dna in genes |
| WO2020092453A1 (en) | 2018-10-29 | 2020-05-07 | The Broad Institute, Inc. | Nucleobase editors comprising geocas9 and uses thereof |
| WO2020154500A1 (en) | 2019-01-23 | 2020-07-30 | The Broad Institute, Inc. | Supernegatively charged proteins and uses thereof |
| SG11202109882VA (en) | 2019-03-19 | 2021-10-28 | Broad Inst Inc | Methods and compositions for editing nucleotide sequences |
| EP3956349A1 (en) | 2019-04-17 | 2022-02-23 | The Broad Institute, Inc. | Adenine base editors with reduced off-target effects |
| AU2020284581A1 (en) * | 2019-05-29 | 2021-12-23 | Monsanto Technology Llc | Directed genome engineering using enhanced targeted editing technologies |
| US12435330B2 (en) | 2019-10-10 | 2025-10-07 | The Broad Institute, Inc. | Methods and compositions for prime editing RNA |
| KR20230019843A (en) | 2020-05-08 | 2023-02-09 | 더 브로드 인스티튜트, 인코퍼레이티드 | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
| GB202007943D0 (en) * | 2020-05-27 | 2020-07-08 | Snipr Biome Aps | Products & methods |
| EP4288541A1 (en) | 2021-02-05 | 2023-12-13 | Christiana Care Gene Editing Institute, Inc. | Methods of and compositions for reducing gene expression and/or activity |
| US20220325338A1 (en) * | 2021-04-02 | 2022-10-13 | The Regents Of The University Of California | Systems and methods for performing and measuring homologous chromosome template repair |
| WO2022256448A2 (en) * | 2021-06-01 | 2022-12-08 | Artisan Development Labs, Inc. | Compositions and methods for targeting, editing, or modifying genes |
| CN115466780B (en) * | 2021-06-10 | 2025-08-15 | 北京大学 | Method for researching conformational change of Cas9 protein when recognizing target gene locus |
| JP2024534945A (en) | 2021-09-10 | 2024-09-26 | アジレント・テクノロジーズ・インク | Guide RNA for Prime Editing with Chemical Modifications |
| CN113801852B (en) * | 2021-10-18 | 2023-08-18 | 齐齐哈尔医学院 | A GPD1L-deleted human embryonic stem cell line and its construction method and application |
| US20250129387A1 (en) * | 2022-02-02 | 2025-04-24 | Biotalys NV | Methods for genome editing |
| CN114703174B (en) * | 2022-04-12 | 2023-10-24 | 中国科学院海洋研究所 | CRISPR/Cas9 gene knockout method for rapidly obtaining genotype and phenotype mutation and application thereof |
| WO2024092217A1 (en) * | 2022-10-27 | 2024-05-02 | The Trustees Of Columbia University In The City Of New York | Systems and methods for gene insertions |
| CN117126828A (en) * | 2023-09-15 | 2023-11-28 | 通用生物(安徽)股份有限公司 | Method for improving in-vitro enzyme digestion efficiency of Cas9 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030003547A1 (en) * | 2000-05-03 | 2003-01-02 | Institut Curie | Methods and compositions for effecting homologous recombination |
| US8871445B2 (en) * | 2012-12-12 | 2014-10-28 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
| US9023649B2 (en) * | 2012-12-17 | 2015-05-05 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US20160032292A1 (en) * | 2014-07-17 | 2016-02-04 | Georgia Tech Research Corporation | Aptamer-Guided Gene Targeting |
| US20180230494A1 (en) * | 2014-10-01 | 2018-08-16 | The General Hospital Corporation | Methods for increasing efficiency of nuclease-induced homology-directed repair |
Family Cites Families (50)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10087431B2 (en) | 2010-03-10 | 2018-10-02 | The Regents Of The University Of California | Methods of generating nucleic acid fragments |
| AU2011253222B2 (en) | 2010-05-10 | 2014-08-21 | The Regents Of The University Of California | Endoribonuclease compositions and methods of use thereof |
| US9593338B2 (en) | 2011-09-28 | 2017-03-14 | The Regents Of The University Of California | Synthetic transcriptional control elements and methods of generating and using such elements |
| US9895451B2 (en) | 2011-12-02 | 2018-02-20 | Yale University | Formulations for targeted release of agents to low pH tissue environments or cellular compartments and methods of use thereof |
| GB201122458D0 (en) | 2011-12-30 | 2012-02-08 | Univ Wageningen | Modified cascade ribonucleoproteins and uses thereof |
| WO2013122816A1 (en) | 2012-02-13 | 2013-08-22 | The Regents Of The University Of California | Methods for genome-wide screening and construction of genetic interaction maps |
| CA2865011C (en) | 2012-02-29 | 2021-06-15 | Sangamo Biosciences, Inc. | Methods and compositions for treating huntington's disease |
| AU2013235340B2 (en) | 2012-03-17 | 2019-05-23 | The Regents Of The University Of California | Fast diagnosis and personalized treatments for acne |
| US9637739B2 (en) | 2012-03-20 | 2017-05-02 | Vilnius University | RNA-directed DNA cleavage by the Cas9-crRNA complex |
| WO2013141680A1 (en) | 2012-03-20 | 2013-09-26 | Vilnius University | RNA-DIRECTED DNA CLEAVAGE BY THE Cas9-crRNA COMPLEX |
| PE20190844A1 (en) | 2012-05-25 | 2019-06-17 | Emmanuelle Charpentier | MODULATION OF TRANSCRIPTION WITH ADDRESSING RNA TO GENERIC DNA |
| US9688971B2 (en) | 2012-06-15 | 2017-06-27 | The Regents Of The University Of California | Endoribonuclease and methods of use thereof |
| KR20230065381A (en) | 2012-07-25 | 2023-05-11 | 더 브로드 인스티튜트, 인코퍼레이티드 | Inducible dna binding proteins and genome perturbation tools and applications thereof |
| HK1207111A1 (en) | 2012-08-03 | 2016-01-22 | 加利福尼亚大学董事会 | Methods and compositions for controlling gene expression by rna processing |
| IN2015DN01480A (en) | 2012-08-29 | 2015-07-03 | Sangamo Biosciences Inc | |
| UA118090C2 (en) | 2012-09-07 | 2018-11-26 | ДАУ АГРОСАЙЄНСІЗ ЕлЕлСі | Fad2 performance loci and corresponding target site specific binding proteins capable of inducing targeted breaks |
| DE202013012597U1 (en) * | 2012-10-23 | 2017-11-21 | Toolgen, Inc. | A composition for cleaving a target DNA comprising a guide RNA specific for the target DNA and a Cas protein-encoding nucleic acid or Cas protein, and their use |
| BR112015009812A2 (en) | 2012-10-31 | 2017-08-22 | Cellectis | METHOD FOR SPECIFIC GENETIC INSERTION INTO A PLANT GENOME, TRANSFORMED PLANT CELL AND ITS USE, HERBICIDIDE RESISTANT PLANT, KIT, VECTOR, AND HOST CELL |
| EP2928303A4 (en) | 2012-12-07 | 2016-07-13 | Haplomics Inc | REPAIR OF FACTOR VIII MUTATION AND INDUCTION OF TOLERANCE |
| WO2014093701A1 (en) | 2012-12-12 | 2014-06-19 | The Broad Institute, Inc. | Functional genomics using crispr-cas systems, compositions, methods, knock out libraries and applications thereof |
| US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
| EP2931898B1 (en) | 2012-12-12 | 2016-03-09 | The Broad Institute, Inc. | Engineering and optimization of systems, methods and compositions for sequence manipulation with functional domains |
| DK2898075T3 (en) | 2012-12-12 | 2016-06-27 | Broad Inst Inc | CONSTRUCTION AND OPTIMIZATION OF IMPROVED SYSTEMS, PROCEDURES AND ENZYME COMPOSITIONS FOR SEQUENCE MANIPULATION |
| MX380562B (en) | 2012-12-12 | 2025-03-12 | Broad Inst Inc | MODIFICATIONS OF SYSTEMS, METHODS AND COMPOSITIONS OPTIMIZED GUIDE FOR SEQUENCE MANIPULATION. |
| EP2921557B1 (en) | 2012-12-12 | 2016-07-13 | The Broad Institute, Inc. | Engineering of systems, methods and optimized guide compositions for sequence manipulation |
| WO2014093694A1 (en) | 2012-12-12 | 2014-06-19 | The Broad Institute, Inc. | Crispr-cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
| BR112015013784A2 (en) | 2012-12-12 | 2017-07-11 | Massachusetts Inst Technology | application, manipulation and optimization of systems, methods and compositions for sequence manipulation and therapeutic applications |
| WO2014096972A2 (en) | 2012-12-21 | 2014-06-26 | Cellectis | Potatoes with reduced cold-induced sweetening |
| CN103981147B (en) | 2013-02-08 | 2017-11-10 | 中国科学院上海生命科学研究院 | A kind of new method for preparing hepatic parenchymal cells |
| US20140288421A1 (en) | 2013-03-12 | 2014-09-25 | The Regents Of The University Of California | Gas vesicle ultrasound contrast agents and methods of using the same |
| US20140288411A1 (en) | 2013-03-12 | 2014-09-25 | The Regents Of The University Of California | Gas vesicle magnetic resonance imaging contrast agents and methods of using the same |
| RU2018122288A (en) | 2013-03-14 | 2019-03-06 | Карибо Биосайенсиз, Инк. | COMPOSITIONS AND METHODS WITH PARTICIPATION OF NUCLEIC ACIDS AIMED AT NUCLEIC ACIDS |
| US11332719B2 (en) | 2013-03-15 | 2022-05-17 | The Broad Institute, Inc. | Recombinant virus and preparations thereof |
| HK1218232A1 (en) | 2013-03-21 | 2017-02-10 | 桑格摩生物科学股份有限公司 | Targeted disruption of t cell receptor genes using engineered zinc finger protein nucleases |
| EP4286517A3 (en) | 2013-04-04 | 2024-03-13 | President and Fellows of Harvard College | Therapeutic uses of genome editing with crispr/cas systems |
| US9902973B2 (en) | 2013-04-11 | 2018-02-27 | Caribou Biosciences, Inc. | Methods of modifying a target nucleic acid with an argonaute |
| JP2016521975A (en) | 2013-05-15 | 2016-07-28 | サンガモ バイオサイエンシーズ, インコーポレイテッド | Methods and compositions for the treatment of genetic conditions |
| US20140349405A1 (en) | 2013-05-22 | 2014-11-27 | Wisconsin Alumni Research Foundation | Rna-directed dna cleavage and gene editing by cas9 enzyme from neisseria meningitidis |
| US20160122774A1 (en) | 2013-05-29 | 2016-05-05 | Cellectis | A method for producing precise dna cleavage using cas9 nickase activity |
| AU2014274939B2 (en) | 2013-06-04 | 2020-03-19 | President And Fellows Of Harvard College | RNA-guideded transcriptional regulation |
| US9267135B2 (en) | 2013-06-04 | 2016-02-23 | President And Fellows Of Harvard College | RNA-guided transcriptional regulation |
| JP6588438B2 (en) | 2013-08-28 | 2019-10-09 | サンガモ セラピューティクス, インコーポレイテッド | Composition for linking a DNA binding domain and a cleavage domain |
| AU2014350051A1 (en) | 2013-11-18 | 2016-07-07 | Crispr Therapeutics Ag | CRISPR-Cas system materials and methods |
| HUE051628T2 (en) | 2013-12-09 | 2021-03-01 | Sangamo Therapeutics Inc | Methods and compositions for genome engineering |
| WO2015089419A2 (en) | 2013-12-12 | 2015-06-18 | The Broad Institute Inc. | Delivery, use and therapeutic applications of the crispr-cas systems and compositions for targeting disorders and diseases using particle delivery components |
| US20150191744A1 (en) | 2013-12-17 | 2015-07-09 | University Of Massachusetts | Cas9 effector-mediated regulation of transcription, differentiation and gene editing/labeling |
| US10774338B2 (en) | 2014-01-16 | 2020-09-15 | The Regents Of The University Of California | Generation of heritable chimeric plant traits |
| ES2796087T3 (en) | 2014-02-03 | 2020-11-25 | Sangamo Therapeutics Inc | Methods and compositions for the treatment of beta thalassemia |
| JP6594891B2 (en) | 2014-03-18 | 2019-10-23 | サンガモ セラピューティクス, インコーポレイテッド | Methods and compositions for modulating zinc finger protein expression |
| CA2947035A1 (en) | 2014-05-08 | 2015-11-12 | Sangamo Biosciences, Inc. | Methods and compositions for treating huntington's disease |
-
2015
- 2015-08-26 GB GB1703026.3A patent/GB2544001A/en not_active Withdrawn
- 2015-08-26 WO PCT/US2015/047046 patent/WO2016033246A1/en not_active Ceased
- 2015-08-26 EP EP15763698.6A patent/EP3186376B1/en active Active
- 2015-08-26 US US14/836,753 patent/US9970030B2/en active Active
- 2015-08-26 CA CA2959070A patent/CA2959070C/en active Active
- 2015-08-26 SG SG11201701245QA patent/SG11201701245QA/en unknown
- 2015-08-26 ES ES15763698T patent/ES2730378T3/en active Active
- 2015-08-26 DK DK15763698.6T patent/DK3186376T3/en active
- 2015-08-26 AU AU2015308910A patent/AU2015308910B2/en active Active
- 2015-08-26 NZ NZ728437A patent/NZ728437A/en unknown
-
2017
- 2017-02-21 IL IL250700A patent/IL250700B/en active IP Right Grant
-
2018
- 2018-02-02 US US15/887,893 patent/US20180171360A1/en not_active Abandoned
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030003547A1 (en) * | 2000-05-03 | 2003-01-02 | Institut Curie | Methods and compositions for effecting homologous recombination |
| US8871445B2 (en) * | 2012-12-12 | 2014-10-28 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
| US9023649B2 (en) * | 2012-12-17 | 2015-05-05 | President And Fellows Of Harvard College | RNA-guided human genome engineering |
| US20160032292A1 (en) * | 2014-07-17 | 2016-02-04 | Georgia Tech Research Corporation | Aptamer-Guided Gene Targeting |
| US20180230494A1 (en) * | 2014-10-01 | 2018-08-16 | The General Hospital Corporation | Methods for increasing efficiency of nuclease-induced homology-directed repair |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2022060707A1 (en) * | 2020-09-15 | 2022-03-24 | Rutgers, The State University Of New Jersey | Systems for gene editing and methods of use thereof |
| WO2024086805A3 (en) * | 2022-10-21 | 2024-06-06 | Watchmaker Genomics, Inc. | Methods and compositions for sequencing library normalization |
| US12410469B2 (en) | 2022-10-21 | 2025-09-09 | Watchmaker Genomics, Inc. | Methods and compositions for sequencing library normalization |
Also Published As
| Publication number | Publication date |
|---|---|
| US20160257973A1 (en) | 2016-09-08 |
| IL250700A0 (en) | 2017-04-30 |
| US9970030B2 (en) | 2018-05-15 |
| ES2730378T3 (en) | 2019-11-11 |
| WO2016033246A1 (en) | 2016-03-03 |
| AU2015308910B2 (en) | 2017-12-07 |
| GB201703026D0 (en) | 2017-04-12 |
| EP3186376A1 (en) | 2017-07-05 |
| AU2015308910A1 (en) | 2017-02-09 |
| EP3186376B1 (en) | 2019-03-20 |
| CA2959070C (en) | 2020-11-10 |
| DK3186376T3 (en) | 2019-05-06 |
| NZ728437A (en) | 2018-02-23 |
| GB2544001A (en) | 2017-05-03 |
| CA2959070A1 (en) | 2016-03-03 |
| SG11201701245QA (en) | 2017-03-30 |
| IL250700B (en) | 2020-10-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9970030B2 (en) | Methods for increasing CAS9-mediated engineering efficiency | |
| US20250043257A1 (en) | Novel crispr-associated (cas) protein | |
| US10711258B2 (en) | Engineered nucleic-acid targeting nucleic acids | |
| US20190264193A1 (en) | Protein engineering methods | |
| HK1240618B (en) | Methods for increasing cas9-mediated engineering efficiency | |
| HK1240618A1 (en) | Methods for increasing cas9-mediated engineering efficiency | |
| HK40073630A (en) | Crispr-associated (cas) protein | |
| HK40005731B (en) | Crispr-associated (cas) protein |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| AS | Assignment |
Owner name: CARIBOU BIOSCIENCES, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CAMERON, PETER SEAN;MAY, ANDREW P.;NYE, CHRISTOPHER H.;AND OTHERS;REEL/FRAME:048016/0416 Effective date: 20150917 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |