WO2021155607A1 - Modified cytosine base editor and application thereof - Google Patents
Modified cytosine base editor and application thereof Download PDFInfo
- Publication number
- WO2021155607A1 WO2021155607A1 PCT/CN2020/074561 CN2020074561W WO2021155607A1 WO 2021155607 A1 WO2021155607 A1 WO 2021155607A1 CN 2020074561 W CN2020074561 W CN 2020074561W WO 2021155607 A1 WO2021155607 A1 WO 2021155607A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cytosine
- base editor
- target
- sequence
- editing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/34—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving hydrolase
Definitions
- the present invention belongs to the field of biotechnology. More specifically, the present invention relates to a modified cytosine base editor and its application.
- Base editing has been widely used for targeted base editing and has great potential in correcting disease-causing mutations.
- CRISPR/Cas and base editors-mediated gene editing methods have been developed, and have brought great hopes for the treatment of genetic diseases caused by pathogenic mutations.
- Clinical applications based on CRISPR/Cas gene editing or base editing require comprehensive analysis of off-target effects to reduce the risk of harmful mutations.
- HGGTS high-throughput whole-genome translocation sequencing
- GUI-seq unbiased recognition of double-strand breaks
- CIRCLE-seq report cleavage efficiency in vitro by cycle sequencing
- none of these methods can effectively detect single nucleotide variation (SNV). So far there is no effective method to detect SNV in this field.
- the defect lies in the low editing efficiency of homology-mediated repair.
- XTEN linker 16aa long XTEN linker
- the second-generation base editor (BE2) system also fuses base excision repair inhibitor UGI and dCas9 together to combine editing The efficiency is increased three times, up to about 20%.
- BE3 creates a nick in the non-complementary DNA strand, and the cell uses the uracil (U)-containing DNA strand as a template for repair, thereby replicating this base editing.
- U uracil
- this BE3 system significantly improves the base editing efficiency, and its average indel (insertion-deletion) incidence is only 1.1%.
- CBE cytosine base editor
- the purpose of the present invention is to provide a modified cytosine base editor and its application.
- a method for improving the efficiency or fidelity of targeted editing of a cytosine base editor includes: modifying the cytosine deaminase in the cytosine base editor
- the cytosine deaminase includes APOBEC1 or its homologues, and the modification includes the amino acids corresponding to Trp(W) at position 90 and Arg(R) at position 126 of APOBEC1. Make a mutation and connect the cytosine base editor to the nuclear localization sequence.
- the cytosine base editor is BE3 gene editor system.
- the APOBEC1 homologue includes an enzyme selected from the group consisting of AID, APOBEC3G, APOBECA3A, CDA1.
- the mutation is a mutation of the cytosine deaminase corresponding to APOBEC1 at position 90 Trp to Tyr(Y); and/or, a mutation of Arg at position 126 to Glu(E) .
- the N-terminal and/or C-terminal of the cytosine base editor is connected to a nuclear localization sequence. Preferably, it is at the C-terminus of UGI in the cytosine base editor or at the N-terminus of cytosine deaminase.
- a modified cytosine deaminase comprising APOBEC1 or a homologue thereof, the cytosine deaminase corresponding to APOBEC1 Trp( There are mutations in the amino acid of W) and Arg(R) at position 126.
- the APOBEC1 homologue includes an enzyme selected from the group consisting of AID, APOBEC3G, APOBECA3A, CDA1.
- the mutation is that the cytosine deaminase corresponding to APOBEC1's Trp at position 90 is mutated to Tyr(Y); and/or, the Arg at position 126 is mutated to Glu(E).
- a cytosine base editor which comprises the modified cytosine deaminase.
- the cytosine base editor is also connected to the nuclear localization sequence; preferably, the N-terminal and/or C-terminal is connected to the nuclear localization sequence.
- the cytosine base editor and the nuclear localization sequence further include a linking sequence, such as a tag sequence (more specifically, a Flag tag).
- an isolated polynucleotide which encodes the modified cytosine deaminase or the cytosine base editor.
- the cytosine base editor has the nucleotide sequence shown in SEQ ID NO: 2.
- a recombinant expression vector which comprises the polynucleotide.
- a genetically engineered host cell which contains the vector or the polynucleotide integrated in the genome.
- the editor is BE3 base editor.
- the use of the cytosine base editor is provided for gene editing, reducing off-target effects, improving the efficiency of targeted editing, or improving the fidelity of targeted editing.
- a method for gene editing which includes mediating gene editing with the cytosine base editor.
- nucleic acid sequence encoding the cytosine base editor and the sgRNA are co-injected into the receptor to perform gene editing.
- the receptor includes: somatic cells or germ cells.
- the germ cells include embryonic cells or fertilized eggs.
- a reagent or kit for gene editing which includes the modified cytosine deaminase; or the cytosine base editor.
- FIG. 1 Targeted editing efficiency of cytosine base editor (CBE) variants.
- CBE cytosine base editor
- a The predicted rAPOBEC1 structure in various rAPOBEC1 variants. The mutated residues are highlighted and marked on the structure.
- b Sequence alignment between hAPOBEC3G and rAPOBEC1. Amino acids, same residues; +, common substituents.
- the green triangles represent the residues in the hydrophobic active region of APOBEC3G, and the yellow stars represent the residues in the ssDNA binding region.
- c The crystal structure of APOBEC3G.
- FIG. 1 DNA and RNA off-target activity of CBEs variants.
- b The mutation type distribution of Cre, BE3 and 4 CBE variant treatment groups.
- c Comparison of the total number of RNA off-target SNVs detected 36 hours after transfection. 3 repetitions per group. Compared with the GFP group, the two-tailed Student's t test was used to calculate the P value above each column.
- d Distribution of mutation types in GFP, BE3 and 4 CBE variant treatment groups.
- FIG. 3 Activity of BE3-FNLS or BE3-hA3A variants.
- a Targeting efficiency of BE3-FNLS or BE3-hA3A at each target. See Table 6 for target site related sequences and primer sequences.
- c Comparison of the total number of RNA off-target SNVs detected 36 h after transfection. 3 repetitions per group.
- FIG. 4 Targeted editing of BE3 and BE3 variants at different target sites.
- a Target editing efficiency and indels frequency of different versions of CBE variants at an additional 11 target sites.
- b Comparison of targeting efficiency of CBE variants.
- c Comparison of indel frequencies between CBE variants.
- d On-target efficiency of each target engineered BE3 variant.
- e Comparison of editing efficiency of CBE variants at each C of the target site.
- f Comparison of the editing efficiency in the window and the editing efficiency outside the window of the CBE variant. Edit window: 5-7 bases.
- Each group n 3 biological replicates.
- P value adopts two-sided t test.
- the target site sequence and primer sequence are shown in Table 6.
- FIG. 1 Embryonic development rate of BE3 and BE3 variants.
- a Use sgRNA-D to detect the blastocyst rate of BE3 and BE3 variants.
- b Use other sgRNAs to detect the blastocyst rate of BE3-hA3A and BE3-FNLS.
- Each group n 3 biological replicates.
- FIG. 6 Targeted editing efficiency of CBE variants and editing efficiency of non-targeted SNVs.
- a Target editing efficiency of BE3 and CBE variants in WGS data.
- b Comparison of C-to-T and G-to-A conversion between CBE variant treatment group and Cre or BE3 group. P value adopts two-sided t test. *P ⁇ 0.05, **P ⁇ 0.01,***P ⁇ 0.001.
- FIG. 7 Venn diagram of SNVs detected in each embryo through WGS data. a. SNVs identified in embryos treated with BE3 R126E. b. SNVs identified in embryos treated with BE3 R132E. c. SNVs identified in embryos treated with YE1-BE3. d. SNVs identified in embryos treated with FE1-BE3.
- Figure 8 Non-target SNVs characteristics of CBE variants.
- the inventor's analysis detected the overlap between the SNVs and the extra-target sites predicted by Cas-OFFinder and CRISPOR.
- FIG. 10 Detection of RNA off-target efficiency of CBE variants 72 hours after transfection.
- a Comparison of the total number of RNA non-targeted SNVs detected 72 hours after transfection.
- the P value of each group was calculated.
- b Distribution of mutation types in the GFP, BE3 and BE3 variant treatment groups.
- c Off-target efficiency of BE3 variant RNA 72 hours after transfection.
- FIG. 11 Target editing efficiency and off-target of BE3-FNLS.
- a Comparison of targeting efficiency of CBE variants.
- b Comparison of editing efficiency of CBE variants at each C of the target site.
- c SNVs found in embryos treated with BE3 -hA3A Y130F and YE1-BE3-FNLS.
- d The overlap between the SNVs detected from the inventor's analysis and the off-target sites predicted by Cas-OFFinder and CRISPOR.
- e Distribution of DNA non-targeted SNVs mutation types of embryos treated with BE3-hA3AY130F and YE1-BE3-FNLS.
- f Distribution of DNA non-targeted SNVs mutation types of embryos treated with BE3-hA3AY130F and YE1-BE3-FNLS.
- RNA non-targeted SNVs mutation types of embryos treated with BE3-hA3AY130F and YE1-BE3-FNLS g.
- h. The RNA off-target rate of BE3 and BE3-FNLS 36 hours after transfection.
- Each group n 3 biological replicates.
- FIG. 12 Activity of BE3 and BE3 variants at indicated off-target sites.
- a sgRNA-dependent off-target effects of BE3 variants.
- b The editing frequency of BE3 variants at designated off-target sites.
- the P value was compared with the YE1-BE3-FNLS group using a two-sided t test. Compared with the YE1-BE3-FNLS group, a red star indicates an increase in the editing frequency, and a green star indicates a decrease in the editing frequency.
- Figure 13 Schematic diagram of YE1-BE3-FNLS plasmid.
- the inventors analyzed the DNA and RNA of multiple CBE variants by two-cell embryo injection whole genome off-target analysis (Genome-wide Off-target analysis by Two-cell embryo Injection, GOTI) and RNA-Seq sequencing After in-depth analysis of off-target effects, the cytosine base editor has been modified, and the targeted editing efficiency and fidelity of the cytosine base editor have been significantly improved.
- the cytosine base editor includes cytosine deaminase.
- the cytosine deaminase includes APOBEC1 or a homologue thereof.
- the APOBEC1 homologues include enzymes that have the same or similar functions as APOBEC1, or enzymes that have substantially the same or substantially similar domains as APOBEC1, or those that come from a different species than APOBEC1 but play the same role in the respective species. Enzyme.
- the APOBEC1 homologues include but are not limited to enzymes selected from the group consisting of AID, APOBEC3G, APOBECA3A, CDA1.
- the present invention first provides a modified cytosine deaminase.
- the cytosine deaminase has mutations in the amino acids corresponding to Trp(W) at position 90 and Arg(R) at position 126 of APOBEC1, and the cytosine The deaminase is linked to the nuclear localization sequence.
- the mutation is the mutation of Trp at position 90 of APOBEC1 to Tyr(Y) of the cytosine deaminase; and/or the mutation of Arg at position 126 to Glu(E).
- the cytosine deaminase and the nuclear localization sequence are also connected by a linking sequence, and the linking sequence may be any linking sequence that does not affect the functions of the two, for example, It is a tag sequence or some flexible linking sequence known in the art.
- Appropriate labels can be used in the present invention.
- the tag can be FLAG, HA, HA1, c-Myc, Poly-His, Poly-Arg, Strep-TagII, AU1, EE, T7, 4A6, ⁇ , B, gE or Ty1.
- the modified cytosine deaminase (modified enzyme) of the present invention can be a recombinant protein, a natural protein, a synthetic protein, and a recombinant protein is preferred.
- the protein of the present invention can be a natural purified product, or a chemically synthesized product, or produced from a prokaryotic or eukaryotic host (for example, bacteria, yeast, higher plant, insect, and mammalian cells) using recombinant technology.
- the present invention also includes fragments, derivatives and analogs of the engineered enzymes.
- fragment refers to a protein that substantially retains the same biological function or activity as the engineered enzyme of the present invention.
- the protein fragment, derivative or analogue of the present invention may be (i) a protein in which one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues) are substituted, and such substituted amino acid residues It may or may not be encoded by the genetic code, or (ii) a protein with a substitution group in one or more amino acid residues, or (iii) a protein formed by fusing an additional amino acid sequence to the protein sequence (such as Leader sequence or secretory sequence or sequence used to purify the protein or proprotein sequence, or fusion protein). According to the definition herein, these fragments, derivatives and analogs belong to the scope well known to those skilled in the art.
- engineered enzyme also includes (but is not limited to): several (usually 1-20, more preferably 1-10, still more preferably 1-8, 1- 5, 1-3, or 1-2) amino acid deletions, insertions and/or substitutions, and addition or deletion of one or several (usually within 20) at the C-terminus and/or N-terminus, preferably Within 10, more preferably within 5) amino acids.
- amino acids with similar or similar properties when amino acids with similar or similar properties are substituted, the function of the protein is usually not changed.
- adding one or several amino acids to the C-terminus and/or N-terminus usually does not change the function of the protein.
- the term also includes active fragments and active derivatives of engineered enzymes.
- the term "engineered enzyme” also includes (but is not limited to): the amino acid sequence of the modified enzyme is more than 80%, preferably more than 85%, more preferably more than 90% , And more preferably 95% or more, such as 98% or more, 99% or more sequence identity of the derived protein that retains its protein activity.
- the amino acids corresponding to Trp (W) at position 90 and Arg (R) at position 126 of APOBEC1 have the mutations, and they are also connected. There is the nuclear localization sequence.
- the present invention also provides a polynucleotide sequence encoding the engineered enzyme of the present invention or a conservative variant protein thereof.
- the polynucleotide of the present invention may be in the form of DNA or RNA.
- the form of DNA includes cDNA, genomic DNA or synthetic DNA.
- DNA can be single-stranded or double-stranded.
- DNA can be a coding strand or a non-coding strand.
- the polynucleotide encoding the mature protein of the mutant includes: only the coding sequence of the mature protein; the coding sequence of the mature protein and various additional coding sequences; the coding sequence of the mature protein (and optional additional coding sequence) and non- Coding sequence.
- polynucleotide encoding a protein may include a polynucleotide encoding the protein, or a polynucleotide that also includes additional coding and/or non-coding sequences.
- the full-length nucleotide sequence of the modified enzyme of the present invention or its fragments can usually be obtained by PCR amplification method, recombination method or artificial synthesis method.
- primers can be designed according to the relevant nucleotide sequence disclosed in the present invention, especially the open reading frame sequence, and a commercially available cDNA library or a cDNA prepared by a conventional method known to those skilled in the art can be used.
- the library is used as a template to amplify the relevant sequences. When the sequence is long, it is often necessary to perform two or more PCR amplifications, and then splice the amplified fragments together in the correct order.
- the recombination method can be used to obtain the relevant sequence in large quantities. This is usually done by cloning it into a vector, then transferring it into a cell, and then isolating the relevant sequence from the proliferated host cell by conventional methods.
- artificial synthesis methods can also be used to synthesize related sequences, especially when the fragment length is short. Usually, by first synthesizing multiple small fragments, and then ligating to obtain fragments with very long sequences.
- the DNA sequence encoding the protein (or fragment or derivative thereof) of the present invention can be obtained completely through chemical synthesis. This DNA sequence can then be introduced into various existing DNA molecules (or such as vectors) and cells known in the art. In addition, mutations can also be introduced into the protein sequence of the present invention through chemical synthesis.
- the present invention also relates to a vector containing the polynucleotide of the present invention, a host cell produced by genetic engineering using the vector of the present invention or the modified enzyme coding sequence, and a method for producing the protein of the present invention through recombinant technology.
- the polynucleotide sequence of the present invention can be used to express or produce a recombinant engineered enzyme.
- the present invention also provides a cytosine base editor containing the modified enzyme or its polynucleotide sequence.
- the cytosine base editor is BE3 base editor.
- Other components of the cytosine base editor are known to those skilled in the art.
- the modified enzyme polynucleotide sequence or the cytosine base editor polynucleotide sequence can be inserted into a recombinant expression vector.
- recombinant expression vector refers to bacterial plasmids, bacteriophages, yeast plasmids, plant cell viruses, mammalian cell viruses or other vectors well known in the art. In short, any plasmid and vector can be used as long as it can replicate and stabilize in the host.
- An important feature of an expression vector is that it usually contains an origin of replication, a promoter, a marker gene, and translation control elements.
- an expression vector containing the modified enzyme polynucleotide sequence or the cytosine base editor polynucleotide sequence and appropriate transcription/translation control signals. These methods include in vitro recombinant DNA technology, DNA synthesis technology, and in vivo recombination technology.
- the DNA sequence can be effectively linked to an appropriate promoter in the expression vector to guide mRNA synthesis.
- the expression vector also includes a ribosome binding site for translation initiation and a transcription terminator.
- the expression vector preferably contains one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells.
- a vector containing the above-mentioned appropriate DNA sequence and an appropriate promoter or control sequence can be used to transform an appropriate host cell into a recipient cell.
- the present invention also provides a method for gene editing, which includes mediating gene editing with the cytosine base editor of the present invention.
- a method for gene editing which includes mediating gene editing with the cytosine base editor of the present invention.
- other gene editing reagents can be used known in the art.
- sgRNA can be designed in a manner known in the art.
- the target of gene editing is not particularly limited, and it may be a somatic cell or a germ cell, and it may be an animal cell or a human cell.
- CBE cytosine base editor
- the off-target effects of DNA and RNA limit its application in science and medicine.
- the inventors screened more than 20 reasonably designed CBE mutants in detail, and analyzed the off-target effects of DNA and RNA using GOTI and RNA-Seq, respectively.
- the predicted residue mutations at the DNA binding site significantly reduced off-target effects, in some cases to levels comparable to unedited controls.
- the CBE variant YE1-BE3-FNLS obtained in the present invention has very low off-target efficiency and bystander editing while maintaining extremely high targeted editing efficiency.
- the inventors not only identified multiple residues that can specifically affect RNA and DNA off-target activity and narrow the base editing window, but also introduced a CBE variant with high fidelity and high editing efficiency, thereby expanding The application of these powerful tools in the laboratory and in the clinic.
- the inventor screened dozens of rAPOBEC1 mutations based on the findings of multiple previous studies, and found that BE3 R132E , YE1-BE3, and FE1-BE3 mutations significantly reduced the off-target effects of DNA and RNA. Maintained their targeted editing activity. Interestingly, the inventors observed that the variants with reduced DNA/RNA off-target effects (BE3 R132E , YE1-BE3, FE1-BE3 and YE1-BE3-FNLS) also have reduced base editing windows. Rees et al. reported that bases located outside the active window but located in the R-loop region of ssDNA can still be edited, albeit with lower efficiency, especially if they are located in the favorable editing motif of rAPOBEC1. These may help explain these results.
- rAPOBEC1 and hAPOBEC3A are considered to have only one catalytic domain
- the inventors predicted the possible impact of the mutation introduced in the DNA binding motif.
- rAPOBEC1 may adopt different binding modes to adapt to ssDNA and RNA. This highlights the necessity for base editing researchers to evaluate the off-target effects of base editing on DNA and RNA.
- the inventors speculate that the heterogeneity of this binding mode may help explain that some CBE variants discovered by the inventors not only retain high DNA off-target effects, but also significantly reduce RNA off-target effects (and vice versa) The phenomenon.
- R132E affects the interaction of rAPOBEC1 with DNA and RNA
- R126E mainly affects its DNA binding ability
- Y130F mainly affects its RNA binding ability.
- the YE1-BE3-FNLS mutation contains both the R126E mutation and the substitution of tyrosine at the W90 residue in the hydrophobic region of rAPOBEC1
- this residue is considered to be involved in the binding of rAPOBEC1 to ssDNA/RNA.
- the inventors preliminarily speculate that the W90Y mutation It helps to explain that the high fidelity of YE1-BE3-FNLS may be due to the change of rAPOBEC1-RNA interaction.
- a mixture of mRNA and sgRNA from gene editing tools such as Cas9/BE3 was injected into a blastomere of a 2-cell stage embryo, which was derived from a wild-type female mouse X Ai9 male mouse.
- the action of Cre produces chimeric embryos, in which the injected cells are marked with tdTomato (red), a positive tdTomato indicates that editing has occurred, and a negative tdTomato indicates unedited.
- TdTomato positive cells and tdTomato negative cells were separated from chimeric embryos by FACS at E14.5 and used for WGS analysis respectively.
- Off-target SNV and indel were identified by comparing tdTomato+ cells and tdTomato- cells using three algorithms (Mutect2, Lofreq and Strelka for SNV analysis, and Mutect2, Scalpel and Strelka for indel analysis).
- Heterozygous Ai9 full name B6.Cg-Gt(ROSA)26Sortm9(CAG-td-Tomato)Hze/J; JAX strain 007909
- male mice and female C57BL/6 mice (4 weeks old) were mated for embryo collection. ICR females are used as recipients. The use and care of animals follow the guidelines of the Biomedical Research Ethics Committee of the Shanghai Institute of Biology, Chinese Academy of Sciences.
- the wild-type APOBEC1 protein sequence is shown in SEQ ID NO:1:
- Cytosine base editor 3 (BE3, rAPOBEC1-nCas9-UGI), including Apobec1 and Sp nCas9 enzymes, UGI enzymes, among which Apobec1 and Sp nCas9 enzymes, Sp nCas9 enzymes and UGI enzymes, respectively pass Two peptides, 16AA (sequence SGSETPGTSESATPES) and 4AA (sequence SGGS) are connected, and rAPOBEC1 is recombinant APOBEC1.
- BE3 R126E The BE3 of the previous paragraph (1), in which the 126th position of the rAPOBEC1 sequence was changed from R to E.
- BE3 R128E BE3 of (1), where the 128th bit of the rAPOBEC1 sequence was changed from R to E.
- BE3 R132E BE3 of (1), in which position 132 of the rAPOBEC1 sequence was changed from R to E.
- YE1-BE3 BE3 of (1), in which the 90th W mutation of the rAPOBEC1 sequence was changed to Y (W90Y), and the 126th position was changed from R to E (R126E).
- FE1-BE3 BE3 of (1), in which the 90th position of the rAPOBEC1 sequence was changed to F (W90F), and the 126th position was changed from R to E (R126E).
- BE3-hA3A Use human APOBECA3A (human APOBECA3A) to replace apobec1 on BE3 to build a new BE3 editing tool
- BE3-hA3A Y130F mutation Y130 to F in human APOBECA3A.
- BE3-FNLS BE3 of (1), where the N-terminal of the rAPOBEC1 sequence is connected with the Flag tag and the NLS nuclear localization sequence (sequence: PKKKRKV), and there is also an NLS at the C-terminal of the base editor.
- YE1-BE3-FNLS For BE3-FNLS, the 90th position of the rAPOBEC1 sequence was changed to Y (W90Y), and the 126th position was changed from R to E (R126E).
- the schematic diagram is shown in Figure 13.
- sequence of YE1-BE3 after mutation is (SEQ ID NO: 223):
- the mutant was inserted into the pCMV-BE3 plasmid to obtain the CBE mutant plasmid.
- T7 promoter to the base editor coding region, and use primers F and R (base editor IVT F and base editor IVT R, etc. in Table 2) to pair the plasmids (YE1-BE3, BE3-FNLS, YE1-BE3-FNLS, hA3A) -BE3)
- primers F and R base editor IVT F and base editor IVT R, etc. in Table 2
- the T7-sgRNA-PCR product was purified with MEGA-shortscript T7 kit (Life Technologies) as a template for IVT. Through PCR amplification, T7 promoter was added to Cre in vitro transcription template. Purify the T7 Cre-PCR product as an in vitro transcription template, and use mMESSAGE mMACHINE T7 ULTRA kit (Life Technologies) for in vitro transcription. The Cas9 mRNA, Cre mRNA and sgRNAs were purified with MEGA-clear kit (Life Technologies), and eluted with RNase-free water.
- IVT is in vitro transcription, Tyr-C, targeting sgRNA with Tyr gene code C, Tyr-D, targeting sgRNA with Tyr gene code D, Tyr-F, targeting sgRNA with Tyr gene code F.
- the gene editing steps are: after mixing the mRNA and sgRNA, using a microinjector, inject it into a blastomere of a mouse 2-cell stage embryo.
- the NEBuilder-HiFi-DNA assembly master mix (New England Biolabs) was used for site-directed mutagenesis of BE3. Briefly, the inventors used a primer containing the desired point mutation to amplify a suitable vector plasmid by PCR.
- pCMV-BE3 variants-polyA-pCMV-mCherry-polyA is assembled by NEBuilder-HiFi DNA, and the PCR-amplified pCMV-mCherry polyA is combined with the digested pCMV-BE3 variant backbone.
- the PCR amplified U6-sgRNA was combined with the digested pCMV-EGFP-polyA backbone, and then assembled by NEBuilder-HiFi-DNA to obtain pCMV-EGFP-polyA-U6-sgRNA.
- HEK293T cells were cultured in DMEM containing 10% fetal bovine serum (FBS) and 37°C humidified incubator containing 5% CO2.
- FBS fetal bovine serum
- pCMV-BE3 (WT/BE3 variant)-polyA-pCMV-mCherry polyA and pCMV-EGFP-polyA-U6-sgRNA expression plasmids were co-transfected with liposome 3000 (ThermoFisher Scientific) according to the instructions. 36 or 72 hours after transfection, the cells were washed with phosphate buffered saline (PBS) and trypsinized with 0.05% trypsin EDTA. The cell suspension was filtered through a 40 ⁇ m cell strainer, and EGFP/mCherry positive cells were separated by flow cytometry.
- PBS phosphate buffered saline
- RNAs are fragmented and converted into cDNA using random hexamers or oligonucleotide (dT) primers. Connect the 5'end and 3'end of the cDNA to the adaptor respectively, and use the PCR method to enrich and amplify the correctly connected cDNA fragments. The concentration of the library was determined with a bioanalyzer. Sequencing is performed on the Illumina HiSeq platform.
- the transfected cells were taken, and EGFP+/mCherry+ cells were sorted by FACS. According to the instructions, use Tiangen DNA Extraction Kit (TIANGEN) to extract genomic DNA.
- the gene-specific primers on both sides of the target sequence (Tables 3 and 4) were used to amplify the target genomic site by PCR.
- ExTaq (TAKARA) was activated at 95°C for 3 minutes, followed by 34 cycles of PCR (at 95°C for 30 seconds, 62°C for 30 seconds, 72°C for 1 minute), and finally at 72°C for 5 minutes. Purify the DNA amplicons using the Universal DNA Purification Kit (TIANGEN) according to the instructions. The amplicon was connected to the adapter and sequenced on the Illumina HiSeq-Xten platform.
- the prepared tissue was enzymatically hydrolyzed in 5ml trypsin EDTA (0.05%) incubation solution at 37°C for 30 minutes, and 5ml DMEM medium and 10% fetal bovine serum (FBS) were added to stop the digestion. Then use a 1 ml pipette to homogenize the fetal tissue 30-40 times. The cell suspension was centrifuged for 6 min (800 rpm), and then the pellet was resuspended in DMEM medium containing 10% FBS. Finally, filter the cell suspension with a 40 ⁇ m cell strainer, and separate the tdtomato + /tdtomato - cells with a flow cytometer. Through the second round of flow cytometry and fluorescence microscope analysis, it was found that the purity of the sample was greater than 95%.
- Genomic DNA was extracted from the cells using the DNeasy Blood and Tissue Kit (Cat. No. 69504, Qiagen) according to the instructions.
- Whole genome sequencing is performed by Illumina HiSeq X 10, with an average coverage rate of 50 times.
- BWA v0.7.12
- mm10 reference genome
- Picard tool v2.3.0
- the inventors performed single nucleotide mutations, using the default parameters of Mutect2 (v3.5), Lofreq (v2.1.2) and Strelka (v2.7.1).
- Kind of algorithm Kind of algorithm.
- RNA sequence data analysis uses FastQC (v0.11.3) and Trimmomatic (v0.36) for quality control. Qualified readings use STAR (v2.5.2b) and are mapped to the reference genome (integrated GRCh38) in the 2-way mode with default parameters. Then use the Picard tool (v2.3.0) to sort and mark the duplicates of the mapped BAM file.
- the optimized BAM file has been split read across joint connections, local realignment, basic recalibration, and variant calls using SplitNCigarReads, IndelRealigner, BaseRecalivator, and haplotype calling tools in GATK (v3.5) .
- PS protein structure prediction server predicts the structure of rAPOBEC1.
- the crystal structure of APOBEC3G was downloaded from PDB (http://www.rcsb.org/3d-view/3IQS) and presented using PyMOL (v2.3.2).
- the present invention uses R version 3.5.1 (http://www.R-project.org/) for statistical analysis. All tests are two-sided, P ⁇ 0.05 considered the difference to be significant.
- variants include leucine-enriched N-terminal or C-terminal deletions or mutations of rAPOBEC1 (Del32, R33A, K34A, Del34, Del77, Del116, Del169, Del182, P190A, and P191A), and predicted rAPOBEC1 catalytic activity Site mutations (H61A, H61R, V62A, E63A, E63Q, C93S, C96S).
- the inventor predicted that the R126 site of rAPOBEC1 (corresponding to the R320 site of APOBEC3G) interacts with the phosphate backbone of ssDNA ( Figure 1b, c), and the R126E mutant can retain target editing activity.
- R128 and R132 are close to R126, and the inventors also introduced R128E and R132E mutations ( Figure 1a-c).
- the inventors also studied the effect of a combination of point mutations (W90A, W90F, W90Y) at the active site of the hydrophobic domain of rAPOBEC1, and these mutations can reduce the width of the base editing window.
- the inventors will use CBE mutant plasmids to transfect HEK293T cells to analyze its editing activity and off-target effects.
- 7 mutants that can retain their targeted activity were screened from 23 mutants, including R33A, K34A, V62A, W90F+R126E, W90Y+R126E, R126E, and R132E , 4 of the mutants (W90F+R126E, W90Y+R126E, R126E, R132E) did not increase the mutation efficiency of indels ( Figure 1d, Figure 4 and Table 5).
- these experiments show that the editing window of the W90F+R126E, W90Y+R126E, and R132E variants becomes narrower, Figure 4).
- BE3 R126E treated embryos in DNA from non-targeted SNV 283 ⁇ 32 was reduced to 28 ⁇ 6
- the untargeted SNV of DNA in embryos treated with BE3 R132E was 47 ⁇ 8
- the untargeted SNV of DNA in embryos treated with YE1-BE3 was 12 ⁇ 2
- the DNA treated with FE1-BE3 The non-targeted SNV is 27 ⁇ 19.
- RNA-seq used to evaluate the off-target effects of these variants on the transcriptome of the HEK293T cells transfected.
- three variants BE3 R132E , YE1-BE3 and FE1-BE3 showed significantly reduced RNA off-target editing 36h after transfection ( Figures 2c and 2d).
- the RNA off-target editing of the BE3 R126E mutant strain did not decrease at 36h after transfection, but it decreased significantly at 72h after transfection.
- BE3-hA3A Figure 3a and Figure 11
- BE3 hA3AY130F
- BE3 was converted from humanAPOBECA3A mutation Y130 to F. It can be observed that this mutation significantly reduces the number of off-target SNVs.
- the inventors used GOTI to analyze the off-target effects of BE3-hA3A, but found that BE3-hA3A is obviously toxic to embryos ( Figure 5).
- the inventors constructed a high-fidelity variant YE1-BE on the basis of the BE3-FNLS editor.
- a nuclear localization signal peptide was added to the C-terminus and N-terminus of the variant.
- the sequence was optimized for codons expressed in human cells.
- the codon-optimized DNA sequence is (SEQ ID NO: 2):
- YE1-BE3-FNLS In addition to this new YE1-BE3-FNLS variant, the inventors tested the targeted editing efficiency of BE3, YE1-BE3, BE3-hA3A, BE3-hA3A, Y130F, and BE3-FNLS on 21 targets of HEK293T cells. And bystander editing. YE1-BE3-FNLS had the highest targeting efficiency, which was 70.7 ⁇ 5.2% ( Figure 3d). It is worth noting that YE1-BE3-FNLS has the lowest indels level among the tested variants, at 0.8 ⁇ 0.2%, and the number of other bystander edits is also the lowest at 0.6 ⁇ 0.4% ( Figure 3d-e).
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Physics & Mathematics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Medicinal Chemistry (AREA)
- Mycology (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
本发明属于生物技术领域,更具体地,本发明涉及一种经改造的胞嘧啶碱基编辑器及其应用。The present invention belongs to the field of biotechnology. More specifically, the present invention relates to a modified cytosine base editor and its application.
碱基编辑已被广泛应用于进行有针对性的碱基编辑,在纠正致病突变方面具有很大的潜力。Base editing has been widely used for targeted base editing and has great potential in correcting disease-causing mutations.
CRISPR/Cas和base editors介导的基因编辑方法已经开发出来,并为治疗致病性突变引起的遗传疾病带来了很大的希望。基于CRISPR/Cas基因编辑或者碱基编辑的临床应用需要综合性分析脱靶效应,减少有害突变的风险。尽管本领域中开发了多种方法来检测全基因组的基因编辑细胞的离靶活性,包括高通量全基因组易位测序(HTGTS),全基因组,双链断裂的无偏差识别(GUIDE-seq)和通过循环测序在体外报告切割效率(CIRCLE-seq)。然而,这些方法都不能有效地检测单核苷酸变异(SNV)。至今本领域还没有一种有效的方法来检测SNV。CRISPR/Cas and base editors-mediated gene editing methods have been developed, and have brought great hopes for the treatment of genetic diseases caused by pathogenic mutations. Clinical applications based on CRISPR/Cas gene editing or base editing require comprehensive analysis of off-target effects to reduce the risk of harmful mutations. Although a variety of methods have been developed in the field to detect the off-target activity of genome-wide gene editing cells, including high-throughput whole-genome translocation sequencing (HTGTS), whole-genome, unbiased recognition of double-strand breaks (GUIDE-seq) And report cleavage efficiency in vitro by cycle sequencing (CIRCLE-seq). However, none of these methods can effectively detect single nucleotide variation (SNV). So far there is no effective method to detect SNV in this field.
并且,CRISPR/Cas在应用时,缺陷在于同源介导修复的低编辑效率。本领域人员利用16aa长的XTEN接头(XTEN linker)将胞苷脱氨酶APOBEC1与dCas9连接在一起,从而构建出第一代碱基编辑器(BE1)。为了增加体内编辑效率,第二代碱基编辑器(BE2)系统除了将胞苷脱氨酶与dCas9连接在一起之外,还将碱基切除修复抑制剂UGI与dCas9融合在一起,从而将编辑效率提高三倍,最高达到20%左右。In addition, when CRISPR/Cas is applied, the defect lies in the low editing efficiency of homology-mediated repair. Those in the art use a 16aa long XTEN linker (XTEN linker) to link the cytidine deaminase APOBEC1 and dCas9 together to construct the first generation base editor (BE1). In order to increase editing efficiency in vivo, in addition to linking cytidine deaminase and dCas9, the second-generation base editor (BE2) system also fuses base excision repair inhibitor UGI and dCas9 together to combine editing The efficiency is increased three times, up to about 20%.
为了将碱基编辑效率提高,本领域人员将dCas9换为Cas9n来模拟错配修复,从而构建出第三代碱基编辑器(BE3)。BE3在非互补DNA链上产生切口,细胞使用含尿嘧啶(U)的DNA链作为模板来进行修复,从而复制这种碱基编辑。在人细胞系的多种靶基因中,这种BE3系统使得碱基编辑效率发生显著性地提高,它的平均indel(insertion-deletion)发生率仅为1.1%。对这些测试的靶基因而言,这些数字是对Cas9介导的HDR的巨大改进;平均的HDR介导的编辑频率仅为0.5%,并且相比于之前的单碱基编辑,更多的indel被观察到。在多次细胞分裂中,CRISPR碱基编辑持续存在,说明这种方法产生稳定的碱基编辑。但是,这种BE3系统也会遭受脱靶编辑的影响。In order to improve the efficiency of base editing, those skilled in the art replaced dCas9 with Cas9n to simulate mismatch repair, thereby constructing a third-generation base editor (BE3). BE3 creates a nick in the non-complementary DNA strand, and the cell uses the uracil (U)-containing DNA strand as a template for repair, thereby replicating this base editing. Among a variety of target genes in human cell lines, this BE3 system significantly improves the base editing efficiency, and its average indel (insertion-deletion) incidence is only 1.1%. For these tested target genes, these numbers are a huge improvement over Cas9-mediated HDR; the average HDR-mediated editing frequency is only 0.5%, and compared to previous single-base editing, more indels be observed. The persistence of CRISPR base editing in multiple cell divisions indicates that this method produces stable base editing. However, this BE3 system will also suffer from off-target editing.
然而,先前的研究已经证明胞嘧啶碱基编辑器(CBE)具有DNA和RNA脱靶效应。减少不必要的DNA和RNA脱靶效应对于科学研究以及治疗应用有重要意义。However, previous studies have proved that the cytosine base editor (CBE) has DNA and RNA off-target effects. Reducing unnecessary DNA and RNA off-target effects is of great significance for scientific research and therapeutic applications.
发明内容Summary of the invention
本发明的目的在于提供一种经改造的胞嘧啶碱基编辑器及其应用。The purpose of the present invention is to provide a modified cytosine base editor and its application.
在本发明的第一方面,提供一种提高胞嘧啶碱基编辑器靶向编辑效率或保真度的方法,包括:在胞嘧啶碱基编辑器中,对其中的胞嘧啶脱氨酶进行改造,所述的胞嘧啶脱氨酶包括APOBEC1或其同源物,所述的改造包括对该胞嘧啶脱氨酶的相应于APOBEC1第90位Trp(W)和第126位Arg(R)的氨基酸进行突变,且将该胞嘧啶碱基编辑器与核定位序列连接。In the first aspect of the present invention, a method for improving the efficiency or fidelity of targeted editing of a cytosine base editor is provided, which includes: modifying the cytosine deaminase in the cytosine base editor The cytosine deaminase includes APOBEC1 or its homologues, and the modification includes the amino acids corresponding to Trp(W) at
在一个优选例中,所述的胞嘧啶碱基编辑器为BE3基因编辑器系统。In a preferred example, the cytosine base editor is BE3 gene editor system.
在另一优选例中,所述APOBEC1同源物包括选自下组的酶:AID,APOBEC3G,APOBECA3A,CDA1。In another preferred example, the APOBEC1 homologue includes an enzyme selected from the group consisting of AID, APOBEC3G, APOBECA3A, CDA1.
在另一优选例中,所述的突变为对所述胞嘧啶脱氨酶的相应于APOBEC1第90位Trp突变为Tyr(Y);和/或,将第126位Arg突变为Glu(E)。In another preferred example, the mutation is a mutation of the cytosine deaminase corresponding to APOBEC1 at
在另一优选例中,该胞嘧啶碱基编辑器的N端和/或C端连接核定位序列。较佳地在胞嘧啶碱基编辑器中UGI的C端或在胞嘧啶脱氨酶的N端。In another preferred embodiment, the N-terminal and/or C-terminal of the cytosine base editor is connected to a nuclear localization sequence. Preferably, it is at the C-terminus of UGI in the cytosine base editor or at the N-terminus of cytosine deaminase.
在本发明的另一方面,提供一种经改造的胞嘧啶脱氨酶,所述胞嘧啶脱氨酶包括APOBEC1或其同源物,该胞嘧啶脱氨酶中相应于APOBEC1第90位Trp(W)和第126位Arg(R)的氨基酸存在突变。In another aspect of the present invention, there is provided a modified cytosine deaminase, said cytosine deaminase comprising APOBEC1 or a homologue thereof, the cytosine deaminase corresponding to APOBEC1 Trp( There are mutations in the amino acid of W) and Arg(R) at
在另一优选例中,所述APOBEC1同源物包括选自下组的酶:AID,APOBEC3G,APOBECA3A,CDA1。In another preferred example, the APOBEC1 homologue includes an enzyme selected from the group consisting of AID, APOBEC3G, APOBECA3A, CDA1.
在另一优选例中,所述的突变为所述胞嘧啶脱氨酶的相应于APOBEC1第90位Trp突变为Tyr(Y);和/或,第126位Arg突变为Glu(E)。In another preferred example, the mutation is that the cytosine deaminase corresponding to APOBEC1's Trp at
在本发明的另一方面,提供一种胞嘧啶碱基编辑器,其包含所述的经改造的胞嘧啶脱氨酶。In another aspect of the present invention, a cytosine base editor is provided, which comprises the modified cytosine deaminase.
在另一优选例中,该胞嘧啶碱基编辑器还与核定位序列连接;较佳地,N端和/或C端与核定位序列连接。In another preferred embodiment, the cytosine base editor is also connected to the nuclear localization sequence; preferably, the N-terminal and/or C-terminal is connected to the nuclear localization sequence.
在另一优选例中,所述的胞嘧啶碱基编辑器与核定位序列之间,还包括连接序列,如标签序列(更具体如Flag标签)。In another preferred example, the cytosine base editor and the nuclear localization sequence further include a linking sequence, such as a tag sequence (more specifically, a Flag tag).
在本发明的另一发明,提供分离的多核苷酸,其编码所述的经改造的胞嘧啶脱氨酶或所述的胞嘧啶碱基编辑器。In another aspect of the present invention, an isolated polynucleotide is provided, which encodes the modified cytosine deaminase or the cytosine base editor.
在另一优选例中,所述胞嘧啶碱基编辑器具有SEQ ID NO:2所示的核苷酸序列。In another preferred embodiment, the cytosine base editor has the nucleotide sequence shown in SEQ ID NO: 2.
在本发明的另方面,提供一种重组表达载体,其包含所述的多核苷酸。In another aspect of the present invention, a recombinant expression vector is provided, which comprises the polynucleotide.
在本发明的另方面,提供一种遗传工程化的宿主细胞,它含有所述的载体,或基因组中整合有所述的多核苷酸。In another aspect of the present invention, a genetically engineered host cell is provided, which contains the vector or the polynucleotide integrated in the genome.
在另一优选例中,所述的编辑器为BE3碱基编辑器。In another preferred example, the editor is BE3 base editor.
在发明的另一方面,提供所述的胞嘧啶碱基编辑器的用途,用于进行基因编辑,降低脱靶效应、提高靶向编辑效率或提高靶向编辑的保真度。In another aspect of the invention, the use of the cytosine base editor is provided for gene editing, reducing off-target effects, improving the efficiency of targeted editing, or improving the fidelity of targeted editing.
在发明的另一方面,提供一种进行基因编辑的方法,包括以所述的胞嘧啶碱基编 辑器介导基因编辑。In another aspect of the invention, a method for gene editing is provided, which includes mediating gene editing with the cytosine base editor.
在另一优选例中,将编码所述的胞嘧啶碱基编辑器的核酸序列以及sgRNA共同注射受体,从而进行基因编辑。In another preferred embodiment, the nucleic acid sequence encoding the cytosine base editor and the sgRNA are co-injected into the receptor to perform gene editing.
在另一优选例中,所述的受体包括:体细胞或生殖细胞。In another preferred embodiment, the receptor includes: somatic cells or germ cells.
在另一优选例中,所述的生殖细胞包括胚胎细胞或受精卵。In another preferred embodiment, the germ cells include embryonic cells or fertilized eggs.
在发明的另一方面,提供一种用于进行基因编辑的试剂或试剂盒,其中包括所述的经改造的胞嘧啶脱氨酶;或其中包括所述的胞嘧啶碱基编辑器。In another aspect of the invention, a reagent or kit for gene editing is provided, which includes the modified cytosine deaminase; or the cytosine base editor.
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。Other aspects of the present invention are obvious to those skilled in the art due to the disclosure herein.
图1、胞嘧啶碱基编辑器(CBE)变体的靶向编辑效率。a.各种rAPOBEC1变体中预测的rAPOBEC1结构。突变的残基被突出显示并标记在结构上。b.hAPOBEC3G和rAPOBEC1之间的序列比对。氨基酸,相同的残基;+,常见的取代基。绿色三角形表示APOBEC3G疏水活性区的残基,黄色星形表示ssDNA结合区的残基。c.APOBEC3G的晶体结构。d.不同CBE变体的靶向编辑效率和indel频率。每组有n=3个生物重复。绿色和黄色分别表示螺旋结构和环结构中的残基。紫色三角形表示随后对其进行脱靶检测的变体。Figure 1. Targeted editing efficiency of cytosine base editor (CBE) variants. a. The predicted rAPOBEC1 structure in various rAPOBEC1 variants. The mutated residues are highlighted and marked on the structure. b. Sequence alignment between hAPOBEC3G and rAPOBEC1. Amino acids, same residues; +, common substituents. The green triangles represent the residues in the hydrophobic active region of APOBEC3G, and the yellow stars represent the residues in the ssDNA binding region. c. The crystal structure of APOBEC3G. d. Target editing efficiency and indel frequency of different CBE variants. There are n=3 biological replicates in each group. Green and yellow indicate the residues in the helical structure and loop structure, respectively. The purple triangles indicate variants for which off-target detection is subsequently performed.
图2、CBEs变体的DNA和RNA脱靶活性。a.检测到的脱靶SNVs总数的比较。Cre组n=2,BE3组n=6,BE3 R126E组n=10,BE3 R132E组n=3,YE1-BE3组n=8,FE1-BE3组n=3。与Cre组对比,通过进行双尾学生t检验计算获得图上的P值。b.Cre、BE3和4个CBE变体处理组的突变类型分布。c.转染后36小时检测到的RNA脱靶SNVs总数的比较。每组3个重复。与GFP组相比,采用双尾学生t检验计算各栏以上P值。d.GFP、BE3和4个CBE变体处理组突变类型的分布。 Figure 2. DNA and RNA off-target activity of CBEs variants. a. Comparison of the total number of off-target SNVs detected. Cre group n=2, BE3 group n=6, BE3 R126E group n=10, BE3 R132E group n=3, YE1-BE3 group n=8, FE1-BE3 group n=3. Compared with the Cre group, the P value on the graph was calculated by two-tailed Student's t test. b. The mutation type distribution of Cre, BE3 and 4 CBE variant treatment groups. c. Comparison of the total number of RNA off-target SNVs detected 36 hours after transfection. 3 repetitions per group. Compared with the GFP group, the two-tailed Student's t test was used to calculate the P value above each column. d. Distribution of mutation types in GFP, BE3 and 4 CBE variant treatment groups.
图3、BE3-FNLS或BE3-hA3A变体的活性。a.BE3-FNLS或BE3-hA3A在各个靶点的靶向效率。靶位点相关序列和引物的序列见表6。b.检测到的脱靶SNVs总数的比较。Cre组n=2,BE3组n=6,YE1-BE3组n=5,BE3-hA3A Y130F组n=3,YE1-BE3-FNLS组n=3。与Cre组比较计算各组P值。c.转染后36 h检测到的RNA脱靶SNVs总数的比较。每组3个重复。与GFP组比较计算各组P值。d.窗口内编辑效率和窗口外编辑效率的比较。编辑窗口,5-7位碱基。e.各突变体在每个位点的indels分布。P值采用双尾学生t检验。 Figure 3. Activity of BE3-FNLS or BE3-hA3A variants. a. Targeting efficiency of BE3-FNLS or BE3-hA3A at each target. See Table 6 for target site related sequences and primer sequences. b. Comparison of the total number of off-target SNVs detected. Cre group n=2, BE3 group n=6, YE1-BE3 group n=5, BE3-hA3A Y130F group n=3, YE1-BE3-FNLS group n=3. Calculate the P value of each group compared with the Cre group. c. Comparison of the total number of RNA off-target SNVs detected 36 h after transfection. 3 repetitions per group. Calculate the P value of each group compared with the GFP group. d. Comparison of editing efficiency inside the window and editing efficiency outside the window. Edit window, bases 5-7. e. The distribution of indels of each mutant at each site. P value adopts two-tailed Student's t test.
图4、BE3和BE3变体在不同靶位点的靶向编辑。a.不同版本CBE变体在额外11个靶位点的靶向编辑效率和indels频率。b.CBE变体的靶向效率比较。c.CBE变体间的indel频率比较。d.每个靶点工程化BE3变体的靶上效率。e.CBE变体在靶位点每个C处编辑效率的比较。f.CBE变体的窗口内编辑效率和窗口外编辑效率比较。编辑窗口:5-7 位碱基。每组n=3个生物学重复。P值采用双侧t检验。靶位点序列及引物的序列见表6。Figure 4. Targeted editing of BE3 and BE3 variants at different target sites. a. Target editing efficiency and indels frequency of different versions of CBE variants at an additional 11 target sites. b. Comparison of targeting efficiency of CBE variants. c. Comparison of indel frequencies between CBE variants. d. On-target efficiency of each target engineered BE3 variant. e. Comparison of editing efficiency of CBE variants at each C of the target site. f. Comparison of the editing efficiency in the window and the editing efficiency outside the window of the CBE variant. Edit window: 5-7 bases. Each group n=3 biological replicates. P value adopts two-sided t test. The target site sequence and primer sequence are shown in Table 6.
图5、BE3和BE3变体的胚胎发育率。a用sgRNA-D检测BE3和BE3变体的囊胚率。b用其它sgRNAs检测BE3-hA3A和BE3-FNLS的囊胚率。每组n=3个生物学重复。Figure 5. Embryonic development rate of BE3 and BE3 variants. a Use sgRNA-D to detect the blastocyst rate of BE3 and BE3 variants. b Use other sgRNAs to detect the blastocyst rate of BE3-hA3A and BE3-FNLS. Each group n=3 biological replicates.
图6、CBE变体靶向编辑效率及非靶向SNVs的编辑效率。a.WGS数据中BE3和CBE变异的靶向编辑效率。b.CBE变异体处理组与Cre或BE3组之间C-to-T和G-to-A转换的比较。P值采用双侧t检验。*P<0.05,**P<0.01,***P<0.001.Figure 6. Targeted editing efficiency of CBE variants and editing efficiency of non-targeted SNVs. a. Target editing efficiency of BE3 and CBE variants in WGS data. b. Comparison of C-to-T and G-to-A conversion between CBE variant treatment group and Cre or BE3 group. P value adopts two-sided t test. *P<0.05, **P<0.01,***P<0.001.
图7、通过WGS数据在每个胚胎中检测到的SNVs的维恩图。a.在经BE3 R126E处理的胚胎中鉴定出的SNVs。b.在经BE3 R132E处理的胚胎中鉴定出的SNVs。c.在经YE1-BE3处理的胚胎中鉴定出的SNVs。d.在FE1-BE3处理的胚胎中鉴定出的SNVs。 Figure 7. Venn diagram of SNVs detected in each embryo through WGS data. a. SNVs identified in embryos treated with BE3 R126E. b. SNVs identified in embryos treated with BE3 R132E. c. SNVs identified in embryos treated with YE1-BE3. d. SNVs identified in embryos treated with FE1-BE3.
图8、CBE变体的非目标SNVs特性。本发明人的分析检测到的SNVs与Cas-OFFinder和CRISPOR预测的目标外位点之间的重叠。Figure 8. Non-target SNVs characteristics of CBE variants. The inventor's analysis detected the overlap between the SNVs and the extra-target sites predicted by Cas-OFFinder and CRISPOR.
图9、转染36小时后BE3变体RNA脱靶效率。Figure 9. BE3 variant RNA off-target efficiency 36 hours after transfection.
图10、转染72小时后CBE变体的RNA脱靶效率检测。a.转染72小时后检测到的RNA非靶向SNVs总数的比较。GFP组n=6,BE3组n=9,BE3 R126E组n=7,YE1-BE3组n=2。通过与GFP组学生t检验比较,计算各组P值。b.GFP、BE3和BE3变体处理组的突变类型分布。c.转染72小时后BE3变体RNA的脱靶效率。 Figure 10. Detection of RNA off-target efficiency of CBE variants 72 hours after transfection. a. Comparison of the total number of RNA non-targeted SNVs detected 72 hours after transfection. GFP group n=6, BE3 group n=9, BE3 R126E group n=7, YE1-BE3 group n=2. By comparing with the Student's t test of the GFP group, the P value of each group was calculated. b. Distribution of mutation types in the GFP, BE3 and BE3 variant treatment groups. c. Off-target efficiency of BE3 variant RNA 72 hours after transfection.
图11、BE3-FNLS的靶向编辑效率和脱靶。a.CBE变体靶向效率的比较。b.CBE变体在靶位点的每个C处编辑效率的比较。c.BE3-hA3A Y130F和YE1-BE3-FNLS处理的胚胎中发现的SNVs。d.从本发明人的分析中检测到的SNVs与Cas-OFFinder和CRISPOR预测的脱靶位点之间的重叠。e.BE3-hA3AY130F和YE1-BE3-FNLS处理胚胎的DNA非靶向SNVs突变类型分布。f.BE3-hA3AY130F和YE1-BE3-FNLS处理胚胎的RNA非靶向SNVs突变类型分布。g.APOBEC1在BE3和BE3-FNLS中的表达水平。h.转染36小时后BE3和BE3-FNLS的RNA的脱靶辑率。每组n=3个生物学重复。P值采用双侧t检验。 Figure 11. Target editing efficiency and off-target of BE3-FNLS. a. Comparison of targeting efficiency of CBE variants. b. Comparison of editing efficiency of CBE variants at each C of the target site. c. SNVs found in embryos treated with BE3 -hA3A Y130F and YE1-BE3-FNLS. d. The overlap between the SNVs detected from the inventor's analysis and the off-target sites predicted by Cas-OFFinder and CRISPOR. e. Distribution of DNA non-targeted SNVs mutation types of embryos treated with BE3-hA3AY130F and YE1-BE3-FNLS. f. The distribution of RNA non-targeted SNVs mutation types of embryos treated with BE3-hA3AY130F and YE1-BE3-FNLS. g. The expression level of APOBEC1 in BE3 and BE3-FNLS. h. The RNA off-target rate of BE3 and BE3-FNLS 36 hours after transfection. Each group n=3 biological replicates. P value adopts two-sided t test.
图12、BE3和BE3变异体在指示的脱靶位点的活性。a.BE3变体的sgRNA依赖性脱靶效应。b.BE3变体在指定的脱靶位点的编辑频率。P值采用双侧t检验,与YE1-BE3-FNLS组比较。与YE1-BE3-FNLS组相比,红星表示编辑频率增加,绿星表示编辑频率降低。*P<0.05,**P<0.01,**P<0.001。用脂质体3000将表达BE3、BE3 R126E、BE3 R132E、YE1-BE3、FE1-BE3、BE3-hA3A、BE3-hA3A Y130F、BE3-FNLS、YE1-BE3-FNLS和sgRNAs的质粒转染HEK293T细胞。转染后3天,提取基因组DNA,通过PCR扩增,并通过高通量DNA测序分析在靶位点的编辑效率,以及这些sgRNAs的前十个预测的脱靶位点。靶位点序列和引物的序列见表4。每个单元格表示通过C到T编辑的读数占总读数的百分比。每组n=3个生物学重复。 Figure 12. Activity of BE3 and BE3 variants at indicated off-target sites. a. sgRNA-dependent off-target effects of BE3 variants. b. The editing frequency of BE3 variants at designated off-target sites. The P value was compared with the YE1-BE3-FNLS group using a two-sided t test. Compared with the YE1-BE3-FNLS group, a red star indicates an increase in the editing frequency, and a green star indicates a decrease in the editing frequency. *P<0.05, **P<0.01, **P<0.001. Plasmids expressing BE3, BE3 R126E, BE3 R132E , YE1-BE3, FE1-BE3, BE3-hA3A, BE3-hA3A Y130F , BE3-FNLS, YE1-BE3-FNLS and sgRNAs were transfected into HEK293T cells with liposome 3000. Three days after transfection, genomic DNA was extracted, amplified by PCR, and analyzed by high-throughput DNA sequencing to analyze the editing efficiency at the target site and the top ten predicted off-target sites of these sgRNAs. The target site sequence and primer sequence are shown in Table 4. Each cell represents the percentage of the total readings edited from C to T. Each group n=3 biological replicates.
图13、YE1-BE3-FNLS质粒示意图。Figure 13. Schematic diagram of YE1-BE3-FNLS plasmid.
本发明中,本发明人通过二细胞胚胎注射全基因组脱靶分析(Genome-wide Off-target analysis by Two-cell embryo Injection,GOTI)以及RNA-Seq测序来分析多个CBE变异体的DNA和RNA的脱靶效应,经过深入分析,改造了胞嘧啶碱基编辑器,该胞嘧啶碱基编辑器的靶向编辑效率、保真度得以显著性提高。In the present invention, the inventors analyzed the DNA and RNA of multiple CBE variants by two-cell embryo injection whole genome off-target analysis (Genome-wide Off-target analysis by Two-cell embryo Injection, GOTI) and RNA-Seq sequencing After in-depth analysis of off-target effects, the cytosine base editor has been modified, and the targeted editing efficiency and fidelity of the cytosine base editor have been significantly improved.
所述的胞嘧啶碱基编辑器中,包括胞嘧啶脱氨酶。所述胞嘧啶脱氨酶包括APOBEC1或其同源物。所述APOBEC1同源物包括与APOBEC1发挥相同或接近功能的酶,或与APOBEC1具有基本上相同或基本上接近的结构域的酶,或与APOBEC1来自不同的物种但在各自物种中发挥同样作用的酶。例如,所述的APOBEC1同源物包括但不限于选自下组的酶:AID,APOBEC3G,APOBECA3A,CDA1。The cytosine base editor includes cytosine deaminase. The cytosine deaminase includes APOBEC1 or a homologue thereof. The APOBEC1 homologues include enzymes that have the same or similar functions as APOBEC1, or enzymes that have substantially the same or substantially similar domains as APOBEC1, or those that come from a different species than APOBEC1 but play the same role in the respective species. Enzyme. For example, the APOBEC1 homologues include but are not limited to enzymes selected from the group consisting of AID, APOBEC3G, APOBECA3A, CDA1.
本发明首先提供了一种经改造的胞嘧啶脱氨酶,该胞嘧啶脱氨酶中相应于APOBEC1第90位Trp(W)和第126位Arg(R)的氨基酸存在突变,且该胞嘧啶脱氨酶与核定位序列连接。较佳地,所述的突变为所述胞嘧啶脱氨酶的相应于APOBEC1第90位Trp突变为Tyr(Y);和/或,第126位Arg突变为Glu(E)。The present invention first provides a modified cytosine deaminase. The cytosine deaminase has mutations in the amino acids corresponding to Trp(W) at
在本发明的优选方式中,在所述的胞嘧啶脱氨酶与核定位序列之间,还通过连接序列连接,所述的连接序列可以是任何不影响两者的功能的连接序列,例如可以是标签序列或一些本领域已知的柔性连接序列。合适的标签可以被用于本发明中。例如,所述的标签可以是FLAG,HA,HA1,c-Myc,Poly-His,Poly-Arg,Strep-TagII,AU1,EE,T7,4A6,ε,B,gE或Ty1。In a preferred mode of the present invention, the cytosine deaminase and the nuclear localization sequence are also connected by a linking sequence, and the linking sequence may be any linking sequence that does not affect the functions of the two, for example, It is a tag sequence or some flexible linking sequence known in the art. Appropriate labels can be used in the present invention. For example, the tag can be FLAG, HA, HA1, c-Myc, Poly-His, Poly-Arg, Strep-TagII, AU1, EE, T7, 4A6, ε, B, gE or Ty1.
本发明的经改造的胞嘧啶脱氨酶(经改造的酶)可以是重组蛋白、天然蛋白、合成蛋白,优选重组蛋白。本发明的蛋白可以是天然纯化的产物,或是化学合成的产物,或使用重组技术从原核或真核宿主(例如,细菌、酵母、高等植物、昆虫和哺乳动物细胞)中产生。The modified cytosine deaminase (modified enzyme) of the present invention can be a recombinant protein, a natural protein, a synthetic protein, and a recombinant protein is preferred. The protein of the present invention can be a natural purified product, or a chemically synthesized product, or produced from a prokaryotic or eukaryotic host (for example, bacteria, yeast, higher plant, insect, and mammalian cells) using recombinant technology.
本发明还包括所述经改造的酶的片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持本发明的经改造的酶相同的生物学功能或活性的蛋白。本发明的蛋白片段、衍生物或类似物可以是(i)有一个或多个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的蛋白,而这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的,或(ii)在一个或多个氨基酸残基中具有取代基团的蛋白,或(iii)附加的氨基酸序列融合到此蛋白序列而形成的蛋白(如前导序列或分泌序列或用来纯化此蛋白的序列或蛋白原序列,或融合蛋白)。根据本文的定义这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。然而,所述的经改 造的酶及其片段、衍生物和类似物的氨基酸序列中,肯定存在本发明上面所述的保守突变,也即相应于APOBEC1第90位Trp(W)和第126位Arg(R)的氨基酸存在所述突变,且也连接有所述的核定位序列。The present invention also includes fragments, derivatives and analogs of the engineered enzymes. As used herein, the terms "fragment", "derivative" and "analog" refer to a protein that substantially retains the same biological function or activity as the engineered enzyme of the present invention. The protein fragment, derivative or analogue of the present invention may be (i) a protein in which one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues) are substituted, and such substituted amino acid residues It may or may not be encoded by the genetic code, or (ii) a protein with a substitution group in one or more amino acid residues, or (iii) a protein formed by fusing an additional amino acid sequence to the protein sequence (such as Leader sequence or secretory sequence or sequence used to purify the protein or proprotein sequence, or fusion protein). According to the definition herein, these fragments, derivatives and analogs belong to the scope well known to those skilled in the art. However, in the amino acid sequence of the modified enzyme and its fragments, derivatives and analogs, there must be conservative mutations described above in the present invention, which correspond to Trp(W) at
在本发明中,术语“经改造的酶”还包括(但并不限于):若干个(通常为1-20个,更佳地1-10个,还更佳如1-8个、1-5个、1-3个、或1-2个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加或缺失一个或数个(通常为20个以内,较佳地为10个以内,更佳地为5个以内)氨基酸。例如,在本领域中,用性能相近或相似的氨基酸进行取代时,通常不会改变蛋白质的功能。又比如,在C末端和/或N末端添加一个或数个氨基酸通常也不会改变蛋白质的功能。该术语还包括经改造的酶的活性片段和活性衍生物。但是在这些变异形式中,肯定存在本发明上面所述的保守突变,也即相应于APOBEC1第90位Trp(W)和第126位Arg(R)的氨基酸存在所述突变,且也连接有所述的核定位序列。In the present invention, the term "engineered enzyme" also includes (but is not limited to): several (usually 1-20, more preferably 1-10, still more preferably 1-8, 1- 5, 1-3, or 1-2) amino acid deletions, insertions and/or substitutions, and addition or deletion of one or several (usually within 20) at the C-terminus and/or N-terminus, preferably Within 10, more preferably within 5) amino acids. For example, in the field, when amino acids with similar or similar properties are substituted, the function of the protein is usually not changed. For another example, adding one or several amino acids to the C-terminus and/or N-terminus usually does not change the function of the protein. The term also includes active fragments and active derivatives of engineered enzymes. However, in these variant forms, there must be the conservative mutations described above in the present invention, that is, the amino acids corresponding to Trp (W) at
在本发明中,术语“经改造的酶”还包括(但并不限于):与所述的经改造的酶的氨基酸序列具有80%以上,较佳地85%以上,更佳地90%以上,进一步更佳地95%以上,如98%以上、99%以上序列相同性的保留其蛋白活性的衍生的蛋白。同样地,这些衍生的蛋白中,肯定存在本发明上面所述的保守突变,也即相应于APOBEC1第90位Trp(W)和第126位Arg(R)的氨基酸存在所述突变,且也连接有所述的核定位序列。In the present invention, the term "engineered enzyme" also includes (but is not limited to): the amino acid sequence of the modified enzyme is more than 80%, preferably more than 85%, more preferably more than 90% , And more preferably 95% or more, such as 98% or more, 99% or more sequence identity of the derived protein that retains its protein activity. Similarly, in these derived proteins, there must be the conservative mutations described above in the present invention, that is, the amino acids corresponding to Trp (W) at
本发明还提供了编码本发明经改造的酶或其保守性变异蛋白的多核苷酸序列。The present invention also provides a polynucleotide sequence encoding the engineered enzyme of the present invention or a conservative variant protein thereof.
本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。The polynucleotide of the present invention may be in the form of DNA or RNA. The form of DNA includes cDNA, genomic DNA or synthetic DNA. DNA can be single-stranded or double-stranded. DNA can be a coding strand or a non-coding strand.
编码所述突变体的成熟蛋白的多核苷酸包括:只编码成熟蛋白的编码序列;成熟蛋白的编码序列和各种附加编码序列;成熟蛋白的编码序列(和任选的附加编码序列)以及非编码序列。The polynucleotide encoding the mature protein of the mutant includes: only the coding sequence of the mature protein; the coding sequence of the mature protein and various additional coding sequences; the coding sequence of the mature protein (and optional additional coding sequence) and non- Coding sequence.
“编码蛋白的多核苷酸”可以是包括编码此蛋白的多核苷酸,也可以是还包括附加编码和/或非编码序列的多核苷酸。The "polynucleotide encoding a protein" may include a polynucleotide encoding the protein, or a polynucleotide that also includes additional coding and/or non-coding sequences.
本发明的经改造的酶核苷酸全长序列或其片段通常可以用PCR扩增法、重组法或人工合成的方法获得。对于PCR扩增法,可根据本发明所公开的有关核苷酸序列,尤其是开放阅读框序列来设计引物,并用市售的cDNA库或按本领域技术人员已知的常规方法所制备的cDNA库作为模板,扩增而得有关序列。当序列较长时,常常需要进行两次或多次PCR扩增,然后再将各次扩增出的片段按正确次序拼接在一起。The full-length nucleotide sequence of the modified enzyme of the present invention or its fragments can usually be obtained by PCR amplification method, recombination method or artificial synthesis method. For the PCR amplification method, primers can be designed according to the relevant nucleotide sequence disclosed in the present invention, especially the open reading frame sequence, and a commercially available cDNA library or a cDNA prepared by a conventional method known to those skilled in the art can be used. The library is used as a template to amplify the relevant sequences. When the sequence is long, it is often necessary to perform two or more PCR amplifications, and then splice the amplified fragments together in the correct order.
一旦获得了有关的序列,就可以用重组法来大批量地获得有关序列。这通常是将其克隆入载体,再转入细胞,然后通过常规方法从增殖后的宿主细胞中分离得到有关 序列。Once the relevant sequence is obtained, the recombination method can be used to obtain the relevant sequence in large quantities. This is usually done by cloning it into a vector, then transferring it into a cell, and then isolating the relevant sequence from the proliferated host cell by conventional methods.
此外,还可用人工合成的方法来合成有关序列,尤其是片段长度较短时。通常,通过先合成多个小片段,然后再进行连接可获得序列很长的片段。目前,已经可以完全通过化学合成来得到编码本发明蛋白(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中已知的各种现有的DNA分子(或如载体)和细胞中。此外,还可通过化学合成将突变引入本发明蛋白序列中。In addition, artificial synthesis methods can also be used to synthesize related sequences, especially when the fragment length is short. Usually, by first synthesizing multiple small fragments, and then ligating to obtain fragments with very long sequences. At present, the DNA sequence encoding the protein (or fragment or derivative thereof) of the present invention can be obtained completely through chemical synthesis. This DNA sequence can then be introduced into various existing DNA molecules (or such as vectors) and cells known in the art. In addition, mutations can also be introduced into the protein sequence of the present invention through chemical synthesis.
本发明也涉及包含本发明的多核苷酸的载体,以及用本发明的载体或经改造的酶编码序列经基因工程产生的宿主细胞,以及经重组技术产生本发明所述蛋白的方法。The present invention also relates to a vector containing the polynucleotide of the present invention, a host cell produced by genetic engineering using the vector of the present invention or the modified enzyme coding sequence, and a method for producing the protein of the present invention through recombinant technology.
通过常规的重组DNA技术,可利用本发明的多聚核苷酸序列来表达或生产重组的经改造的酶。一般来说有以下步骤:(1).用本发明的编码经改造的酶的多核苷酸,或用含有该多核苷酸的重组表达载体转化或转导合适的宿主细胞;(2).在合适的培养基中培养的宿主细胞;(3).从培养基或细胞中分离、纯化蛋白质。Through conventional recombinant DNA technology, the polynucleotide sequence of the present invention can be used to express or produce a recombinant engineered enzyme. Generally speaking, there are the following steps: (1). Use the polynucleotide encoding the modified enzyme of the present invention, or use the recombinant expression vector containing the polynucleotide to transform or transduce a suitable host cell; (2). Host cells cultured in a suitable medium; (3). Separating and purifying proteins from the medium or cells.
本发明中,还提供了含有所述经改造的酶或其多核苷酸序列的胞嘧啶碱基编辑器。在本发明的优选方式中,所述的胞嘧啶碱基编辑器为BE3碱基编辑器。所述的胞嘧啶碱基编辑器的其它组成元件是本领域技术人员已知的。The present invention also provides a cytosine base editor containing the modified enzyme or its polynucleotide sequence. In a preferred mode of the present invention, the cytosine base editor is BE3 base editor. Other components of the cytosine base editor are known to those skilled in the art.
本发明中,所述经改造的酶多核苷酸序列或所述胞嘧啶碱基编辑器多核苷酸序列可插入到重组表达载体中。术语“重组表达载体”指本领域熟知的细菌质粒、噬菌体、酵母质粒、植物细胞病毒、哺乳动物细胞病毒或其他载体。总之,只要能在宿主体内复制和稳定,任何质粒和载体都可以用。表达载体的一个重要特征是通常含有复制起点、启动子、标记基因和翻译控制元件。In the present invention, the modified enzyme polynucleotide sequence or the cytosine base editor polynucleotide sequence can be inserted into a recombinant expression vector. The term "recombinant expression vector" refers to bacterial plasmids, bacteriophages, yeast plasmids, plant cell viruses, mammalian cell viruses or other vectors well known in the art. In short, any plasmid and vector can be used as long as it can replicate and stabilize in the host. An important feature of an expression vector is that it usually contains an origin of replication, a promoter, a marker gene, and translation control elements.
本领域的技术人员熟知的方法能用于构建含有所述经改造的酶多核苷酸序列或所述胞嘧啶碱基编辑器多核苷酸序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状。Methods well known to those skilled in the art can be used to construct an expression vector containing the modified enzyme polynucleotide sequence or the cytosine base editor polynucleotide sequence and appropriate transcription/translation control signals. These methods include in vitro recombinant DNA technology, DNA synthesis technology, and in vivo recombination technology. The DNA sequence can be effectively linked to an appropriate promoter in the expression vector to guide mRNA synthesis. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator. The expression vector preferably contains one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells.
包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞后受体细胞。A vector containing the above-mentioned appropriate DNA sequence and an appropriate promoter or control sequence can be used to transform an appropriate host cell into a recipient cell.
本发明还提供了一种进行基因编辑的方法,包括以本发明所述的胞嘧啶碱基编辑器介导基因编辑。除了采用本发明所述的胞嘧啶碱基编辑器进行基因编辑以外,其它方面的基因编辑试剂可以运用本领域已知的,例如sgRNA可以采用本领域已知的方式进行设计。The present invention also provides a method for gene editing, which includes mediating gene editing with the cytosine base editor of the present invention. In addition to using the cytosine base editor of the present invention for gene editing, other gene editing reagents can be used known in the art. For example, sgRNA can be designed in a manner known in the art.
本发明中中,适用的基因编辑的对象没有特别的限制,可以是体细胞或生殖细胞, 可以是动物细胞或人细胞。In the present invention, the target of gene editing is not particularly limited, and it may be a somatic cell or a germ cell, and it may be an animal cell or a human cell.
尽管胞嘧啶碱基编辑器(CBE)作为强大的基因编辑器有着广阔的前景,但DNA和RNA的脱靶效应限制了其在科学和医学上的应用。在本发明的具体实施方式中,本发明人详细筛选了20多个合理设计的CBE突变体,并使用GOTI和RNA-Seq分别分析了DNA和RNA的脱靶效应。预测的DNA结合位点的残基突变显著降低了脱靶效应,在某些情况下降低到与未经编辑的对照组相当的水平。本发明获得的CBE变体YE1-BE3-FNLS,该突变体在保持极高的靶向编辑效率的同时,具有非常低的脱靶效率和bystander编辑。本发明中,本发明人不仅鉴定了能够特异性影响RNA以及DNA脱靶活性以及缩小碱基编辑窗口的多个残基,还引入了一种高保真同时高编辑效率的CBE变体,从而扩展了这些强大工具在实验室和临床上的应用。Although the cytosine base editor (CBE) has broad prospects as a powerful gene editor, the off-target effects of DNA and RNA limit its application in science and medicine. In the specific embodiment of the present invention, the inventors screened more than 20 reasonably designed CBE mutants in detail, and analyzed the off-target effects of DNA and RNA using GOTI and RNA-Seq, respectively. The predicted residue mutations at the DNA binding site significantly reduced off-target effects, in some cases to levels comparable to unedited controls. The CBE variant YE1-BE3-FNLS obtained in the present invention has very low off-target efficiency and bystander editing while maintaining extremely high targeted editing efficiency. In the present invention, the inventors not only identified multiple residues that can specifically affect RNA and DNA off-target activity and narrow the base editing window, but also introduced a CBE variant with high fidelity and high editing efficiency, thereby expanding The application of these powerful tools in the laboratory and in the clinic.
本发明的具体实施方式中,本发明人根据先前多个研究的发现筛选了几十个rAPOBEC1突变,结果发现BE3 R132E、YE1-BE3和FE1-BE3变异显著降低了DNA和RNA的脱靶效应,同时保持了它们的靶向编辑活性。有趣的是,本发明人观察到DNA/RNA脱靶效应降低的变体(BE3 R132E、YE1-BE3、FE1-BE3和YE1-BE3-FNLS)也都有缩小的碱基编辑窗口。Rees等人报道称,位于活性窗口外但位于ssDNA R-环区域内的碱基仍然可以被编辑,尽管效率较低,特别是如果它们位于rAPOBEC1的有利编辑基序中。这些可能有助于解释这些结果。 In the specific embodiment of the present invention, the inventor screened dozens of rAPOBEC1 mutations based on the findings of multiple previous studies, and found that BE3 R132E , YE1-BE3, and FE1-BE3 mutations significantly reduced the off-target effects of DNA and RNA. Maintained their targeted editing activity. Interestingly, the inventors observed that the variants with reduced DNA/RNA off-target effects (BE3 R132E , YE1-BE3, FE1-BE3 and YE1-BE3-FNLS) also have reduced base editing windows. Rees et al. reported that bases located outside the active window but located in the R-loop region of ssDNA can still be edited, albeit with lower efficiency, especially if they are located in the favorable editing motif of rAPOBEC1. These may help explain these results.
考虑到rAPOBEC1和hAPOBEC3A都被认为只有一个催化域,本发明人在预测DNA结合基序中引入的突变可能产生的影响。然而,鉴于已知的BEs编辑DNA和RNA的能力,rAPOBEC1可能采用不同的结合模式来适应ssDNA和RNA。这突出了碱基编辑研究人员评估碱基编辑对DNA和RNA脱靶效应的必要性。本发明人推测,这种结合模式的异质性可能有助于解释本发明人所发现的有些CBE变体既保留了较高的DNA脱靶效应,同时又显著降低RNA脱靶效应(反之亦然)的现象。Considering that both rAPOBEC1 and hAPOBEC3A are considered to have only one catalytic domain, the inventors predicted the possible impact of the mutation introduced in the DNA binding motif. However, given the known ability of BEs to edit DNA and RNA, rAPOBEC1 may adopt different binding modes to adapt to ssDNA and RNA. This highlights the necessity for base editing researchers to evaluate the off-target effects of base editing on DNA and RNA. The inventors speculate that the heterogeneity of this binding mode may help explain that some CBE variants discovered by the inventors not only retain high DNA off-target effects, but also significantly reduce RNA off-target effects (and vice versa) The phenomenon.
特别地,本发明人推测R132E影响rAPOBEC1与DNA和RNA的相互作用,而R126E主要影响其DNA结合能力,Y130F主要影响其RNA结合能力。考虑到YE1-BE3-FNLS变异同时含有R126E突变和位于rAPOBEC1疏水区的W90残基处的酪氨酸替代,该残基被认为参与了rAPOBEC1与ssDNA/RNA的结合,本发明人初步推测W90Y突变有助于解释YE1-BE3-FNLS的高保真度可能源于rAPOBEC1-RNA相互作用的改变这一现象。In particular, the inventors speculate that R132E affects the interaction of rAPOBEC1 with DNA and RNA, while R126E mainly affects its DNA binding ability, and Y130F mainly affects its RNA binding ability. Considering that the YE1-BE3-FNLS mutation contains both the R126E mutation and the substitution of tyrosine at the W90 residue in the hydrophobic region of rAPOBEC1, this residue is considered to be involved in the binding of rAPOBEC1 to ssDNA/RNA. The inventors preliminarily speculate that the W90Y mutation It helps to explain that the high fidelity of YE1-BE3-FNLS may be due to the change of rAPOBEC1-RNA interaction.
值得注意的是,本发明人筛选的较理想的变体YE1-BE3-FNLS同时具有最高的靶向编辑效率和最低水平的indels和bystander编辑。先前的一项研究表明,编辑窗口中的多个Cs可能会增加编辑期间发生indels的概率。与BE3-FNLS相比,YE1-BE3-FNLS显著缩小了基本编辑窗口,从而减少了多个Cs的分布,这可能可以解释indel频率显著降低的现象。总之,本发明人的工作举例说明了如何通过生物-洞察力驱动的工程,将胞嘧啶基编辑的脱靶效应减至最小,从而将这些强大的基因编辑工具用于研究和治疗应用。It is worth noting that the ideal variant YE1-BE3-FNLS screened by the inventors has both the highest targeted editing efficiency and the lowest level of indels and bystander editing. A previous study showed that multiple Cs in the editing window may increase the probability of indels occurring during editing. Compared with BE3-FNLS, YE1-BE3-FNLS significantly reduces the basic editing window, thereby reducing the distribution of multiple Cs, which may explain the significant decrease in indel frequency. In summary, the inventor’s work illustrates how to minimize the off-target effects of cytosine editing through bio-insight-driven engineering, so that these powerful gene editing tools can be used in research and therapeutic applications.
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如J.萨姆布鲁克等编著,分子克隆实验指南,第三版,科学出版社,2002中所述的条件,或按照制造厂商所建议的条件。The present invention will be further explained below in conjunction with specific embodiments. It should be understood that these embodiments are only used to illustrate the present invention and not to limit the scope of the present invention. The experimental methods without specific conditions in the following examples usually follow the conventional conditions as described in J. Sambrook et al., Molecular Cloning Experiment Guide, Third Edition, Science Press, 2002, or according to the conditions described in the manufacturer The suggested conditions.
材料与方法Materials and Methods
GOTI方法GOTI method
将Cas9/BE3等基因编辑工具的mRNA和sgRNA的混合物注射到2细胞阶段胚胎的一个卵裂球中,该胚胎来自于野生型雌性小鼠X Ai9雄性小鼠。Cre的作用产生嵌合胚胎,其中注射的细胞被tdTomato(红色)标记,tdTomato阳性表示发生了编辑,tdTomato阴性表示未编辑。通过FACS在E14.5从嵌合胚胎中分离tdTomato阳性细胞和tdTomato阴性细胞,并分别用于WGS分析。通过使用三种算法(用于SNV分析的Mutect2,Lofreq和Strelka,以及用于indel分析的Mutect2,Scalpel和Strelka)比较tdTomato+细胞和tdTomato-细胞来鉴定脱靶SNV和indel。A mixture of mRNA and sgRNA from gene editing tools such as Cas9/BE3 was injected into a blastomere of a 2-cell stage embryo, which was derived from a wild-type female mouse X Ai9 male mouse. The action of Cre produces chimeric embryos, in which the injected cells are marked with tdTomato (red), a positive tdTomato indicates that editing has occurred, and a negative tdTomato indicates unedited. TdTomato positive cells and tdTomato negative cells were separated from chimeric embryos by FACS at E14.5 and used for WGS analysis respectively. Off-target SNV and indel were identified by comparing tdTomato+ cells and tdTomato- cells using three algorithms (Mutect2, Lofreq and Strelka for SNV analysis, and Mutect2, Scalpel and Strelka for indel analysis).
动物护理Animal care
杂合子Ai9(全名B6.Cg-Gt(ROSA)26Sortm9(CAG-td-Tomato)Hze/J;JAX株007909)雄性小鼠和雌性C57BL/6小鼠(4周龄)交配进行胚胎采集。ICR雌性被用作受体。动物的使用和护理遵循中国科学院上海生物研究所生物医学研究伦理委员会的指导方针。Heterozygous Ai9 (full name B6.Cg-Gt(ROSA)26Sortm9(CAG-td-Tomato)Hze/J; JAX strain 007909) male mice and female C57BL/6 mice (4 weeks old) were mated for embryo collection. ICR females are used as recipients. The use and care of animals follow the guidelines of the Biomedical Research Ethics Committee of the Shanghai Institute of Biology, Chinese Academy of Sciences.
突变体构建Mutant construction
野生型APOBEC1蛋白序列如SEQ ID NO:1所示:The wild-type APOBEC1 protein sequence is shown in SEQ ID NO:1:
(1)胞嘧啶碱基编辑器3(BE3,rAPOBEC1-nCas9-UGI),包括Apobec1以及Sp nCas9酶、UGI酶,其中Apobec1与Sp nCas9酶之间、Sp nCas9酶与UGI酶之间,分别通过16AA(序列为SGSETPGTSESATPES)和4AA(序列为SGGS)两段肽进行连接,其中rAPOBEC1为重组APOBEC1。(1) Cytosine base editor 3 (BE3, rAPOBEC1-nCas9-UGI), including Apobec1 and Sp nCas9 enzymes, UGI enzymes, among which Apobec1 and Sp nCas9 enzymes, Sp nCas9 enzymes and UGI enzymes, respectively pass Two peptides, 16AA (sequence SGSETPGTSESATPES) and 4AA (sequence SGGS) are connected, and rAPOBEC1 is recombinant APOBEC1.
BE3 R126E:上段(1)的BE3,其中rAPOBEC1序列第126位由R突变为E。 BE3 R126E : The BE3 of the previous paragraph (1), in which the 126th position of the rAPOBEC1 sequence was changed from R to E.
BE3 R128E:(1)的BE3,其中rAPOBEC1序列第128位由R突变为E。 BE3 R128E : BE3 of (1), where the 128th bit of the rAPOBEC1 sequence was changed from R to E.
BE3 R132E:(1)的BE3,其中rAPOBEC1序列第132位由R突变为E。 BE3 R132E : BE3 of (1), in which position 132 of the rAPOBEC1 sequence was changed from R to E.
YE1-BE3:(1)的BE3,其中rAPOBEC1序列第90位W突变为Y(W90Y),第第126位由R突变为E(R126E)。YE1-BE3: BE3 of (1), in which the 90th W mutation of the rAPOBEC1 sequence was changed to Y (W90Y), and the 126th position was changed from R to E (R126E).
FE1-BE3:(1)的BE3,其中rAPOBEC1序列第90位W突变为F(W90F),第126位由R突变为E(R126E)。FE1-BE3: BE3 of (1), in which the 90th position of the rAPOBEC1 sequence was changed to F (W90F), and the 126th position was changed from R to E (R126E).
BE3-hA3A:使用人的APOBECA3A(human APOBECA3A)取代BE3上的apobec1,构建的新的BE3编辑工具BE3-hA3A: Use human APOBECA3A (human APOBECA3A) to replace apobec1 on BE3 to build a new BE3 editing tool
BE3-hA3A Y130F:在human APOBECA3A中突变Y130为F。 BE3-hA3A Y130F : mutation Y130 to F in human APOBECA3A.
BE3-FNLS:(1)的BE3,其中rAPOBEC1序列的N端连接Flag标签和NLS核定位序列(序列为:PKKKRKV),碱基编辑器的C端也存在一个NLS。BE3-FNLS: BE3 of (1), where the N-terminal of the rAPOBEC1 sequence is connected with the Flag tag and the NLS nuclear localization sequence (sequence: PKKKRKV), and there is also an NLS at the C-terminal of the base editor.
YE1-BE3-FNLS:对于BE3-FNLS,其中rAPOBEC1序列第90位W突变为Y(W90Y),第126位由R突变为E(R126E)。其示意图如图13。YE1-BE3-FNLS: For BE3-FNLS, the 90th position of the rAPOBEC1 sequence was changed to Y (W90Y), and the 126th position was changed from R to E (R126E). The schematic diagram is shown in Figure 13.
突变后的YE1-BE3序列为(SEQ ID NO:223):The sequence of YE1-BE3 after mutation is (SEQ ID NO: 223):
所述突变体的插入到pCMV-BE3质粒中,获得CBE突变体质粒。The mutant was inserted into the pCMV-BE3 plasmid to obtain the CBE mutant plasmid.
突变碱基编辑器mRNA和sgRNA的获得)(Acquisition of mutant base editor mRNA and sgRNA)
在碱基编辑器编码区加入T7启动子,用引物F和R(表2中base editor IVT F和base editor IVT R等)对质粒(YE1-BE3,BE3-FNLS,YE1-BE3-FNLS,hA3A-BE3)进行PCR扩增,纯化T7碱基编辑器PCR产物作为转录模板,用mMESSAGE-mMACHINE T7-ULTRA-kit(Life Technologies)试剂盒体外转录(In Vitro Transcription,IVT)。通过px330的PCR扩增,在sgRNA模板中加入T7启动子。用MEGA-shortscript T7试剂盒(Life Technologies)纯化T7-sgRNA-PCR产物作为IVT的模板。通过PCR扩增,在Cre体外转录模板中加入T7启动子。纯化T7 Cre-PCR产物作为体外转录模板,用mMESSAGE mMACHINE T7 ULTRA kit(Life Technologies)体外转录。用MEGA-clear试剂盒(Life Technologies)纯化Cas9 mRNA、Cre mRNA和sgRNAs,并用无RNase的水洗脱。Add the T7 promoter to the base editor coding region, and use primers F and R (base editor IVT F and base editor IVT R, etc. in Table 2) to pair the plasmids (YE1-BE3, BE3-FNLS, YE1-BE3-FNLS, hA3A) -BE3) Perform PCR amplification, purify the T7 base editor PCR product as a transcription template, and use mMESSAGE-mMACHINE T7-ULTRA-kit (Life Technologies) kit in vitro transcription (In Vitro Transcription, IVT). Through the PCR amplification of px330, the T7 promoter was added to the sgRNA template. The T7-sgRNA-PCR product was purified with MEGA-shortscript T7 kit (Life Technologies) as a template for IVT. Through PCR amplification, T7 promoter was added to Cre in vitro transcription template. Purify the T7 Cre-PCR product as an in vitro transcription template, and use mMESSAGE mMACHINE T7 ULTRA kit (Life Technologies) for in vitro transcription. The Cas9 mRNA, Cre mRNA and sgRNAs were purified with MEGA-clear kit (Life Technologies), and eluted with RNase-free water.
表1、sgRNA序列Table 1, sgRNA sequence
表2、引物Table 2. Primers
其中,IVT为体外转录,Tyr-C,靶向Tyr基因代码为C的sgRNA,Tyr-D,靶向Tyr基因代码为D的sgRNA,Tyr-F,靶向Tyr基因代码为F的sgRNA。Among them, IVT is in vitro transcription, Tyr-C, targeting sgRNA with Tyr gene code C, Tyr-D, targeting sgRNA with Tyr gene code D, Tyr-F, targeting sgRNA with Tyr gene code F.
获得上述mRNA和sgRNA后,基因编辑步骤为:将mRNA和sgRNA混合后,使用显微注射仪,注射到小鼠2-cell阶段胚胎的一个卵裂球中。After obtaining the above-mentioned mRNA and sgRNA, the gene editing steps are: after mixing the mRNA and sgRNA, using a microinjector, inject it into a blastomere of a mouse 2-cell stage embryo.
囊胚率计算Blastocyst rate calculation
超排卵雌性C57BL/6小鼠(4周龄)与雄性C57BL/6小鼠交配,注射hCG后24小时采集输卵管受精胚胎。在含有5μg/ml细胞松弛素B(CB)的M2培养基中,用FemtoJet微型注射器(Eppendorf)在恒定流量条件下将Cas9 mRNA和sgRNA混合注入受精卵胞浆。注入的胚胎在37℃、5%CO 2的空气中在KSOM中培养至囊胚期。 Superovulated female C57BL/6 mice (4 weeks old) were mated with male C57BL/6 mice, and the fallopian tube fertilized embryos were collected 24 hours after injection of hCG. In M2 medium containing 5μg/ml cytochalasin B (CB), Cas9 mRNA and sgRNA were mixed and injected into the cytoplasm of the fertilized ovum with a FemtoJet micro-syringe (Eppendorf) under constant flow conditions. The injected embryos were cultured to the blastocyst stage in KSOM at 37°C and 5% CO 2 in air.
2-细胞胚胎注射、胚胎培养和胚胎移植2-cell embryo injection, embryo culture and embryo transfer
将超排卵C57BL/6雌性小鼠(4周龄)与杂合子Ai9小鼠(全称B6.Cg-Gt(ROSA)26Sortm9(CAG-td-Tomato)Hze/J;JAX株007909)雄性交配,在注射hCG后23小时采集输卵管受精胚胎。对于2细胞编辑,将BE3 mRNA(50 ng/μl)或BE3变体mRNA(50 ng/μl)的混合物,sgRNA(50ng/μl)和Cre mRNA(2ng/μl)分别注入hCG注射后48h的2-细胞胚卵裂球细胞浆中,在含有5μg/ml细胞松弛素B(CB)的M2培养液中,用FemtoJet微注射器(Eppendorf)在恒定流量下进行注射。注入的胚胎在含氨基酸的KSOM培养基中于37℃、5%CO2的培养箱中培养2小时,然后在0.5dpc的条件下移植到ICR假孕雌鼠的输卵管中。Superovulated C57BL/6 female mice (4 weeks old) were bred with heterozygous Ai9 mice (full name B6.Cg-Gt(ROSA)26Sortm9(CAG-td-Tomato)Hze/J; JAX strain 007909) males, and The fallopian tube fertilized embryos were collected 23 hours after hCG injection. For 2 cell editing, a mixture of BE3 mRNA (50 ng/μl) or BE3 variant mRNA (50 ng/μl), sgRNA (50ng/μl) and Cre mRNA (2ng/μl) were injected into 2 cells 48h after hCG injection. -In the cytoplasm of embryonic blastomeres, inject in M2 medium containing 5 μg/ml cytochalasin B (CB) with a FemtoJet microsyringe (Eppendorf) at a constant flow rate. The injected embryos were cultured in a KSOM medium containing amino acids in an incubator at 37°C and 5% CO2 for 2 hours, and then transplanted into the fallopian tubes of ICR pseudo-pregnant female mice under the condition of 0.5 dpc.
克隆clone
用NEBuilder-HiFi-DNA组装主混合物(新英格兰生物实验室)对BE3进行定点突变。简单地说,本发明人用一个含有期望点突变的引物通过PCR扩增合适的载体质粒。 pCMV-BE3 variants-polyA-pCMV-mCherry-polyA是通过NEBuilder-HiFi的DNA组装,将PCR扩增的pCMV-mCherry polyA与经消化的pCMV-BE3变异体主干结合产生的。将PCR扩增的U6-sgRNA与经消化的pCMV-EGFP-polyA骨架结合,经NEBuilder-HiFi-DNA组装得到pCMV-EGFP-polyA-U6-sgRNA。The NEBuilder-HiFi-DNA assembly master mix (New England Biolabs) was used for site-directed mutagenesis of BE3. Briefly, the inventors used a primer containing the desired point mutation to amplify a suitable vector plasmid by PCR. pCMV-BE3 variants-polyA-pCMV-mCherry-polyA is assembled by NEBuilder-HiFi DNA, and the PCR-amplified pCMV-mCherry polyA is combined with the digested pCMV-BE3 variant backbone. The PCR amplified U6-sgRNA was combined with the digested pCMV-EGFP-polyA backbone, and then assembled by NEBuilder-HiFi-DNA to obtain pCMV-EGFP-polyA-U6-sgRNA.
细胞培养、转染与FACSCell culture, transfection and FACS
HEK293T细胞在含10%胎牛血清(FBS)的DMEM和含5%CO2的37℃湿化培养箱中培养。pCMV-BE3(WT/BE3变体)-polyA-pCMV-mCherry polyA和pCMV-EGFP-polyA-U6-sgRNA表达质粒使用脂质体3000(ThermoFisher Scientific)按照说明书共转染。转染后36或72小时,用磷酸盐缓冲液(PBS)清洗细胞,用0.05%胰蛋白酶EDTA进行胰蛋白酶化。细胞悬液经40μm细胞过滤器过滤,流式细胞仪分离EGFP/mCherry阳性细胞。HEK293T cells were cultured in DMEM containing 10% fetal bovine serum (FBS) and 37°C humidified incubator containing 5% CO2. pCMV-BE3 (WT/BE3 variant)-polyA-pCMV-mCherry polyA and pCMV-EGFP-polyA-U6-sgRNA expression plasmids were co-transfected with liposome 3000 (ThermoFisher Scientific) according to the instructions. 36 or 72 hours after transfection, the cells were washed with phosphate buffered saline (PBS) and trypsinized with 0.05% trypsin EDTA. The cell suspension was filtered through a 40μm cell strainer, and EGFP/mCherry positive cells were separated by flow cytometry.
RNA测序RNA sequencing
收集约500000个细胞(前5%EGFP/mCheery信号),根据标准方案提取RNA。为了构建文库,用随机六聚体或寡核苷酸(dT)引物将mRNAs片段化并转化为cDNA。将cDNA的5′端和3′端分别与接头连接,并用PCR方法对连接正确的cDNA片段进行富集和扩增。用生物分析仪测定文库的浓度。在Illumina HiSeq平台上进行测序。Collect about 500,000 cells (the first 5% EGFP/mCheery signal), and extract RNA according to standard protocols. To construct a library, mRNAs are fragmented and converted into cDNA using random hexamers or oligonucleotide (dT) primers. Connect the 5'end and 3'end of the cDNA to the adaptor respectively, and use the PCR method to enrich and amplify the correctly connected cDNA fragments. The concentration of the library was determined with a bioanalyzer. Sequencing is performed on the Illumina HiSeq platform.
DNA扩增子的深度测序Deep sequencing of DNA amplicons
72小时后取转染细胞,流式细胞仪(FACS)对EGFP+/mCherry+细胞进行分选。根据说明书使用天根基因DNA提取试剂盒(TIANGEN)提取基因组DNA。利用靶序列两侧的基因特异性引物(表3和4)通过PCR扩增目的基因组位点。ExTaq(TAKARA)在95℃下激活3分钟,然后进行34个周期的PCR(在95℃下持续30秒,62℃持续30秒,72℃持续1分钟),最后在72℃下延长5分钟。使用通用DNA纯化试剂盒(TIANGEN)按照说明纯化DNA扩增子。将扩增子连接到接头,并在Illumina HiSeq-Xten平台上进行测序。After 72 hours, the transfected cells were taken, and EGFP+/mCherry+ cells were sorted by FACS. According to the instructions, use Tiangen DNA Extraction Kit (TIANGEN) to extract genomic DNA. The gene-specific primers on both sides of the target sequence (Tables 3 and 4) were used to amplify the target genomic site by PCR. ExTaq (TAKARA) was activated at 95°C for 3 minutes, followed by 34 cycles of PCR (at 95°C for 30 seconds, 62°C for 30 seconds, 72°C for 1 minute), and finally at 72°C for 5 minutes. Purify the DNA amplicons using the Universal DNA Purification Kit (TIANGEN) according to the instructions. The amplicon was connected to the adapter and sequenced on the Illumina HiSeq-Xten platform.
表3、用于对靶位点进行深度测序的引物Table 3. Primers used for deep sequencing of target sites
表4、用于脱靶效应深度测序的引物Table 4. Primers used for off-target effect deep sequencing
GOTI的FACSGOTI's FACS
为了分离小鼠胚胎细胞,制备的组织在37℃的5ml胰蛋白酶EDTA(0.05%)孵育液中酶解30min,加入5ml DMEM培养基和10%胎牛血清(FBS)停止消化。然后用1毫升移液管将胎儿组织均匀化30-40次。将细胞悬浮液离心6min(800rpm),然后在含有10%FBS的DMEM培养基中重新悬浮颗粒。最后,用40μm的细胞过滤器过滤细胞悬液,用流式细胞仪分离tdtomato +/tdtomato -细胞。通过第二轮流式细胞术和荧光显微镜分析,发现样品纯度大于95%。 In order to isolate mouse embryonic cells, the prepared tissue was enzymatically hydrolyzed in 5ml trypsin EDTA (0.05%) incubation solution at 37°C for 30 minutes, and 5ml DMEM medium and 10% fetal bovine serum (FBS) were added to stop the digestion. Then use a 1 ml pipette to homogenize the fetal tissue 30-40 times. The cell suspension was centrifuged for 6 min (800 rpm), and then the pellet was resuspended in DMEM medium containing 10% FBS. Finally, filter the cell suspension with a 40μm cell strainer, and separate the tdtomato + /tdtomato - cells with a flow cytometer. Through the second round of flow cytometry and fluorescence microscope analysis, it was found that the purity of the sample was greater than 95%.
全基因组测序和RNA序列数据分析Whole genome sequencing and RNA sequence data analysis
用DNeasy血液和组织试剂盒(目录号69504,Qiagen)按照说明从细胞中提取基因组DNA。全基因组测序由Illumina HiSeq X 10进行,平均覆盖率为50倍。BWA(v0.7.12)用于将合格的测序读数映射到参考基因组(mm10)。然后使用Picard工具(v2.3.0)对映射的BAM文件进行排序和标记。为了高置信度地鉴定全基因组的新发SNVs,本发明人进行了单核苷酸变异,分别使用默认参数的Mutect2(v3.5)、Lofreq(v2.1.2)和Strelka(v2.7.1)三种算法。同时,分别使用Mutect2(v3.5)、Scalpel(v0.5.3)和Strelka(v2.7.1)检测全基因组从头索引,利用默认参数。三种SNVs或indel算法的重叠被认为是真正的突变。所有的测序数据都保存在NCBI序列读取档案(SRA)中。Genomic DNA was extracted from the cells using the DNeasy Blood and Tissue Kit (Cat. No. 69504, Qiagen) according to the instructions. Whole genome sequencing is performed by
使用两个先前报告的算法,即Cas offender(http://www.rgenome.net/Cas-officer/)和CRISPOR(http://CRISPOR.tefor.net/)预测目标位点的潜在脱靶位点。SNVs和indels用annovar(2016-02-01版)使用RefSeq数据库进行注释。Use two previously reported algorithms, namely Cas offender (http://www.rgenome.net/Cas-officer/) and CRISPOR (http://CRISPOR.tefor.net/) to predict the potential off-target site of the target site . SNVs and indels are annotated with annovar (version 2016-02-01) using the RefSeq database.
RNA序列数据分析采用FastQC(v0.11.3)和Trimmomatic(v0.36)进行质量控制。合格的读数使用STAR(v2.5.2b),在默认参数的2通模式下被映射到参考基因组(集成GRCh38)。然后应用Picard工具(v2.3.0)对映射的BAM文件的重复项进行排序和标记。优化后的BAM文件分别进行了跨接头连接的拆分读取、本地重新对齐、基础重新校准和使用GATK(v3.5)中的SplitNCigarReads、IndelRealigner、BaseRecalivator和单倍型调用工具进行的变体调用。RNA sequence data analysis uses FastQC (v0.11.3) and Trimmomatic (v0.36) for quality control. Qualified readings use STAR (v2.5.2b) and are mapped to the reference genome (integrated GRCh38) in the 2-way mode with default parameters. Then use the Picard tool (v2.3.0) to sort and mark the duplicates of the mapped BAM file. The optimized BAM file has been split read across joint connections, local realignment, basic recalibration, and variant calls using SplitNCigarReads, IndelRealigner, BaseRecalivator, and haplotype calling tools in GATK (v3.5) .
结构预测Structure prediction
从UniProt(https://www.UniProt.org/)检索大鼠APOBEC1和人APOBEC3G的氨基酸序列,用NCBI-blastp(https://blast.NCBI.nlm.nih.gov/blast.cgi?程序=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome)。根据已知结构蛋白质的一致序列和二级结构信息,由蛋白质结构预测服务器(PS)对rAPOBEC1的结构进行预测。 APOBEC3G的晶体结构是从PDB(http://www.rcsb.org/3d-view/3IQS)下载的,并使用PyMOL(v2.3.2)呈现。To retrieve the amino acid sequences of rat APOBEC1 and human APOBEC3G from UniProt (https://www.UniProt.org/), use NCBI-blastp (https://blast.NCBI.nlm.nih.gov/blast.cgi? program= blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome). According to the consensus sequence and secondary structure information of known structural proteins, the protein structure prediction server (PS) predicts the structure of rAPOBEC1. The crystal structure of APOBEC3G was downloaded from PDB (http://www.rcsb.org/3d-view/3IQS) and presented using PyMOL (v2.3.2).
统计分析Statistical Analysis
本发明采用R版本3.5.1(http://www.R-project.org/)进行统计分析。所有试验均为双侧,P<0.05认为差异有显著性。The present invention uses R version 3.5.1 (http://www.R-project.org/) for statistical analysis. All tests are two-sided, P<0.05 considered the difference to be significant.
实施例1、APOBEC1突变体及其效果Example 1. APOBEC1 mutants and their effects
本发明人将各种突变引入重组APOBEC1(rAPOBEC1),以期望影响DNA或RNA活性(图1a)。具体而言,变异体包括rAPOBEC1亮氨酸富集的N端或C端的删除或突变(Del32、R33A、K34A、Del34、Del77、Del116、Del169、Del182、P190A和P191A),以及预测的rAPOBEC1催化活性位点的突变(H61A、H61R、V62A、E63A、E63Q、C93S、C96S)。本发明人经过对APOBEC3G的结构研究,预测rAPOBEC1的R126位点(对应APOBEC3G的R320位点)与ssDNA的磷酸酯骨架相互作用(图1b、c),并且R126E突变体可保留靶编辑活性。R128和R132接近R126,本发明人还引入了R128E和R132E突变(图1a-c)。本发明人还研究了rAPOBEC1疏水性结构域的活性位点的点突变组合(W90A,W90F,W90Y)的影响,这些突变可以缩小碱基编辑窗口的宽度。The present inventors introduced various mutations into recombinant APOBEC1 (rAPOBEC1) in the hope of affecting DNA or RNA activity (Figure 1a). Specifically, variants include leucine-enriched N-terminal or C-terminal deletions or mutations of rAPOBEC1 (Del32, R33A, K34A, Del34, Del77, Del116, Del169, Del182, P190A, and P191A), and predicted rAPOBEC1 catalytic activity Site mutations (H61A, H61R, V62A, E63A, E63Q, C93S, C96S). After studying the structure of APOBEC3G, the inventor predicted that the R126 site of rAPOBEC1 (corresponding to the R320 site of APOBEC3G) interacts with the phosphate backbone of ssDNA (Figure 1b, c), and the R126E mutant can retain target editing activity. R128 and R132 are close to R126, and the inventors also introduced R128E and R132E mutations (Figure 1a-c). The inventors also studied the effect of a combination of point mutations (W90A, W90F, W90Y) at the active site of the hydrophobic domain of rAPOBEC1, and these mutations can reduce the width of the base editing window.
本发明人将用CBE突变体质粒转染HEK293T细胞来分析其编辑活性和脱靶效应。通过在10个基因组靶位点上的检测,从23个突变体中筛到7个可以保留其靶向活性的突变体,包括R33A、K34A、V62A、W90F+R126E、W90Y+R126E、R126E和R132E,其中4个突变体(W90F+R126E、W90Y+R126E、R126E、R132E)的插入缺失(indels)突变效率没有增加(图1d,图4和表5)。此外,这些实验表明,W90F+R126E、W90Y+R126E和R132E变体的编辑窗口变窄,图4)。The inventors will use CBE mutant plasmids to transfect HEK293T cells to analyze its editing activity and off-target effects. Through detection on 10 genomic target sites, 7 mutants that can retain their targeted activity were screened from 23 mutants, including R33A, K34A, V62A, W90F+R126E, W90Y+R126E, R126E, and R132E , 4 of the mutants (W90F+R126E, W90Y+R126E, R126E, R132E) did not increase the mutation efficiency of indels (Figure 1d, Figure 4 and Table 5). In addition, these experiments show that the editing window of the W90F+R126E, W90Y+R126E, and R132E variants becomes narrower, Figure 4).
表5、CBE变体与BE3之间的靶向效率和indel率的P值Table 5. P value of targeting efficiency and indel rate between CBE variants and BE3
接下来,本发明人通过GOTI来评估其中具有高DNA靶向效率突变体BE3
R126E、BE3
R132E、BE3
W90Y+R126E(YE1-BE37)和BE3
W90F+R126E(FE1-BE37)(表6)的DNA脱靶活性。首先,需要注意的是,Ai9小鼠的胚胎发育过程没有受到任何这些变体的有害影响(图5)。通过全基因组测序评估这些变体的靶向效率(图6)。与野生型BE3处理的胚胎相比,BE3
R126E、BE3
R132E、YE1-BE3或FE1-BE3处理的胚胎中的DNA非靶向SNVs数量显著减少:BE3
R126E处理的胚胎中的DNA非靶向SNV从283±32减少到28±6,BE3
R132E处理的胚胎中的DNA非靶向SNV为47±8,YE1-BE3处理的胚胎中的DNA非靶向SNV为12±2,FE1-BE3处理的DNA非靶向SNV为27±19。
Next, the inventors used GOTI to evaluate the DNA of mutants BE3 R126E , BE3 R132E , BE3 W90Y+R126E (YE1-BE37) and BE3 W90F+R126E (FE1-BE37) (Table 6) that have high DNA targeting efficiency. Off-target activity. First of all, it should be noted that the embryonic development of Ai9 mice has not been harmfully affected by any of these variants (Figure 5). The targeting efficiency of these variants was evaluated by whole-genome sequencing (Figure 6). Compared to wild-type embryos treated BE3, BE3 R126E, BE3 R132E, the number of non-targeted SNVs DNA YE1-BE3 or FE1-BE3 treated embryos was significantly reduced: BE3 R126E treated embryos in DNA from
表6、HiSeq×10测序汇总Table 6. HiSeq×10 sequencing summary
重要的是,注射四种CBE变体的胚胎与未编辑的对照胚胎之间的SNV数量没有显著差异(平均14个SNV,接近自发突变率)(图2a,图7和表7)。此外,和野生型BE3相比,CBE变体表现出显著减少的突变,并且这些CBE变体组中检测到的SNV与使用Cas-Offender和CRISPOR软件分析的脱靶位点都不重叠(图2b,图6和8)。这些发现表明,这些CBE变体可产生较少的sgRNA非依赖性的DNA脱靶。Importantly, there was no significant difference in the number of SNVs between embryos injected with the four CBE variants and unedited control embryos (14 SNVs on average, close to the spontaneous mutation rate) (Figure 2a, Figure 7 and Table 7). In addition, compared with wild-type BE3, CBE variants showed significantly reduced mutations, and the SNV detected in these CBE variant groups did not overlap with the off-target sites analyzed using Cas-Offender and CRISPOR software (Figure 2b, Figures 6 and 8). These findings indicate that these CBE variants can produce fewer sgRNA-independent DNA off-targets.
表7、来自不同算法的非目标SNVTable 7. Non-target SNV from different algorithms
本发明人还使用RNA-seq来评估这些变体在其所转染HEK293T细胞转录组上的脱靶效应。与野生型BE3相比,有三种变体BE3 R132E、YE1-BE3和FE1-BE3在转染后36h显示出明显减少的RNA脱靶编辑(图2c和2d)。相比之下,BE3 R126E变异株的RNA脱靶编辑在转染后36h没有下降,但在转染后72h有显著下降。与转染GFP的对照细胞组相比,野生型BE3和BE3 R126E组的SNVs数量显著增加,然而BE3 R132E、YE1-BE3或FE1-BE3组SNVs数目没有增加(图2c、2d,图9和10)。总之,这些结果表明BE3 R132E、YE1-BE3和FE1-BE3变体是高保真的碱基编辑器,与BE3相比,它们的DNA脱靶和RNA脱靶效应都显著降低。 The inventors also used RNA-seq to evaluate the off-target effects of these variants on the transcriptome of the HEK293T cells transfected. Compared with wild-type BE3, three variants BE3 R132E , YE1-BE3 and FE1-BE3 showed significantly reduced RNA off-target editing 36h after transfection (Figures 2c and 2d). In contrast, the RNA off-target editing of the BE3 R126E mutant strain did not decrease at 36h after transfection, but it decreased significantly at 72h after transfection. Compared with the GFP-transfected control cell group, the number of SNVs in the wild-type BE3 and BE3 R126E groups increased significantly, but the number of SNVs in the BE3 R132E , YE1-BE3 or FE1-BE3 groups did not increase (Figures 2c, 2d, Figures 9 and 10). ). In summary, these results indicate that the BE3 R132E , YE1-BE3, and FE1-BE3 variants are high-fidelity base editors, and their DNA off-target and RNA off-target effects are significantly reduced compared with BE3.
尽管前述三种BE3变体(BE3 R132E、YE1-BE3和FE1-BE3变体)可以显著降低脱靶效应,但它们的靶向编辑效率不如BE3-hA3A(图3a和图11)。BE3(hA3AY130F)是在humanAPOBECA3A突变Y130转变为F,可以观测到这一突变显著减少脱靶SNV的数量。本发明人使用GOTI分析BE3-hA3A的脱靶效应,然而发现BE3-hA3A对胚胎是明显有毒的(图5)。 Although the aforementioned three BE3 variants (BE3 R132E , YE1-BE3 and FE1-BE3 variants) can significantly reduce off-target effects, their targeted editing efficiency is not as good as BE3-hA3A (Figure 3a and Figure 11). BE3 (hA3AY130F) was converted from humanAPOBECA3A mutation Y130 to F. It can be observed that this mutation significantly reduces the number of off-target SNVs. The inventors used GOTI to analyze the off-target effects of BE3-hA3A, but found that BE3-hA3A is obviously toxic to embryos (Figure 5).
本发明人试图进一步获得既具有高靶效率又具有高保真度的碱基编辑器,将Y130F突变引入BE3-hA3A中,发现BE3-hA3A Y130F编辑器有很高的靶向编辑效率(图3a),但它仍然产生大量的DNA脱靶SNVs(409±86)(图3b和图11)。 The inventors tried to further obtain a base editor with high target efficiency and high fidelity, introduced the Y130F mutation into BE3-hA3A, and found that the BE3-hA3A Y130F editor has high targeted editing efficiency (Figure 3a) , But it still produces a large number of DNA off-target SNVs (409±86) (Figure 3b and Figure 11).
接下来,本发明人在BE3-FNLS编辑器的基础上,构建了一个高保真变体YE1-BE,该变体的C端和N端增添了一个核定位信号肽,且本发明人对DNA序列进行了人类细胞中表达的密码子优化。该密码子优化的DNA序列为(SEQ ID NO:2):Next, the inventors constructed a high-fidelity variant YE1-BE on the basis of the BE3-FNLS editor. A nuclear localization signal peptide was added to the C-terminus and N-terminus of the variant. The sequence was optimized for codons expressed in human cells. The codon-optimized DNA sequence is (SEQ ID NO: 2):
除了这种新的YE1-BE3-FNLS变体外,本发明人在HEK293T细胞的21个靶点上测试了BE3、YE1-BE3、BE3-hA3A、BE3-hA3A Y130F和BE3-FNLS的靶向编辑效率和bystander编辑。YE1-BE3-FNLS靶向效率最高,为70.7±5.2%(图3d)。值得注意的是,YE1-BE3-FNLS在被检测的变体中的indels水平最低,为0.8±0.2%,其它bystander编辑的数量也是最低,为0.6±0.4%(图3d-e)。此外,与BE3相比,YE1-BE3-FNLS也显著降低了DNA和RNA的脱靶活性,达到了与未经编辑的对照组相当的水平(图3b-c)。考虑到GOTI方法是用来分析基因编辑蛋白的sgRNA非依赖的脱靶效应,本发明人还需要检测YE1-BE3-FNLS的sgRNA依赖性的脱靶效应。与其他BE3变体处理的细胞相比,转染YE1-BE3-FNLS的细胞中sgRNA依赖的脱靶效应没有显著差异(图12)。因此,这种新的碱基编辑器突变体符合本发明人的双重标准,即高的靶向编辑效率与高保真度。 In addition to this new YE1-BE3-FNLS variant, the inventors tested the targeted editing efficiency of BE3, YE1-BE3, BE3-hA3A, BE3-hA3A, Y130F, and BE3-FNLS on 21 targets of HEK293T cells. And bystander editing. YE1-BE3-FNLS had the highest targeting efficiency, which was 70.7±5.2% (Figure 3d). It is worth noting that YE1-BE3-FNLS has the lowest indels level among the tested variants, at 0.8±0.2%, and the number of other bystander edits is also the lowest at 0.6±0.4% (Figure 3d-e). In addition, compared with BE3, YE1-BE3-FNLS also significantly reduced the off-target activity of DNA and RNA, reaching a level comparable to that of the unedited control group (Figure 3b-c). Considering that the GOTI method is used to analyze the sgRNA-independent off-target effects of gene editing proteins, the inventors also need to detect the sgRNA-dependent off-target effects of YE1-BE3-FNLS. Compared with cells treated with other BE3 variants, there was no significant difference in sgRNA-dependent off-target effects in cells transfected with YE1-BE3-FNLS (Figure 12). Therefore, this new base editor mutant meets the inventor's double standard, namely, high targeted editing efficiency and high fidelity.
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独 引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。All documents mentioned in the present invention are cited as references in this application, as if each document was individually cited as a reference. In addition, it should be understood that after reading the above teaching content of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.
Claims (22)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2020/074561 WO2021155607A1 (en) | 2020-02-07 | 2020-02-07 | Modified cytosine base editor and application thereof |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2020/074561 WO2021155607A1 (en) | 2020-02-07 | 2020-02-07 | Modified cytosine base editor and application thereof |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2021155607A1 true WO2021155607A1 (en) | 2021-08-12 |
Family
ID=77199722
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2020/074561 Ceased WO2021155607A1 (en) | 2020-02-07 | 2020-02-07 | Modified cytosine base editor and application thereof |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2021155607A1 (en) |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107164377A (en) * | 2017-06-12 | 2017-09-15 | 王小平 | Gene knockout method and its application based on base editor |
| CN108513575A (en) * | 2015-10-23 | 2018-09-07 | 哈佛大学的校长及成员们 | Nucleobase editors and their uses |
| WO2018165629A1 (en) * | 2017-03-10 | 2018-09-13 | President And Fellows Of Harvard College | Cytosine to guanine base editor |
| CN108822217A (en) * | 2018-02-23 | 2018-11-16 | 上海科技大学 | A kind of gene base editing machine |
| CN109321584A (en) * | 2017-12-27 | 2019-02-12 | 华东师范大学 | A simple qualitative/quantitative reporting system for detecting the efficiency of single-base gene editing technology |
| WO2019042284A1 (en) * | 2017-09-01 | 2019-03-07 | Shanghaitech University | Fusion proteins for improved precision in base editing |
| WO2019126762A2 (en) * | 2017-12-22 | 2019-06-27 | The Broad Institute, Inc. | Cas12a systems, methods, and compositions for targeted rna base editing |
| WO2019126709A1 (en) * | 2017-12-22 | 2019-06-27 | The Broad Institute, Inc. | Cas12b systems, methods, and compositions for targeted dna base editing |
-
2020
- 2020-02-07 WO PCT/CN2020/074561 patent/WO2021155607A1/en not_active Ceased
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108513575A (en) * | 2015-10-23 | 2018-09-07 | 哈佛大学的校长及成员们 | Nucleobase editors and their uses |
| WO2018165629A1 (en) * | 2017-03-10 | 2018-09-13 | President And Fellows Of Harvard College | Cytosine to guanine base editor |
| CN107164377A (en) * | 2017-06-12 | 2017-09-15 | 王小平 | Gene knockout method and its application based on base editor |
| WO2019042284A1 (en) * | 2017-09-01 | 2019-03-07 | Shanghaitech University | Fusion proteins for improved precision in base editing |
| WO2019126762A2 (en) * | 2017-12-22 | 2019-06-27 | The Broad Institute, Inc. | Cas12a systems, methods, and compositions for targeted rna base editing |
| WO2019126709A1 (en) * | 2017-12-22 | 2019-06-27 | The Broad Institute, Inc. | Cas12b systems, methods, and compositions for targeted dna base editing |
| CN109321584A (en) * | 2017-12-27 | 2019-02-12 | 华东师范大学 | A simple qualitative/quantitative reporting system for detecting the efficiency of single-base gene editing technology |
| CN108822217A (en) * | 2018-02-23 | 2018-11-16 | 上海科技大学 | A kind of gene base editing machine |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Bai et al. | CRISPR/Cas9-mediated precise genome modification by a long ssDNA template in zebrafish | |
| Zhou et al. | Programmable base editing of the sheep genome revealed no genome-wide off-target mutations | |
| CN110804628B (en) | High-specificity off-target-free single-base gene editing tool | |
| WO2021042470A1 (en) | Use of combination of grna targets in constructing cell lines of porcine models of haemophilia a, b and ab | |
| US20230416710A1 (en) | Engineered and chimeric nucleases | |
| CN109072258A (en) | Replicative transposition subsystem | |
| CN110300802A (en) | Compositions and base editing methods for base editing in animal embryos | |
| CN113249362B (en) | Modified cytosine base editor and application thereof | |
| CN116716298A (en) | A guided editing system and site-directed modification method of target gene sequence | |
| KR102699756B1 (en) | Prime editing-based gene editing composition with improved editing efficiency and use thereof | |
| CN115703842A (en) | Base editor for efficient and highly accurate cytosine C to guanine G conversion | |
| CN117701542A (en) | Cytosine deaminase SfSddA, base editor comprising same and application | |
| Zuo et al. | High-fidelity base editor with no detectable genome-wide off-target effects | |
| WO2024119461A1 (en) | Compositions and methods for detecting target cleavage sites of crispr/cas nucleases and dna translocation | |
| WO2021155607A1 (en) | Modified cytosine base editor and application thereof | |
| CN109943589A (en) | A single base mutation method and system used | |
| CN109504707A (en) | The restorative procedure in the iPSCs Mitochondrial DNA Mutation site based on mitoTALENs | |
| CN114875033A (en) | sgRNA, CRISPR/Cas reagent and application thereof | |
| CN116970590B (en) | Super mini-gene editor smaller than 380 amino acids and application thereof | |
| CN120210166A (en) | Cytosine deaminase AbSddA, base editor containing the same and application thereof | |
| US20250051773A1 (en) | Dnazyme and use thereof | |
| CN120464610A (en) | Cytosine deaminase SvSddA, base editor containing the same and applications thereof | |
| CN120591242A (en) | Cytosine deaminase AiSddA, base editor containing same and applications thereof | |
| CN114262708B (en) | Kit and method for producing FecB gene g.A746G site-directed mutagenesis sheep | |
| CN111471680A (en) | Reconstructed eggs of zebrafish model and construction method and application thereof, construction method of zebrafish model |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20917487 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 20917487 Country of ref document: EP Kind code of ref document: A1 |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 20917487 Country of ref document: EP Kind code of ref document: A1 |
|
| 32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 10/07/2023) |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 20917487 Country of ref document: EP Kind code of ref document: A1 |