[go: up one dir, main page]

WO2024235228A1 - Rna polymerase mutants with improved performance - Google Patents

Rna polymerase mutants with improved performance Download PDF

Info

Publication number
WO2024235228A1
WO2024235228A1 PCT/CN2024/093147 CN2024093147W WO2024235228A1 WO 2024235228 A1 WO2024235228 A1 WO 2024235228A1 CN 2024093147 W CN2024093147 W CN 2024093147W WO 2024235228 A1 WO2024235228 A1 WO 2024235228A1
Authority
WO
WIPO (PCT)
Prior art keywords
amino acid
rna polymerase
mutant
mutated
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/093147
Other languages
French (fr)
Chinese (zh)
Inventor
杨广宇
秦伟彤
黑墨翰
罗漫杰
徐灿
聂挺
余佩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hzymes Biotechnology Co Ltd
Original Assignee
Hzymes Biotechnology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN202310558557.6A external-priority patent/CN118956809A/en
Priority claimed from CN202410499488.0A external-priority patent/CN120829885A/en
Application filed by Hzymes Biotechnology Co Ltd filed Critical Hzymes Biotechnology Co Ltd
Publication of WO2024235228A1 publication Critical patent/WO2024235228A1/en
Anticipated expiration legal-status Critical
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/10Cells modified by introduction of foreign genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)

Definitions

  • the invention relates to an RNA polymerase mutant with improved performance, belonging to the technical field of genetic engineering and enzyme engineering.
  • T7 RNA polymerase is derived from Escherichia coli bacteriophage T7 and is a DNA-dependent single-subunit RNA polymerase that highly specifically recognizes the T7 promoter sequence.
  • the enzyme has a wide range of applications in in vitro RNA synthesis, including the synthesis of gRNA, siRNA, and isotope-labeled or non-isotope-labeled RNA probes, and the synthesis of capped mRNA with the participation of Cap analog for intracellular or in vivo expression and mRNA vaccine and drug research.
  • it can also be used for isothermal amplification, such as sequence-specific nucleic acid in vitro amplification (NASBA) and transcription-mediated amplification (TMA).
  • NASBA sequence-specific nucleic acid in vitro amplification
  • TMA transcription-mediated amplification
  • wild-type T7 RNAP has some shortcomings in practical applications, such as the generation of byproducts during RNA synthesis, including short fragments generated during transcription initiation, 3' end extension products, and dsRNA byproducts, which can affect the effects of mRNA vaccine and drug research.
  • the wild-type T7 RNA polymerase currently on the market cannot fully meet the growing research needs and needs to be optimized and improved to achieve better application value.
  • T7 RNAP some technologies have been used to improve T7 RNAP, such as mutants with improved enzyme activity, mutants with reduced dsRNA contaminants, and mutants with improved thermal stability.
  • US20150024435 A1 provides a mutant of T7 RNA polymerase with improved enzyme activity
  • CN 111212905 A provides a T7 RNAP mutant that reduces dsRNA contaminants and concatenated transcripts produced during in vitro transcription reactions
  • CN 112831484 B provides a method suitable for the synthesis of RNA containing internal termination signals and RNA with hairpin structures at the ends, which can synthesize Cas9 sgRNA with higher purity than T7 RNAP-WT
  • CN 108779446 A and US 9193959 B2 provide T7 RNAP mutants with improved thermal stability.
  • these mutants in the prior art often need to be improved in performance, such as insufficient performance improvement, balance between thermal stability and activity, or failure to effectively reduce byproducts.
  • the amino acid at position 217 is mutated to L, A or V, preferably, mutated to L;
  • the amino acid at position 121 is mutated to M, L or S, preferably, mutated to M;
  • the amino acid at position 223 is mutated to M, P, L, V or F, preferably, mutated to M;
  • the amino acid at position 632 is mutated to G or E, preferably, mutated to G;
  • amino acid at position 369 is mutated to T or E;
  • the RNA polymerase mutant also includes mutations at one or more of the following positions corresponding to the amino acid sequence of the parent RNA polymerase: position 430, position 633, position 786, position 744, position 724, position 43, and position 723.
  • the amino acid at position 430 is mutated to a non-polar hydrophobic amino acid; alternatively, it is mutated to P, L, V, M, F; preferably, it is mutated to P;
  • the amino acid at position 723 is mutated to E or S; preferably, mutated to E;
  • the amino acid at position 724 is mutated to N.
  • the RNA polymerase mutant comprises at least any one of the following mutations that is equivalent to or corresponds to the parent polymerase mutant: I217L, I217A, I217V, S397W, S397A, S397P, T121M, T121L, T121S, S223M, S223P, S223L, S223V, S223F, R632G, R632E, M369E, M369T, N579A, N579W, and N579D mutations.
  • the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P and Q786L, named I217L/S397W/S430P/S633P/Q786L (hereinafter referred to as M7).
  • the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, N579W, named I217L/S397W/S430P/S633P/Q786L/N579W.
  • the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, C723E, A724N, named I217L/S397W/S430P, S633P/Q786L/C723E/A724N.
  • the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, Q744R, S43Y, and is named I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y (hereinafter referred to as M8).
  • the amino acid sequence of the RNA polymerase mutant is such as SEQ ID NO.2 (corresponding to I217L), SEQ ID NO.3 (corresponding to S397W), SEQ ID NO.4 (corresponding to T121M), SEQ ID NO.5 (corresponding to S223M), SEQ ID NO.6 (corresponding to R632G), SEQ ID NO.7 (corresponding to I217L/S397W/S430P/S633P/Q786L) or SEQ ID NO.
  • SEQ ID NO.8 (corresponding to I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y), SEQ ID NO.79 (corresponding to M369T), SEQ ID NO.80 (corresponding to N579W), SEQ ID NO.81 (corresponding to I217L/S397W/S430P/S633P/Q786L/N579W), SEQ ID NO.82 (corresponding to I217L/S397W/S430P/S633P/Q786L/Q744R), and SEQ ID NO.83 (corresponding to I217L/S397W/S430P/S633P/Q786L/C723E/A724N).
  • the second object of the present invention is to provide a polynucleotide encoding the aforementioned RNA polymerase mutant.
  • the synthesis method comprises using an enzyme system comprising the RNA polymerase mutant to catalyze the synthesis of polynucleotides or using the whole cells of the above-mentioned cells to catalyze the synthesis of polynucleotides.
  • the synthesis method comprises in vitro transcription synthesis, isothermal amplification or transcription-mediated amplification.
  • the sixth object of the present invention is to provide an application of the above-mentioned RNA polymerase mutant or the above-mentioned polynucleotide synthesis method in at least one of mRNA vaccine preparation, nucleic acid drug preparation, gene editing and protein expression system construction, and the protein expression system construction includes but is not limited to in vivo protein expression system construction or cell-free protein expression in vitro translation system construction.
  • a seventh object of the present invention is to provide a kit for polynucleotide synthesis, which comprises the aforementioned RNA polymerase mutant.
  • the eighth object of the present invention is to provide a method for improving at least one performance of an RNA polymerase; the performance is selected from the group consisting of improved enzyme activity, improved thermal stability, improved 3' end consistency, higher in vitro transcription yield, fewer dsRNA byproducts and lower immunogenicity, the method comprising: based on the amino acid sequence corresponding to the parent RNA polymerase, one or more amino acid mutations are present at positions 121, 223, 632, 217, 397, 369 and 579, or one or more amino acid mutations occur at amino acid residues at equivalent positions in the parent RNA polymerase; the amino acid sequence of the parent RNA polymerase has at least 80% identity with the amino acids shown in SEQ ID NO.1 and has RNA polymerase activity.
  • the amino acid sequence of the parent RNA polymerase when the mutation site contains position 121, the amino acid sequence of the parent RNA polymerase has at least 90% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity; when the mutation site contains position 369, the amino acid sequence of the parent RNA polymerase has at least 98.41% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity.
  • the present invention provides a series of RNA polymerase mutants with improved performance, which have at least any of the following performance improvements: enzyme activity, thermal stability, 3' end consistency, dsRNA byproducts and immunogenicity, etc., and can meet various needs of practical applications and research.
  • the amino acid at position 223 was mutated to construct single-point mutants S223M, S223L, and S223V, which increased the enzyme activity by more than 18% compared with the wild-type WT.
  • S223M had the highest enzyme activity, which increased by 25% compared with the wild-type WT.
  • the amino acid at position 632 was mutated to construct single-point mutants R632G and R632E, which showed an enzyme activity increase of more than 20% compared with the wild-type WT.
  • R632G had the highest enzyme activity, which increased by 22% compared with the wild-type WT.
  • FIG4A shows the dsRNA content detection results of IVT products of T7 RNAP wild type and single mutants S430P, S633P, Q786L, I217L, S397W, combined mutants I217L/S397W/S430P/S633P/Q786L, I217L/S397W/S430P/S633P/Q786L/N579W, I217L/S397W/S430P/S633P/Q786L/Q744R, and I217L/S397W/S430P/S633P/Q786L/C723E/A724N;
  • Figure 5B shows the 3' end consistency test results of the IVT product of T7 RNAP mutant Q744R at 37°C;
  • Figure 5C shows the 3' end consistency test results of the IVT product of T7 RNAP mutant S43Y at 37°C;
  • Figure 5D shows the 3' end consistency test results of the IVT product of T7 RNAP mutant G47A+884G at 37°C;
  • Figure 5J shows the 3’ end consistency test results of the IVT product of T7 RNAP mutant S397W at 50°C;
  • Figure 7E is a sequence conservation analysis of T7 RNAP mutation sites 520 to 650 in the amino acid sequences of RNA polymerases from different sources.
  • the term "corresponding to” refers to a manner of determining a specific amino acid in a sequence where a specific amino acid sequence is referenced. For example, for the purposes of the present invention, when a specific amino acid position is referenced, a skilled person can compare another amino acid sequence with the amino acid sequence that has been referenced to determine which specific amino acid may be of interest in the other amino acid sequence. Alternative alignment methods may be used and are well known to those skilled in the art.
  • Wild-type enzyme when referring to an amino acid sequence or a nucleic acid sequence means that the amino acid sequence or nucleic acid sequence is a natural or naturally occurring sequence.
  • naturally occurring refers to any substance (e.g., a protein, amino acid, or nucleic acid sequence) found in nature.
  • non-naturally occurring refers to any substance not found in nature (e.g., a recombinant nucleic acid and protein sequence produced in a laboratory, or a modification of a wild-type sequence).
  • wild-type enzyme and “parent enzyme” are used interchangeably.
  • Sequence identity The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter "sequence identity”.
  • Recombinant When used in reference to a cell, nucleic acid, protein, or vector, the term “recombinant” means that it has been modified from its native state. Thus, for example, a recombinant cell expresses genes not found in the native (non-recombinant) form of the cell, or expresses native genes at different levels or under different conditions than found in nature.
  • a recombinant nucleic acid differs from a native sequence in one or more nucleotides and/or is operably linked to a heterologous sequence (e.g., a heterologous promoter in an expression vector).
  • a recombinant protein may differ from a native sequence in one or more amino acids and/or be fused to a heterologous sequence.
  • a vector comprising a nucleic acid encoding a polypeptide is a recombinant vector.
  • the term “recombinant” is synonymous with "genetically modified" and "transgenic.”
  • Droplet microfluidic cell sorting is based on a microfluidic ultra-high throughput fluorescence-activated droplet sorting platform (Fluorescence-Activated Droplet Sorting, FADS), and the FADS screening process includes three steps: Droplet encapsulation and incubation of single cells, incubation and reaction, and ultra-high throughput screening of single cell droplets after reaction.
  • FADS can encapsulate and culture single cells, use specific fluorescent substrates to perform fluorescence quantification of the production of secondary metabolites or specific enzymes of single cells, and use the screening platform to sort and collect droplets whose fluorescence intensity meets the screening threshold, and finally obtain a large number of candidate droplets.
  • This screening method has the advantages of high throughput and strong specificity.
  • the heat treatment condition is 40-50° C. for 5-10 min, preferably 45° C. for 7 min.
  • the target cells are the 0.4% to 0.45% of droplet-encapsulated cells that exhibit the highest viability.
  • the amplified polynucleotide encoding T7 RNA polymerase and the nucleotide sequence are as shown in SEQ ID NO.15, and the expressed amino acid sequence is the wild-type T7 RNA polymerase as shown in SEQ ID NO.1.
  • the method further includes at least one step of: isolating a target fragment from a target cell sorted by a droplet microfluidic cell sorting method, transforming the fragment into another cell, and screening the cell again after culturing the fragment.
  • T7 RNA polymerase mutants expressed by positive cells are collected, the system containing the T7 RNA polymerase mutants is heat-treated, and then the enzyme activity is determined, and the positive clones whose enzyme activity is at least 1.5 times that of the wild type are sequenced.
  • T7 RNA polymerase mutants were screened using the following protocol:
  • the obtained mixed plasmid mutant library was electroporated into fresh E. coli BL21 (DE3) competent cells to obtain a mutant library containing about 107 clones.
  • E. coli was induced and expressed, and the single cell and reaction system were encapsulated using the FADS platform.
  • the single cell encapsulation rate was controlled at about 10% to improve the accuracy of screening.
  • the droplets were treated at a temperature of about 45°C for an appropriate time, and the activity was reduced to about one-third of the original to achieve the effect of high temperature treatment, and then incubated at 37°C for 3h.
  • the most suitable laser spot position, sorting speed and other conditions are determined by sorting FITC and ink. Then the incubated microdroplets are injected into the detection and sorting chip. The droplets of about 0.42% showing the highest activity are collected in a 1.5mL centrifuge tube, and the positive gene is recovered by PCR. The target fragment is treated with BamHI and HindIII double enzymes and reconstructed into the pQE-80L linearized vector. The ligated vector is transformed into the BL21 (DE3) competent state again, and the single clone grown on the plate is picked for rescreening.
  • the crude enzyme solution was transferred to a 384-well plate and the enzyme activity was measured using the STAR system (iSpinach aptamer-based monitoring of transcription activity in real-time, STAR, see application number: CN202211274914.8, the invention name is "A detection method for real-time monitoring of in vitro transcribed RNA synthesis", the introduction and use of the system are detailed in CN115896213A text). Then the mutants whose residual activity after heat treatment is at least 1.5 times that of the wild type are sequenced, and the distribution of mutation points is analyzed by sequencing results.
  • STAR system iSpinach aptamer-based monitoring of transcription activity in real-time, STAR, see application number: CN202211274914.8, the invention name is "A detection method for real-time monitoring of in vitro transcribed RNA synthesis", the introduction and use of the system are detailed in CN115896213A text.
  • a recombinant expression vector carrying a nucleotide sequence encoding a T7 RNAP mutant was constructed and transformed into BL21 (DE3) Escherichia coli competent cells to obtain recombinant strains of T7 RNP wild type and mutant, induced expression and prepared T7 RNAP mutants by protein purification. Then, a single-stranded RNA aptamer was synthesized by in vitro transcription reaction. The single-stranded RNA can bind to a specific chemical small molecule DFHBI to produce a significant fluorescence signal enhancement. The relative activity of different mutants and T7 RNAP-WT was determined by monitoring the change of real-time fluorescence during the reaction.
  • Tm values of different mutants were determined by differential scanning fluorimetry by combining the T7 RNAP mutant protein solution with Sypro Orange dye.
  • the application performance of T7 RNAP mutants is achieved by analyzing the products synthesized by in vitro transcription (IVT), including IVT yield, product purity, dsRNA content in the product and 3' end consistency index of the product.
  • IVT in vitro transcription
  • the present invention also provides a T7 RNA polymerase mutant screened out by the above screening process.
  • the T7 RNA polymerase mutant described herein is a polypeptide that has at least one amino acid residue difference compared to the wild-type T7 RNA polymerase.
  • the amino acid sequence of the wild-type T7 RNA polymerase is shown in SEQ ID NO.1.
  • the T7 RNA polymerase mutant provided by the present invention has at least one of the following amino acid residues: corresponding to the amino acid sequence shown in SEQ ID NO.1, position 217 is L (leucine), position 397 is A (alanine) or W (tryptophan), and position 579 is A (alanine) or W (tryptophan).
  • the T7 RNA polymerase mutant containing the aforementioned amino acid residues has good thermal stability.
  • the amino acid sequence corresponding to SEQ ID NO.1 as described herein refers to the amino acid residue determined by the position number of the position in the amino acid sequence shown in SEQ ID NO.1 when the T7 RNA polymerase mutant is aligned with the amino acid sequence shown in SEQ ID NO.1.
  • the position number of a certain amino acid residue in the amino acid sequence shown in SEQ ID NO.1 of the T7 RNA polymerase mutant corresponding to the amino acid sequence shown in SEQ ID NO.1 is different from the position number of the position in the T7 RNA polymerase mutant.
  • an amino acid residue is located at the i-th position in the amino acid sequence shown in SEQ ID NO.1 and is located at the i-N position in the amino acid sequence of the T7 RNA polymerase mutant.
  • the amino acid residue is defined by its position in the amino acid sequence corresponding to SEQ ID NO.1, rather than its position in the mutant.
  • the T7 RNA polymerase mutant provided by the present invention has at least 80% identity with SEQ ID NO.1, for example, it may be but is not limited to at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • Identity or similar terms herein refer to the percentage of identical nucleotides or amino acids between the sequences to be compared under optimal alignment. The percentage is purely statistical, and the differences between the two sequences may be (but not necessarily) randomly distributed over the entire length of the sequences to be compared. The comparison of two sequences is typically performed by identifying local regions of corresponding sequences relative to fragments or "comparison windows" after optimal alignment.
  • a T7 RNA polymerase mutant having at least 80% identity with SEQ ID NO.1 contains at least one of the amino acid residues at position 217 being L, at position 397 being A or W, and at position 579 being A or W.
  • the T7 RNA polymerase mutant contains an amino acid residue corresponding to L at position 217 of the amino acid sequence shown in SEQ ID NO.1, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the T7 RNA polymerase mutant contains an amino acid residue that is A or W at position 397 of the amino acid sequence shown in SEQ ID NO.1, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1, wherein preferably position 397 is W.
  • the T7 RNA polymerase mutant contains an amino acid residue that is A or W at position 579 of the amino acid sequence shown in SEQ ID NO.1, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1, wherein preferably position 579 is W.
  • the T7 RNA polymerase mutant removes at least one of the aforementioned amino acid residues and also includes at least one of the following: corresponding to the amino acid sequence shown in SEQ ID NO.1, at least one of the positions 369, 430, 633, 786, 744, 723 and 724 is different from SEQ ID NO.1.
  • the 633rd position corresponding to the amino acid sequence shown in SEQ ID NO.1 is a non-polar hydrophobic amino acid, further preferably P (proline), L (leucine), V (valine), M (methionine) or F (phenylalanine); further preferably P, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the mutation at position 633 is beneficial to improve the stability of T7 RNA polymerase.
  • the 744th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is R (arginine), P (proline) or L (leucine); more preferably R; and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the mutation at position 744 is beneficial to the improvement of T7 RNA polymerase activity.
  • the 786th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is Y (tyrosine) or a non-polar hydrophobic amino acid; more preferably Y, L (leucine), M (methionine), W (tryptophan) or F (phenylalanine); more preferably L; and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the mutation at position 786 is beneficial to improve the stability of T7 RNA polymerase.
  • the 723rd position corresponding to the amino acid sequence shown in SEQ ID NO.1 is E (glutamic acid) or S (serine); more preferably E; and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the mutation at position 723 is beneficial to improve protein aggregation.
  • the 369th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is T (threonine) or E (glutamate); and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the 579th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is W (tryptophan) or A (alanine); further preferably W; and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the 430th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is P (proline); and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the 724th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is N (asparagine); and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.
  • the T7 RNA polymerase mutant is selected from any one of the following:
  • the amino acid sequence of the T7 RNA polymerase mutant is SEQ ID NO.2 (corresponding to I217L), SEQ ID NO.3 (corresponding to S397W), SEQ ID NO.4 (corresponding to T121M), SEQ ID NO.5 (corresponding to S223M), SEQ ID NO.6 (corresponding to R632G), SEQ ID NO.7 (corresponding to I217L/S397W/S430P/S633P/Q786L) or SEQ ID NO.8 (corresponding to I217L/S397W/S430P/S633P/Q786L /Q744R/S43Y), SEQ ID NO.79 (corresponding to M369T), SEQ ID NO.80 (corresponding to N579W), SEQ ID NO.81 (corresponding to I217L/S397W/S430P/S633P/Q786L/N579W), SEQ ID NO.82 (corresponding to I217L/S397W/S430P/
  • SEQ ID NO.2 is a T7 RNA polymerase mutant with a single point mutation I217L, whose Tm value is 50.5 ⁇ 0.21°C. The residual activity after treatment at 50°C for 5 minutes is 52%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 134%.
  • SEQ ID NO.3 is a T7 RNA polymerase mutant with a single point mutation S397W, whose Tm value is 49.2 ⁇ 0.13°C. The residual activity after treatment at 50°C for 5 minutes is 47%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 162%.
  • SEQ ID NO.80 is a T7 RNA polymerase mutant with a single point mutation N579W, whose Tm value is 48.0 ⁇ 0.23°C and the residual activity after treatment at 50°C for 5 minutes is 38%.
  • SEQ ID NO.7 is a T7 RNA polymerase mutant with combined mutations I217L, S397W, S430P, S633P and Q786L, with a Tm value of 54.52 ⁇ 0.20°C and a residual activity of 58% after heat treatment at 58°C for 15min.
  • SEQ ID NO.81 is a T7 RNA polymerase mutant with combined mutations I217L, S397W, S430P, S633P, Q786L and N579W, with a Tm value of 55.56 ⁇ 0.16°C and a residual activity of 65% after heat treatment at 58°C for 15 minutes.
  • SEQ ID NO.82 is a T7 RNA polymerase mutant with combined mutations I217L, S397W, S430P, S633P, Q786L and Q744R.
  • the Tm value is 54.50 ⁇ 0.18°C.
  • the residual activity after heat treatment at 58°C for 15min is 87%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 195%.
  • SEQ ID NO.83 is a polymerase mutant with combined mutations I217L, S397W, S430P, S633P, Q786L, C723E and A724N.
  • the Tm value is 54.54 ⁇ 0.23°C.
  • the residual activity after heat treatment at 58°C for 15min is 81%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 183%.
  • the present invention also provides a polynucleotide encoding the above-mentioned T7 RNA polymerase mutant.
  • polynucleotide refers to a polymeric form of nucleotides of any length, including ribonucleotides and/or deoxyribonucleotides.
  • examples of polynucleotides include, but are not limited to, single-stranded, double-stranded, or multi-stranded DNA or RNA, genomic DNA, cDNA, DNA-RNA hybrids, or polymers containing purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural or derived nucleotide bases.
  • the polynucleotide encodes the T7 RNA polymerase mutant described above, optionally encoding the sense strand or the antisense strand.
  • the polynucleotide may be naturally occurring, synthetic, recombinant, or any combination thereof.
  • the terms "polynucleotide” and “nucleic acid” are used interchangeably herein.
  • the present invention also provides a vector, which carries a polynucleotide encoding the above-mentioned T7 RNA polymerase mutant.
  • the vector is well known to those skilled in the art, including but not limited to: plasmid; phagemid; cosmid; artificial chromosome, such as yeast artificial chromosome (YAC), bacterial artificial chromosome (BAC) or P1-derived artificial chromosome (PAC); bacteriophage such as ⁇ phage or M13 phage and animal virus, etc.
  • Animal viruses that can be used as vectors include but are not limited to retroviruses (including lentiviruses), adenoviruses, adeno-associated viruses, herpes viruses (such as herpes simplex virus), poxviruses, baculoviruses, papillomaviruses, and papillomaviruses.
  • the vector of the present invention contains regulatory elements commonly used in genetic engineering, such as enhancers, promoters, internal ribosome entry sites (IRES) and other expression control elements (such as transcription termination signals, or polyadenylation signals and poly-U sequences, etc.).
  • the present invention further provides a cell, which carries the aforementioned polynucleotide or the aforementioned vector, or expresses the aforementioned T7 RNA polymerase mutant.
  • the cell may refer to a single cell, a cell line, or a cell culture.
  • the cell herein includes its progeny, which may not necessarily be completely identical to the primary cell due to natural, accidental or intentional mutations, and may differ from the primary cell in morphology and/or in genomic DNA.
  • the cell may be a natural cell or a transformant.
  • the present invention also provides a method for synthesizing a polynucleotide, wherein the method uses the RNA polymerase mutant of the present invention.
  • the method for synthesizing the polynucleotide comprises using an enzyme system comprising the aforementioned T7 RNA polymerase mutant to catalyze a synthesis reaction.
  • the synthesis method comprises in vitro transcription synthesis (In Vitro Transcription, IVT), isothermal amplification, such as sequence-specific nucleic acid in vitro amplification (Nucleic Acid Specific-Based Amplification, NASBA) or transcription-mediated amplification (Transcription mediated amplification, TMA)
  • the present invention also provides the use of the aforementioned T7 RNA polymerase mutant or the aforementioned polynucleotide synthesis method in any one of the following: at least one of mRNA vaccine preparation, nucleic acid drug preparation, gene editing and protein expression system construction, and the protein expression system construction includes but is not limited to in vivo protein expression system construction or cell-free protein expression in vitro translation system construction.
  • the present invention also provides a kit for polynucleotide synthesis, which comprises the aforementioned T7 RNA polymerase mutant.
  • kit for polynucleotide synthesis provided by the present invention may also comprise other detection reagents or consumables acceptable in the art, and those skilled in the art may select corresponding reagents and/or consumables based on the contents well known in the art and recorded in various general and more specific teaching materials, references, process manuals, product descriptions, standard documents and equipment manuals.
  • kits include, but are not limited to, one or more of buffers, salts, metal ions, dNTPs, enzymes, primers, probes, fluorescent dyes, luminescent substrates, reference substances, quality control substances and calibrators.
  • the wild-type T7 RNA polymerase gene nucleotide sequence (corresponding amino acid sequence such as SEQ ID NO.1) was optimized according to the Escherichia coli codon to obtain the nucleotide sequence SEQ ID NO.15, and then the optimized nucleotide sequence was inserted between the two restriction endonuclease sites of BamHI and HindIII of the vector pQE-80L (Qiagen, item number HG-VYQ0254), that is, the recombinant vector pQE-80L-T7 RNAP containing the wild-type T7 RNAP gene was obtained.
  • the plasmid structure is shown in Figure 1.
  • the recombinant vector pQE-80L-T7 RNAP was used as a template to amplify the T7 RNAP gene, and the degree of mutation was mainly achieved by changing the Mn 2+ concentration in the reaction system. 0.1mM, 0.2mM, 0.3mM, 0.4mM, 0.5mM, 0.6mM Mn 2+ concentration gradients were set for gene amplification, and the mutation rate was optimal with 1 to 3 mutation points per gene.
  • the amplification system was as follows: pQE-80L-T7 RNAP plasmid (0.2-1 ng/ ⁇ L), dATP (0.25 mM), dGTP (0.25 mM), dCTP (1.05 mM), dTTP (1.05 mM), DreamTaq Buffer (Takara), DreamTaq (Takara, 0.1 U/ ⁇ L), MnCl 2 (0.1 mM-0.6 mM), T7 RNAP-F (0.2 ⁇ M), T7 RNAP-R (0.2 ⁇ M), and the total PCR reaction system was 50 ⁇ L.
  • the target fragment size was analyzed by agarose gel electrophoresis, recovered by the kit, and then double-digested with BamHI and HindIII.
  • the digested product was then purified and recovered, and ligated with the linearized pQE-80L vector (double-digested with BamHI and HindIII) overnight under the action of T4 DNA ligase.
  • the recovered ligation product was electrotransformed into electrocompetent E. coli 10G, and the library capacity was calculated based on the clones grown, and the library capacity was more than 10 6.
  • the frequency of introduction under 0.2 mM Mn 2+ was 1.7 base mutations/gene, and the frequency of introduction under 0.3 mM Mn 2+ was 3.5 base mutations/gene. Therefore, in this example, the mutant plasmids obtained under 0.2 mM Mn 2+ and 0.3 mM Mn 2+ were mixed to obtain a mutant library in which an average of 2.6 mutant bases were introduced into each gene.
  • the obtained mixed plasmid mutant library was electroporated into fresh E. coli BL21 (DE3) competent cells to obtain a clone mutant library containing about 10 7.
  • the whole screening process is shown in Figure 2.
  • E. coli was induced and expressed, and the single cell and reaction system were encapsulated using the FADS platform (droplet microfluidic cell sorting based on microfluidic ultra-high throughput fluorescence-activated droplet sorting platform, Fluorescence-Activated Droplet Sorting, FADS).
  • the reaction system was published in patent CN 115896213 A, including single clones, 40-100nM DNA template ispinach, 100 ⁇ M DFHBI, 0.5-6mM NTPs, 30mM MgCl 2 , 5Mm NaCl, 20mM DTT, 0.002U/ ⁇ L inorganic pyrophosphatase, and 0.2U/ ⁇ L mouse ribonuclease inhibitor.
  • the single-cell encapsulation rate is controlled at about 10% to improve the accuracy of screening.
  • the droplets need to be heat-treated at 45°C for 7 minutes, and the activity will drop to about one-third of the original to achieve the effect of high-temperature treatment, and then incubated at 37°C for 3 hours.
  • the optimal laser spot position, sorting speed and other conditions are determined by sorting FITC and ink. The incubated microdroplets are then re-injected into the detection and sorting chip.
  • the PMT was set to 0.43V
  • the small and large thresholds of sorting were set to 0.2V and 0.6V respectively
  • the sorting voltage was 1.6V.
  • the droplets of about 0.42% showing the highest activity were collected in a 1.5mL centrifuge tube, and the positive genes were recovered by PCR.
  • the target fragments were double-digested with SacI and HindIII, reconstructed into the pQE-80L linearized vector, and the ligated vector was transformed into BL21 (DE3) competent cells again, and the single clones grown on the plate were picked for rescreening.
  • the enzyme activity was measured using the STAR system (iSpinach aptamer-based monitoring of transcription activity in real-time, STAR, see application number: CN202211274914.8, the invention name is "A detection method for real-time monitoring of in vitro transcribed synthetic RNA", the introduction and use of the system are detailed in CN115896213A text).
  • Mutants whose residual activity after heat treatment is at least 1.5 times that of the wild type were sequenced. Analysis of the sequencing results revealed that the frequencies of mutations at positions 217, 397, 121, 223, 632, 369 and 579 corresponding to the parental RNA polymerase (i.e., wild-type T7 RNAP, as shown in the amino acid sequence SEQ ID NO.1) were relatively high.
  • the wild-type T7 RNAP shown in the amino acid sequence SEQ ID NO.1 was determined as the parent, and the 217th amino acid, the 397th amino acid, the 121st amino acid, the 223rd amino acid and the 632nd amino acid of the parent amino acid sequence were used as key sites to design mutants and combination mutants with amino acid sequences as shown in SEQ ID NO.2 to 8; mutants S430P, S633P, Q786L, Q744R, S43Y and mutant combinations reported in the prior art were designed as controls, and the amino acid sequences were as shown in SEQ ID NO.9 to 14, namely the following mutants:
  • S430P (US7507567B2), S633P (US7507567B2) and Q786L (CN102177236B) are mutation points reported in the prior art to improve the thermal stability of T7 RNAP
  • Q744R (US20150024435A1) is a mutation point reported in the prior art to improve the activity of T7 RNAP
  • S43Y (CN112831484B) is a mutation point reported in the prior art to reduce the RdRp activity of T7 RNAP
  • G47A+884G refers to the addition of an additional amino acid G at the C-terminus of the mutant amino acid sequence on the basis of G47A (total length 883), so that the total length of the mutant is 884.
  • This mutation method (CN111212905A) can reduce dsRNA contaminants and concatenated transcripts generated during in vitro transcription reactions.
  • the specific steps for constructing the mutant vector are as follows: reverse transcribe the above amino acid sequences (SEQ ID NO.1-14, SEQ ID NO.79-83) and optimize them according to the Escherichia coli codon to obtain nucleotide sequences (SEQ ID NO.15-28, SEQ ID NO.84-88); then insert the nucleotide sequences between the two restriction endonuclease sites of BamHI and HindIII of the vector pQE-80L to form the recombinant expression vectors of wild-type T7 RNAP and mutant T7 RNAP.
  • T7 RNAP-WT for expressing wild type T7 RNAP
  • T7 RNAP-I217L for expressing single point mutant I217L
  • T7 RNAP-S397W for expressing single point mutant S397W
  • 7 RNAP-T121M for expressing mutant T121M
  • T7 RNAP-S223M for expressing mutant S223M
  • T7 RNAP-R632G for expressing mutant R632G
  • T7 RNAP-I217L/S397W/S430P/S633P/Q786L for expressing combined mutant M7: I217L/S397W/S430P/S633P/Q786L
  • the wild-type and mutant recombinant strains prepared in the previous step were placed in LB medium containing ampicillin resistance and cultured in a shaking incubator at 37°C until the OD600 value was close to 1.2. Then, isopropyl- ⁇ -D-thiogalactopyranoside (IPTG) with a final concentration of 1 mM was added to induce expression in a shaking incubator at 25°C overnight (16 hours). The bacterial precipitate was then collected by centrifugation at 4°C and 5000 rpm for 20 min.
  • IPTG isopropyl- ⁇ -D-thiogalactopyranoside
  • the bacteria were fully resuspended in a lysis buffer containing 300 mM NaCl, 20 mM Tris HCl (pH 7.5), 0.5 mg/mL lysozyme, and 0.5 mM DTT, and the protein lysate was centrifuged in a high-speed refrigerated centrifuge at 4°C and 18000 rpm for 1 h. The supernatant was then separated and filtered to remove impurities. The filtered supernatant can be temporarily stored on ice for subsequent nickel column purification.
  • the nickel column Before the protein passes through the nickel column, it is necessary to equilibrate the nickel column with 10 times the volume of elution buffer (20mM Tris HCl (PH7.5), 300mM NaCl, 0.5mM DTT), and then add the previously filtered protein solution to the nickel column. After all the protein solution has passed through the nickel column filler, first use a certain concentration of Triton X-ray to wash the impurities, and then use different gradients of imidazole solution (50mM 100mM 150mM-200mM-500mM) to elute, and use several test tubes to number them in order of outflow to collect the outflow.
  • elution buffer 20mM Tris HCl (PH7.5), 300mM NaCl, 0.5mM DTT
  • the enzyme storage buffer consisted of 50mM Tris-HCl, 100mM NaCl, 20mM 2-Mercaptoethanol, 1mM EDTA, 50% Glycerol, 0.1% Triton X-100, pH 7.9.
  • the wild-type T7 RNA and the mutant T7 RNA prepared in Example 2 were taken and their enzyme activities were tested.
  • the reaction system for real-time fluorescence detection of the relative enzyme activity of T7 RNAP mutants is shown in Table 3.
  • Table 3 For specific experimental methods, please refer to the Chinese patent application with application number: CN202211274914.8, and the invention name is "A detection method for real-time monitoring of RNA synthesized by in vitro transcription", in which the added protein concentration and volume of the T7 RNAP mutant are consistent with the wild type with known enzyme activity and protein concentration.
  • the reaction is carried out in a 96-well plate, and the fluorescence value generated by the binding of the aptamer produced by the reaction and DFHBI is measured in real time by an enzyme reader.
  • the reaction system is determined according to the experimental needs.
  • the reaction conditions are 37°C.
  • the enzyme reader When the enzyme reader is used for real-time measurement, it takes 1 hour to measure a value every 1 minute. After the reaction is completed, the Origin software is used to draw a curve graph, and the standard curve of the protein concentration and fluorescence intensity of the wild type is calculated at the time with the best linear relationship, as shown in Figure 3, so as to calculate the relative enzyme activity of the mutant. All reagents in the reaction system need to be prepared with Nuclease-free H 2 O, and the consumables used are all RNase-free grade to prevent the generated RNA from being degraded by RNase.
  • the relative enzyme activity detection of mutants is shown in Table 4-1, Table 4-2, and Table 4-3.
  • the enzyme activities of single point mutant I217L, single point mutant N579W, single point mutant S397W, single point mutant T121M, single point mutant S223M, and single point mutant R632G at 37°C are significantly improved compared with T7 RNAP-WT, and are significantly better than the single point mutants S430P, G47A+884G, single point mutant S633P, and single point mutant Q786L reported in the prior art.
  • the enzyme activity of the combined mutant M8 is significantly higher than that of WT and the single point mutants S430P, S633P, Q744R, Q786L, S43Y, and G47A+884G mutants reported in the prior art.
  • the relative activities of the combined mutants M7 and I217L/S397W/S430P/S633P/Q786L/N579W at 37°C were not different from those of T7 RNAP-WT.
  • the residual activity of the combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R and I217L/S397W/S430P/S633P/Q786L/C723E/A724N was significantly higher than that of T7 RNAP-WT, and was significantly higher than the residual activity of T7-RNAP-WT, single point mutant S430P, single point mutant S633P and single point mutant Q786L after treatment at 50°C for 5 min.
  • the residual relative activity after heat treatment indicated that the combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R and the combined mutants I217L/S397W/S430P/S633P/Q786L/C723E/A724N had higher thermal stability than the existing technology.
  • T7 RNAP protein solution 0.2 mg/mL
  • 2 ⁇ L Sypro Orange dye 100 ⁇
  • PBS 0.1 mg/mL
  • the Tm of a protein is an indicator of thermal stability. Generally speaking, the higher the Tm, the more stable the protein.
  • the in vitro transcription (IVT) reaction system is shown in Table 6.
  • the reaction system can be scaled up in equal proportions. After the system is configured, it is placed at 37°C and 50°C for 2 hours, respectively. Then, 10U DNaseI is added to each reaction according to 1 ⁇ g template and reacted at 37°C for 30 minutes to digest the DNA template. The IVT product is then recovered by LiCl precipitation, the RNA concentration is measured, and the mRNA yield ( ⁇ g) corresponding to 1 ⁇ g template DNA is calculated.
  • the mRNA yield ( ⁇ g) corresponding to 1 ⁇ g template DNA is shown in Table 7-1.
  • the IVT yield of the combined mutant M8 of the present invention is significantly improved compared with the wild-type WT and the single-point mutants S430P, S430P, Q786L, and G47A+884G reported in the prior art, and is comparable to the IVT yields of the mutants Q744R and S43Y reported in the prior art, and is also somewhat improved compared to the combined mutant M7 IVT yield; at the same time, under 50°C reaction conditions, the WT and Q744R reported in the prior art are significantly improved compared to the combined mutant M7.
  • the mutants 744R, S43Y, and G74A+884G produced almost no IVT products, indicating that WT, Q744R, S43Y, and G47A+884G could not tolerate the reaction at 50°C; the IVT yield of the combined mutant I217L/S397W/S430P/S633P/Q786L/N579W at 50°C increased by 60% compared with the IVT yield of WT at 37°C; the combined mutant I217L/S397W/S430P/S633P/Q786L/N579W at 50°C increased by 60% compared with the IVT yield of WT at 37°C.
  • the IVT yield of L/Q744R at 37°C or 50°C was significantly increased compared with WT and single-point mutants I217L, S397W, S430P, S633P, and Q786L.
  • the IVT yield of the combined mutant I217L/S397W/S430P/S633P/Q786L/C723E/A724N at 37°C was increased by about 20% compared with WT and single-point mutants I217L, S430P, S633P, and Q786L.
  • the IVT yield at 50°C was increased by about 120% compared with the IVT yield of WT and single point mutants I217L, S430P, S633P, and Q786L at 37°C; the IVT yield of the combined mutant M8 at 50°C was significantly higher than that of WT and mutants reported in the prior art, indicating that the combined mutants M8 and M7 have better IVT application performance, can produce higher yields of in vitro transcription products, can tolerate 50°C reactions, and have good thermal stability.
  • the product purities of the single mutants S430P, S633P, Q786L, I217L, S397W, the combined mutants M7, I217L/S397W/S430P/S633P/Q786L/N579W, I217L/S397W/S430P/S633P/Q786L/Q744R, and I217L/S397W/S430P/S633P/Q786L/C723E/A724N are shown in Table 7-2.
  • the purity of the IVT product of N579W at 37°C was significantly improved compared with WT and single point mutants I217L, S397W, S430P, S633P, and Q786L, and the product purity improvement rate was all above 10%; the purity of the product of the combination mutants I217L/S397W/S430P/S633P/Q786L/Q744R and the combination mutants I217L/S397W/S430P/S633P/Q786L/C723E/A724N at 37°C was significantly improved compared with WT and single point mutants I217L, S397W, S430P, S633P, and Q786L, and the product purity improvement rate was all above 5%.
  • the purity of the IVT product of P/Q786L/C723E/A724N at 50°C is within 5% relative to the purity of the IVT product of WT and single point mutants I217L, S397W, S430P, S633P, Q786L at 37°C and single point mutants I217L, S397W, S430P, S633P, Q786L at 50°C, and it is considered that there is no significant difference. This shows that the purity of the IVT product of the T7 RNAP
  • the wild type WT, single mutants Q744R, S43Y, G47+884G, T121M, S223M, R632G, S430P, S633P, Q786L, I217L, S397W, combined mutants M7 and M8 were tested using our company's self-developed dsRNA content ELISA quantitative detection kit.
  • the purified IVT product RNA samples obtained in 3 were tested for dsRNA content.
  • the specific operation method is as follows: (1) The dsRNA standard was diluted with STE Buffer to 8 concentrations of 2pg/ ⁇ L, 1pg/ ⁇ L, 0.5pg/ ⁇ L, 0.25pg/ ⁇ L, 0.125pg/ ⁇ L, 0.0625pg/ ⁇ L, 0.0312pg/ ⁇ L, and 0pg/ ⁇ L. The purified RNA samples to be tested were diluted to 3 different concentrations respectively.
  • FIG4A The experimental results are shown in FIG4A , wherein the dsRNA content of the combined mutant M8 of the present invention in the unit mass IVT product of the conventional 37° C. reaction of the wild-type WT, Q744R, S43Y and G47A+884G is significantly lower than that of the wild-type WT and the single point mutants Q744R, S43Y, S430P, S633P, Q786L and M7 combined mutants reported in the prior art; in the unit mass IVT product of the in vitro transcription reaction at 50° C., the combined mutant M The dsRNA content of 8 is significantly lower than that produced by the reaction at 37°C.
  • the dsRNA content of the IVT product per unit mass under the reaction conditions of 50°C is significantly reduced compared to the single point mutants S430P, S633P, and Q786L reported in the prior art, and the dsRNA content of its 50°C in vitro transcription product can reach the same level as the G47A+884G mutant reported in CN111212905A that can reduce dsRNA contaminants and concatenated transcripts in the in vitro transcription product.
  • the dsRNA content level of the combined mutant M8 of the present invention per unit mass of the IVT product is lower than that of the combined mutant M7.
  • Wild type WT single mutants S430P, S633P, Q786L, I217L, S397W, combined mutants I217L/S397W/S430P/S633P/Q786L, I217L/S397W/S430P/S633P/Q786L/N579W, I217L/S397W/S430P/S633P/Q786L/Q744R, I217L/S397W/S430P/S633P/Q786L
  • the dsRNA content in the IVT product of /C723E/A724N was detected by dot blot.
  • the IVT RNA sample recovered by LiCl precipitation obtained in 3 was added dropwise to the positively charged Hybond N membrane (GE Amersham), and then the membrane was blocked in 5% (w/v) skim milk powder TBS-T buffer (20mM Tris, pH 7.4, 150mM NaCl, 0.1% v/v Tween-20). Then J2 anti-dsRNA antibody was diluted 1:5000; incubated overnight at 4°C. The blot was detected with HRP-conjugated secondary antibody (Cell Signaling Technologies). A dsRNA standard with a known concentration was used as a positive control.
  • the 3' end consistency detection of IVT products is achieved through RNase T1 digestion experiment.
  • RNase T1 can specifically degrade single-stranded RNA at the G residue, which forms the corresponding intermediate 2', 3'-cyclic phosphate (-mP) salt, cleaving the phosphodiester bond between the 3'-guanosine residue and the 5'-OH residue of the adjacent nucleotide; the reaction products are 3'-GMP and oligonucleotides containing terminal 3'-GMP. Then, the molecular weight of the 3' end product is analyzed by LC-MS and compared with the theoretical molecular weight.
  • -mP 2', 3'-cyclic phosphate
  • the 3' end of the normal mRNA product is a G base ending with -OH
  • the 3' end G base of the heterogeneous product ends with -mP
  • the molecular weight of the heterogeneous product is 80 more than that of the normal product.
  • the 3' end consistency (%) is calculated by analyzing the proportion of the theoretical molecular weight product.
  • Figures 5A to 5J The experimental results are shown in Figures 5A to 5J.
  • the molecular weight of the pure product with 3' end consistency is 9663.1, and the molecular weight of the 3' end heterogeneous conjugation product is 9743.1.
  • Figures 5A to 5J are the results of LC-MS analysis of 3' end products of IVT products of wild type WT (37°C), Q774R (37°C), S43Y (37°C), G47A+884G (37°C), M7 (37°C), M7 (50°C), M8 (37°C), M8 (50°C), I217L (50°C), and S397W (50°C).
  • Table 8 The detailed data analysis is shown in Table 8.
  • the immune response of cells to dsRNA byproducts may lead to adverse consequences.
  • the purified IVT product RNA sample was used for cell response detection.
  • the specific operation method is as follows: (1) Thaw the cryotube containing the Raw cell suspension and mix it evenly, and add the cell suspension to a culture flask or culture dish containing culture medium for overnight culture. When the RAW cell density reaches 80%-90%, subculture is performed. (2) Inoculate cells: Prepare IVT product-lipid complexes and transfect cells with cationic liposome transfection reagent Lipofectamine 3000.
  • IFN- ⁇ and IL-6 are cytokines produced by the body's immune cells. They are immune response factors produced by immune cells through immune response when the body is infected. The levels of IFN- ⁇ and IL-6 detected in this study reflect the immunogenicity of the T7 RNA polymerase IVT products.
  • T7 RNAP mutants were prepared, and their activity changes were detected (the activity changes were detected using the relative activity detection method of Example 3, and the Tm values of each mutant were determined using the differential scanning fluorimetry of Example 3.
  • the Tm value reflects the stability of each mutant protein. The higher the Tm value, the more stable the protein).
  • T7 RNAP mutants with mutation sites Q786M, Q786F, I217A, I217V, S397P, N579A, and M369E have higher thermal stability than T7 RNAP-WT.
  • I121L, I121S, S223L, S223V, and R632E mutants have no obvious advantage in thermal stability over wild-type T7 RNAP, they have higher enzyme activity. These mutants will have broad application prospects in in vitro transcription, mRNA vaccine and drug research and production, gene editing, isothermal amplification, in vivo protein expression, or cell-free protein expression in vitro translation systems.
  • RNA polymerases from different sources with an amino acid identity of more than 80% were compared, and the results are shown in Figures 7A to 7E. It was found that some sequences in the sequence are highly conserved, including positions 217, 223, 397, 579, and 632. When the amino acid substitutions at positions 217, 223, 397, 579, and 632 described in the present application are transferred to the corresponding positions in other closely related phage RNA polymerases (SEQ ID NO. 29 to SEQ ID NO. 62), they are expected to have the same effect.
  • the closely related bacteriophage RNA polymerase amino acid sequences SEQ ID NO. 29 to SEQ ID NO.
  • the amino acid substitutions at positions 121 and 369 are expected to have the same effect when transferred to the corresponding positions in other closely related phage RNA polymerases.
  • the amino acid sequences of the closely related phage RNA polymerases at position 121 (SEQ ID NO.29 to SEQ ID NO.39, SEQ ID NO.41 to SEQ ID NO.42, SEQ ID NO.44 to SEQ ID NO.45, SEQ ID NO.47, SEQ ID NO.49 to SEQ ID NO.53) have 97.96%, 98.07%, 98.64%, 98.07%, 98.66%, 98.67%, 98.68%, 98.69%, 98.70%, 98.71%, 98.72%, 98.73%, 98.74%, 98.76%, 98.77%, 98.78%, 98.80%, 98.81%, 98.82%, 98.83%, 98.84%, 98.86
  • the closely related bacteriophage RNA polymerase amino acid sequence SEQ ID NO.44 at position 369 has a sequence identity of 98.41% with the wild-type amino acid sequence of T7 bacteriophage RNA polymerase shown in SEQ ID NO.1 in Examples 1-4.
  • amino acid sequence of the wild-type parent T7 RNA polymerase of the present invention (SEQ ID NO.1):

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Zoology (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Medicinal Chemistry (AREA)
  • Cell Biology (AREA)
  • Mycology (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

Provided are a series of RNA polymerase mutants with improved performance. The mutants, compared with an amino acid sequence of a parental RNA polymerase, have one or more amino acid mutations at positions 121, 223, 632, 217, 397, 369, and 579 or have one or more amino acid mutations that occur at amino acid residues at equivalent positions in the parental RNA polymerase. The mutants are at least improved in terms of any of the following: enzymatic activity, thermal stability, 3' terminal consistency, in-vitro transcription yield, dsRNA byproducts, immunogenicity, and the like, and can meet various requirements of practical application and research.

Description

一种性能提升的RNA聚合酶突变体A RNA polymerase mutant with improved performance 技术领域Technical Field

本发明涉及一种性能提升的RNA聚合酶突变体,属于基因工程及酶工程技术领域。The invention relates to an RNA polymerase mutant with improved performance, belonging to the technical field of genetic engineering and enzyme engineering.

背景技术Background Art

T7 RNA聚合酶(T7 RNA polymerase,T7 RNAP)来源于大肠杆菌噬菌体T7,是一种高度特异识别T7启动子序列的DNA依赖的单亚基RNA聚合酶。该酶在体外RNA合成方面具有广泛应用,包括合成gRNA、siRNA以及同位素标记或非同位素标记的RNA探针,在Cap analog参与下合成Capped mRNA,用于细胞内或体内表达以及mRNA疫苗和药物研究。此外,它还可用于等温扩增,如序列特异性核酸体外扩增(NASBA)以及转录介导的扩增(TMA)等。T7 RNA polymerase (T7 RNAP) is derived from Escherichia coli bacteriophage T7 and is a DNA-dependent single-subunit RNA polymerase that highly specifically recognizes the T7 promoter sequence. The enzyme has a wide range of applications in in vitro RNA synthesis, including the synthesis of gRNA, siRNA, and isotope-labeled or non-isotope-labeled RNA probes, and the synthesis of capped mRNA with the participation of Cap analog for intracellular or in vivo expression and mRNA vaccine and drug research. In addition, it can also be used for isothermal amplification, such as sequence-specific nucleic acid in vitro amplification (NASBA) and transcription-mediated amplification (TMA).

但野生型的T7 RNAP在实际应用中存在一些不足,如合成RNA过程中可能产生副产物,包括转录起始过程中产生的短片段、3'末端延伸产物和dsRNA副产物,这些副产物会影响mRNA疫苗和药物研究的效果。随着mRNA药物及分子诊断技术的发展,目前市场上以野生型为主的T7 RNA聚合酶无法完全满足日益增长的研究需求,需要进行优化改进以实现更好的应用价值。However, wild-type T7 RNAP has some shortcomings in practical applications, such as the generation of byproducts during RNA synthesis, including short fragments generated during transcription initiation, 3' end extension products, and dsRNA byproducts, which can affect the effects of mRNA vaccine and drug research. With the development of mRNA drugs and molecular diagnostic technology, the wild-type T7 RNA polymerase currently on the market cannot fully meet the growing research needs and needs to be optimized and improved to achieve better application value.

目前已有一些技术对T7 RNAP进行了改进,如提高酶活性的突变体、减少dsRNA污染物的突变体以及提高热稳定性的突变体。例如,US20150024435 A1提供了酶活性提高的T7 RNA聚合酶的突变体,CN 111212905 A提供了减少在体外转录反应期间产生的dsRNA污染物和连缀转录物的T7 RNAP突变体;以及CN 112831484 B提供了一种适用于内部含有终止信号的RNA以及末端形成发夹结构的RNA的合成,相比于T7 RNAP-WT能够合成纯度更高的Cas9 sgRNA;CN 108779446 A和US 9193959 B2提供了热稳定性提高的T7 RNAP突变体。然而,现有技术中这些突变体往往性能上有待提升,例如性能提升不够明显、热稳定性和活性的平衡问题或者副产物的未能有效减少。At present, some technologies have been used to improve T7 RNAP, such as mutants with improved enzyme activity, mutants with reduced dsRNA contaminants, and mutants with improved thermal stability. For example, US20150024435 A1 provides a mutant of T7 RNA polymerase with improved enzyme activity, CN 111212905 A provides a T7 RNAP mutant that reduces dsRNA contaminants and concatenated transcripts produced during in vitro transcription reactions; and CN 112831484 B provides a method suitable for the synthesis of RNA containing internal termination signals and RNA with hairpin structures at the ends, which can synthesize Cas9 sgRNA with higher purity than T7 RNAP-WT; CN 108779446 A and US 9193959 B2 provide T7 RNAP mutants with improved thermal stability. However, these mutants in the prior art often need to be improved in performance, such as insufficient performance improvement, balance between thermal stability and activity, or failure to effectively reduce byproducts.

因此,本发明旨在开发性能提升的T7 RNAP突变体,该突变体至少具有如下任意性能的提升:酶活性、热稳定性、3'末端一致性、体外转录产量、dsRNA副产物和免疫原性等,以满足实际应用和研究的多种需求。Therefore, the present invention aims to develop a T7 RNAP mutant with improved performance, which has at least any of the following performance improvements: enzyme activity, thermal stability, 3' end consistency, in vitro transcription yield, dsRNA byproducts and immunogenicity, etc., so as to meet the various needs of practical applications and research.

发明内容Summary of the invention

为解决上述问题,本发明提供了一系列性能提升的RNA聚合酶突变体,该突变体至少具有如下任意性能的提升:酶活性、热稳定性、3'末端一致性、体外转录产量、dsRNA副产物和免疫原性等,能够满足实际应用和研究的多种需求。To solve the above problems, the present invention provides a series of RNA polymerase mutants with improved performance, which have at least any of the following performance improvements: enzyme activity, thermal stability, 3' end consistency, in vitro transcription yield, dsRNA byproducts and immunogenicity, etc., and can meet various needs of practical applications and research.

本发明的第一个目的是提供一种性能提升的RNA聚合酶突变体,所述RNA聚合酶突变体是在对应于亲本RNA聚合酶的氨基酸序列的基础上,第121、223、632、217、397、369、579位的位置具有一个或者多个氨基酸突变,或者在亲本RNA聚合酶中等价位置的氨基酸残基处发生了一个或者多个氨基酸突变;所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少80%的同一性且具有RNA聚合酶活性;The first object of the present invention is to provide an RNA polymerase mutant with improved performance, wherein the RNA polymerase mutant has one or more amino acid mutations at positions 121, 223, 632, 217, 397, 369, and 579, or one or more amino acid mutations occur at amino acid residues at equivalent positions in the parent RNA polymerase, based on the amino acid sequence corresponding to the parent RNA polymerase; the amino acid sequence of the parent RNA polymerase has at least 80% identity with the amino acids shown in SEQ ID NO.1 and has RNA polymerase activity;

在一种实施方式中,所述性能选自酶活性、热稳定性、3’末端一致性、体外转录产量、dsRNA副产物和免疫原性中的一种或多种。In one embodiment, the property is selected from one or more of enzyme activity, thermal stability, 3' end consistency, in vitro transcription yield, dsRNA byproducts and immunogenicity.

在一种实施方式中,当突变位点含第121位时,所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少90%的同一性且具有RNA聚合酶活性;当突变位点含第369位时,所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少98.41%的同一性且具有RNA聚合酶活性。In one embodiment, when the mutation site contains position 121, the amino acid sequence of the parent RNA polymerase has at least 90% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity; when the mutation site contains position 369, the amino acid sequence of the parent RNA polymerase has at least 98.41% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity.

在一种实施方式中,所述亲本RNA聚合酶为野生型T7 RNA聚合酶的氨基酸序列如SEQ ID NO.1所示。In one embodiment, the parent RNA polymerase is a wild-type T7 RNA polymerase whose amino acid sequence is shown in SEQ ID NO.1.

在一种实施方式中,所述与SEQ ID NO.1所示的氨基酸具有至少80%的同一性的亲本RNA聚合酶,来自不同的大肠杆菌噬菌体或其他噬菌体,包括但不限于:Escherichia phage T7、Enterobacteria phage vB EcoP IMME390、Enterococcus phage EFA-2、Escherichia phage 13a、Escherichia phage 64795ec1、Escherichia phage ClCC 80001、Escherichia phage Ebrios、Escherichia phage EG1、Escherichia phage HZ2R8、Escherichia phage HZP2、Escherichia phage JacobBurckhardt、Escherichia phage JeanTinguely、Escherichia phage LL2、Escherichia phage N30、Escherichia phage NC-A、Klebsiella phage Patroon、Salmonella phage 3A_8767、Salmonella phage Vi06、Serratia phage 2050H2、Serratia phage Pila、Serratia phage SlM9-3Y、Escherichia phage JB01、Yersinia phage phiA1122、Yersinia phage vB_YpYeO9、Yersinia phage YpP-R、Aeromonas phage PZL-Ah1、Citrobacter phage SH2、Escherichia phage E-2、Yersinia phage phiYe-F10、Citrobacter phage phiCFP-1、Citrobacter phage SH1、Salmonella phage phiSG-JL2、Yersinia phage phiYeO3-12、Klebsiella phage 31、Enterobacteria phage T3中的任一一种RNA聚合酶的氨基酸序列。上述亲本RNA聚合酶的氨基酸序列的第223、632、217、397、579位为保守序列。In one embodiment, the parent RNA polymerase having at least 80% identity with the amino acid shown in SEQ ID NO.1 is from different Escherichia coli phages or other phages, including but not limited to: Escherichia phage T7, Enterobacteria phage vB EcoP IMME390, Enterococcus phage EFA-2, Escherichia phage 13a, Escherichia phage 64795ec1, Escherichia phage ClCC 80001, Escherichia phage Ebrios, Escherichia phage EG1, Escherichia phage HZ2R8, Escherichia phage HZP2, Escherichia phage JacobBu rckhardt, Escherichia phage JeanTinguely, Escherichia phage LL2, Escherichia phage N30, Escherichia phage NC-A, Klebsiella p hage Patroon、Salmonella phage 3A_8767、Salmonella phage Vi06、Serratia phage 2050H2、Serratia phage Pila、Serratia phage S lM9-3Y, Escherichia phage JB01, Yersinia phage phiA1122, Yersinia phage vB_YpYeO9, Yersinia phage YpP-R, Aeromonas phage PZL The amino acid sequence of any one of the RNA polymerases selected from the group consisting of Citrobacter phage SH2, Escherichia phage E-2, Yersinia phage phiYe-F10, Citrobacter phage phiCFP-1, Citrobacter phage SH1, Salmonella phage phiSG-JL2, Yersinia phage phiYeO3-12, Klebsiella phage 31, and Enterobacteria phage T3. The 223rd, 632nd, 217th, 397th, and 579th positions of the amino acid sequences of the above parent RNA polymerases are conserved sequences.

可选地,所述亲本RNA聚合酶的氨基酸序列的登录号为QZB83517.1、YP 009814072.1、QJT70354.1、YP 002003938.1、YP 009291482.1、YP 009152465.1、YP 009799383.1、YP  009795804.1、YP 009798010.1、YP 009820203.1、QXV80637.1、QXV80862.1、YP 009812318.1、YP 009813771.1、YP 009820155.1、QBQ72872.1、YP 009804754.1、YP 004306655.1、YP 009791749.1、QFG06779.1、YP 009786924.1、QOV06378.1、NP 848264.1、WMX18675.1、AFK13398.1、QMP81608.1、YP_009289300.1、YP_009226209.1、YP_009785350.1、YP_009205657.1、YP_009286631.1、YP_001949750.1、NP_052071.1、QGH73719.1、NP_523301.1中的任一一种,分别对应SEQ ID NO.1、SEQ ID NO.29~62;其中SEQ ID NO.29~62与SEQ ID NO.1的氨基酸相比,分别具有97.96%、98.07%、98.64%、98.07%、98.19%、90.26%、97.51%、97.40%、97.40%、98.19%、98.53%、82.35%、98.75%、97.73%、82.47%、98.41%、92.98%、82.47%、98.41%、82.47%、98.30%、98.41%、98.19%、98.53%、89.92%、82.69%、82.58%、82.47%、82.47%、82.35%、82.47%、82.24%、82.13%、82.01%的序列同一性。Optionally, the accession numbers of the amino acid sequence of the parent RNA polymerase are QZB83517.1, YP 009814072.1, QJT70354.1, YP 002003938.1, YP 009291482.1, YP 009152465.1, YP 009799383.1, YP 009795804.1, YP 009798010.1, YP 009820203.1, QXV80637.1, QXV80862.1, YP 009812318.1, YP 009813771.1, YP 009820155.1, QBQ72872.1, YP 009804754.1, YP 004306655.1, YP 009791749.1, QFG06779.1, YP 009786924.1, QOV06378.1, NP 848264.1, WMX18675.1, AFK13398.1, QMP81608.1, YP_009289300.1, YP_009226209.1, YP_009785350.1, YP_009205657.1, YP_009286631.1, YP_001949750.1, NP_052071.1, QGH73719.1, NP_523301.1, corresponding to SEQ ID NO.1, SEQ ID NO.29-62, respectively; wherein SEQ ID NO.29-62 and SEQ ID Compared with the amino acid of NO.1, it has 97.96%, 98.07%, 98.64%, 98.07%, 98.19%, 90.26%, 97.51%, 97.40%, 97.40%, 98.19%, 98.53%, 82.35%, 98.75%, 97.73%, 82.47%, 98.41%, 92 .98%, 82.47%, 98.41%, 82.47%, 98.30%, 98.41%, 98.19%, 98.53%, 89.92%, 82.69%, 82.58%, 82.47%, 82.47%, 82.35%, 82.47%, 82.24%, 82.13%, 82.01% sequence identity.

可选地,所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少90%的同一性且具有RNA聚合酶活性;可选地,所述亲本RNA聚合酶的氨基酸序列分别对应SEQ ID NO.1、SEQ ID NO.30~39、SEQ ID NO.41~42、SEQ ID NO.44~45、SEQ ID NO.47、SEQ ID NO.49~53,其氨基酸序列的第121、223、632、217、397、579位为保守序列。Optionally, the amino acid sequence of the parent RNA polymerase has at least 90% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity; optionally, the amino acid sequence of the parent RNA polymerase corresponds to SEQ ID NO.1, SEQ ID NO.30-39, SEQ ID NO.41-42, SEQ ID NO.44-45, SEQ ID NO.47, SEQ ID NO.49-53, respectively, and the 121st, 223rd, 632nd, 217th, 397th, and 579th positions of its amino acid sequence are conserved sequences.

可选地,所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少98.41%的同一性且具有RNA聚合酶活性;可选地,所述亲本RNA聚合酶的氨基酸序列如SEQ ID NO.1或SEQ ID NO.44所示,这两条序列中第121、223、632、217、397、369、579位的氨基酸一致。Optionally, the amino acid sequence of the parent RNA polymerase has at least 98.41% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity; optionally, the amino acid sequence of the parent RNA polymerase is as shown in SEQ ID NO.1 or SEQ ID NO.44, and the amino acids at positions 121, 223, 632, 217, 397, 369, and 579 in these two sequences are consistent.

在一种实施方式中,所述第217位氨基酸突变为L、A或V,优选地,突变为L;In one embodiment, the amino acid at position 217 is mutated to L, A or V, preferably, mutated to L;

所述第397位氨基酸突变为W、A或P,优选地,突变为W;The amino acid at position 397 is mutated to W, A or P, preferably, to W;

所述第121位氨基酸突变为M、L或S,优选地,突变为M;The amino acid at position 121 is mutated to M, L or S, preferably, mutated to M;

所述第223位氨基酸突变为M、P、L、V或F,优选地,突变为M;The amino acid at position 223 is mutated to M, P, L, V or F, preferably, mutated to M;

所述第632位氨基酸突变为G或E,优选地,突变为G;The amino acid at position 632 is mutated to G or E, preferably, mutated to G;

所述第369位氨基酸突变为T或E;The amino acid at position 369 is mutated to T or E;

所述第579位氨基酸突变为A、W或D;优选地,突变为W。The amino acid at position 579 mutates to A, W or D; preferably, mutates to W.

在一种实施方式中,所述RNA聚合酶突变体还包括对应于亲本RNA聚合酶的氨基酸序列的如下一个或者多个位点的突变:第430位、633位、第786位、第744位、第724位、第43位、第723位。In one embodiment, the RNA polymerase mutant also includes mutations at one or more of the following positions corresponding to the amino acid sequence of the parent RNA polymerase: position 430, position 633, position 786, position 744, position 724, position 43, and position 723.

在一种实施方式中,所述第430位氨基酸突变为非极性疏水性氨基酸;可选地,突变为P、L、V、M、F;优选地,突变为P; In one embodiment, the amino acid at position 430 is mutated to a non-polar hydrophobic amino acid; alternatively, it is mutated to P, L, V, M, F; preferably, it is mutated to P;

所述第633位氨基酸突变为非极性疏水性氨基酸;可选地,突变为P、L、V、M或F、A、I、W;优选地,突变为P;The amino acid at position 633 is mutated to a non-polar hydrophobic amino acid; optionally, it is mutated to P, L, V, M or F, A, I, W; preferably, it is mutated to P;

所述第786位氨基酸突变为非极性疏水性氨基酸或Y;可选地,突变为P、I、A、V、Y、L、M、W或F;优选地,突变为L;The amino acid at position 786 is mutated to a non-polar hydrophobic amino acid or Y; alternatively, it is mutated to P, I, A, V, Y, L, M, W or F; preferably, it is mutated to L;

所述第744位氨基酸突变为R、P或L;优选地,突变为R;The amino acid at position 744 is mutated to R, P or L; preferably, mutated to R;

所述第43位氨基酸突变为Y或A;优选地,突变为Y;The amino acid at position 43 mutates to Y or A; preferably, mutates to Y;

所述第723位氨基酸突变为E或S;优选地,突变为E;The amino acid at position 723 is mutated to E or S; preferably, mutated to E;

所述第724位氨基酸突变为N。The amino acid at position 724 is mutated to N.

在一种实施方式中,所述RNA聚合酶突变体至少包含等价于或者对应于亲本聚合酶突变体的如下任一一种突变:I217L、I217A、I217V、S397W、S397A、S397P、T121M、T121L、T121S、S223M、S223P、S223L、S223V、S223F、R632G、R632E、M369E、M369T、N579A、N579W、N579D突变。In one embodiment, the RNA polymerase mutant comprises at least any one of the following mutations that is equivalent to or corresponds to the parent polymerase mutant: I217L, I217A, I217V, S397W, S397A, S397P, T121M, T121L, T121S, S223M, S223P, S223L, S223V, S223F, R632G, R632E, M369E, M369T, N579A, N579W, and N579D mutations.

在一种实施方式中,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P和Q786L,命名为I217L/S397W/S430P/S633P/Q786L(以下简称M7)。In one embodiment, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P and Q786L, named I217L/S397W/S430P/S633P/Q786L (hereinafter referred to as M7).

在一种实施方式中,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、N579W,命名为I217L/S397W/S430P/S633P/Q786L/N579W。In one embodiment, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, N579W, named I217L/S397W/S430P/S633P/Q786L/N579W.

在一种实施方式中,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、Q744R,命名为I217L/S397W/S430P/S633P/Q786L/Q744R。In one embodiment, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, Q744R, named I217L/S397W/S430P/S633P/Q786L/Q744R.

在一种实施方式中,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、C723E、A724N,命名为I217L/S397W/S430P、S633P/Q786L/C723E/A724N。In one embodiment, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, C723E, A724N, named I217L/S397W/S430P, S633P/Q786L/C723E/A724N.

在一种实施方式中,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、Q744R、S43Y,命名为I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y(以下简称M8)。In one embodiment, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, Q744R, S43Y, and is named I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y (hereinafter referred to as M8).

在一种实施方式中,所述RNA聚合酶突变体的氨基酸序列如SEQ ID NO.2(对应I217L)、SEQ ID NO.3(对应S397W)、SEQ ID NO.4(对应T121M)、SEQ ID NO.5(对应S223M)、SEQ ID NO.6(对应R632G)、SEQ ID NO.7(对应I217L/S397W/S430P/S633P/Q786L)或SEQ  ID NO.8(对应I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y)、SEQ ID NO.79(对应M369T)、SEQ ID NO.80(对应N579W)、SEQ ID NO.81(对应I217L/S397W/S430P/S633P/Q786L/N579W)、SEQ ID NO.82(对应I217L/S397W/S430P/S633P/Q786L/Q744R)、SEQ ID NO.83(对应I217L/S397W/S430P/S633P/Q786L/C723E/A724N)所示。In one embodiment, the amino acid sequence of the RNA polymerase mutant is such as SEQ ID NO.2 (corresponding to I217L), SEQ ID NO.3 (corresponding to S397W), SEQ ID NO.4 (corresponding to T121M), SEQ ID NO.5 (corresponding to S223M), SEQ ID NO.6 (corresponding to R632G), SEQ ID NO.7 (corresponding to I217L/S397W/S430P/S633P/Q786L) or SEQ ID NO. SEQ ID NO.8 (corresponding to I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y), SEQ ID NO.79 (corresponding to M369T), SEQ ID NO.80 (corresponding to N579W), SEQ ID NO.81 (corresponding to I217L/S397W/S430P/S633P/Q786L/N579W), SEQ ID NO.82 (corresponding to I217L/S397W/S430P/S633P/Q786L/Q744R), and SEQ ID NO.83 (corresponding to I217L/S397W/S430P/S633P/Q786L/C723E/A724N).

本发明的第二个目的是提供一种编码前述RNA聚合酶突变体的多核苷酸。The second object of the present invention is to provide a polynucleotide encoding the aforementioned RNA polymerase mutant.

在一种实施方式中,所述多核苷酸的序列如SEQ ID NO.16(对应I217L)、SEQ ID NO.17(对应S397W)、SEQ ID NO.18(对应T121M)、SEQ ID NO.19(对应S223M)、SEQ ID NO.20(对应R632G)、SEQ ID NO.21(对应I217L/S397W/S430P/S633P/Q786L)、SEQ ID NO.22(对应I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y)所示。In one embodiment, the sequence of the polynucleotide is as shown in SEQ ID NO.16 (corresponding to I217L), SEQ ID NO.17 (corresponding to S397W), SEQ ID NO.18 (corresponding to T121M), SEQ ID NO.19 (corresponding to S223M), SEQ ID NO.20 (corresponding to R632G), SEQ ID NO.21 (corresponding to I217L/S397W/S430P/S633P/Q786L), and SEQ ID NO.22 (corresponding to I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y).

本发明的第三个目的是提供一种载体,该载体携带前述多核苷酸。The third object of the present invention is to provide a vector carrying the aforementioned polynucleotide.

可选地,所述载体为pQE-80L,还可以为pET-28a(+)或pPIC9K等。Optionally, the vector is pQE-80L, or may be pET-28a(+) or pPIC9K, etc.

本发明的第四个目的是提供一种细胞,该细胞携带前述多核苷酸或前述载体,或表达前述RNA聚合酶突变体。所述细胞可以指单个细胞,也可以指细胞系,或细胞培养物。本文中细胞包括其后代,后代可以由于天然的、偶然的或故意的突变不一定与原代细胞完全相同,在形态学上和/或在基因组DNA上与原代细胞存在差异。所述细胞可以为天然细胞或转化体。The fourth object of the present invention is to provide a cell, which carries the aforementioned polynucleotide or the aforementioned vector, or expresses the aforementioned RNA polymerase mutant. The cell may refer to a single cell, a cell line, or a cell culture. Cells herein include their offspring, which may not necessarily be identical to the primary cell due to natural, accidental or deliberate mutations, and may differ from the primary cell in morphology and/or in genomic DNA. The cell may be a natural cell or a transformant.

本发明的第五个目的是提供一种多核苷酸的合成方法,所述方法用到本发明的RNA聚合酶突变体。A fifth object of the present invention is to provide a method for synthesizing a polynucleotide, wherein the method uses the RNA polymerase mutant of the present invention.

在一种实施方式中,所述合成方法包括使用包含所述的RNA聚合酶突变体的酶体系催化合成多核苷酸或者上述细胞的全细胞催化合成多核苷酸。In one embodiment, the synthesis method comprises using an enzyme system comprising the RNA polymerase mutant to catalyze the synthesis of polynucleotides or using the whole cells of the above-mentioned cells to catalyze the synthesis of polynucleotides.

可选地,所述合成方法包括体外转录合成、等温扩增或转录介导的扩增。Optionally, the synthesis method comprises in vitro transcription synthesis, isothermal amplification or transcription-mediated amplification.

本发明的第六个目的是提供一种上述RNA聚合酶突变体或上述多核苷酸的合成方法在mRNA疫苗制备、核酸药物制备、基因编辑和蛋白表达系统构建中的至少一种中的应用,蛋白表达系统构建包括但不限于体内蛋白质表达系统构建或无细胞蛋白表达体外翻译系统构建。The sixth object of the present invention is to provide an application of the above-mentioned RNA polymerase mutant or the above-mentioned polynucleotide synthesis method in at least one of mRNA vaccine preparation, nucleic acid drug preparation, gene editing and protein expression system construction, and the protein expression system construction includes but is not limited to in vivo protein expression system construction or cell-free protein expression in vitro translation system construction.

本发明的第七个目的是提供一种用于多核苷酸合成的试剂盒,该试剂盒包含前述RNA聚合酶突变体。A seventh object of the present invention is to provide a kit for polynucleotide synthesis, which comprises the aforementioned RNA polymerase mutant.

本发明的第八个目的是提供一种提高RNA聚合酶的至少一个性能的方法;所述性能选自酶活性提升、热稳定性提升、3'末端一致性提升、更高的体外转录产量、更少的dsRNA副产物和更低的免疫原性,所述方法为:在对应于亲本RNA聚合酶的氨基酸序列的基础上,第121、223、632、217、397、369、579位的位置具有一个或者多个氨基酸突变,或者在亲本RNA聚合酶中等价位置的氨基酸残基处发生了一个或者多个氨基酸突变;所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少80%的同一性且具有RNA聚合酶活性。The eighth object of the present invention is to provide a method for improving at least one performance of an RNA polymerase; the performance is selected from the group consisting of improved enzyme activity, improved thermal stability, improved 3' end consistency, higher in vitro transcription yield, fewer dsRNA byproducts and lower immunogenicity, the method comprising: based on the amino acid sequence corresponding to the parent RNA polymerase, one or more amino acid mutations are present at positions 121, 223, 632, 217, 397, 369 and 579, or one or more amino acid mutations occur at amino acid residues at equivalent positions in the parent RNA polymerase; the amino acid sequence of the parent RNA polymerase has at least 80% identity with the amino acids shown in SEQ ID NO.1 and has RNA polymerase activity.

在一种实施方式中,当突变位点含第121位时,所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少90%的同一性且具有RNA聚合酶活性;当突变位点含第369位时,所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少98.41%的同一性且具有RNA聚合酶活性。In one embodiment, when the mutation site contains position 121, the amino acid sequence of the parent RNA polymerase has at least 90% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity; when the mutation site contains position 369, the amino acid sequence of the parent RNA polymerase has at least 98.41% identity with the amino acid shown in SEQ ID NO.1 and has RNA polymerase activity.

本发明的有益效果:Beneficial effects of the present invention:

本发明提供了一系列性能提升的RNA聚合酶突变体,该突变体至少具有如下任意性能的提升:酶活性、热稳定性、3'末端一致性、dsRNA副产物和免疫原性等,能够满足实际应用和研究的多种需求。The present invention provides a series of RNA polymerase mutants with improved performance, which have at least any of the following performance improvements: enzyme activity, thermal stability, 3' end consistency, dsRNA byproducts and immunogenicity, etc., and can meet various needs of practical applications and research.

其中:in:

(1)本发明在氨基酸序列如SEQ ID NO.1所示RNA聚合酶的基础上,对第121位氨基酸进行突变,构建得到单点突变体T121M、T121A、T121V,相较于野生型WT,酶活性提高47%以上;其中T121M酶活性最高,相较于野生型WT提高了123%;(1) Based on the RNA polymerase with the amino acid sequence shown in SEQ ID NO.1, the present invention mutates the 121st amino acid to construct single-point mutants T121M, T121A, and T121V, which have an enzyme activity increased by more than 47% compared with the wild-type WT; among them, T121M has the highest enzyme activity, which is increased by 123% compared with the wild-type WT;

对第217位氨基酸进行突变,构建得到单点突变体I217L、I217A、I217V,相较于野生型WT,酶活性提高47%以上;其中,I217L酶活性最高,相较于野生型WT提高了34%;The amino acid at position 217 was mutated to construct single-point mutants I217L, I217A, and I217V, which increased the enzyme activity by more than 47% compared with the wild-type WT. Among them, I217L had the highest enzyme activity, which increased by 34% compared with the wild-type WT.

对第223位氨基酸进行突变,构建得到单点突变体S223M、S223L、S223V,相较于野生型WT,酶活性提高18%以上;其中,S223M酶活性最高,相较于野生型WT提高了25%;The amino acid at position 223 was mutated to construct single-point mutants S223M, S223L, and S223V, which increased the enzyme activity by more than 18% compared with the wild-type WT. Among them, S223M had the highest enzyme activity, which increased by 25% compared with the wild-type WT.

对第397位氨基酸进行突变,构建得到单点突变体S397W、S397P,相较于野生型WT,酶活性提高36%以上;其中,S397W酶活性最高,相较于野生型WT提高了62%;The single-point mutants S397W and S397P were constructed by mutating the amino acid at position 397. The enzyme activity of S397W was increased by more than 36% compared with the wild-type WT. Among them, the enzyme activity of S397W was the highest, which was increased by 62% compared with the wild-type WT.

对第632位氨基酸进行突变,构建得到单点突变体R632G、R632E,相较于野生型WT,酶活性提高20%以上;其中,R632G酶活性最高,相较于野生型WT提高了22%;The amino acid at position 632 was mutated to construct single-point mutants R632G and R632E, which showed an enzyme activity increase of more than 20% compared with the wild-type WT. Among them, R632G had the highest enzyme activity, which increased by 22% compared with the wild-type WT.

对第579位氨基酸进行突变,构建得到单点突变体N579W,相较于野生型WT,酶活性提高46%;The amino acid at position 579 was mutated to construct a single-point mutant N579W, which showed an enzyme activity increase of 46% compared with the wild-type WT;

其中,单点突变体I217L、单点突变体S397W、单点突变体T121M、单点突变体S223M和单点突变体R632G在37℃下的相对酶活性显著优于现有技术已报道的单点突变体S430P、单点突变体G47A+884G、单点突变体S633P和单点突变体Q786L。Among them, the relative enzyme activities of the single point mutant I217L, single point mutant S397W, single point mutant T121M, single point mutant S223M and single point mutant R632G at 37°C are significantly better than those of the single point mutant S430P, single point mutant G47A+884G, single point mutant S633P and single point mutant Q786L reported in the prior art.

(2)本发明在单点突变的基础上组合突变,构建得到组合突变体I217L/S397W/S430P/S633P/Q786L(即M7)、I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y(即M8)、I217L/S397W/S430P/S633P/Q786L/Q744R、I217L/S397W/S430P/S633P/Q786L/C723E/A724N,相较于野生型WT,I217L/S397W/S430P/S633P/Q786L/Q744R的酶活性提高了95%,I217L/S397W/S430P/S633P/Q786L/C723E/A724N的酶活性提高了83%,M8的酶活性提高了336%;M7、M8、I217L/S397W/S430P/S633P/Q786L/Q744R和I217L/S397W/S430P/S633P/Q786L/C723E/A724N的Tm值明显高于野生型和现有技术中报道的其他突变体。(2) The present invention combines mutations based on single-point mutations to construct combined mutants I217L/S397W/S430P/S633P/Q786L (i.e., M7), I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y (i.e., M8), I217L/S397W/S430P/S633P/Q786L/Q744R, and I217L/S397W/S430P/S633P/Q786L/C723E/A724N. Compared with the wild-type WT, I217L/S397W/ The enzyme activity of S430P/S633P/Q786L/Q744R increased by 95%, the enzyme activity of I217L/S397W/S430P/S633P/Q786L/C723E/A724N increased by 83%, and the enzyme activity of M8 increased by 336%; the Tm values of M7, M8, I217L/S397W/S430P/S633P/Q786L/Q744R and I217L/S397W/S430P/S633P/Q786L/C723E/A724N were significantly higher than those of the wild type and other mutants reported in the prior art.

(3)本发明的组合突变体M8,在50℃下的体外转录产量和3'末端一致性产物明显优于野生型WT和现有技术中报道的其他突变体,表明组合突变体M8具有更高的IVT产量和3'末端一致性;I217L/S397W/S430P/S633P/Q786L/Q744R和I217L/S397W/S430P/S633P/Q786L/C723E/A724N在37℃下的IVT产量较野生型WT分别提高43%、10.7%,纯度分别达到79.4%、77.4%。(3) The combined mutant M8 of the present invention has significantly better in vitro transcription yield and 3' end consistency product at 50°C than the wild-type WT and other mutants reported in the prior art, indicating that the combined mutant M8 has higher IVT yield and 3' end consistency; the IVT yields of I217L/S397W/S430P/S633P/Q786L/Q744R and I217L/S397W/S430P/S633P/Q786L/C723E/A724N at 37°C are 43% and 10.7% higher than those of the wild-type WT, respectively, and the purities reach 79.4% and 77.4%, respectively.

(4)本发明的组合突变体M8、I217L/S397W/S430P/S633P/Q786L/Q744R和I217L/S397W/S430P/S633P/Q786L/C723E/A724N的dsRNA转录副产物含量明显低于野生型WT和现有技术中报道的突变体以及M7组合突变体,M8组合突变体在50℃反应条件下单位质量IVT产物的dsRNA含量较低,可以达到CN 111212905A中报道的能够减少体外转录产物中dsRNA污染物和连缀转录物的G47A+884G突变体的相同水平。(4) The combined mutants M8, I217L/S397W/S430P/S633P/Q786L/Q744R and I217L/S397W/S430P/S633P/Q786L/C723E/A724N of the present invention have significantly lower dsRNA transcription byproduct contents than the wild-type WT, mutants reported in the prior art, and the M7 combined mutant. The M8 combined mutant has a lower dsRNA content per unit mass of IVT product under reaction conditions of 50°C, which can reach the same level as the G47A+884G mutant reported in CN 111212905A that can reduce dsRNA contaminants and concatenated transcripts in in vitro transcription products.

(5)本发明的组合突变体M8的体外转录产物较野生型具有更低的免疫原性。(5) The in vitro transcription product of the combined mutant M8 of the present invention has lower immunogenicity than the wild type.

(6)本发明通过使用FADS平台对单细胞及反应体系进行包裹,成功建立了高通量T7RNAP突变体库筛选平台,利用该平台,本发明筛选到了如下T7 RNA聚合酶突变体,该T7RNA聚合酶突变体对应于SEQ ID NO.1所示氨基酸序列,存在如下中的至少一种氨基酸残基:217位为L,397位为A或W,和,579位为A或W;且,与SEQ ID NO.1至少具有80%同一性。本发明提供的T7 RNA聚合酶突变体具有很好的热稳定性,以及较好的温度适应范围,不仅能够在常规的37℃条件下反应,还可以在高温50℃条件下反应。本发明提供的T7 RNAP突变体在37℃条件下还具有较好的应用性能,含有多个突变位点的T7 RNA聚合酶组合突变体,体外转录产物纯度优于野生型T7 RNA聚合酶或已报到的T7 RNA聚合酶突变体。本发明提供的T7 RNA聚合酶突变体在50℃条件下仍具有非常好的应用性能,合成RNA的产量明显优于野生型和已报到的突变体,产物纯度不差于现有技术,并且合成的mRNA具有较少的dsRNA副产物,将会产生更低的免疫源性,产物的3'一致性相对于现有技术有较明显的改善,这些指标对于mRNA疫苗及药物的研究生产十分有利。(6) The present invention successfully established a high-throughput T7 RNAP mutant library screening platform by using the FADS platform to encapsulate single cells and reaction systems. Using this platform, the present invention screened the following T7 RNA polymerase mutants, which correspond to the amino acid sequence shown in SEQ ID NO.1, and have at least one of the following amino acid residues: L at position 217, A or W at position 397, and A or W at position 579; and have at least 80% identity with SEQ ID NO.1. The T7 RNA polymerase mutant provided by the present invention has good thermal stability and a good temperature adaptability range. It can react not only under conventional 37°C conditions, but also under high temperature 50°C conditions. The T7 RNAP mutant provided by the present invention also has good application performance under 37°C conditions. The T7 RNA polymerase combination mutant containing multiple mutation sites has a purity of in vitro transcription product that is better than that of the wild-type T7 RNA polymerase or the reported T7 RNA polymerase mutant. The T7 RNA polymerase mutant provided by the present invention still has very good application performance under 50°C conditions, the yield of synthetic RNA is significantly better than that of the wild type and reported mutants, the product purity is not worse than the existing technology, and the synthesized mRNA has fewer dsRNA by-products, which will produce lower immunogenicity, and the 3' consistency of the product is significantly improved compared with the existing technology. These indicators are very beneficial for the research and production of mRNA vaccines and drugs.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本发明具体实施方式或现有技术中的技术方案,下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the specific implementation methods of the present invention or the technical solutions in the prior art, the drawings required for use in the specific implementation methods or the description of the prior art will be briefly introduced below. Obviously, the drawings described below are some implementation methods of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying creative work.

图1为T7 RNAP重组表达载体示意图;Figure 1 is a schematic diagram of the T7 RNAP recombinant expression vector;

图2为基于适配体的荧光激活的微液滴分选流程图;FIG2 is a flow chart of microdroplet sorting based on fluorescence activation of aptamers;

图3为实时荧光法测定不同突变体的相对活性时T7 RNAP野生型的蛋白浓度随荧光强度的变化图;FIG3 is a graph showing the change in protein concentration of wild-type T7 RNAP with fluorescence intensity when the relative activities of different mutants were determined by real-time fluorescence method;

图4A为T7 RNAP野生型及单突变体S430P、S633P、Q786L、I217L、S397W、组合突变体I217L/S397W/S430P/S633P/Q786L、I217L/S397W/S430P/S633P/Q786L/N579W、I217L/S397W/S430P/S633P/Q786L/Q744R、I217L/S397W/S430P/S633P/Q786L/C723E/A724N的IVT产物的dsRNA含量检测结果;FIG4A shows the dsRNA content detection results of IVT products of T7 RNAP wild type and single mutants S430P, S633P, Q786L, I217L, S397W, combined mutants I217L/S397W/S430P/S633P/Q786L, I217L/S397W/S430P/S633P/Q786L/N579W, I217L/S397W/S430P/S633P/Q786L/Q744R, and I217L/S397W/S430P/S633P/Q786L/C723E/A724N;

图4B为T7 RNAP野生型及单突变体I217L、S397W、S430P、S633P、Q786L、组合突变体I217L/S397W/S430P/S633P/Q786L、组合突变体I217L/S397W/S430P/S633P/Q786L/N579W、组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R、组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N的IVT产物的dsRNA含量检测结果;Figure 4B shows the dsRNA content detection results of IVT products of T7 RNAP wild type and single mutants I217L, S397W, S430P, S633P, Q786L, combined mutants I217L/S397W/S430P/S633P/Q786L, combined mutants I217L/S397W/S430P/S633P/Q786L/N579W, combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R, and combined mutants I217L/S397W/S430P/S633P/Q786L/C723E/A724N;

图5A为T7 RNAP野生型37℃条件下IVT产物的3'末端一致性检测结果;Figure 5A shows the 3' end consistency test results of the IVT product of T7 RNAP wild type at 37°C;

图5B为T7 RNAP突变体Q744R在37℃条件下IVT产物的3'末端一致性检测结果;Figure 5B shows the 3' end consistency test results of the IVT product of T7 RNAP mutant Q744R at 37°C;

图5C为T7 RNAP突变体S43Y在37℃条件下IVT产物的3'末端一致性检测结果;Figure 5C shows the 3' end consistency test results of the IVT product of T7 RNAP mutant S43Y at 37°C;

图5D为T7 RNAP突变体G47A+884G在37℃条件下IVT产物的3'末端一致性检测结果;Figure 5D shows the 3' end consistency test results of the IVT product of T7 RNAP mutant G47A+884G at 37°C;

图5E为T7 RNAP突变体M7:I217L/S397W/S430P/S633P/Q786L在37℃条件下IVT产物的3'末端一致性检测结果;Figure 5E shows the 3' end consistency test results of the IVT product of T7 RNAP mutant M7: I217L/S397W/S430P/S633P/Q786L at 37°C;

图5F为T7 RNAP突变体M7:I217L/S397W/S430P/S633P/Q786L在50℃条件下IVT产物的3'末端一致性检测结果;Figure 5F shows the 3' end consistency test results of the IVT product of T7 RNAP mutant M7: I217L/S397W/S430P/S633P/Q786L at 50°C;

图5G为T7 RNAP突变体M8:I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y在37℃条件下IVT产物的3'末端一致性检测结果;Figure 5G shows the 3' end consistency test results of the IVT product of T7 RNAP mutant M8: I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y at 37°C;

图5H为T7 RNAP突变体M8:I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y在50℃条件下IVT产物的3’末端一致性检测结果;Figure 5H shows the 3’ end consistency test results of the IVT product of T7 RNAP mutant M8: I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y at 50°C;

图5I为T7 RNAP突变体I217L在50℃条件下IVT产物的3’末端一致性检测结果;Figure 5I shows the 3’ end consistency test results of the IVT product of T7 RNAP mutant I217L at 50°C;

图5J为T7 RNAP突变体S397W在50℃条件下IVT产物的3’末端一致性检测结果;Figure 5J shows the 3’ end consistency test results of the IVT product of T7 RNAP mutant S397W at 50°C;

图6A为T7 RNAP野生型及突变体M8的IVT产物细胞免疫应答因子IFN-α的检测结果;Figure 6A shows the detection results of IFN-α, a cellular immune response factor, in the IVT products of T7 RNAP wild type and mutant M8;

图6B为T7 RNAP野生型及突变体M8的IVT产物细胞免疫应答因子IL-6的检测结果;Figure 6B shows the detection results of IL-6, a cellular immune response factor, in the IVT products of wild-type T7 RNAP and mutant M8;

图7A为T7 RNAP突变位点第1~129位在不同来源RNA聚合酶氨基酸序列中的序列保守性分析;FIG7A is a sequence conservation analysis of T7 RNAP mutation sites 1 to 129 in RNA polymerase amino acid sequences from different sources;

图7B为T7 RNAP突变位点第130~259位在不同来源RNA聚合酶氨基酸序列中的序列保守性分析;FIG7B is a sequence conservation analysis of T7 RNAP mutation sites 130 to 259 in RNA polymerase amino acid sequences from different sources;

图7C为T7 RNAP突变位点第260~389位在不同来源RNA聚合酶氨基酸序列中的序列保守性分析;FIG7C is a sequence conservation analysis of T7 RNAP mutation sites 260 to 389 in RNA polymerase amino acid sequences from different sources;

图7D为T7 RNAP突变位点第390~519位在不同来源RNA聚合酶氨基酸序列中的序列保守性分析;FIG7D is a sequence conservation analysis of T7 RNAP mutation sites 390 to 519 in RNA polymerase amino acid sequences from different sources;

图7E为T7 RNAP突变位点第520~650位在不同来源RNA聚合酶氨基酸序列中的序列保守性分析。Figure 7E is a sequence conservation analysis of T7 RNAP mutation sites 520 to 650 in the amino acid sequences of RNA polymerases from different sources.

具体实施方式DETAILED DESCRIPTION

下面将结合实施例对本发明的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solution of the present invention will be clearly and completely described below in conjunction with the embodiments. Obviously, the described embodiments are part of the embodiments of the present invention, rather than all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.

本发明涉及的部分定义或术语:Some definitions or terms related to the present invention:

性能提升:术语“性能提升”意指与相对于亲本有所改善的变体相关的特征。这种性能提升包括但不限于以下一种或多种:RNA聚合酶突变体酶活性提升、热稳定性提升,RNA聚合酶突变体在体外转录(IVT)反应中具有更高的体外转录产量、3'末端一致性提升、更少的dsRNA副产物,或者RNA聚合酶突变体产生的RNA转录物具有更低的免疫原性等。Improved performance: The term "improved performance" refers to characteristics associated with a variant that is improved relative to the parent. Such improved performance includes, but is not limited to, one or more of the following: improved enzymatic activity of the RNA polymerase mutant, improved thermal stability, higher in vitro transcription yield of the RNA polymerase mutant in an in vitro transcription (IVT) reaction, improved 3' end consistency, fewer dsRNA byproducts, or RNA transcripts produced by the RNA polymerase mutant have lower immunogenicity, etc.

对应于:如本文使用的,术语“对应于”是指确定序列中的特定氨基酸(其中参考了特定氨基酸序列)的方式。例如出于本发明的目的,当参考特定氨基酸位置时,技术人员能够将另一氨基酸序列与所述已经被参考的氨基酸序列进行比对,从而确定哪一个特定氨基酸可能在所述另一氨基酸序列中是感兴趣的。可使用可替代的比对方法,并且这些方法为本领域技术人员所熟知。Corresponding to: As used herein, the term "corresponding to" refers to a manner of determining a specific amino acid in a sequence where a specific amino acid sequence is referenced. For example, for the purposes of the present invention, when a specific amino acid position is referenced, a skilled person can compare another amino acid sequence with the amino acid sequence that has been referenced to determine which specific amino acid may be of interest in the other amino acid sequence. Alternative alignment methods may be used and are well known to those skilled in the art.

突变体:如本文使用的,当涉及本发明的变体的使用时,术语“突变体”或“多肽变体”或“多肽”或“RNA聚合酶突变体”意指具有RNA聚合酶酶活性的、在相对于“亲本”RNA聚合酶的一个或多个(例如若干个)位置包含改变(即取代、插入和/或缺失)的多肽。取代意指用不同的氨基酸替代占据某一位置的氨基酸;缺失意指去除占据某一位置的氨基酸;而插入意指在邻接并且紧随占据某一位置的氨基酸之后添加氨基酸。在描述本发明的变体中,为了便于参考,对以下所述的命名法进行了改编。采用了已接受的IUPAC单字母或三字母的氨基酸缩写。取代:对于氨基酸取代,使用以下命名法:原始氨基酸、位置、被取代的氨基酸。因此,将在位置49处的谷氨酸被精氨酸取代表示为“E49R”。多个突变通过符号(“/”)分开,例如“E49R/M358F”代表在位置49和位置358处的谷氨酸(E)和蛋氨酸(M)分别被精氨酸(R)和苯丙氨酸(F)取代。Mutant: As used herein, the term "mutant" or "polypeptide variant" or "polypeptide" or "RNA polymerase mutant" when used in relation to variants of the present invention means a polypeptide having RNA polymerase enzymatic activity comprising an alteration (i.e., a substitution, insertion and/or deletion) at one or more (e.g., several) positions relative to a "parent" RNA polymerase. Substitution means replacing the amino acid occupying a position with a different amino acid; deletion means removing the amino acid occupying a position; and insertion means adding an amino acid adjacent to and immediately following the amino acid occupying a position. In describing the variants of the present invention, the nomenclature described below has been adapted for ease of reference. Accepted IUPAC single-letter or three-letter abbreviations for amino acids are used. Substitution: For amino acid substitutions, the following nomenclature is used: original amino acid, position, substituted amino acid. Thus, a substitution of glutamic acid at position 49 by arginine is represented as "E49R". Multiple mutations are separated by a symbol ("/"), for example "E49R/M358F" represents that glutamic acid (E) and methionine (M) at position 49 and position 358 are replaced by arginine (R) and phenylalanine (F), respectively.

亲本或亲本RNA聚合酶:如本文使用的术语“亲本”RNA聚合酶意指进行改变以产生本发明的RNA聚合酶突变体的RNA聚合酶。该术语还指本发明的突变体与之进行比较的多肽。亲本可以是天然存在(野生型)多肽,或者它可以甚至是通过任何适合的手段制备的其变体。例如,亲本蛋白质可以是在氨基酸序列方面已经进行过修饰或改变的天然存在多肽的变体。因此,亲本RNA聚合酶可具有一个或多个(或一个或几个)氨基酸取代、缺失和/或插入。因此,亲本RNA聚合酶可为亲本RNA聚合酶的变体。亲本也可以是等位基因变体,其是由占据相同染色体基因座的基因的两种或更多种可替代形式中的任一种编码的多肽。Parent or parent RNA polymerase: The term "parent" RNA polymerase as used herein means an RNA polymerase that is altered to produce an RNA polymerase mutant of the present invention. The term also refers to a polypeptide to which a mutant of the present invention is compared. A parent can be a naturally occurring (wild-type) polypeptide, or it can even be a variant thereof prepared by any suitable means. For example, a parent protein can be a variant of a naturally occurring polypeptide that has been modified or altered in terms of amino acid sequence. Thus, a parent RNA polymerase can have one or more (or one or several) amino acid substitutions, deletions and/or insertions. Thus, a parent RNA polymerase can be a variant of a parent RNA polymerase. A parent can also be an allelic variant, a polypeptide encoded by any of two or more alternative forms of a gene occupying the same chromosomal locus.

野生型酶:提及氨基酸序列或核酸序列时术语“野生型”意指该氨基酸序列或核酸序列是天然或天然存在的序列。如本文所用,术语“天然存在的”是指在自然界中发现的任何物质(例如蛋白质、氨基酸或核酸序列)。相反,术语“非天然存在的”是指在自然界中未发现的任何物质(例如,在实验室中产生的重组核酸和蛋白质序列、或野生型序列的修饰)。当亲本酶不是变体酶时,术语“野生型酶”和“亲本酶”可互换地使用。Wild-type enzyme: The term "wild-type" when referring to an amino acid sequence or a nucleic acid sequence means that the amino acid sequence or nucleic acid sequence is a natural or naturally occurring sequence. As used herein, the term "naturally occurring" refers to any substance (e.g., a protein, amino acid, or nucleic acid sequence) found in nature. In contrast, the term "non-naturally occurring" refers to any substance not found in nature (e.g., a recombinant nucleic acid and protein sequence produced in a laboratory, or a modification of a wild-type sequence). When the parent enzyme is not a variant enzyme, the terms "wild-type enzyme" and "parent enzyme" are used interchangeably.

序列同一性:两个氨基酸序列之间或两个核苷酸序列之间的关联度通过参数“序列同一性”来描述。Sequence identity: The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter "sequence identity".

出于本发明的目的,使用尼德勒曼–翁施算法(Needleman-Wunsch algorithm)(Needleman和Wunsch,1970,J.Mol.Biol.[分子生物学杂志]48:443-453)来确定两个氨基酸序列之间的序列同一性,该算法如EMBOSS软件包(EMBOSS:The European MolecularBiology Open Software Suite[欧洲分子生物学开放软件套件],Rice等人,2000,TrendsGenet.[遗传学趋势]16:276-277)(优选5.0.0版本或更新版本)的Needle程序中所实施的。使用的参数可以是空位开放罚分10、空位延伸罚分0.5、以及EBLOSUM62(BLOSUM62的EMBOSS版本)取代矩阵。使用Needle标记的“最长同一性”的输出(使用-nobrief选项获得)作为同一性百分比并且如下计算: For the purposes of the present invention, the sequence identity between two amino acid sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J.Mol.Biol. [Journal of Molecular Biology] 48: 443-453), as implemented in the Needle program of the EMBOSS software package (EMBOSS: The European Molecular Biology Open Software Suite [European Molecular Biology Open Software Suite], Rice et al., 2000, Trends Genet. [Genetics Trend] 16: 276-277) (preferably 5.0.0 version or later). The parameters used can be a gap opening penalty of 10, a gap extension penalty of 0.5, and an EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. The output of the "longest identity" marked with Needle (obtained using the -nobrief option) is used as the identity percentage and is calculated as follows:

(相同的残基x 100)/(比对长度-比对中的空位总数)(number of identical residues x 100)/(length of alignment - total number of gaps in the alignment)

可替代地,所使用的参数可以是空位开放罚分10、空位延伸罚分0.5和EDNAFULL(NCBI NUC4.4的EMBOSS版本)取代矩阵。使用Needle标记的“最长同一性”的输出(使用-nobrief选项获得)作为同一性百分比并且如下计算:Alternatively, the parameters used can be a gap opening penalty of 10, a gap extension penalty of 0.5 and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix. The output of the "longest identity" of the Needle tag (obtained using the -nobrief option) is used as the identity percentage and is calculated as follows:

(相同的脱氧核糖核苷酸×100)/(比对长度–比对中的空位总数)(number of identical deoxyribonucleotides × 100)/(length of alignment – total number of gaps in alignment)

表达:如本文使用的术语“表达”是指涉及变体产生的任何步骤,包括但不限于,转录、转录后修饰、翻译、翻译后修饰以及分泌。Expression: As used herein, the term "expression" refers to any step involved in the production of a variant including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.

表达载体:如本文使用的术语“表达载体”是指线性或环状DNA分子,该分子包含编码变体的多核苷酸并且可操作地连接至提供用于其表达的控制序列。Expression vector: As used herein, the term "expression vector" refers to a linear or circular DNA molecule that comprises a polynucleotide encoding a variant and is operably linked to control sequences that provide for its expression.

宿主细胞:术语“宿主细胞”意指易于用包含本发明的多核苷酸的核酸构建体或表达载体进行转化、转染、转导等的任何细胞类型。术语“宿主细胞”涵盖由于复制期间出现的突变而与亲本细胞不同的亲本细胞的任何后代、连同重组宿主细胞、分离的宿主细胞(例如分离的重组宿主细胞)、异源宿主细胞。Host cell: The term "host cell" means any cell type that is susceptible to transformation, transfection, transduction, etc. with a nucleic acid construct or expression vector comprising a polynucleotide of the present invention. The term "host cell" encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication, as well as recombinant host cells, isolated host cells (e.g., isolated recombinant host cells), heterologous host cells.

重组:当用于提及细胞、核酸、蛋白质或载体时,术语“重组”意指已经从其天然状态经修饰。因此,例如,重组细胞表达在天然(非重组)形式的细胞内未发现的基因,或与在自然界中发现的相比,以不同水平表达或在不同条件下表达天然基因。重组核酸与天然序列的差异在于一个或多个核苷酸和/或与异源序列(例如,表达载体中的异源启动子)可操作地连接。重组蛋白与天然序列的差异可以在于一个或多个氨基酸和/或与异源序列融合。包含编码多肽的核酸的载体是重组载体。术语“重组”与“遗传修饰的”和“转基因的”同义。Recombinant: When used in reference to a cell, nucleic acid, protein, or vector, the term "recombinant" means that it has been modified from its native state. Thus, for example, a recombinant cell expresses genes not found in the native (non-recombinant) form of the cell, or expresses native genes at different levels or under different conditions than found in nature. A recombinant nucleic acid differs from a native sequence in one or more nucleotides and/or is operably linked to a heterologous sequence (e.g., a heterologous promoter in an expression vector). A recombinant protein may differ from a native sequence in one or more amino acids and/or be fused to a heterologous sequence. A vector comprising a nucleic acid encoding a polypeptide is a recombinant vector. The term "recombinant" is synonymous with "genetically modified" and "transgenic."

下面将结合实施例对本发明的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solution of the present invention will be clearly and completely described below in conjunction with the embodiments. Obviously, the described embodiments are part of the embodiments of the present invention, rather than all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.

需要说明的是,本文中氨基酸残基的简写同本领域的一般定义,本文在一些记载中仍在简写后的括号中注明其全称。本文中“氨基酸残基”和“氨基酸”可以互相替换使用。本文中氨基酸序列从氮端起始。It should be noted that the abbreviations of amino acid residues herein are the same as the general definitions in the art, and the full names are still indicated in brackets after the abbreviations in some records herein. "Amino acid residue" and "amino acid" can be used interchangeably herein. The amino acid sequence herein starts from the nitrogen terminus.

在一些可选的实施方式中,提供了一种T7 RNA聚合酶突变体的筛选方法,包括在含有锰离子的反应体系中扩增编码T7 RNA聚合酶的多核苷酸,然后将携带有扩增产物的载体转化至细胞,诱导表达后采用液滴微流控细胞分选方法分选靶细胞;反应体系中锰离子浓度为0.1~0.6mM,优选为0.2mM~0.3mM。液滴微流控细胞分选基于微流控超高通量荧光激活液滴分选平台(Fluorescence-Activated Droplet Sorting,FADS),FADS筛选过程包括三个步骤: 单细胞的液滴包裹和孵育、孵育和反应,以及反应后单细胞液滴的超高通量筛选。FADS可对单细胞进行封装培养,利特异性荧光底物对单细胞的次级代谢产物、或特异性酶类的产量进行荧光定量,通过筛选平台对荧光强度符合筛选阈值的液滴进行分选收集,最终获得大量备选液滴。该筛选方法的通量高、特异性强的优点。In some optional embodiments, a method for screening T7 RNA polymerase mutants is provided, comprising amplifying a polynucleotide encoding T7 RNA polymerase in a reaction system containing manganese ions, then transforming a vector carrying the amplified product into cells, and inducing expression to sort target cells using a droplet microfluidic cell sorting method; the manganese ion concentration in the reaction system is 0.1 to 0.6 mM, preferably 0.2 mM to 0.3 mM. Droplet microfluidic cell sorting is based on a microfluidic ultra-high throughput fluorescence-activated droplet sorting platform (Fluorescence-Activated Droplet Sorting, FADS), and the FADS screening process includes three steps: Droplet encapsulation and incubation of single cells, incubation and reaction, and ultra-high throughput screening of single cell droplets after reaction. FADS can encapsulate and culture single cells, use specific fluorescent substrates to perform fluorescence quantification of the production of secondary metabolites or specific enzymes of single cells, and use the screening platform to sort and collect droplets whose fluorescence intensity meets the screening threshold, and finally obtain a large number of candidate droplets. This screening method has the advantages of high throughput and strong specificity.

在可选的实施方式中,热处理的条件为40~50℃处理5~10min,优选为45℃处理7min。In an optional embodiment, the heat treatment condition is 40-50° C. for 5-10 min, preferably 45° C. for 7 min.

在可选的实施方式中,靶细胞为表现出最高活力的0.4%~0.45%的液滴包裹的细胞。In an alternative embodiment, the target cells are the 0.4% to 0.45% of droplet-encapsulated cells that exhibit the highest viability.

在可选的实施方式中,所述扩增编码T7 RNA聚合酶的多核苷酸的和核苷酸序列如SEQ ID NO.15所示,其表达氨基酸序列如SEQ ID NO.1所示的野生型T7 RNA聚合酶。In an optional embodiment, the amplified polynucleotide encoding T7 RNA polymerase and the nucleotide sequence are as shown in SEQ ID NO.15, and the expressed amino acid sequence is the wild-type T7 RNA polymerase as shown in SEQ ID NO.1.

在可选的实施方式中,还包括至少一次:分离经液滴微流控细胞分选方法分选得到的靶细胞中的目的片段,转化至另一细胞中,培养后再次筛选。In an optional embodiment, the method further includes at least one step of: isolating a target fragment from a target cell sorted by a droplet microfluidic cell sorting method, transforming the fragment into another cell, and screening the cell again after culturing the fragment.

在可选的实施方式中,收集阳性细胞表达的T7 RNA聚合酶突变体,对含有T7 RNA聚合酶突变体的体系进行热处理,然后进行酶活测定,将酶活至少是野生型1.5倍的阳性克隆测序。In an optional embodiment, T7 RNA polymerase mutants expressed by positive cells are collected, the system containing the T7 RNA polymerase mutants is heat-treated, and then the enzyme activity is determined, and the positive clones whose enzyme activity is at least 1.5 times that of the wild type are sequenced.

在某一次的实施方式中,采用如下方案筛选了T7 RNA聚合酶突变体:In one embodiment, T7 RNA polymerase mutants were screened using the following protocol:

首先,以含有野生型T7 RNAP基因的重组载体pQE-80L-T7 RNAP为模板进行T7 RNAP基因(SEQ ID NO.15)的PCR扩增,通过改变反应体系中Mn2+浓度来获得不同突变程度的扩增产物;然后将产物与载体pQE-80L连接构建重组表达载体,电转至感受态E.coli 10G中,根据长出的克隆子计算库容量达106以上,优选出引入突变频率为2~3个碱基每个基因的Mn2+浓度为0.2mM-0.3mM。First, PCR amplification of the T7 RNAP gene (SEQ ID NO.15) was performed using the recombinant vector pQE-80L-T7 RNAP containing the wild-type T7 RNAP gene as a template, and amplification products with different mutation degrees were obtained by changing the Mn 2+ concentration in the reaction system; then the product was connected with the vector pQE-80L to construct a recombinant expression vector, which was electroporated into competent E.coli 10G, and the library capacity was calculated to be more than 106 based on the grown clones, and the Mn 2+ concentration for introducing a mutation frequency of 2 to 3 bases per gene was preferably 0.2mM-0.3mM.

然后,将得到的混合质粒突变体文库电转入新鲜的E.coli BL21(DE3)感受态中,获得含107左右的克隆子的突变体文库。按照蛋白表达的步骤对大肠杆菌进行诱导、表达并使用FADS平台对单细胞及反应体系进行包裹,单细胞包裹率控制在10%左右以提高筛选的准确度。体外孵育过程中,将液滴在45℃左右的温度下处理适当时间,活性降至原来的三分之一左右,以达到高温处理的效果,之后再在37℃下孵育3h。Then, the obtained mixed plasmid mutant library was electroporated into fresh E. coli BL21 (DE3) competent cells to obtain a mutant library containing about 107 clones. According to the steps of protein expression, E. coli was induced and expressed, and the single cell and reaction system were encapsulated using the FADS platform. The single cell encapsulation rate was controlled at about 10% to improve the accuracy of screening. During in vitro incubation, the droplets were treated at a temperature of about 45°C for an appropriate time, and the activity was reduced to about one-third of the original to achieve the effect of high temperature treatment, and then incubated at 37°C for 3h.

利用FADS对突变体文库进行分选前,先通过对FITC和墨水的分选来确定最适的激光斑位置、分选速度等条件。之后将孵育后的微液滴注入检测、分选芯片中。将表现出最高活力的0.42%左右的液滴收集于1.5mL离心管中,阳性基因通过PCR回收,通过对目的片段进行BamHI、HindIII双酶切处理,重新构建至pQE-80L线性化载体,再次将连接载体转化至BL21(DE3)感受态中,挑取平板上长出的单克隆以进行复筛。Before using FADS to sort the mutant library, the most suitable laser spot position, sorting speed and other conditions are determined by sorting FITC and ink. Then the incubated microdroplets are injected into the detection and sorting chip. The droplets of about 0.42% showing the highest activity are collected in a 1.5mL centrifuge tube, and the positive gene is recovered by PCR. The target fragment is treated with BamHI and HindIII double enzymes and reconstructed into the pQE-80L linearized vector. The ligated vector is transformed into the BL21 (DE3) competent state again, and the single clone grown on the plate is picked for rescreening.

在分选过程中,还对不同批次的转化突变体文库进行几轮分选,将得到的阳性基因进行混合回收,以确保能分选到更多的有益突变点。从上一轮得到的克隆子中,挑取新鲜的单克隆于96孔板中,以野生型BL21(DE3)-T7 RNAP单克隆为对照。进行诱导、表达之后,利用细胞裂解液裂解制备粗酶液,将粗酶液于45℃下热处理5min后,迅速置于冰上,再将粗酶液转移至384孔板中,利用STAR体系(基于iSpinach适配体的实时转录活性监测体系(iSpinach aptamer-based monitoring of transcriptionactivity in real-time,STAR),参见申请号:CN202211274914.8,发明名称为“一种实时监测体外转录合成RNA的检测方法”的中国专利申请,该体系的介绍及使用方法详见CN115896213A文本)进行酶活测定。然后将热处理之后残余活性是野生型至少1.5倍的突变体进行测序,通过测序结果分析突变点的分布。During the sorting process, several rounds of sorting were performed on different batches of transformed mutant libraries, and the obtained positive genes were mixed and recovered to ensure that more beneficial mutation points could be sorted. From the clones obtained in the previous round, fresh single clones were picked in a 96-well plate, and the wild-type BL21 (DE3)-T7 RNAP single clone was used as a control. After induction and expression, the crude enzyme solution was prepared by lysing the cell with cell lysis solution, and the crude enzyme solution was heat treated at 45°C for 5 minutes, and then quickly placed on ice. The crude enzyme solution was transferred to a 384-well plate and the enzyme activity was measured using the STAR system (iSpinach aptamer-based monitoring of transcription activity in real-time, STAR, see application number: CN202211274914.8, the invention name is "A detection method for real-time monitoring of in vitro transcribed RNA synthesis", the introduction and use of the system are detailed in CN115896213A text). Then the mutants whose residual activity after heat treatment is at least 1.5 times that of the wild type are sequenced, and the distribution of mutation points is analyzed by sequencing results.

根据上述突变点的分布构建携带编码T7 RNAP突变体的核苷酸序列的重组表达载体,并转化到BL21(DE3)大肠杆菌感受态细胞中,获得T7 RNP野生型和突变体的重组菌株,诱导表达并通过蛋白纯化制备T7 RNAP突变体。然后,通过体外转录反应合成单链RNA适配体,该单链RNA能够与特定的化学小分子DFHBI结合产生显著的荧光信号增强,通过监测反应过程中实时荧光的变化,来测定不同突变体与T7 RNAP-WT的相对活性。进一步的,通过T7 RNAP突变体蛋白溶液与Sypro Orange染料结合,利用差示扫描荧光法测定不同突变体的Tm值。T7 RNAP突变体的应用性能通过分析体外转录反应(In vitro transcription,IVT)合成的产物来实现,包括IVT产量,产物纯度,产物中的dsRNA含量以及产物的3'端一致性指标。According to the distribution of the above mutation points, a recombinant expression vector carrying a nucleotide sequence encoding a T7 RNAP mutant was constructed and transformed into BL21 (DE3) Escherichia coli competent cells to obtain recombinant strains of T7 RNP wild type and mutant, induced expression and prepared T7 RNAP mutants by protein purification. Then, a single-stranded RNA aptamer was synthesized by in vitro transcription reaction. The single-stranded RNA can bind to a specific chemical small molecule DFHBI to produce a significant fluorescence signal enhancement. The relative activity of different mutants and T7 RNAP-WT was determined by monitoring the change of real-time fluorescence during the reaction. Further, the Tm values of different mutants were determined by differential scanning fluorimetry by combining the T7 RNAP mutant protein solution with Sypro Orange dye. The application performance of T7 RNAP mutants is achieved by analyzing the products synthesized by in vitro transcription (IVT), including IVT yield, product purity, dsRNA content in the product and 3' end consistency index of the product.

在上述实施方式中,筛选出了多个能够提高T7 RNA聚合酶突变体热稳定性的突变位点,因此,本发明还提供了通过上述筛选流程筛选出的T7 RNA聚合酶突变体。本文所述的T7RNA聚合酶突变体,是对照野生型T7 RNA聚合酶,至少存在一个氨基酸残基差异的多肽。在可选的实施方式中,对照的野生型T7 RNA聚合酶的氨基酸序列如SEQ ID NO.1所示。In the above embodiment, multiple mutation sites that can improve the thermal stability of the T7 RNA polymerase mutant are screened out. Therefore, the present invention also provides a T7 RNA polymerase mutant screened out by the above screening process. The T7 RNA polymerase mutant described herein is a polypeptide that has at least one amino acid residue difference compared to the wild-type T7 RNA polymerase. In an optional embodiment, the amino acid sequence of the wild-type T7 RNA polymerase is shown in SEQ ID NO.1.

本发明提供的T7 RNA聚合酶突变体存在如下中的至少一种氨基酸残基:对应于SEQ ID NO.1所示氨基酸序列,217位为L(亮氨酸),397位为A(丙氨酸)或W(色氨酸),和,579位为A(丙氨酸)或W(色氨酸)。含有前述氨基酸残基的T7 RNA聚合酶突变体具有很好的热稳定性。The T7 RNA polymerase mutant provided by the present invention has at least one of the following amino acid residues: corresponding to the amino acid sequence shown in SEQ ID NO.1, position 217 is L (leucine), position 397 is A (alanine) or W (tryptophan), and position 579 is A (alanine) or W (tryptophan). The T7 RNA polymerase mutant containing the aforementioned amino acid residues has good thermal stability.

本文中所述的对应于SEQ ID NO.1所示氨基酸序列,指的是当将T7 RNA聚合酶突变体与SEQ ID NO.1所示氨基酸序列进行比对,以SEQ ID NO.1所示氨基酸序列中位点的位点序数确定的氨基酸残基。比如当T7 RNA聚合酶突变体的氮端一个或几个氨基酸残基的缺失,T7 RNA聚合酶突变体对应于SEQ ID NO.1所示氨基酸序列中的某一氨基酸残基在SEQ ID NO.1中位点的序数和在T7 RNA聚合酶突变体的位点的序数不同,例如,当T7 RNA聚合酶突变体的氮端缺失N个氨基酸残基时,一氨基酸残基在SEQ ID NO.1所示氨基酸序列中位于第i位,在T7 RNA聚合酶突变体的氨基酸序列中位于第i-N位。本文中以氨基酸残基在对应于SEQ ID NO.1所示氨基酸序列中的位置对其进行限定,而非在突变体中的位置。The amino acid sequence corresponding to SEQ ID NO.1 as described herein refers to the amino acid residue determined by the position number of the position in the amino acid sequence shown in SEQ ID NO.1 when the T7 RNA polymerase mutant is aligned with the amino acid sequence shown in SEQ ID NO.1. For example, when one or more amino acid residues are missing at the nitrogen end of the T7 RNA polymerase mutant, the position number of a certain amino acid residue in the amino acid sequence shown in SEQ ID NO.1 of the T7 RNA polymerase mutant corresponding to the amino acid sequence shown in SEQ ID NO.1 is different from the position number of the position in the T7 RNA polymerase mutant. For example, when the nitrogen end of the T7 RNA polymerase mutant lacks N amino acid residues, an amino acid residue is located at the i-th position in the amino acid sequence shown in SEQ ID NO.1 and is located at the i-N position in the amino acid sequence of the T7 RNA polymerase mutant. In this article, the amino acid residue is defined by its position in the amino acid sequence corresponding to SEQ ID NO.1, rather than its position in the mutant.

并且,本发明提供的T7 RNA聚合酶突变体与SEQ ID NO.1至少具有80%同一性,例如可以为但不限于为与SEQ ID NO.1至少80%、85%、90%或95%同一性。本文中“同一性”或类似术语是指,在最佳比对下,待比较序列之间相同核苷酸或氨基酸之百分比。该百分比为纯粹统计学的,且两个序列之间的差异可能为(但不一定)随机分布于待比较序列之整个长度上。两个序列之比较通常藉由在最佳比对之后,相对于片段或“比较窗口”以鉴别对应序列之局部区。Furthermore, the T7 RNA polymerase mutant provided by the present invention has at least 80% identity with SEQ ID NO.1, for example, it may be but is not limited to at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1. "Identity" or similar terms herein refer to the percentage of identical nucleotides or amino acids between the sequences to be compared under optimal alignment. The percentage is purely statistical, and the differences between the two sequences may be (but not necessarily) randomly distributed over the entire length of the sequences to be compared. The comparison of two sequences is typically performed by identifying local regions of corresponding sequences relative to fragments or "comparison windows" after optimal alignment.

可以理解的是,与SEQ ID NO.1至少具有80%同一性的T7 RNA聚合酶突变体含有前述217位为L,397位为A或W,和,579位为A或W中的至少一个氨基酸残基。It is understood that a T7 RNA polymerase mutant having at least 80% identity with SEQ ID NO.1 contains at least one of the amino acid residues at position 217 being L, at position 397 being A or W, and at position 579 being A or W.

在可选的实施方式中,所述T7 RNA聚合酶突变体含有对应于SEQ ID NO.1所示氨基酸序列第217位为L的氨基酸残基,且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。In an optional embodiment, the T7 RNA polymerase mutant contains an amino acid residue corresponding to L at position 217 of the amino acid sequence shown in SEQ ID NO.1, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.

在可选的实施方式中,所述T7 RNA聚合酶突变体含有对应于SEQ ID NO.1所示氨基酸序列第397位为A或W的氨基酸残基,且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性,其中优选第397位为W。In an optional embodiment, the T7 RNA polymerase mutant contains an amino acid residue that is A or W at position 397 of the amino acid sequence shown in SEQ ID NO.1, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1, wherein preferably position 397 is W.

在可选的实施方式中,所述T7 RNA聚合酶突变体含有对应于SEQ ID NO.1所示氨基酸序列第579位为A或W的氨基酸残基,且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性,其中优选第579位为W。In an optional embodiment, the T7 RNA polymerase mutant contains an amino acid residue that is A or W at position 579 of the amino acid sequence shown in SEQ ID NO.1, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1, wherein preferably position 579 is W.

在可选的实施方式中,所述T7 RNA聚合酶突变体除去前述氨基酸残基中的至少一种,还包括如下中的至少一种:对应于SEQ ID NO.1所示氨基酸序列,第369位、第430位、633位、786位、744位、723位和724位至少一个位点与SEQ ID NO.1不同。In an optional embodiment, the T7 RNA polymerase mutant removes at least one of the aforementioned amino acid residues and also includes at least one of the following: corresponding to the amino acid sequence shown in SEQ ID NO.1, at least one of the positions 369, 430, 633, 786, 744, 723 and 724 is different from SEQ ID NO.1.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第633位为非极性疏水性氨基酸,进一步优选为P(脯氨酸)、L(亮氨酸)、V(缬氨酸)、M(蛋氨酸)或F(苯丙氨酸);更进一步优选为P,且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。第633位突变有利于T7 RNA聚合酶稳定性提高。In an optional embodiment, the 633rd position corresponding to the amino acid sequence shown in SEQ ID NO.1 is a non-polar hydrophobic amino acid, further preferably P (proline), L (leucine), V (valine), M (methionine) or F (phenylalanine); further preferably P, and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1. The mutation at position 633 is beneficial to improve the stability of T7 RNA polymerase.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第744位为R(精氨酸)、P(脯氨酸)或L(亮氨酸);进一步优选为R;且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。第744位突变有利于T7 RNA聚合酶活性提高。 In an optional embodiment, the 744th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is R (arginine), P (proline) or L (leucine); more preferably R; and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1. The mutation at position 744 is beneficial to the improvement of T7 RNA polymerase activity.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第786位为Y(酪氨酸)或非极性疏水性氨基酸;进一步优选为Y、L(亮氨酸)、M(蛋氨酸)、W(色氨酸)或F(苯丙氨酸);更进一步优选为L;且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。第786位突变有利于T7 RNA聚合酶稳定性提高。In an optional embodiment, the 786th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is Y (tyrosine) or a non-polar hydrophobic amino acid; more preferably Y, L (leucine), M (methionine), W (tryptophan) or F (phenylalanine); more preferably L; and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1. The mutation at position 786 is beneficial to improve the stability of T7 RNA polymerase.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第723位为E(谷氨酸)或S(丝氨酸);进一步优选为E;且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。第723位突变有利于改善蛋白聚集。In an optional embodiment, the 723rd position corresponding to the amino acid sequence shown in SEQ ID NO.1 is E (glutamic acid) or S (serine); more preferably E; and has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1. The mutation at position 723 is beneficial to improve protein aggregation.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第369位为T(苏氨酸)或E(谷氨酸);且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。In an optional embodiment, the 369th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is T (threonine) or E (glutamate); and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第579位为W(色氨酸)或A(丙氨酸);进一步优选为W;且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。In an optional embodiment, the 579th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is W (tryptophan) or A (alanine); further preferably W; and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第430位为P(脯氨酸);且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。In an optional embodiment, the 430th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is P (proline); and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.

在可选的实施方式中,对应于SEQ ID NO.1所示氨基酸序列的第724位为N(天冬酰胺);且与SEQ ID NO.1至少具有80%、85%、90%或95%同一性。In an optional embodiment, the 724th position corresponding to the amino acid sequence shown in SEQ ID NO.1 is N (asparagine); and it has at least 80%, 85%, 90% or 95% identity with SEQ ID NO.1.

在可选的实施方式中,所述T7 RNA聚合酶突变体选自如下中的任意一种:In an optional embodiment, the T7 RNA polymerase mutant is selected from any one of the following:

(a)在SEQ ID NO.1所示氨基酸序列中至少存在I217L突变;即SEQ ID NO.1的第217位由异亮氨酸突变为亮氨酸。(a) There is at least an I217L mutation in the amino acid sequence shown in SEQ ID NO.1; that is, position 217 of SEQ ID NO.1 mutates from isoleucine to leucine.

(b)在SEQ ID NO.1所示氨基酸序列中至少存在S397W突变;即SEQ ID NO.1的第397位由丝氨酸突变为色氨酸。(b) There is at least S397W mutation in the amino acid sequence shown in SEQ ID NO.1; that is, position 397 of SEQ ID NO.1 mutates from serine to tryptophan.

(c)在SEQ ID NO.1所示氨基酸序列中至少存在N579W突变;即SEQ ID NO.1的第579位由天冬酰胺突变为色氨酸。(c) There is at least N579W mutation in the amino acid sequence shown in SEQ ID NO.1; that is, position 579 of SEQ ID NO.1 mutates from asparagine to tryptophan.

(d)在SEQ ID NO.1所示氨基酸序列中至少存在I217L、S397W、S430P、S633P和Q786L突变;即SEQ ID NO.1的第217位由异亮氨酸突变为亮氨酸,第397位由丝氨酸突变为色氨酸,第430位由丝氨酸突变为脯氨酸,第633位由丝氨酸突变为脯氨酸,第786位谷氨酰胺由突变为亮氨酸。(d) at least I217L, S397W, S430P, S633P and Q786L mutations exist in the amino acid sequence shown in SEQ ID NO.1; that is, position 217 of SEQ ID NO.1 mutates from isoleucine to leucine, position 397 mutates from serine to tryptophan, position 430 mutates from serine to proline, position 633 mutates from serine to proline, and position 786 of glutamine mutates from leucine.

(e)在SEQ ID NO.1所示氨基酸序列中至少存在I217L、S397W、S430P、S633P、Q786L和N579W突变;即SEQ ID NO.1的第217位由异亮氨酸突变为亮氨酸,第397位由丝氨酸突变为色氨酸,第430位由丝氨酸突变为脯氨酸,第633位由丝氨酸突变为脯氨酸,第786位谷氨酰胺由突变为亮氨酸,第579位由天冬酰胺突变为色氨酸。(e) in the amino acid sequence shown in SEQ ID NO.1, there are at least I217L, S397W, S430P, S633P, Q786L and N579W mutations; that is, position 217 of SEQ ID NO.1 mutates from isoleucine to leucine, position 397 mutates from serine to tryptophan, position 430 mutates from serine to proline, position 633 mutates from serine to proline, position 786 mutates from glutamine to leucine, and position 579 mutates from asparagine to tryptophan.

(f)在SEQ ID NO.1所示氨基酸序列中至少存在I217L、S397W、S430P、S633P、Q786L和Q744R突变;即SEQ ID NO.1的第217位由异亮氨酸突变为亮氨酸,第397位由丝氨酸突变为色氨酸,第430位由丝氨酸突变为脯氨酸,第633位由丝氨酸突变为脯氨酸,第786位谷氨酰胺由突变为亮氨酸,第744位由谷氨酰胺突变为精氨酸。(f) in the amino acid sequence shown in SEQ ID NO.1, there are at least I217L, S397W, S430P, S633P, Q786L and Q744R mutations; that is, position 217 of SEQ ID NO.1 mutates from isoleucine to leucine, position 397 mutates from serine to tryptophan, position 430 mutates from serine to proline, position 633 mutates from serine to proline, position 786 mutates from glutamine to leucine, and position 744 mutates from glutamine to arginine.

(g)在SEQ ID NO.1所示氨基酸序列中至少存在I217L、S397W、S430P、S633P、Q786L、C723E和A724N突变。即SEQ ID NO.1的第217位由异亮氨酸突变为亮氨酸,第397位由丝氨酸突变为色氨酸,第430位由丝氨酸突变为脯氨酸,第633位由丝氨酸突变为脯氨酸,第786位谷氨酰胺由突变为亮氨酸,第723位由半胱氨酸突变为谷氨酸,第724位由丙氨酸突变为天冬酰胺。(g) at least I217L, S397W, S430P, S633P, Q786L, C723E and A724N mutations exist in the amino acid sequence shown in SEQ ID NO.1. That is, the 217th position of SEQ ID NO.1 mutates from isoleucine to leucine, the 397th position mutates from serine to tryptophan, the 430th position mutates from serine to proline, the 633rd position mutates from serine to proline, the 786th position of glutamine mutates from leucine, the 723rd position mutates from cysteine to glutamic acid, and the 724th position of alanine mutates from asparagine.

在可选的实施方式中,所述T7 RNA聚合酶突变体的氨基酸序列SEQ ID NO.2(对应I217L)、SEQ ID NO.3(对应S397W)、SEQ ID NO.4(对应T121M)、SEQ ID NO.5(对应S223M)、SEQ ID NO.6(对应R632G)、SEQ ID NO.7(对应I217L/S397W/S430P/S633P/Q786L)或SEQ ID NO.8(对应I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y)所示、SEQ ID NO.79(对应M369T)、SEQ ID NO.80(对应N579W)、SEQ ID NO.81(对应I217L/S397W/S430P/S633P/Q786L/N579W)、SEQ ID NO.82(对应I217L/S397W/S430P/S633P/Q786L/Q744R)、SEQ ID NO.83(对应I217L/S397W/S430P/S633P/Q786L/C723E/A724N)所示。In an optional embodiment, the amino acid sequence of the T7 RNA polymerase mutant is SEQ ID NO.2 (corresponding to I217L), SEQ ID NO.3 (corresponding to S397W), SEQ ID NO.4 (corresponding to T121M), SEQ ID NO.5 (corresponding to S223M), SEQ ID NO.6 (corresponding to R632G), SEQ ID NO.7 (corresponding to I217L/S397W/S430P/S633P/Q786L) or SEQ ID NO.8 (corresponding to I217L/S397W/S430P/S633P/Q786L /Q744R/S43Y), SEQ ID NO.79 (corresponding to M369T), SEQ ID NO.80 (corresponding to N579W), SEQ ID NO.81 (corresponding to I217L/S397W/S430P/S633P/Q786L/N579W), SEQ ID NO.82 (corresponding to I217L/S397W/S430P/S633P/Q786L/Q744R), and SEQ ID NO.83 (corresponding to I217L/S397W/S430P/S633P/Q786L/C723E/A724N).

SEQ ID NO.2为单点突变I217L的T7 RNA聚合酶突变体,其Tm值为50.5±0.21℃,在50℃处理5min后的残余活性为52%,以野生型的酶活性为100%作为参考,该突变体活性是134%。SEQ ID NO.2 is a T7 RNA polymerase mutant with a single point mutation I217L, whose Tm value is 50.5±0.21℃. The residual activity after treatment at 50℃ for 5 minutes is 52%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 134%.

SEQ ID NO.3为单点突变S397W的T7 RNA聚合酶突变体,其Tm值为49.2±0.13℃,在50℃处理5min后的残余活性为47%,以野生型的酶活性为100%作为参考,该突变体活性是162%。SEQ ID NO.3 is a T7 RNA polymerase mutant with a single point mutation S397W, whose Tm value is 49.2±0.13℃. The residual activity after treatment at 50℃ for 5 minutes is 47%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 162%.

SEQ ID NO.80为单点突变N579W的T7 RNA聚合酶突变体,其Tm值为48.0±0.23℃,在50℃处理5min后的残余活性为38%。SEQ ID NO.80 is a T7 RNA polymerase mutant with a single point mutation N579W, whose Tm value is 48.0±0.23℃ and the residual activity after treatment at 50℃ for 5 minutes is 38%.

SEQ ID NO.7为组合突变I217L、S397W、S430P、S633P和Q786L的T7 RNA聚合酶突变体,Tm值为54.52±0.20℃,在58℃热处理15min后的残余活性为58%,SEQ ID NO.7 is a T7 RNA polymerase mutant with combined mutations I217L, S397W, S430P, S633P and Q786L, with a Tm value of 54.52±0.20℃ and a residual activity of 58% after heat treatment at 58℃ for 15min.

SEQ ID NO.81为组合突变I217L、S397W、S430P、S633P、Q786L和N579W的T7 RNA聚合酶突变体,Tm值为55.56±0.16℃,在58℃热处理15min后的残余活性为65%, SEQ ID NO.81 is a T7 RNA polymerase mutant with combined mutations I217L, S397W, S430P, S633P, Q786L and N579W, with a Tm value of 55.56±0.16°C and a residual activity of 65% after heat treatment at 58°C for 15 minutes.

SEQ ID NO.82为组合突变I217L、S397W、S430P、S633P、Q786L和Q744R的T7 RNA聚合酶突变体,Tm值为54.50±0.18℃,在58℃热处理15min后的残余活性为87%,以野生型的酶活性为100%作为参考,该突变体活性是195%。SEQ ID NO.82 is a T7 RNA polymerase mutant with combined mutations I217L, S397W, S430P, S633P, Q786L and Q744R. The Tm value is 54.50±0.18℃. The residual activity after heat treatment at 58℃ for 15min is 87%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 195%.

SEQ ID NO.83为组合突变I217L、S397W、S430P、S633P、Q786L和C723E和A724N聚合酶突变体,Tm值为54.54±0.23℃,在58℃热处理15min后的残余活性为81%,以野生型的酶活性为100%作为参考,该突变体活性是183%。SEQ ID NO.83 is a polymerase mutant with combined mutations I217L, S397W, S430P, S633P, Q786L, C723E and A724N. The Tm value is 54.54±0.23℃. The residual activity after heat treatment at 58℃ for 15min is 81%. Taking the enzyme activity of the wild type as 100% as a reference, the activity of this mutant is 183%.

根据本发明的另一个方面,本发明还提供了编码上述T7 RNA聚合酶突变体的多核苷酸。According to another aspect of the present invention, the present invention also provides a polynucleotide encoding the above-mentioned T7 RNA polymerase mutant.

本文中“多核苷酸”指的是任何长度的核苷酸的聚合物形式,多核苷酸包括核糖核苷酸和/或脱氧核糖核苷酸。多核苷酸的实例包括但不限于单链、双链或多链DNA或RNA、基因组DNA、cDNA、DNA-RNA杂合体或者包含嘌呤和嘧啶碱基或其他天然、化学或生化修饰、非天然或衍生的核苷酸碱基的聚合物。多核苷酸编码上述T7 RNA聚合酶突变体,编码可选地为编码正义链或反义链。多核苷酸可以是天然存在的、合成的、重组的或它们的任意组合。术语“多核苷酸”和“核酸”本文中可互换地使用。As used herein, "polynucleotide" refers to a polymeric form of nucleotides of any length, including ribonucleotides and/or deoxyribonucleotides. Examples of polynucleotides include, but are not limited to, single-stranded, double-stranded, or multi-stranded DNA or RNA, genomic DNA, cDNA, DNA-RNA hybrids, or polymers containing purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural or derived nucleotide bases. The polynucleotide encodes the T7 RNA polymerase mutant described above, optionally encoding the sense strand or the antisense strand. The polynucleotide may be naturally occurring, synthetic, recombinant, or any combination thereof. The terms "polynucleotide" and "nucleic acid" are used interchangeably herein.

根据本发明的另一个方面,本发明还提供了一种载体,该载体携带编码上述T7 RNA聚合酶突变体的多核苷酸。所述载体是本领域技术人员公知的,包括但不限于:质粒;噬菌粒;柯斯质粒;人工染色体,例如酵母人工染色体(YAC)、细菌人工染色体(BAC)或P1来源的人工染色体(PAC);噬菌体如λ噬菌体或M13噬菌体及动物病毒等。可用作载体的动物病毒包括但不限于,逆转录酶病毒(包括慢病毒)、腺病毒、腺相关病毒、疱疹病毒(如单纯疱疹病毒)、痘病毒、杆状病毒、乳头瘤病毒、乳头多瘤空泡病毒。在一些实施方式中,本发明所述载体中包含基因工程中常用的调控元件,例如增强子、启动子、内部核糖体进入位点(IRES)和其他表达控制元件(例如转录终止信号,或者多腺苷酸化信号和多聚U序列等)。According to another aspect of the present invention, the present invention also provides a vector, which carries a polynucleotide encoding the above-mentioned T7 RNA polymerase mutant. The vector is well known to those skilled in the art, including but not limited to: plasmid; phagemid; cosmid; artificial chromosome, such as yeast artificial chromosome (YAC), bacterial artificial chromosome (BAC) or P1-derived artificial chromosome (PAC); bacteriophage such as λ phage or M13 phage and animal virus, etc. Animal viruses that can be used as vectors include but are not limited to retroviruses (including lentiviruses), adenoviruses, adeno-associated viruses, herpes viruses (such as herpes simplex virus), poxviruses, baculoviruses, papillomaviruses, and papillomaviruses. In some embodiments, the vector of the present invention contains regulatory elements commonly used in genetic engineering, such as enhancers, promoters, internal ribosome entry sites (IRES) and other expression control elements (such as transcription termination signals, or polyadenylation signals and poly-U sequences, etc.).

根据本发明的另一个方面,本发明还提供了一种细胞,该细胞携带前述多核苷酸或前述载体,或表达前述T7 RNA聚合酶突变体。所述细胞可以指单个细胞,也可以指细胞系,或细胞培养物。本文中细胞包括其后代,后代可以由于天然的、偶然的或故意的突变不一定与原代细胞完全相同,在形态学上和/或在基因组DNA上与原代细胞存在差异。所述细胞可以为天然细胞或转化体。According to another aspect of the present invention, the present invention further provides a cell, which carries the aforementioned polynucleotide or the aforementioned vector, or expresses the aforementioned T7 RNA polymerase mutant. The cell may refer to a single cell, a cell line, or a cell culture. The cell herein includes its progeny, which may not necessarily be completely identical to the primary cell due to natural, accidental or intentional mutations, and may differ from the primary cell in morphology and/or in genomic DNA. The cell may be a natural cell or a transformant.

根据本发明的另一个方面,本发明还提供了一种多核苷酸的合成方法,所述方法用到本发明的RNA聚合酶突变体。该多核苷酸的合成方法包括使用包含前述T7 RNA聚合酶突变体的酶体系催化合成反应。所述合成方法包括体外转录合成(In Vitro Transcription,IVT)、等温扩增,例如序列特异性核酸体外扩增(Nucleic Acid Specific-Based Amplification,NASBA)或转录介导的扩增(Transcription mediated amplification,TMA)According to another aspect of the present invention, the present invention also provides a method for synthesizing a polynucleotide, wherein the method uses the RNA polymerase mutant of the present invention. The method for synthesizing the polynucleotide comprises using an enzyme system comprising the aforementioned T7 RNA polymerase mutant to catalyze a synthesis reaction. The synthesis method comprises in vitro transcription synthesis (In Vitro Transcription, IVT), isothermal amplification, such as sequence-specific nucleic acid in vitro amplification (Nucleic Acid Specific-Based Amplification, NASBA) or transcription-mediated amplification (Transcription mediated amplification, TMA)

根据本发明的另一个方面,本发明还提供了前述T7 RNA聚合酶突变体或前述多核苷酸的合成方法在如下任意一项中的应用:mRNA疫苗制备、核酸药物制备、基因编辑和蛋白表达系统构建中的至少一种,蛋白表达系统构建包括但不限于体内蛋白质表达系统构建或无细胞蛋白表达体外翻译系统构建。According to another aspect of the present invention, the present invention also provides the use of the aforementioned T7 RNA polymerase mutant or the aforementioned polynucleotide synthesis method in any one of the following: at least one of mRNA vaccine preparation, nucleic acid drug preparation, gene editing and protein expression system construction, and the protein expression system construction includes but is not limited to in vivo protein expression system construction or cell-free protein expression in vitro translation system construction.

根据本发明的另一个方面,本发明还提供了一种用于多核苷酸合成的试剂盒,该试剂盒包含前述T7 RNA聚合酶突变体。可以理解的是,本发明提供的用于多核苷酸合成的试剂盒还可以包含本领域可接受的其他的检测试剂或耗材,本领域技术人员可以根据本领域众所周知,且如各种一般和更具体的教材、参考文献、工艺手册、商品说明、标准文件和设备说明书等记载的内容选择相应的试剂和/或耗材。试剂盒中具体的可选的试剂和耗材包括但不限于缓冲液、盐、金属离子、dNTPs、酶、引物、探针、荧光染料、发光底物、对照品、质控品和校准品中的一种或多种。According to another aspect of the present invention, the present invention also provides a kit for polynucleotide synthesis, which comprises the aforementioned T7 RNA polymerase mutant. It is understandable that the kit for polynucleotide synthesis provided by the present invention may also comprise other detection reagents or consumables acceptable in the art, and those skilled in the art may select corresponding reagents and/or consumables based on the contents well known in the art and recorded in various general and more specific teaching materials, references, process manuals, product descriptions, standard documents and equipment manuals. Specific optional reagents and consumables in the kit include, but are not limited to, one or more of buffers, salts, metal ions, dNTPs, enzymes, primers, probes, fluorescent dyes, luminescent substrates, reference substances, quality control substances and calibrators.

下面通过具体的实施例进一步说明本发明,但是,应当理解为,这些实施例仅仅是用于更详细地说明之用,而不应理解为用于以任何形式限制本发明。The present invention is further described below by means of specific examples. However, it should be understood that these examples are only used for more detailed description and should not be construed as limiting the present invention in any form.

以下对本发明的优选实施例进行说明,应当理解实施The following describes the preferred embodiments of the present invention.

例是为了更好地解释本发明,不用于限制本发明。The examples are for better explanation of the present invention and are not used to limit the present invention.

实施例1:突变文库构建与筛选Example 1: Construction and screening of mutation library

1、突变文库构建1. Construction of mutation library

将野生型T7 RNA聚合酶基因核苷酸序列(对应的氨基酸序列如SEQ ID NO.1)按照大肠杆菌密码子优化得到核苷酸序列SEQ ID NO.15,再将优化后的核苷酸序列插入到载体pQE-80L(Qiagen,货号HG-VYQ0254)的BamHI、HindIII两个限制性内切酶位点之间,即得到含有野生型T7 RNAP基因的重组载体pQE-80L-T7 RNAP,质粒结构如图1所示,以重组载体pQE-80L-T7 RNAP为模板进行T7 RNAP基因的扩增,突变程度主要通过改变反应体系中Mn2+浓度来实现。分别设置0.1mM、0.2mM、0.3mM、0.4mM、0.5mM、0.6mM Mn2+浓度梯度进行基因扩增,突变率以每条基因1~3个突变点为最佳。The wild-type T7 RNA polymerase gene nucleotide sequence (corresponding amino acid sequence such as SEQ ID NO.1) was optimized according to the Escherichia coli codon to obtain the nucleotide sequence SEQ ID NO.15, and then the optimized nucleotide sequence was inserted between the two restriction endonuclease sites of BamHI and HindIII of the vector pQE-80L (Qiagen, item number HG-VYQ0254), that is, the recombinant vector pQE-80L-T7 RNAP containing the wild-type T7 RNAP gene was obtained. The plasmid structure is shown in Figure 1. The recombinant vector pQE-80L-T7 RNAP was used as a template to amplify the T7 RNAP gene, and the degree of mutation was mainly achieved by changing the Mn 2+ concentration in the reaction system. 0.1mM, 0.2mM, 0.3mM, 0.4mM, 0.5mM, 0.6mM Mn 2+ concentration gradients were set for gene amplification, and the mutation rate was optimal with 1 to 3 mutation points per gene.

扩增体系如下:pQE-80L-T7 RNAP质粒(0.2~1ng/μL),dATP(0.25mM),dGTP(0.25mM),dCTP(1.05mM),dTTP(1.05mM),DreamTaq Buffer(Takara),DreamTaq(Takara,0.1U/μL),MnCl2(0.1mM-0.6mM),T7 RNAP-F(0.2μM),T7 RNAP-R(0.2μM),PCR总反应体系为50μL。 The amplification system was as follows: pQE-80L-T7 RNAP plasmid (0.2-1 ng/μL), dATP (0.25 mM), dGTP (0.25 mM), dCTP (1.05 mM), dTTP (1.05 mM), DreamTaq Buffer (Takara), DreamTaq (Takara, 0.1 U/μL), MnCl 2 (0.1 mM-0.6 mM), T7 RNAP-F (0.2 μM), T7 RNAP-R (0.2 μM), and the total PCR reaction system was 50 μL.

其中,扩增引物T7 RNAP-F和T7 RNAP-R的DNA序列如表1所示,扩增PCR程序如表2所示:Among them, the DNA sequences of amplification primers T7 RNAP-F and T7 RNAP-R are shown in Table 1, and the amplification PCR program is shown in Table 2:

表1突变文库构建扩增引物序列
Table 1 Primer sequences for construction of mutation library

表2 PCR扩增程序
Table 2 PCR amplification program

目的片段大小通过琼脂糖凝胶电泳分析,试剂盒回收,随后通过BamHI和HindIII双酶切处理,然后纯化回收酶切产物,在T4 DNA ligase的作用下与线性化pQE-80L载体(经BamHI、HindIII双酶切后)过夜连接。The target fragment size was analyzed by agarose gel electrophoresis, recovered by the kit, and then double-digested with BamHI and HindIII. The digested product was then purified and recovered, and ligated with the linearized pQE-80L vector (double-digested with BamHI and HindIII) overnight under the action of T4 DNA ligase.

将回收后的连接产物,电转至电转感受态E.coli 10G中,根据长出的克隆子计算库容量,库容量均达106以上。在0.2mM Mn2+条件下引入频率为1.7个碱基突变/基因,在0.3mM Mn2+条件下引入频率为3.5个碱基突变/基因。因此本实施例将0.2mM Mn2+和0.3mM Mn2+条件下获得的突变质粒进行混合,得到每条基因平均被引入2.6个突变碱基的突变体文库。The recovered ligation product was electrotransformed into electrocompetent E. coli 10G, and the library capacity was calculated based on the clones grown, and the library capacity was more than 10 6. The frequency of introduction under 0.2 mM Mn 2+ was 1.7 base mutations/gene, and the frequency of introduction under 0.3 mM Mn 2+ was 3.5 base mutations/gene. Therefore, in this example, the mutant plasmids obtained under 0.2 mM Mn 2+ and 0.3 mM Mn 2+ were mixed to obtain a mutant library in which an average of 2.6 mutant bases were introduced into each gene.

2、突变体库筛选2. Screening of mutant library

将得到的混合质粒突变体文库电转入新鲜的E.coli BL21(DE3)感受态中,获得含107左右的克隆子突变体文库。整个筛选流程如图2所示,按照蛋白表达的步骤对大肠杆菌进行诱导、表达并使用FADS平台(液滴微流控细胞分选基于微流控超高通量荧光激活液滴分选平台,Fluorescence-Activated Droplet Sorting,FADS)对单细胞及反应体系进行包裹,所述反应体系与在专利CN 115896213 A中公布,包含单克隆子、40-100nM DNA模板ispinach、100μM DFHBI、0.5-6mM的NTPs、30mM MgCl2、5Mm NaCl、20mM DTT、0.002U/μL无机焦磷酸酶、0.2U/μL鼠源核糖核酸酶抑制剂。单细胞包裹率控制在10%左右以提高筛选的准确度。体外孵育过程中,需将液滴在45℃下热处理7min,活性会降至原来的三分之一左右,以达到高温处理的效果,之后再在37℃下孵育3h。利用FADS对突变体文库进行分选前,通过对FITC和墨水的分选来确定最适的激光斑位置、分选速度等条件。之后将孵育后的微液滴重注入检测、分选芯片中。The obtained mixed plasmid mutant library was electroporated into fresh E. coli BL21 (DE3) competent cells to obtain a clone mutant library containing about 10 7. The whole screening process is shown in Figure 2. According to the steps of protein expression, E. coli was induced and expressed, and the single cell and reaction system were encapsulated using the FADS platform (droplet microfluidic cell sorting based on microfluidic ultra-high throughput fluorescence-activated droplet sorting platform, Fluorescence-Activated Droplet Sorting, FADS). The reaction system was published in patent CN 115896213 A, including single clones, 40-100nM DNA template ispinach, 100μM DFHBI, 0.5-6mM NTPs, 30mM MgCl 2 , 5Mm NaCl, 20mM DTT, 0.002U/μL inorganic pyrophosphatase, and 0.2U/μL mouse ribonuclease inhibitor. The single-cell encapsulation rate is controlled at about 10% to improve the accuracy of screening. During in vitro incubation, the droplets need to be heat-treated at 45°C for 7 minutes, and the activity will drop to about one-third of the original to achieve the effect of high-temperature treatment, and then incubated at 37°C for 3 hours. Before using FADS to sort the mutant library, the optimal laser spot position, sorting speed and other conditions are determined by sorting FITC and ink. The incubated microdroplets are then re-injected into the detection and sorting chip.

根据突变体文库产生的信号分布情况,将PMT设置为0.43V、分选的小阈值和大阈值分别设置为0.2V和0.6V、分选电压为1.6V,收集表现出最高活力的0.42%左右的液滴于1.5mL离心管中,将阳性基因通过PCR回收,通过对目的片段进行SacI、HindIII双酶切处理,重新构建至pQE-80L线性化载体,再次将连接载体转化至BL21(DE3)感受态中,挑取平板上长出的单克隆以进行复筛。实际在分选过程中,我们对不同批次的转化突变体文库,进行了几轮分选,将得到的阳性基因进行混合回收,以确保能分选到更多有益的突变。According to the signal distribution of the mutant library, the PMT was set to 0.43V, the small and large thresholds of sorting were set to 0.2V and 0.6V respectively, and the sorting voltage was 1.6V. The droplets of about 0.42% showing the highest activity were collected in a 1.5mL centrifuge tube, and the positive genes were recovered by PCR. The target fragments were double-digested with SacI and HindIII, reconstructed into the pQE-80L linearized vector, and the ligated vector was transformed into BL21 (DE3) competent cells again, and the single clones grown on the plate were picked for rescreening. In the actual sorting process, we carried out several rounds of sorting on different batches of transformed mutant libraries, and mixed and recovered the obtained positive genes to ensure that more beneficial mutations could be sorted.

从上一轮得到的克隆子中,挑取约393个新鲜的单克隆于96孔板中,以三个野生型BL21(DE3)-T7 RNAP单克隆为对照。进行诱导、表达之后,利用细胞裂解液裂解制备粗酶液,将粗酶液于45℃下热处理5min后,迅速置于冰上,再将粗酶液转移至384孔板中,利用STAR体系(基于iSpinach适配体的实时转录活性监测体系(iSpinach aptamer-based monitoring of transcriptionactivity in real-time,STAR),参见申请号:CN202211274914.8,发明名称为“一种实时监测体外转录合成RNA的检测方法”的中国专利申请,该体系的介绍及使用方法详见CN115896213A文本)进行酶活性测定。将热处理之后残余活性是野生型至少1.5倍的突变体进行测序。通过测序结果分析发现,对应于亲本RNA聚合酶(即野生型T7 RNAP,氨基酸序列SEQ ID NO.1所示)的第217位、第397位、第121位、第223位、第632位、第369位和第579位突变点出现的频率相对较高。From the clones obtained in the previous round, about 393 fresh monoclones were picked in a 96-well plate, and three wild-type BL21 (DE3)-T7 RNAP monoclones were used as controls. After induction and expression, the crude enzyme solution was prepared by lysing the cell lysate, and the crude enzyme solution was heat-treated at 45°C for 5 minutes, quickly placed on ice, and then transferred to a 384-well plate. The enzyme activity was measured using the STAR system (iSpinach aptamer-based monitoring of transcription activity in real-time, STAR, see application number: CN202211274914.8, the invention name is "A detection method for real-time monitoring of in vitro transcribed synthetic RNA", the introduction and use of the system are detailed in CN115896213A text). Mutants whose residual activity after heat treatment is at least 1.5 times that of the wild type were sequenced. Analysis of the sequencing results revealed that the frequencies of mutations at positions 217, 397, 121, 223, 632, 369 and 579 corresponding to the parental RNA polymerase (i.e., wild-type T7 RNAP, as shown in the amino acid sequence SEQ ID NO.1) were relatively high.

实施例2:制备T7 RNAPExample 2: Preparation of T7 RNAP

1、表达T7 RNA聚合酶突变体的重组载体的构建1. Construction of recombinant vector expressing T7 RNA polymerase mutant

根据实施例1得到的结果,确定以氨基酸序列SEQ ID NO.1所示的野生型T7 RNAP为亲本,在亲本氨基酸序列的第217位氨基酸、第397位氨基酸,第121位氨基酸、第223位和第632位氨基酸作为关键位点,设计氨基酸序列如SEQ ID NO.2~8所示的突变体与组合突变体;设计现有技术已报道过的突变体S430P、S633P、Q786L、Q744R、S43Y及其突变组合作为对照,氨基酸序列如SEQ ID NO.9~14所示,即如下突变体:According to the results obtained in Example 1, the wild-type T7 RNAP shown in the amino acid sequence SEQ ID NO.1 was determined as the parent, and the 217th amino acid, the 397th amino acid, the 121st amino acid, the 223rd amino acid and the 632nd amino acid of the parent amino acid sequence were used as key sites to design mutants and combination mutants with amino acid sequences as shown in SEQ ID NO.2 to 8; mutants S430P, S633P, Q786L, Q744R, S43Y and mutant combinations reported in the prior art were designed as controls, and the amino acid sequences were as shown in SEQ ID NO.9 to 14, namely the following mutants:

单点突变体I217L(SEQ ID NO.2);Single point mutant I217L (SEQ ID NO.2);

单点突变体S397W(SEQ ID NO.3);Single point mutant S397W (SEQ ID NO.3);

单点突变体T121M(SEQ ID NO.4);Single point mutant T121M (SEQ ID NO.4);

单点突变体S223M(SEQ ID NO.5);Single point mutant S223M (SEQ ID NO.5);

单点突变体R632G(SEQ ID NO.6);Single point mutant R632G (SEQ ID NO.6);

组合突变体I217L/S397W/S430P/S633P/Q786L(SEQ ID NO.7); Combined mutant I217L/S397W/S430P/S633P/Q786L (SEQ ID NO. 7);

组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y(SEQ ID NO.8);Combined mutant I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y (SEQ ID NO. 8);

单点突变体S430P(SEQ ID NO.9);Single point mutant S430P (SEQ ID NO.9);

单点突变体S633P(SEQ ID NO.10);Single point mutant S633P (SEQ ID NO.10);

单点突变体Q786L(SEQ ID NO.11);Single point mutant Q786L (SEQ ID NO.11);

单点突变体Q744R(SEQ ID NO.12);Single point mutant Q744R (SEQ ID NO.12);

单点突变体S43Y(SEQ ID NO.13);Single point mutant S43Y (SEQ ID NO.13);

单点突变体G47A+884G(SEQ ID NO.14);Single point mutant G47A+884G (SEQ ID NO.14);

单点突变体M369T(SEQ ID NO.79);Single point mutant M369T (SEQ ID NO.79);

单点突变体N579W(SEQ ID NO.80);Single point mutant N579W (SEQ ID NO.80);

组合突变体I217L/S397W/S430P/S633P/Q786L/N579W(SEQ ID NO.81);Combined mutant I217L/S397W/S430P/S633P/Q786L/N579W (SEQ ID NO.81);

组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R(SEQ ID NO.82);Combined mutant I217L/S397W/S430P/S633P/Q786L/Q744R (SEQ ID NO.82);

组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N(SEQ ID NO.83)。Combined mutant I217L/S397W/S430P/S633P/Q786L/C723E/A724N (SEQ ID NO.83).

其中,S430P(US7507567B2)、S633P(US7507567B2)和Q786L(CN102177236B)是现有技术中报道的可以提高T7 RNAP热稳定性的突变点,Q744R(US20150024435A1)是现有技术中报道的可以提高T7 RNAP活性的突变点,S43Y(CN112831484B)是现有技术中报道可以降低T7 RNAP的RdRp活性的突变点,G47A+884G指的是在G47A(全长883个)的基础上,在突变体氨基酸序列的C末端额外增加一个氨基酸G,使得突变体全长为884个,该突变方法(CN111212905A)能够减少在体外转录反应期间产生的dsRNA污染物和连缀转录物。Among them, S430P (US7507567B2), S633P (US7507567B2) and Q786L (CN102177236B) are mutation points reported in the prior art to improve the thermal stability of T7 RNAP, Q744R (US20150024435A1) is a mutation point reported in the prior art to improve the activity of T7 RNAP, S43Y (CN112831484B) is a mutation point reported in the prior art to reduce the RdRp activity of T7 RNAP, G47A+884G refers to the addition of an additional amino acid G at the C-terminus of the mutant amino acid sequence on the basis of G47A (total length 883), so that the total length of the mutant is 884. This mutation method (CN111212905A) can reduce dsRNA contaminants and concatenated transcripts generated during in vitro transcription reactions.

突变载体构建具体步骤如下:分别将上述氨基酸序列(SEQ ID NO.1~14、SEQ ID NO.79~83)进行反转录后按照大肠杆菌密码子优化得到核苷酸序列(SEQ ID NO.15~28、SEQ ID NO.84~88);再将核苷酸序列插入到载体pQE-80L的BamHI、HindIII两个限制性内切酶位点之间,即形成野生型T7 RNAP和突变体T7 RNAP的重组表达载体。The specific steps for constructing the mutant vector are as follows: reverse transcribe the above amino acid sequences (SEQ ID NO.1-14, SEQ ID NO.79-83) and optimize them according to the Escherichia coli codon to obtain nucleotide sequences (SEQ ID NO.15-28, SEQ ID NO.84-88); then insert the nucleotide sequences between the two restriction endonuclease sites of BamHI and HindIII of the vector pQE-80L to form the recombinant expression vectors of wild-type T7 RNAP and mutant T7 RNAP.

2、表达T7 RNA聚合酶突变体的重组微生物细胞的构建2. Construction of recombinant microbial cells expressing T7 RNA polymerase mutants

将上一步构建得到的重组表达载体转化到BL21(DE3)大肠杆菌感受态细胞中,获得T7 RNAP野生型和突变体的重组菌株,分别命名为:T7 RNAP-WT(用于表达野生型T7 RNAP)、T7 RNAP-I217L(用于表达单点突变体I217L)、T7 RNAP-S397W(用于表达单点突变体S397W)、7 RNAP-T121M(用于表达突变体T121M)、T7 RNAP-S223M(用于表达突变体S223M)、T7 RNAP-R632G(用于表达突变体R632G)、T7 RNAP-I217L/S397W/S430P/S633P/Q786L(用于表达组合突变体M7: I217L/S397W/S430P/S633P/Q786L)、T7 RNAP-I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y(用于表达组合突变体M8:I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y),以及对照组重组菌T7 RNAP-S430P(用于表达单点突变体S430P)、T7 RNAP-S633P(用于表达单点突变体S633P)、T7 RNAP-Q786L(用于表达单点突变体Q786L)、T7 RNAP-G47A+884G(用于表达突变体G47A+884G)、T7 RNAP-M369T(用于表达单点突变体M369T)、T7 RNAP-N579W(用于表达单点突变体N579W)、T7 RNAP-I217L/S397W/S430P/S633P/Q786L/N579W(用于表达组合突变体I217L/S397W/S430P/S633P/Q786L/N579W)、T7 RNAP-I217L/S397W/S430P/S633P/Q786L/Q744R用于表达组合突变体(I217L/S397W/S430P/S633P/Q786L/Q744R)、I217L/S397W/S430P/S633P/Q786L/C723E/A724N(用于表达组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N)。The recombinant expression vector constructed in the previous step was transformed into BL21 (DE3) Escherichia coli competent cells to obtain recombinant strains of T7 RNAP wild type and mutants, which were named as: T7 RNAP-WT (for expressing wild type T7 RNAP), T7 RNAP-I217L (for expressing single point mutant I217L), T7 RNAP-S397W (for expressing single point mutant S397W), 7 RNAP-T121M (for expressing mutant T121M), T7 RNAP-S223M (for expressing mutant S223M), T7 RNAP-R632G (for expressing mutant R632G), T7 RNAP-I217L/S397W/S430P/S633P/Q786L (for expressing combined mutant M7: I217L/S397W/S430P/S633P/Q786L), T7 RNAP-I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y (used to express the combined mutant M8: I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y), and the control recombinant bacteria T7 RNAP-S430P (used to express the single point mutant S430P), T7 RNAP-S633P (used to express the single point mutant S633P), T7 RNAP-Q786L (used to express the single point mutant Q786L), T7 RNAP-G47A+884G (used to express the mutant G47A+884G), T7 RNAP-M369T (used to express the single point mutant M369T), T7 RNAP-N579W (used to express the single point mutant N579W), T7 RNAP-I217L/S397W/S430P/S633P/Q786L/N579W (used to express the combined mutant I217L/S397W/S430P/S633P/Q786L/N579W), T7 RNAP-I217L/S397W/S430P/S633P/Q786L/Q744R was used to express the combined mutant (I217L/S397W/S430P/S633P/Q786L/Q744R), I217L/S397W/S430P/S633P/Q786L/C723E/A724N (used to express the combined mutant I217L/S397W/S430P/S633P/Q786L/C723E/A724N).

3、T7 RNAP的制备3. Preparation of T7 RNAP

将上一步制备得到的野生型和突变体重组菌株置于含有氨苄青霉素抗性的LB培养基中于37℃摇床培养至OD600值接近1.2,然后加入终浓度为1mM的异丙基-β-D-硫代吡喃半乳糖苷(IPTG)于25℃摇床诱导表达过夜(16小时),随后于4℃、5000rpm离心20min收集菌体沉淀,再将菌体充分重悬于含有300mM NaCl、20mM Tris HCl(pH 7.5)、0.5mg/mL溶菌酶、0.5mM DTT的裂解液中用均质机破碎裂解,蛋白裂解液通过高速冷冻离心机于4℃条件18000rpm离心1h,随后分离上清并对其过滤以去除杂质,过滤后的上清可暂时于冰上保存以待进行后续的镍柱纯化。蛋白过镍柱前需用10倍体积的洗脱缓冲液(20mM Tris HCl(PH7.5)、300mM NaCl、0.5mM DTT)平衡镍柱,随后将之前过滤的蛋白液加入镍柱中,待所有的蛋白液全部通过镍柱填料后,先用一定浓度的曲拉通洗杂,然后用不同梯度的咪唑溶液(50mM 100mM 150mM-200mM-500mM)去洗脱,并用若干试管按流出先后顺序编号后收集流出液,上述所有操作均需在冰上或4℃条件下进行,然后通过SDS-PAGE电泳及考马斯亮蓝染色检测所有洗脱下来的蛋白流出液。最后综合选择浓度更高、纯度更优的蛋白收集液进行超滤离心,将蛋白置换到酶储存Buffer中保存,所述酶储存Buffer成分为50mM Tris-HCl,100mM NaCl,20mM 2-Mercaptoethanol,1mM EDTA,50%Glycerol,0.1%Triton X-100,pH 7.9。The wild-type and mutant recombinant strains prepared in the previous step were placed in LB medium containing ampicillin resistance and cultured in a shaking incubator at 37°C until the OD600 value was close to 1.2. Then, isopropyl-β-D-thiogalactopyranoside (IPTG) with a final concentration of 1 mM was added to induce expression in a shaking incubator at 25°C overnight (16 hours). The bacterial precipitate was then collected by centrifugation at 4°C and 5000 rpm for 20 min. The bacteria were fully resuspended in a lysis buffer containing 300 mM NaCl, 20 mM Tris HCl (pH 7.5), 0.5 mg/mL lysozyme, and 0.5 mM DTT, and the protein lysate was centrifuged in a high-speed refrigerated centrifuge at 4°C and 18000 rpm for 1 h. The supernatant was then separated and filtered to remove impurities. The filtered supernatant can be temporarily stored on ice for subsequent nickel column purification. Before the protein passes through the nickel column, it is necessary to equilibrate the nickel column with 10 times the volume of elution buffer (20mM Tris HCl (PH7.5), 300mM NaCl, 0.5mM DTT), and then add the previously filtered protein solution to the nickel column. After all the protein solution has passed through the nickel column filler, first use a certain concentration of Triton X-ray to wash the impurities, and then use different gradients of imidazole solution (50mM 100mM 150mM-200mM-500mM) to elute, and use several test tubes to number them in order of outflow to collect the outflow. All the above operations must be performed on ice or at 4°C, and then all the eluted protein outflows are detected by SDS-PAGE electrophoresis and Coomassie Brilliant Blue staining. Finally, the protein collection liquid with higher concentration and better purity was selected for ultrafiltration and centrifugation, and the protein was replaced into enzyme storage buffer for storage. The enzyme storage buffer consisted of 50mM Tris-HCl, 100mM NaCl, 20mM 2-Mercaptoethanol, 1mM EDTA, 50% Glycerol, 0.1% Triton X-100, pH 7.9.

实施例3:实时荧光法测定不同突变体酶活性检测Example 3: Real-time fluorescence assay for enzyme activity of different mutants

取实施例2制备得到的T7 RNA野生型和T7 RNA突变体,对其进行酶活性检测。 The wild-type T7 RNA and the mutant T7 RNA prepared in Example 2 were taken and their enzyme activities were tested.

1、相对酶活性检测1. Relative enzyme activity detection

实时荧光检测T7 RNAP突变体相对酶活性反应体系如表3所示,具体实验方法参见参见申请号:CN202211274914.8,发明名称为“一种实时监测体外转录合成RNA的检测方法”的中国专利申请,其中T7 RNAP突变体的加入蛋白浓度和体积与已知酶活性和蛋白浓度的野生型保持一致,该反应在96孔板中进行,通过酶标仪实时测定反应产生的适配体与DFHBI结合产生的荧光值。反应体系依据实验需要而定,反应条件为37℃,用酶标仪实时测定时需测定1个小时,每隔1min测定一个数值。反应结束后,后用Origin软件绘制曲线图,取线性关系最好的时间计算野生型的蛋白浓度与荧光强度的标准曲线,如图3所示,从而计算出突变体的相对酶活性。该反应体系中的所有试剂均需用Nuclease-free H2O配制,同时所使用的耗材均为RNase-free级别,以防止产生的RNA被RNase降解。The reaction system for real-time fluorescence detection of the relative enzyme activity of T7 RNAP mutants is shown in Table 3. For specific experimental methods, please refer to the Chinese patent application with application number: CN202211274914.8, and the invention name is "A detection method for real-time monitoring of RNA synthesized by in vitro transcription", in which the added protein concentration and volume of the T7 RNAP mutant are consistent with the wild type with known enzyme activity and protein concentration. The reaction is carried out in a 96-well plate, and the fluorescence value generated by the binding of the aptamer produced by the reaction and DFHBI is measured in real time by an enzyme reader. The reaction system is determined according to the experimental needs. The reaction conditions are 37°C. When the enzyme reader is used for real-time measurement, it takes 1 hour to measure a value every 1 minute. After the reaction is completed, the Origin software is used to draw a curve graph, and the standard curve of the protein concentration and fluorescence intensity of the wild type is calculated at the time with the best linear relationship, as shown in Figure 3, so as to calculate the relative enzyme activity of the mutant. All reagents in the reaction system need to be prepared with Nuclease-free H 2 O, and the consumables used are all RNase-free grade to prevent the generated RNA from being degraded by RNase.

表3实时荧光法测定不同突变体的相对酶活性反应体系
Table 3 Reaction system for determining relative enzyme activity of different mutants by real-time fluorescence method

突变体的相对酶活性检测如表4-1、表4-2、表4-3所示,单点突变体I217L、单点突变体N579W、单点突变体S397W、单点突变体T121M、单点突变体S223M和单点突变体R632G在37℃下的酶活性相对于T7 RNAP-WT有明显提升,并且明显优于现有技术中报道的单点突变体S430P、G47A+884G、单点突变体S633P和单点突变体Q786L。组合突变体M8的酶活性明显高于WT和现有技术中报道的单点突变体S430P、单点突变体S633P、单点突变体Q744R、单点突变体Q786L、单点突变体S43Y以及G47A+884G突变体。组合突变体M7和I217L/S397W/S430P/S633P/Q786L/N579W在37℃下的相对活性与T7 RNAP-WT无差异,组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R和组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N在37℃下的相对活性明显优于T7RNAP-WT,并且58℃热处理15min后的残余活性明显高于T7 RNAP-WT,并且明显高于T7-RNAP-WT、单点突变体S430P、单点突变体S633P和单点突变体Q786L在50℃处理5min后的残余活性,热处理后残余相对活性表明组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R和组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N相对于现有技术具有更高的热稳定性。The relative enzyme activity detection of mutants is shown in Table 4-1, Table 4-2, and Table 4-3. The enzyme activities of single point mutant I217L, single point mutant N579W, single point mutant S397W, single point mutant T121M, single point mutant S223M, and single point mutant R632G at 37°C are significantly improved compared with T7 RNAP-WT, and are significantly better than the single point mutants S430P, G47A+884G, single point mutant S633P, and single point mutant Q786L reported in the prior art. The enzyme activity of the combined mutant M8 is significantly higher than that of WT and the single point mutants S430P, S633P, Q744R, Q786L, S43Y, and G47A+884G mutants reported in the prior art. The relative activities of the combined mutants M7 and I217L/S397W/S430P/S633P/Q786L/N579W at 37°C were not different from those of T7 RNAP-WT. The relative activities of the combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R and I217L/S397W/S430P/S633P/Q786L/C723E/A724N at 37°C were significantly better than those of T7 RNAP-WT, and the residual activity after heat treatment at 58°C for 15 min was significantly higher than that of T7 RNAP-WT. The residual activity of the combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R and I217L/S397W/S430P/S633P/Q786L/C723E/A724N was significantly higher than that of T7 RNAP-WT, and was significantly higher than the residual activity of T7-RNAP-WT, single point mutant S430P, single point mutant S633P and single point mutant Q786L after treatment at 50°C for 5 min. The residual relative activity after heat treatment indicated that the combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R and the combined mutants I217L/S397W/S430P/S633P/Q786L/C723E/A724N had higher thermal stability than the existing technology.

表4-1相对酶活性检测
Table 4-1 Relative enzyme activity detection

表4-2相对酶活性检测

Table 4-2 Relative enzyme activity detection

表4-3相对酶活性检测
Table 4-3 Relative enzyme activity detection

2、差示扫描荧光法测定T7 RNAP的Tm值2. Determination of Tm value of T7 RNAP by differential scanning fluorimetry

分别取18μL T7 RNAP蛋白质溶液(0.2mg/mL)与2μL Sypro Orange染料混合(100×),蛋白质溶液与Sypro Orange染料分别用PBS稀释到所需浓度。先打开qPCR仪预热20min,盖子温度设置为100℃,样品温度设置起点为25℃,升温速率为0.5℃/s,5s等待时间,然后循环120次到85℃。设置滤波片为Sypro Orange通道,10次采样取平均值。蛋白质的Tm是热稳定性的指标,一般而言,Tm越高,蛋白质越稳定。Take 18 μL T7 RNAP protein solution (0.2 mg/mL) and 2 μL Sypro Orange dye (100×) respectively, and dilute the protein solution and Sypro Orange dye to the required concentration with PBS. First turn on the qPCR instrument to preheat for 20 minutes, set the lid temperature to 100°C, set the sample temperature starting point to 25°C, the heating rate to 0.5°C/s, 5s waiting time, and then cycle 120 times to 85°C. Set the filter to the Sypro Orange channel and take the average of 10 samples. The Tm of a protein is an indicator of thermal stability. Generally speaking, the higher the Tm, the more stable the protein.

结果如表5所示,单点突变体M369T和单点突变体N579W的Tm值均高于WT;单点突变体I217L、单点突变体S397W、组合突变体M7和M8、组合突变体I217L/S397W/S430P/S633P/Q786L/N579W、组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R和组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N的Tm值明显高于野生型和现有技术中报道的其他突变体(WT、S430P、S633P、Q786L),说明上述组合突变体相比于现有技术中报道的单点突变体及T7 RNAP-WT具有更高的热稳定性。The results are shown in Table 5. The Tm values of single point mutants M369T and N579W were higher than those of WT; single point mutant I217L, single point mutant S397W, combined mutants M7 and M8, combined mutants I217L/S397W/S430P/S633P/Q786L/N579W, and combined mutants I217L/S397W/S430P/S633P/Q786L The Tm values of I217L/S397W/S430P/S633P/Q786L/C723E/A724N were significantly higher than those of the wild type and other mutants (WT, S430P, S633P, Q786L) reported in the prior art, indicating that the above-mentioned combined mutants have higher thermal stability than the single point mutants and T7 RNAP-WT reported in the prior art.

表5 T7 RNAP野生型及突变体的Tm值(℃)

Table 5 Tm values of wild type and mutants of T7 RNAP (℃)

3、不同温度下的IVT产量和产物纯度对比3. Comparison of IVT yield and product purity at different temperatures

体外转录(IVT)反应体系如表6所示,反应体系可等比例放大,体系配置完成以后,分别置于37℃和50℃条件下反应2h,然后每个反应按照1μg模板加入10U DNaseI在37℃条件下反应30min消化DNA模板,然后通过LiCl沉淀法回收IVT产物,测定RNA浓度,计算出1μg模板DNA对应的mRNA产量(μg)。The in vitro transcription (IVT) reaction system is shown in Table 6. The reaction system can be scaled up in equal proportions. After the system is configured, it is placed at 37°C and 50°C for 2 hours, respectively. Then, 10U DNaseI is added to each reaction according to 1μg template and reacted at 37°C for 30 minutes to digest the DNA template. The IVT product is then recovered by LiCl precipitation, the RNA concentration is measured, and the mRNA yield (μg) corresponding to 1μg template DNA is calculated.

表6体外转录反应体系
Table 6 In vitro transcription reaction system

1μg模板DNA对应的mRNA产量(μg)如表7-1所示。在常规37℃反应条件下,本发明的组合突变体M8的IVT产量相对于野生型WT以及现有技术中报道的单点突变体S430P、单点突变体S430P、单点突变体Q786L、G47A+884G有明显提高,与现有技术中报道的突变体Q744R和S43Y的IVT产量相当,并且相比于组合突变体M7IVT产量也有一定提高;同时在50℃反应条件下,WT和现有技术中报道的Q744R、S43Y、G74A+884G突变体几乎无IVT产物生成,说明WT、Q744R、S43Y、G47A+884G不耐受50℃反应;组合突变体I217L/S397W/S430P/S633P/Q786L/N579W在50℃反应IVT产量相对于WT在37℃反应IVT产量提高了60%;组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R无论是在37℃,还是50℃反应的IVT产量都相对于WT和单点突变体I217L、S397W、S430P、S633P、Q786L都明显提升;组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N在37℃反应IVT产量相对于WT和单点突变体I217L、S430P、S633P、Q786L均提高了20%左右,在50℃反应IVT产量相对于WT和单点突变体I217L、S430P、S633P、Q786L在37℃反应IVT产量提高了120%左右;组合突变体M8在50℃下的IVT产量明显高于WT和现有技术中报道的突变体,表明组合突变体M8突变体、M7突变体具有更好的IVT应用性能,能够生产更高产量的体外转录产物,能够耐受50℃反应,并且具有较好的热稳定性。The mRNA yield (μg) corresponding to 1μg template DNA is shown in Table 7-1. Under conventional 37°C reaction conditions, the IVT yield of the combined mutant M8 of the present invention is significantly improved compared with the wild-type WT and the single-point mutants S430P, S430P, Q786L, and G47A+884G reported in the prior art, and is comparable to the IVT yields of the mutants Q744R and S43Y reported in the prior art, and is also somewhat improved compared to the combined mutant M7 IVT yield; at the same time, under 50°C reaction conditions, the WT and Q744R reported in the prior art are significantly improved compared to the combined mutant M7. The mutants 744R, S43Y, and G74A+884G produced almost no IVT products, indicating that WT, Q744R, S43Y, and G47A+884G could not tolerate the reaction at 50°C; the IVT yield of the combined mutant I217L/S397W/S430P/S633P/Q786L/N579W at 50°C increased by 60% compared with the IVT yield of WT at 37°C; the combined mutant I217L/S397W/S430P/S633P/Q786L/N579W at 50°C increased by 60% compared with the IVT yield of WT at 37°C. The IVT yield of L/Q744R at 37℃ or 50℃ was significantly increased compared with WT and single-point mutants I217L, S397W, S430P, S633P, and Q786L. The IVT yield of the combined mutant I217L/S397W/S430P/S633P/Q786L/C723E/A724N at 37℃ was increased by about 20% compared with WT and single-point mutants I217L, S430P, S633P, and Q786L. Right, the IVT yield at 50°C was increased by about 120% compared with the IVT yield of WT and single point mutants I217L, S430P, S633P, and Q786L at 37°C; the IVT yield of the combined mutant M8 at 50°C was significantly higher than that of WT and mutants reported in the prior art, indicating that the combined mutants M8 and M7 have better IVT application performance, can produce higher yields of in vitro transcription products, can tolerate 50°C reactions, and have good thermal stability.

表7-1 IVT产量
Table 7-1 IVT production

单突变体S430P、S633P、Q786L、I217L、S397W、组合突变体M7、I217L/S397W/S430P/S633P/Q786L/N579W、I217L/S397W/S430P/S633P/Q786L/Q744R和I217L/S397W/S430P/S633P/Q786L/C723E/A724N的产物纯度如表7-2所示,组合突变体I217L/S397W/S430P/S633P/Q786L和组合突变体I217L/S397W/S430P/S633P/Q786L/N579W在37℃反应IVT产物纯度相对于WT和单点突变体I217L、S397W、S430P、S633P、Q786L有明显提升,产物纯度提升率均在10%以上;组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R和组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N在37℃反应的产物纯度相对于WT和单点突变体I217L、S397W、S430P、S633P、Q786L有较明显提升,产物纯度提升率均在5%以上。组合突变体I217L/S397W/S430P/S633P/Q786L、组合突变体I217L/S397W/S430P/S633P/Q786L/N579W、组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R和组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N在50℃反应的IVT产物纯度相对于WT和单点突变体I217L、S397W、S430P、S633P、Q786L在37℃反应以及单点突变体I217L、S397W、S430P、S633P、Q786L在50℃反应的IVT产物纯度差异在5%以内,认为无明显差异。说明本发明提供的T7 RNAP突变体的IVT产物纯度相对于现有技术有一定程度提高。The product purities of the single mutants S430P, S633P, Q786L, I217L, S397W, the combined mutants M7, I217L/S397W/S430P/S633P/Q786L/N579W, I217L/S397W/S430P/S633P/Q786L/Q744R, and I217L/S397W/S430P/S633P/Q786L/C723E/A724N are shown in Table 7-2. The purity of the IVT product of N579W at 37°C was significantly improved compared with WT and single point mutants I217L, S397W, S430P, S633P, and Q786L, and the product purity improvement rate was all above 10%; the purity of the product of the combination mutants I217L/S397W/S430P/S633P/Q786L/Q744R and the combination mutants I217L/S397W/S430P/S633P/Q786L/C723E/A724N at 37°C was significantly improved compared with WT and single point mutants I217L, S397W, S430P, S633P, and Q786L, and the product purity improvement rate was all above 5%. The combined mutants I217L/S397W/S430P/S633P/Q786L, the combined mutants I217L/S397W/S430P/S633P/Q786L/N579W, the combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R, and the combined mutants I217L/S397W/S430P/S633 The purity of the IVT product of P/Q786L/C723E/A724N at 50°C is within 5% relative to the purity of the IVT product of WT and single point mutants I217L, S397W, S430P, S633P, Q786L at 37°C and single point mutants I217L, S397W, S430P, S633P, Q786L at 50°C, and it is considered that there is no significant difference. This shows that the purity of the IVT product of the T7 RNAP mutant provided by the present invention is improved to a certain extent compared with the prior art.

表7-2 IVT产物纯度
Table 7-2 IVT product purity

4、不同温度下IVT产物中的dsRNA含量对比 4. Comparison of dsRNA content in IVT products at different temperatures

野生型WT、单突变体Q744R、S43Y、G47+884G、T121M、S223M、R632G、S430P、S633P、Q786L、I217L、S397W、组合突变体M7、M8利用本公司自研产品dsRNA含量ELISA法定量检测试剂盒检测,将3获得的纯化后的IVT产物RNA样品进行dsRNA含量检测,具体操作方法如下:(1)将dsRNA标准品用STE Buffer稀释至2pg/μL、1pg/μL、0.5pg/μL、0.25pg/μL、0.125pg/μL、0.0625pg/μL、0.0312pg/μL、0pg/μL共8个浓度,将纯化后待测RNA样品分别稀释3个不同的浓度梯度10ng/μL、5ng/μL、1ng/μL;(2)取稀释后标准品和待测样品各100μL加入包被有dsRNA抗体的酶标板微孔中,室温下振板500rpm反应60min;(3)加入洗涤液250μL重复洗板4次,每孔中加入工作浓度的生物素化检测抗体100μL,室温下振板500rpm反应60min;(4)洗板后每孔加入工作浓度的HRP-链霉亲和素100μL,500rpm反应30min;(4)最后加入底物显色液100μL室温避光反应30min后每孔加入终止液50μL,并立即进行上机检测,设定酶标仪双波长为450nm/650nm,以标准品拟合的曲线计算得到待测样品的dsRNA含量。The wild type WT, single mutants Q744R, S43Y, G47+884G, T121M, S223M, R632G, S430P, S633P, Q786L, I217L, S397W, combined mutants M7 and M8 were tested using our company's self-developed dsRNA content ELISA quantitative detection kit. The purified IVT product RNA samples obtained in 3 were tested for dsRNA content. The specific operation method is as follows: (1) The dsRNA standard was diluted with STE Buffer to 8 concentrations of 2pg/μL, 1pg/μL, 0.5pg/μL, 0.25pg/μL, 0.125pg/μL, 0.0625pg/μL, 0.0312pg/μL, and 0pg/μL. The purified RNA samples to be tested were diluted to 3 different concentrations respectively. Gradient 10ng/μL, 5ng/μL, 1ng/μL; (2) Take 100μL of the diluted standard and the sample to be tested and add them to the microwells of the ELISA plate coated with dsRNA antibodies, and shake the plate at 500rpm for 60min at room temperature; (3) Add 250μL of washing solution and repeat the washing for 4 times, add 100μL of biotinylated detection antibody at working concentration to each well, and shake the plate at 500rpm for 60min at room temperature; (4) After washing, add 100μL of HRP-streptavidin at working concentration to each well, and react at 500rpm for 30min; (4) Finally, add 100μL of substrate colorimetric solution and react at room temperature in the dark for 30min, then add 50μL of stop solution to each well, and immediately perform on-machine detection, set the dual wavelength of the ELISA reader to 450nm/650nm, and calculate the dsRNA content of the sample to be tested using the curve fitted by the standard.

实验结果如图4A所示,其中野生型WT、Q744R、S43Y以及G47A+884G在常规37℃反应的单位质量IVT产物中,本发明的组合突变体M8的dsRNA含量明显低于野生型WT和现有技术中报道的单点突变体Q744R、单点突变体S43Y、单点突变体S430P、单点突变体S633P、单点突变体Q786L以及M7组合突变体;在50℃体外转录反应的单位质量IVT产物中,本发明组合突变体M8的dsRNA含量相比于37℃反应产生的dsRNA含量明显降低,在50℃反应条件下单位质量IVT产物的dsRNA含量相比于现有技术中报道单点突变体S430P、单点突变体S633P、Q786L均有明显减少,并且其50℃体外转录产物dsRNA含量可以达到CN111212905A中报道的能够减少体外转录产物中dsRNA污染物和连缀转录物的G47A+884G突变体的相同水平。在50℃条件下本发明组合突变体M8相比于组合突变体M7单位质量IVT产物的dsRNA含量水平更低。由于野生型WT、突变体Q744R、S43Y、G47A+884G、T121M、S223M、R632G在50℃下几乎无IVT产物产生,因此其50℃的dsRNA含量未在结果中进行展示。The experimental results are shown in FIG4A , wherein the dsRNA content of the combined mutant M8 of the present invention in the unit mass IVT product of the conventional 37° C. reaction of the wild-type WT, Q744R, S43Y and G47A+884G is significantly lower than that of the wild-type WT and the single point mutants Q744R, S43Y, S430P, S633P, Q786L and M7 combined mutants reported in the prior art; in the unit mass IVT product of the in vitro transcription reaction at 50° C., the combined mutant M The dsRNA content of 8 is significantly lower than that produced by the reaction at 37°C. The dsRNA content of the IVT product per unit mass under the reaction conditions of 50°C is significantly reduced compared to the single point mutants S430P, S633P, and Q786L reported in the prior art, and the dsRNA content of its 50°C in vitro transcription product can reach the same level as the G47A+884G mutant reported in CN111212905A that can reduce dsRNA contaminants and concatenated transcripts in the in vitro transcription product. Under the conditions of 50°C, the dsRNA content level of the combined mutant M8 of the present invention per unit mass of the IVT product is lower than that of the combined mutant M7. Since the wild type WT, mutants Q744R, S43Y, G47A+884G, T121M, S223M, and R632G produce almost no IVT products at 50°C, their dsRNA content at 50°C is not shown in the results.

野生型WT、单突变体S430P、S633P、Q786L、I217L、S397W、组合突变体I217L/S397W/S430P/S633P/Q786L、I217L/S397W/S430P/S633P/Q786L/N579W、I217L/S397W/S430P/S633P/Q786L/Q744R、I217L/S397W/S430P/S633P/Q786L/C723E/A724N的IVT产物中的dsRNA含量通过dot blot法检测,将3获得的LiCl沉淀回收的IVT RNA样品滴加到带正电荷的Hybond N membrane(GE Amersham),然后膜在5%(w/v)脱脂奶粉TBS-T缓冲液(20mM Tris,pH 7.4,150mM NaCl,0.1%v/v Tween-20)中封闭。再用J2抗dsRNA抗体1:5000稀释;在4℃过夜孵育。用HRP偶联的二抗(Cell Signaling Technologies)检测印迹。以已知浓度的dsRNA标品作为阳性对照。Wild type WT, single mutants S430P, S633P, Q786L, I217L, S397W, combined mutants I217L/S397W/S430P/S633P/Q786L, I217L/S397W/S430P/S633P/Q786L/N579W, I217L/S397W/S430P/S633P/Q786L/Q744R, I217L/S397W/S430P/S633P/Q786L The dsRNA content in the IVT product of /C723E/A724N was detected by dot blot. The IVT RNA sample recovered by LiCl precipitation obtained in 3 was added dropwise to the positively charged Hybond N membrane (GE Amersham), and then the membrane was blocked in 5% (w/v) skim milk powder TBS-T buffer (20mM Tris, pH 7.4, 150mM NaCl, 0.1% v/v Tween-20). Then J2 anti-dsRNA antibody was diluted 1:5000; incubated overnight at 4°C. The blot was detected with HRP-conjugated secondary antibody (Cell Signaling Technologies). A dsRNA standard with a known concentration was used as a positive control.

实验结果如图4B所示,mRNA上样量一致时,通过dot blot分析,单点突变体I217L、S397W、S430P、S633P、Q786L、组合突变体I217L/S397W/S430P/S633P/Q786L、组合突变体I217L/S397W/S430P/S633P/Q786L/N579W、组合突变体I217L/S397W/S430P/S633P/Q786L/Q744R、组合突变体I217L/S397W/S430P/S633P/Q786L/C723E/A724N在50℃反应的IVT产物中的dsRNA副产物含量明显低于WT在37℃反应的IVT产物中的dsRNA副产物含量。说明前述实施例提供的T7 RNAP突变体通过高温反应,IVT产物中的dsRNA含量相对于现有技术中的野生型有明显改善。The experimental results are shown in Figure 4B. When the mRNA loading amount was consistent, dot blot analysis showed that the single point mutants I217L, S397W, S430P, S633P, Q786L, the combined mutants I217L/S397W/S430P/S633P/Q786L, and the combined mutants I217L/S397W/S430P/S633P/Q786L/N579 The dsRNA byproduct content in the IVT products of W, combined mutants I217L/S397W/S430P/S633P/Q786L/Q744R, and combined mutants I217L/S397W/S430P/S633P/Q786L/C723E/A724N at 50°C was significantly lower than the dsRNA byproduct content in the IVT product of WT at 37°C. This indicates that the T7 RNAP mutants provided in the above examples have significantly improved dsRNA content in the IVT products compared to the wild type in the prior art through high temperature reaction.

5、不同温度下IVT产物的3'末端一致性检测5. 3' end consistency detection of IVT products at different temperatures

IVT产物的3'末端一致性检测通过RNase T1消化实验实现。RNase T1可特异性降解在G残基处的单链RNA,其通过形成相应的中间体2',3'-环磷酸(-mP)盐,切割相邻核苷酸的3'-鸟苷残基和5'-OH残基之间的磷酸二酯键;反应产物为3'-GMP和含末端3'-GMP的寡核苷酸。然后,通过LC-MS分析3'末端产物的分子量,并与理论分子量相比如果正常的mRNA产物的3'末端为G碱基以-OH结尾,那么异质性的产物的3'末端G碱基以-mP结尾,异质性产物相对于正常产物的分子量多80。通过分析理论分子量产物所占比例算出3'末端一致性(%)。The 3' end consistency detection of IVT products is achieved through RNase T1 digestion experiment. RNase T1 can specifically degrade single-stranded RNA at the G residue, which forms the corresponding intermediate 2', 3'-cyclic phosphate (-mP) salt, cleaving the phosphodiester bond between the 3'-guanosine residue and the 5'-OH residue of the adjacent nucleotide; the reaction products are 3'-GMP and oligonucleotides containing terminal 3'-GMP. Then, the molecular weight of the 3' end product is analyzed by LC-MS and compared with the theoretical molecular weight. If the 3' end of the normal mRNA product is a G base ending with -OH, then the 3' end G base of the heterogeneous product ends with -mP, and the molecular weight of the heterogeneous product is 80 more than that of the normal product. The 3' end consistency (%) is calculated by analyzing the proportion of the theoretical molecular weight product.

实验结果如图5A~图5J所示,3'末端一致性纯净产物的分子量为9663.1,3'末端异质性连缀产物的分子量为9743.1,图5A~图5J分别为野生型WT(37℃)、Q774R(37℃)、S43Y(37℃)、G47A+884G(37℃)、M7(37℃)、M7(50℃)、M8(37℃)、M8(50℃)、I217L(50℃)、S397W(50℃)的IVT产物LC-MS分析3'末端产物的结果,详细数据分析如表8所示,由于野生型WT、突变体Q744R、S43Y、G47A+884G在50℃下几乎无IVT产物产生,因此其50℃的3'末端一致性未在结果中进行展示。本发明组合突变体M8在37℃反应的3'末端一致性产物明显优于WT、单点突变体Q744R和组合突变体M7,组合突变体M8在50℃反应的3'末端一致性产物明显优于单点突变体S43Y和突变体G47A+884G在37℃的反应产物。说明本发明提供的T7 RNAP组合突变体M8不仅能够耐受50℃反应,而且IVT产物的3'末端一致性相比现有技术有明显改善。The experimental results are shown in Figures 5A to 5J. The molecular weight of the pure product with 3' end consistency is 9663.1, and the molecular weight of the 3' end heterogeneous conjugation product is 9743.1. Figures 5A to 5J are the results of LC-MS analysis of 3' end products of IVT products of wild type WT (37°C), Q774R (37°C), S43Y (37°C), G47A+884G (37°C), M7 (37°C), M7 (50°C), M8 (37°C), M8 (50°C), I217L (50°C), and S397W (50°C). The detailed data analysis is shown in Table 8. Since the wild type WT, mutants Q744R, S43Y, and G47A+884G almost no IVT products are produced at 50°C, their 3' end consistency at 50°C is not shown in the results. The 3' end consistency product of the combined mutant M8 of the present invention at 37°C is significantly better than that of WT, single point mutant Q744R and combined mutant M7, and the 3' end consistency product of the combined mutant M8 at 50°C is significantly better than that of the single point mutant S43Y and mutant G47A+884G at 37°C. This indicates that the T7 RNAP combined mutant M8 provided by the present invention can not only withstand 50°C reaction, but also the 3' end consistency of the IVT product is significantly improved compared with the prior art.

表8 IVT产物3'末端一致性

Table 8 IVT product 3' end consistency

“/”表示野生型WT或突变体在50℃下几乎无IVT产物产生,因此其50℃的3'末端一致性未在结果中进行展示。“/” indicates that the wild-type WT or mutant produced almost no IVT product at 50°C, so its 3' end identity at 50°C was not shown in the results.

6、不同温度下IVT产物的细胞应答检测6. Cellular response detection of IVT products at different temperatures

细胞对dsRNA副产物的免疫反应可能会导致不良后果。以纯化后IVT产物RNA样品进行细胞应答检测,具体操作方法如下:(1)将含有Raw细胞悬液的冻存管解冻并混合均匀,将该细胞悬液加入含有培养基的培养瓶或培养皿中过夜培养。当RAW细胞密度达到80%-90%时进行传代培养。(2)接种细胞:准备IVT产物-脂质复合物,利用阳离子脂质体转染试剂Lipofectamine 3000转染细胞。其中,Polycytidylic acid(Poly(I:C))和Resiquimod(R848)作为阳性对照,仅转染试剂作为阴性对照,无转染处理作为空白对照。(3)培养48小时后收集血清,利用IFN alpha Mouse ELISA Kit(Thermo Scientific)和小鼠IL-6双抗夹心ELISA检测试剂盒(Proteintech Group)对细胞血清中的IFN-α和IL-6细胞因子定量检测,分别以标准品拟合的曲线计算得到待测样品的实测浓度,最后获得T7 RNAP突变体不同温度下IVT产物的IL-6和IFN-α表达情况。IFN-α和IL-6是机体免疫细胞产生的细胞因子,是机体受到侵染时免疫细胞通过免疫应答反应而产生的免疫应答因子,本研究中检测的IFN-α和IL-6水平的高低反映了T7 RNA聚合酶IVT产物的免疫原性强弱。The immune response of cells to dsRNA byproducts may lead to adverse consequences. The purified IVT product RNA sample was used for cell response detection. The specific operation method is as follows: (1) Thaw the cryotube containing the Raw cell suspension and mix it evenly, and add the cell suspension to a culture flask or culture dish containing culture medium for overnight culture. When the RAW cell density reaches 80%-90%, subculture is performed. (2) Inoculate cells: Prepare IVT product-lipid complexes and transfect cells with cationic liposome transfection reagent Lipofectamine 3000. Among them, Polycytidylic acid (Poly (I: C)) and Resiquimod (R848) are used as positive controls, transfection reagent alone is used as negative control, and no transfection treatment is used as blank control. (3) After 48 hours of culture, serum was collected and IFN-α and IL-6 cytokines in cell serum were quantitatively detected using IFN alpha Mouse ELISA Kit (Thermo Scientific) and mouse IL-6 double antibody sandwich ELISA detection kit (Proteintech Group). The measured concentrations of the samples to be tested were calculated using the curves fitted by the standard samples, and finally the IL-6 and IFN-α expression of IVT products of T7 RNAP mutants at different temperatures was obtained. IFN-α and IL-6 are cytokines produced by the body's immune cells. They are immune response factors produced by immune cells through immune response when the body is infected. The levels of IFN-α and IL-6 detected in this study reflect the immunogenicity of the T7 RNA polymerase IVT products.

实验结果如图6A、图6B所示,通过对野生型WT和突变体M8:I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y的IVT产物所引起的细胞内IFN-α和IL-6因子水平比较发现,在四种不同转录模板中,M8在37℃和50℃下的IVT产物所引起的IFN-α和IL-6因子水平均比野生型WT低,表明本发明提供的的T7 RNA聚合酶突变体M8通过体外转录反应得到的mRNA相比于野生型WT具有更低的免疫原性。The experimental results are shown in Figures 6A and 6B. By comparing the intracellular IFN-α and IL-6 factor levels caused by the IVT products of the wild-type WT and mutant M8: I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y, it was found that among the four different transcription templates, the IFN-α and IL-6 factor levels caused by the IVT products of M8 at 37°C and 50°C were lower than those of the wild-type WT, indicating that the mRNA obtained by the in vitro transcription reaction of the T7 RNA polymerase mutant M8 provided by the present invention has lower immunogenicity than that of the wild-type WT.

实施例4:性能提升的T7 RNAP突变体Example 4: T7 RNAP mutants with improved performance

根据实施例2、实施例3的方法,基于野生型T7 RNAP的氨基酸序列(SEQ ID NO.1所示)的第217位氨基酸、第397位氨基酸,第121位氨基酸、第223位和第632位氨基酸进行部分定点突变,并制备得到T7 RNAP突变体,检测其活性变化(利用实施例3的相对活性检测方法检测活性变化,利用实施例3的差示扫描荧光法测定各个突变体的Tm值,Tm值大小反映了各突变体蛋白稳定性的高低,Tm值越高,蛋白质越稳定)。According to the methods of Example 2 and Example 3, partial site-directed mutations were performed on the 217th amino acid, the 397th amino acid, the 121st amino acid, the 223rd amino acid and the 632nd amino acid based on the amino acid sequence of the wild-type T7 RNAP (shown in SEQ ID NO.1), and T7 RNAP mutants were prepared, and their activity changes were detected (the activity changes were detected using the relative activity detection method of Example 3, and the Tm values of each mutant were determined using the differential scanning fluorimetry of Example 3. The Tm value reflects the stability of each mutant protein. The higher the Tm value, the more stable the protein).

表9-1不同位点部分定点突变
Table 9-1 Site-directed mutagenesis at different sites

表9-2不同位点部分定点突变

Table 9-2 Site-directed mutations at different sites

结果如表9-1、表9-2所示,结果表明,第786位突变位M或F,第217位突变为A或V,第397位突变为P,第579位突变为A或D后,在37℃下的酶活性和Tm值均高于现有技术中报道的突变体S430P、S633P、Q786L、G47A+884G及T7 RNAP-WT;第121位突变为L或S、第223位突变为L或V、第632位点突变为E后,在37℃下的酶活性相对于现有技术中报道的突变体及T7 RNAP-WT均有明显提高,第369位突变为E后Tm值相对于野生型T7 RNAP及现有技术中报道的突变体也有一定提高。上述结果说明Q786M、Q786F、I217A、I217V、S397P、N579A、M369E突变位点的T7 RNAP突变体相对于T7 RNAP-WT具有更高的热稳定性。I121L、I121S、S223L、S223V、R632E突变体相对于野生型T7 RNAP虽然热稳定性无明显优势,但是具有更高的酶活性,这些突变体将在体外转录、mRNA疫苗和药物研究生产、基因编辑、等温扩增、体内蛋白质表达或无细胞蛋白表达体外翻译系统等方面具有广泛应用前景。The results are shown in Table 9-1 and Table 9-2. The results indicate that after the 786th position mutated to M or F, the 217th position mutated to A or V, the 397th position mutated to P, and the 579th position mutated to A or D, the enzyme activity and Tm value at 37°C were higher than those of the mutants S430P, S633P, Q786L, G47A+884G and T7 RNAP-WT reported in the prior art; after the 121st position mutated to L or S, the 223rd position mutated to L or V, and the 632nd position mutated to E, the enzyme activity at 37°C was significantly improved compared with the mutants reported in the prior art and T7 RNAP-WT, and the Tm value after the 369th position mutated to E was also improved to a certain extent compared with the wild-type T7 RNAP and the mutants reported in the prior art. The above results show that the T7 RNAP mutants with mutation sites Q786M, Q786F, I217A, I217V, S397P, N579A, and M369E have higher thermal stability than T7 RNAP-WT. Although the I121L, I121S, S223L, S223V, and R632E mutants have no obvious advantage in thermal stability over wild-type T7 RNAP, they have higher enzyme activity. These mutants will have broad application prospects in in vitro transcription, mRNA vaccine and drug research and production, gene editing, isothermal amplification, in vivo protein expression, or cell-free protein expression in vitro translation systems.

实施例5:序列相似度高的T7-RNA突变对应位点Example 5: T7-RNA mutation sites with high sequence similarity

比对了不同来源的、氨基酸一致性为80%以上的RNA聚合酶,结果如图7A~图7E所示,发现序列中部分序列是高度保守的,这部分序列中包括第217位点、第223位点,第397位,第579位、第632位点。当本申请所述第217位点、第223位点、第397位点、第579位、第632位点的氨基酸取代转移至其他密切相关的噬菌体RNA聚合酶(SEQ ID NO.29~SEQ ID NO.62)中的对应位置时,预期具有相同的作用。所述密切相关的噬菌体RNA聚合酶氨基酸序列(SEQ ID NO.29~SEQ ID NO.62)与实施例1-4中的SEQ ID NO.1所示的T7噬菌体RNA聚合酶野生型氨基酸序列分别具有97.96%、98.07%、98.64%、98.07%、98.19%、90.26%、97.51%、97.40%、97.40%、98.19%、98.53%、82.35%、98.75%、97.73%、82.47%、98.41%、92.98%、82.47%、98.41%、82.47%、98.30%、98.41%、98.19%、98.53%、89.92%、82.69%、82.58%、82.47%、82.47%、82.35%、82.47%、82.24%、82.13%、82.01%的序列同一性。RNA polymerases from different sources with an amino acid identity of more than 80% were compared, and the results are shown in Figures 7A to 7E. It was found that some sequences in the sequence are highly conserved, including positions 217, 223, 397, 579, and 632. When the amino acid substitutions at positions 217, 223, 397, 579, and 632 described in the present application are transferred to the corresponding positions in other closely related phage RNA polymerases (SEQ ID NO. 29 to SEQ ID NO. 62), they are expected to have the same effect. The closely related bacteriophage RNA polymerase amino acid sequences (SEQ ID NO. 29 to SEQ ID NO. 62) have 97.96%, 98.07%, 98.64%, 98.07%, 98.19%, 90.26%, 97.51%, 97.40%, 97.40%, 98.19%, 98.53% similarity with the wild-type amino acid sequence of T7 bacteriophage RNA polymerase shown in SEQ ID NO. 1 in Examples 1-4, respectively. %, 82.35%, 98.75%, 97.73%, 82.47%, 98.41%, 92.98%, 82.47%, 98.41%, 82.47%, 98.30%, 98.41%, 98.19%, 98.53%, 89.92%, 82.69%, 82.58%, 82.47%, 82.47%, 82.35%, 82.47%, 82.24%, 82.13%, 82.01% sequence identity.

第121位点、第369位点的氨基酸取代转移至其他密切相关的噬菌体RNA聚合酶中的对应位置时,预期具有相同的作用。第121位点所述密切相关的噬菌体RNA聚合酶氨基酸序列(SEQ ID NO.29~SEQ ID NO.39、SEQ ID NO.41~SEQ ID NO.42、SEQ ID NO.44~SEQ ID NO.45、SEQ ID NO.47、SEQ ID NO.49~SEQ ID NO.53)与实施例1-4中的SEQ ID NO.1所示的T7噬菌体RNA聚合酶野生型氨基酸序列分别具有97.96%、98.07%、98.64%、98.07%、 98.19%、90.26%、97.51%、97.40%、97.40%、98.19%、98.535%、98.75%、97.73%、98.41%、98.30%、98.41%、98.19%、98.53%、89.92%的序列同一性。第369位点所述密切相关的噬菌体RNA聚合酶氨基酸序列SEQ ID NO.44与实施例1-4中的SEQ ID NO.1所示的T7噬菌体RNA聚合酶野生型氨基酸序列具有98.41%的序列同一性。The amino acid substitutions at positions 121 and 369 are expected to have the same effect when transferred to the corresponding positions in other closely related phage RNA polymerases. The amino acid sequences of the closely related phage RNA polymerases at position 121 (SEQ ID NO.29 to SEQ ID NO.39, SEQ ID NO.41 to SEQ ID NO.42, SEQ ID NO.44 to SEQ ID NO.45, SEQ ID NO.47, SEQ ID NO.49 to SEQ ID NO.53) have 97.96%, 98.07%, 98.64%, 98.07%, 98.66%, 98.67%, 98.68%, 98.69%, 98.70%, 98.71%, 98.72%, 98.73%, 98.74%, 98.76%, 98.77%, 98.78%, 98.80%, 98.81%, 98.82%, 98.83%, 98.84%, 98.86%, 98.87%, 98.83%, 98.84%, 98.85 ...3%, 98.83%, 98.83%, 98.83%, 98.8 98.19%, 90.26%, 97.51%, 97.40%, 97.40%, 98.19%, 98.535%, 98.75%, 97.73%, 98.41%, 98.30%, 98.41%, 98.19%, 98.53%, 89.92% sequence identity. The closely related bacteriophage RNA polymerase amino acid sequence SEQ ID NO.44 at position 369 has a sequence identity of 98.41% with the wild-type amino acid sequence of T7 bacteriophage RNA polymerase shown in SEQ ID NO.1 in Examples 1-4.

本发明涉及的野生型亲本T7RNA聚合酶的氨基酸序列(SEQ ID NO.1):
The amino acid sequence of the wild-type parent T7 RNA polymerase of the present invention (SEQ ID NO.1):

虽然本发明已以较佳实施例公开如上,但其并非用以限定本发明,任何熟悉此技术的人,在不脱离本发明的精神和范围内,都可做各种的改动与修饰,因此本发明的保护范围应该以权利要求书所界定的为准。 Although the present invention has been disclosed as above in the form of a preferred embodiment, it is not intended to limit the present invention. Anyone familiar with this technology can make various changes and modifications without departing from the spirit and scope of the present invention. Therefore, the scope of protection of the present invention should be based on the definition of the claims.

Claims (11)

一种性能提升的RNA聚合酶突变体,其特征在于,所述RNA聚合酶突变体是在对应于亲本RNA聚合酶的氨基酸序列的基础上,第121、223、632、217、397、369、579位的位置具有一个或者多个氨基酸突变,或者在亲本RNA聚合酶中等价位置的氨基酸残基处发生了一个或者多个氨基酸突变;所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少80%的同一性且具有RNA聚合酶活性。A RNA polymerase mutant with improved performance, characterized in that the RNA polymerase mutant has one or more amino acid mutations at positions 121, 223, 632, 217, 397, 369, and 579 based on the amino acid sequence corresponding to the parent RNA polymerase, or one or more amino acid mutations occur at the amino acid residues at equivalent positions in the parent RNA polymerase; the amino acid sequence of the parent RNA polymerase has at least 80% identity with the amino acids shown in SEQ ID NO.1 and has RNA polymerase activity. 根据权利要求1所述RNA聚合酶突变体,其特征在于,所述RNA聚合酶突变体还在第430位、633位、第786位、第744位、第724位、第43位或第723位的位置具有一个或者多个氨基酸突变。The RNA polymerase mutant according to claim 1, characterized in that the RNA polymerase mutant also has one or more amino acid mutations at position 430, 633, 786, 744, 724, 43 or 723. 根据权利要求1~2任一所述RNA聚合酶突变体,其特征在于,对应于SEQ ID NO.1所示氨基酸序列,The RNA polymerase mutant according to any one of claims 1 to 2, characterized in that, corresponding to the amino acid sequence shown in SEQ ID NO.1, 所述第217位氨基酸突变为L、A或V;优选地,突变为L;The amino acid at position 217 is mutated to L, A or V; preferably, mutated to L; 所述第397位氨基酸突变为W、A或P;优选地,突变为W;The amino acid at position 397 is mutated to W, A or P; preferably, mutated to W; 所述第121位氨基酸突变为M、L或S;优选地,突变为M;The amino acid at position 121 is mutated to M, L or S; preferably, it is mutated to M; 所述第223位氨基酸突变为M、P、L、V或F;优选地,突变为M;The amino acid at position 223 is mutated to M, P, L, V or F; preferably, mutated to M; 所述第632位氨基酸突变为G或E;优选地,突变为G;The amino acid at position 632 is mutated to G or E; preferably, mutated to G; 所述第369位氨基酸突变为T或E;The amino acid at position 369 is mutated to T or E; 所述第579位氨基酸突变为A、W或D;优选地,突变为W。The amino acid at position 579 mutates to A, W or D; preferably, mutates to W. 根据权利要求2所述RNA聚合酶突变体,其特征在于,对应于SEQ ID NO.1所示氨基酸序列,The RNA polymerase mutant according to claim 2, characterized in that, corresponding to the amino acid sequence shown in SEQ ID NO.1, 所述第430位氨基酸突变为非极性疏水性氨基酸;可选地,突变为P、L、V、M、F;优选地,突变为P;The amino acid at position 430 is mutated to a non-polar hydrophobic amino acid; optionally, it is mutated to P, L, V, M, or F; preferably, it is mutated to P; 所述第633位氨基酸突变为非极性疏水性氨基酸;可选地,突变为P、L、V、M或F、A、I、W;优选地,突变为P;The amino acid at position 633 is mutated to a non-polar hydrophobic amino acid; optionally, it is mutated to P, L, V, M or F, A, I, W; preferably, it is mutated to P; 所述第786位氨基酸突变为非极性疏水性氨基酸或Y;可选地,突变为P、I、A、V、Y、L、M、W或F;优选地,突变为L;The amino acid at position 786 is mutated to a non-polar hydrophobic amino acid or Y; alternatively, it is mutated to P, I, A, V, Y, L, M, W or F; preferably, it is mutated to L; 所述第744位氨基酸突变为R、P或L;优选地,突变为R;The amino acid at position 744 is mutated to R, P or L; preferably, mutated to R; 所述第43位氨基酸突变为Y或A;优选地,突变为Y;The amino acid at position 43 is mutated to Y or A; preferably, mutated to Y; 所述第723位氨基酸突变为E或S;优选地,突变为E;The amino acid at position 723 is mutated to E or S; preferably, mutated to E; 所述第724位氨基酸突变为N;The amino acid at position 724 is mutated to N; 优选地,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P和Q786L,命名为I217L/S397W/S430P/S633P/Q786L; Preferably, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P and Q786L, named I217L/S397W/S430P/S633P/Q786L; 优选地,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、N579W,命名为I217L/S397W/S430P/S633P/Q786L/N579W;Preferably, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, N579W, named I217L/S397W/S430P/S633P/Q786L/N579W; 优选地,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、Q744R,命名为I217L/S397W/S430P/S633P/Q786L/Q744R;Preferably, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, Q744R, named I217L/S397W/S430P/S633P/Q786L/Q744R; 优选地,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、C723E、A724N,命名为I217L/S397W/S430P、S633P/Q786L/C723E/A724N;Preferably, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, C723E, A724N, named I217L/S397W/S430P, S633P/Q786L/C723E/A724N; 优选地,所述RNA聚合酶突变体至少同时存在等价于或者对应于亲本聚合酶突变体的如下突变:I217L、S397W、S430P、S633P、Q786L、Q744R、S43Y,命名为I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y。Preferably, the RNA polymerase mutant simultaneously has at least the following mutations equivalent to or corresponding to the parent polymerase mutant: I217L, S397W, S430P, S633P, Q786L, Q744R, S43Y, named I217L/S397W/S430P/S633P/Q786L/Q744R/S43Y. 一种编码权利要求1~4任一所述RNA聚合酶突变体的多核苷酸。A polynucleotide encoding the RNA polymerase mutant according to any one of claims 1 to 4. 一种携带权利要求5所述多核苷酸的载体或表达权利要求1~4任一所述RNA聚合酶突变体的细胞。A vector carrying the polynucleotide of claim 5 or a cell expressing the RNA polymerase mutant of any one of claims 1 to 4. 一种多核苷酸的合成方法,其特征在于,使用权利要求1~4任一所述RNA聚合酶突变体;其中,合成方法包括体外转录合成、等温扩增或转录介导的扩增。A method for synthesizing a polynucleotide, characterized by using the RNA polymerase mutant described in any one of claims 1 to 4; wherein the synthesis method includes in vitro transcription synthesis, isothermal amplification or transcription-mediated amplification. 权利要求1~4任一所述RNA聚合酶突变体或权利要求6所述多核苷酸的合成方法在mRNA疫苗制备、核酸药物制备、基因编辑或蛋白表达系统构建中的应用。Application of the RNA polymerase mutant according to any one of claims 1 to 4 or the method for synthesizing the polynucleotide according to claim 6 in the preparation of mRNA vaccines, the preparation of nucleic acid drugs, gene editing or the construction of protein expression systems. 一种多核苷酸合成试剂盒,其特征在于,包含权利要求1~4任一所述RNA聚合酶突变体。A polynucleotide synthesis kit, characterized in that it comprises the RNA polymerase mutant according to any one of claims 1 to 4. 一种提高RNA聚合酶的至少一个性能的方法,所述性能选自酶活性、热稳定性、3'末端一致性、体外转录产量、dsRNA副产物和免疫原性,其特征在于,在对应于亲本RNA聚合酶的氨基酸序列的基础上,第121、223、632、217、397、369、579位的位置具有一个或者多个氨基酸突变,或者在亲本RNA聚合酶中等价位置的氨基酸残基处发生了一个或者多个氨基酸突变;所述亲本RNA聚合酶的氨基酸序列与SEQ ID NO.1所示的氨基酸具有至少80%的同一性且具有RNA聚合酶活性。A method for improving at least one performance of an RNA polymerase, wherein the performance is selected from enzyme activity, thermal stability, 3' end consistency, in vitro transcription yield, dsRNA byproducts and immunogenicity, characterized in that, based on the amino acid sequence corresponding to a parent RNA polymerase, one or more amino acid mutations occur at positions 121, 223, 632, 217, 397, 369, and 579, or one or more amino acid mutations occur at amino acid residues at equivalent positions in the parent RNA polymerase; the amino acid sequence of the parent RNA polymerase has at least 80% identity with the amino acids shown in SEQ ID NO.1 and has RNA polymerase activity. T7 RNA聚合酶突变体的筛选方法,其特征在于,包括在含有0.1~0.6mM锰离子的反应体系中扩增编码T7 RNA聚合酶的多核苷酸,然后将携带有扩增产物的载体转化至细胞,对细胞诱导表达后采用液滴微流控细胞分选方法分选靶细胞;在液滴微流控细胞分选的体外孵育步骤中,对液滴进行热处理; The method for screening T7 RNA polymerase mutants is characterized by comprising amplifying a polynucleotide encoding T7 RNA polymerase in a reaction system containing 0.1 to 0.6 mM manganese ions, then transforming a vector carrying the amplified product into cells, inducing expression in the cells and sorting target cells using a droplet microfluidic cell sorting method; in the in vitro incubation step of the droplet microfluidic cell sorting, heat treating the droplets; 优选地,所述热处理的条件为40~50℃处理5~10min;Preferably, the heat treatment is carried out at 40-50°C for 5-10 minutes; 优选地,所述热处理的条件为45℃处理7min;Preferably, the heat treatment condition is 45°C for 7 minutes; 优选地,反应体系中锰离子浓度为0.2mM~0.3mM;Preferably, the manganese ion concentration in the reaction system is 0.2 mM to 0.3 mM; 优选地,所述靶细胞为表现出最高活力的0.4%~0.45%的液滴包裹的细胞;Preferably, the target cells are 0.4% to 0.45% of the droplet-encapsulated cells that exhibit the highest viability; 优选地,所述扩增编码T7 RNA聚合酶的多核苷酸的和核苷酸序列如SEQ ID NO.15所示;Preferably, the amplified polynucleotide encoding T7 RNA polymerase and the nucleotide sequence are shown as SEQ ID NO.15; 优选地,还包括至少一次:分离经液滴微流控细胞分选方法分选得到的靶细胞中的目的片段,转化至另一细胞中,培养后再次筛选;Preferably, the method further comprises at least one step: separating the target fragments from the target cells sorted by the droplet microfluidic cell sorting method, transforming the fragments into another cell, and screening the cells again after culturing; 优选地,收集阳性细胞表达的T7 RNA聚合酶突变体,对含有T7 RNA聚合酶突变体的体系进行热处理,然后进行酶活测定,将酶活至少是野生型1.5倍的阳性克隆测序。 Preferably, the T7 RNA polymerase mutants expressed by the positive cells are collected, the system containing the T7 RNA polymerase mutants is heat-treated, and then the enzyme activity is determined, and the positive clones whose enzyme activity is at least 1.5 times that of the wild type are sequenced.
PCT/CN2024/093147 2023-05-15 2024-05-14 Rna polymerase mutants with improved performance Pending WO2024235228A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN202310558557.6A CN118956809A (en) 2023-05-15 2023-05-15 T7 RNA polymerase mutant with improved thermal stability and its application
CN202310558557.6 2023-05-15
CN202410499488.0A CN120829885A (en) 2024-04-24 2024-04-24 A mutant RNA polymerase with improved performance
CN202410499488.0 2024-04-24

Publications (1)

Publication Number Publication Date
WO2024235228A1 true WO2024235228A1 (en) 2024-11-21

Family

ID=93518738

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/093147 Pending WO2024235228A1 (en) 2023-05-15 2024-05-14 Rna polymerase mutants with improved performance

Country Status (1)

Country Link
WO (1) WO2024235228A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119752838A (en) * 2024-12-03 2025-04-04 湖南艾科瑞生物工程有限公司 Heat-resistant high-yield T7 RNA polymerase mutant and preparation method and application thereof

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009113718A1 (en) * 2008-03-11 2009-09-17 東ソー株式会社 Rna polymerase mutant having improved functions
CN102177236A (en) * 2008-08-08 2011-09-07 东曹株式会社 RNA polymerase mutants with improved function
US20150024435A1 (en) * 2011-10-06 2015-01-22 Commissariat à I'Energie Atomique et aux Energies Alternatives Mutated t7 rna polymerases
CN106459879A (en) * 2014-03-20 2017-02-22 德克萨斯大学系统董事会 T7 RNA polymerase variants with extended substrate range and increased transcription yield
CN112831484A (en) * 2021-01-13 2021-05-25 华中科技大学 T7-RNA polymerase mutants and their applications
US20210309976A1 (en) * 2019-02-20 2021-10-07 Modernatx, Inc. Rna polymerase variants for co-transcriptional capping

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009113718A1 (en) * 2008-03-11 2009-09-17 東ソー株式会社 Rna polymerase mutant having improved functions
CN102177236A (en) * 2008-08-08 2011-09-07 东曹株式会社 RNA polymerase mutants with improved function
US20150024435A1 (en) * 2011-10-06 2015-01-22 Commissariat à I'Energie Atomique et aux Energies Alternatives Mutated t7 rna polymerases
CN106459879A (en) * 2014-03-20 2017-02-22 德克萨斯大学系统董事会 T7 RNA polymerase variants with extended substrate range and increased transcription yield
US20210309976A1 (en) * 2019-02-20 2021-10-07 Modernatx, Inc. Rna polymerase variants for co-transcriptional capping
CN112831484A (en) * 2021-01-13 2021-05-25 华中科技大学 T7-RNA polymerase mutants and their applications

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119752838A (en) * 2024-12-03 2025-04-04 湖南艾科瑞生物工程有限公司 Heat-resistant high-yield T7 RNA polymerase mutant and preparation method and application thereof

Similar Documents

Publication Publication Date Title
US20210340509A1 (en) Reverse transcriptase with increased enzyme activity and application thereof
CN109423496A (en) Nucleic acid construct for endogenously expressing RNA polymerase in cell
CN107012130A (en) A kind of glucose oxidase mutant and its encoding gene and application
WO2023232075A1 (en) Rna polymerase fusion protein and use thereof
JP2025029079A (en) Protein translation using circular RNA and its applications
CN118792276A (en) T7 RNA polymerase mutant and its preparation method and application
WO2024235228A1 (en) Rna polymerase mutants with improved performance
CN118956809A (en) T7 RNA polymerase mutant with improved thermal stability and its application
CN115261363B (en) Method for measuring RNA deaminase activity of APOBEC3A and RNA high-activity APOBEC3A variant
CN110283244A (en) Segmentation green fluorescent protein system for protein-interacting detection
AU2015289386A1 (en) Methods and compositions for expression of polypeptides in a cell
CN116590251A (en) Alcohol oxidase mutant and its coding gene and application
CN113215005A (en) In-vitro cell-free protein synthesis system (D2P system), kit and application thereof
CN117187210B (en) Mutant Bst DNA polymerase large fragment and preparation method thereof
CN106399352A (en) Folding factors for adjusting expression of target protein and application of folding factors
US20250019675A1 (en) T7 rna polymerase variants for rna synthesis
CN118290560A (en) A mutant of eIF2α, mutant cell, preparation method and application thereof
CN120829885A (en) A mutant RNA polymerase with improved performance
CN111778270B (en) Method for reflecting in vitro cell-free protein expression level by integrating luminescent reporter gene
CN111378706B (en) Method for changing in vitro protein synthesis capacity through Edc3 gene knockout and application thereof
CN115011578B (en) A kind of enhanced M-MLV reverse transcriptase mutant and its application
CN114807145B (en) Leader sequence for improving cap-independent translation efficiency and application thereof
WO2024244010A1 (en) Dna polymerase mutant for sequencing
CN115960924A (en) Preparation method and application of recombinant JO-1 antigen
CN120400113A (en) A method for reducing impurity proteins in target protein synthesis

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24806578

Country of ref document: EP

Kind code of ref document: A1