WO2025140227A1 - Modified aminoacyl-trna synthetase, nucleic acid construct and genetically engineered strain - Google Patents
Modified aminoacyl-trna synthetase, nucleic acid construct and genetically engineered strain Download PDFInfo
- Publication number
- WO2025140227A1 WO2025140227A1 PCT/CN2024/142051 CN2024142051W WO2025140227A1 WO 2025140227 A1 WO2025140227 A1 WO 2025140227A1 CN 2024142051 W CN2024142051 W CN 2024142051W WO 2025140227 A1 WO2025140227 A1 WO 2025140227A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- trna synthetase
- aminoacyl
- homology
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1085—Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/645—Fungi ; Processes using fungi
Definitions
- the present invention relates to the field of biotechnology, and preferably to an aminoacyl-tRNA synthetase and a nucleic acid construct thereof, a genetically engineered strain, and an application thereof in cell-free synthesis of non-natural amino acid proteins.
- the reaction can be theoretically promoted by increasing the concentration of the main components in the reaction system.
- the unnatural amino acid is transported to the ribosome by the aminoacyl-tRNA formed by the catalysis of a specific aminoacyl-tRNA synthetase, and the unnatural amino acid is assembled in the peptide chain under the catalysis of the ribosome by pairing with the codon.
- the binding ability of the unnatural aminoacyl-tRNA to the transport protein and the matching degree with the ribosome are weaker than those of the classical aminoacyl-tRNA, and the reaction is not carried out. It is necessary to promote the reaction by increasing the substrate concentration.
- the orthogonal aminoacyl-tRNA synthetase used for the introduction of unnatural amino acids is usually obtained by mutating the substrate binding site of the natural aminoacyl-tRNA synthetase, and the change in substrate specificity often requires the price of decreased catalytic activity. In order to compensate for this decreased activity, a high concentration of enzyme or tRNA or unnatural amino acid is required to make the reaction proceed in a direction that is conducive to peptide chain synthesis.
- the commonly used orthogonal translation system pylrs-tRNA CUA pyl comes from the archaea Methanosarcina mazei and Methanosarcina barkeri PylRS (MmPylRS, MbPylRS). Due to its special structure, the recognition of tRNA by pylrs is independent of the anticodon loop and has substrate recognition plasticity. Through directed evolution, the use of pylrs-tRNA CUA pyl has enabled the introduction of more than 200 unnatural amino acids.
- MmPylRS and MbPylRS have an N-terminal domain with low solubility, which leads to their poor solubility. Even after codon optimization, the expression level of MmPylRS and MbPylRS in E. coli is difficult to increase, and the protein concentration after concentration still cannot exceed 8 mg/ml. The advantages of the cell-free in vitro protein translation system cannot be fully utilized, and the introduction efficiency of non-natural amino acids is very low, which is not suitable for actual production applications.
- Methanomethylophilus alvus PylRS is an aminoacyl-tRNA synthetase. This synthetase belongs to the nucleotide class and its full name is pyrrolysyl-tRNA synthetase, commonly referred to as PylRS.
- PylRS pyrrolysyl-tRNA synthetase
- the main function of PylRS is to help prokaryotes synthesize a special amino acid - pyrrolysine (Pyl), which does not exist in any known eukaryotic organisms [1] .
- Methanomethylophilus alvus is a methanogenic archaeon originally isolated from the stomach of bovine ruminants. PylRS was discovered by sequence analysis of the M. alvus genome and can be used to modify foreign proteins and express new biological activities, so it has important application value in the field of synthetic biology.
- MaPylRS Methanomethylophilus alvus
- MaPylRS is mainly used to modify exogenous proteins in prokaryotes such as Escherichia coli [2] , and there are also reports of successful applications in eukaryotic organisms [3] .
- MaPylRS when MaPylRS is expressed in Saccharomyces cerevisiae [4] , it can achieve efficient site-specific insertion of target non-natural amino acids into target proteins.
- Saccharomyces cerevisiae [4] it can achieve efficient site-specific insertion of target non-natural amino acids into target proteins.
- MaPylRS exists in the cell in the form of an exogenous plasmid, and there is no eukaryotic cell system that can directly recombinantly express MaPylRS in its genome stably and efficiently [5] .
- Protein Expression Systems in vitro refers to the use of cell lysate and other non-in vivo conditions to synthesize proteins. It is only necessary to add DNA or RNA templates, RNA polymerase and necessary amino acids, ATP and auxiliary factors to the reaction system to complete the synthesis of the target protein. Since the complex metabolic steps of the entire cell are not required, the system can achieve rapid and efficient large-scale protein production. Due to its high flexibility and simplified process, it has become one of the widely used tools in the biomedical field and scientific research [6] . At present, the commercial in vitro protein expression systems that are frequently used include Escherichia coli system (E.
- CRISPR/Cas Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR associated
- CRISPR/Cas Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR associated
- This technology can precisely cut DNA, allowing researchers to increase or decrease specific parts of genes to study their functions, and can also be used to treat certain genetic diseases [13] .
- a specific gRNA sequence needs to be paired with the Cas9 nuclease protein and delivered to the target cell. When the gRNA binds to Cas9, it will be carried to the target genome.
- the gRNA controls the Cas9 enzyme to find the PAM sequence (i.e. "protospacer adjacent motif") on the genome and recognize the sequence 20bp upstream of it.
- the Cas9 enzyme then creates a double-stranded nick 3 bases upstream of the PAM.
- the broken DNA chains can be connected through the mechanism of homologous recombination (HDR), thereby achieving the purpose of genetic modification [14] .
- HDR homologous recombination
- the pCAS plasmid widely used in the prior art has both the Cas9 gene sequence and the gRNA element [17] , which can achieve genome modification of S. cerevisiae through a single transformation.
- Kluyveromyces is a type of yeast fungus, belonging to S. cerevisiae. It has a wide range of industrial applications, especially in the food and beverage industry.
- Kluyveromyces is widely used in the fermentation process of food and beverages such as dairy products, wine, and beer. It can decompose sugars and produce a variety of enzymes, promote the fermentation process and give the food a good taste and flavor. Compared with other brewing yeasts, Kluyveromyces has some unique characteristics, such as higher temperature adaptability and acid resistance.
- the present invention designs a histidine tag at the N-terminus and C-terminus of the MaPylRS gene, and a thrombin restriction site for convenient tag removal. It is found that adding a sequence at the N-terminus or C-terminus not only promotes the stability of the enzyme, makes it easier to purify, but also further improves the catalytic activity of the enzyme.
- A4 is an aminoacyl-tRNA synthetase
- the tag protein is selected from T7 tag, CBP tag, CMyc tag, FLAG tag, Spot tag, C tag, Avi tag, Streg tag, SUMO tag, GST tag, MBP tag or a combination thereof; preferably T7 tag.
- a his tag is connected to the N-terminus or C-terminus of the aminoacyl-tRNA synthetase.
- a his tag, a thrombin cleavage site, and a tag protein are connected to the N-terminus of the aminoacyl-tRNA synthetase in order from the N-terminus to the C-terminus.
- the coding sequence of the recombinant aminoacyl-tRNA synthetase comprises any one of SEQ ID NO:4, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63 or an active fragment thereof, or each of which has ⁇ 85% homology (preferably, ⁇ 90% homology; preferably ⁇ 95% homology; most preferably, ⁇ 97% homology, such as 98% or more, 99% or more) with the nucleotide sequence shown above and each of which has the same activity as any one of SEQ ID NO:4, SEQ ID NO:61, SEQ ID NO:62 or SEQ ID NO:63.
- the second aspect of the present invention provides a nucleic acid construct encoding the recombinant aminoacyl-tRNA synthetase described in the first aspect of the present invention.
- the structure of formula II can be from 5' to 3' or from 3' to 5':
- the encoding product of Z1-Z2-Z3 is connected to the N-terminus or C-terminus of the encoding product of Z4.
- the encoding product of Z1-Z2-Z3 is connected to the N-terminus of the encoding product of Z4.
- the amino acid sequence encoded by Z1 is HHHHHH; more preferably, Z1 encodes a sequence comprising HHHHHH or an active fragment thereof, or a nucleotide having ⁇ 85% homology with the nucleotide sequence shown above (preferably, ⁇ 90% homology; preferably ⁇ 95% homology; most preferably, ⁇ 97% homology, such as above 98%, above 99%) and having the same activity as the above sequence.
- the amino acid sequence encoded by Z2 is LVPRGS; more preferably, Z2 encodes a sequence containing LVPRGS or an active fragment thereof, or a nucleotide having ⁇ 85% homology with the nucleotide sequence shown above (preferably, ⁇ 90% homology; preferably ⁇ 95% homology; most preferably, ⁇ 97% homology, such as above 98%, above 99%) and having the same activity as the above sequence.
- the amino acid sequence encoded by the Z3 is SEQ ID NO:59; more preferably, the Z code contains the sequence shown in SEQ ID NO:59 or an active fragment thereof, or is a nucleotide having ⁇ 85% homology with the nucleotide sequence shown above (preferably, ⁇ 90% homology; preferably ⁇ 95% homology; most preferably, ⁇ 97% homology, such as above 98%, above 99%) and having the same activity as the above sequence.
- the nucleic acid construct further comprises a promoter.
- nucleic acid structure comprises the structure described in Formula III:
- the promoter is selected from PGK1, GAP1, ADH1, HXK1, GAPDH1, TEF1 or TIF11.
- the third aspect of the present invention provides a vector, wherein the vector contains the nucleic acid construct provided by the second aspect of the present invention.
- the fourth aspect of the present invention provides a genetically engineered strain, wherein one or more sites of the genome of the genetically engineered strain are integrated with the nucleic acid construct described in the second aspect of the present invention.
- the site is selected from Lys1-5, glpA or UPF1.
- the nucleic acid construct further comprises a promoter and a terminator.
- the promoter is selected from PGK1, GAP1, ADH1, HXK1, GAPDH1, TEF1, TIF11, GAL1, GAL7 or GAL10.
- the fifth aspect of the present invention provides a method for synthesizing proteins incorporated with non-natural amino acids, using the recombinant aminoacyl-tRNA synthetase described in the first aspect of the present invention, or using the genetically engineered strain described in the fourth aspect of the present invention to provide the recombinant aminoacyl-tRNA synthetase.
- the seventh aspect of the present invention provides a cell-free system for synthesizing proteins containing non-natural amino acids, characterized in that the cell-free system at least includes a cell extract, and the cell extract is derived from the genetically engineered strain provided in the fourth aspect of the present invention.
- the cell extract is selected from any one or combination of the following sources: Escherichia coli, Kluyveromyces lactis, wheat germ cells, insect cells, rabbit reticulocytes, CHO cells, COS cells, VERO cells, BHK cells, human fibrosarcoma HT1080 cells, or a combination thereof.
- the cell extract is derived from yeast cells.
- the cell-free system further comprises: a non-natural amino acid, an orthogonal tRNA and a template comprising a target protein gene sequence, wherein the codon encoding the amino acid in the target protein gene sequence is mutated.
- the non-natural amino acid has a structural formula of the compound of formula (2) or a salt thereof.
- n is selected from a natural number of 1-20
- R1 is selected from substituted or unsubstituted C5-C60 aryl or heteroaryl, substituted or unsubstituted C1-C20 alkyl, substituted or unsubstituted C2-C20 alkenyl or substituted or unsubstituted C2-C20 alkynyl
- A is selected from O or -CH2- .
- the R 1 is selected from substituted or unsubstituted C2-C10 alkenyl.
- the R 1 is selected from substituted or unsubstituted C2-C6 alkenyl.
- the R 1 is selected from substituted or unsubstituted C2-C20 alkynyl.
- the R 1 is selected from substituted or unsubstituted C2-C10 alkynyl.
- the R 1 is selected from substituted or unsubstituted C2-C6 alkynyl.
- the A is selected from O.
- the A is selected from -CH 2 -.
- the substituents are common substituents in the art, such as aryl, heteroaryl, alkyl, cycloalkyl, aryloxy, heteroaryloxy, alkyloxy, cycloalkyloxy, hydroxyl, thiol, ester, carboxyl, cyano, halogen, nitro, sulfonic acid, azido, alkenyl, alkynyl, phosphate, etc.
- the target protein is selected from: luciferin, luciferase (such as firefly luciferase), fluorescent protein (such as green fluorescent protein, yellow fluorescent protein), aminoacyl-tRNA synthetase, glyceraldehyde-3-phosphate dehydrogenase, catalase, actin, variable region of antibody, luciferase mutation, ⁇ -amylase, enterobactin A, hepatitis C virus E2 glycoprotein, insulin precursor, interferon ⁇ A, cytokine, interferon ⁇ 2b, interleukin-1 ⁇ , lysozyme, serum albumin, single-chain antibody fragment (scFV), thyroxine transporter, tyrosinase, xylanase or a combination thereof.
- luciferin such as firefly luciferase
- fluorescent protein such as green fluorescent protein, yellow fluorescent protein
- aminoacyl-tRNA synthetase such as green fluorescent protein
- the eighth aspect of the present invention provides a method for preparing the genetically engineered strain described in the fourth aspect of the present invention, characterized in that the nucleic acid construct described in the second aspect of the present invention is transferred or integrated into a cell through transformation, transfection or gene editing technology.
- the nucleic acid construct is integrated into the genome of the cell via the active site.
- the nucleic acid construct further comprises a promoter and a terminator.
- the promoter is selected from PGK1, GAP1, ADH1, HXK1, GAPDH1, TEF1 or TIF11.
- the site is selected from Lys1-5, glpA or UPF1.
- the tenth aspect of the present invention provides a method for in vitro synthesis of proteins containing non-natural amino acids, characterized in that the method is prepared using the cell-free system described in the sixth or seventh aspect of the present invention or the kit described in the ninth aspect of the present invention.
- N-terminal T7 tag and thrombin cleavage site of the natural or modified MaPylRS synthetase will further promote the stability of the enzyme, make it easier to purify, and increase the catalytic activity of the enzyme, which is beneficial to improve the efficiency of the introduction of non-natural amino acids.
- the present invention integrates the aforementioned designed MaPylRS recombinant expression structure into the cell genome through CRISPR/Cas9 combined with efficient transformation technology, thereby achieving the stable existence of MaPylRS in the cell genome and the continuous expression of MaPylRS protein.
- the Kluyveromyces strain inserted with the recombinant MaPylRS was prepared into an in vitro expression system, which achieved the site-specific insertion of non-natural amino acids into the exogenous target protein, greatly simplified the preparation steps, saved costs, and increased the stability of the synthesized protein with the non-natural amino acid insertion.
- Figure 1 shows the construction diagram of long N-terminal mapylrs
- Figure 2 shows the N-his mapylrs construction diagram
- FIG. 3 shows the construction diagram of C-his mapylrs
- Figure 4 shows the electrophoresis of the lysis supernatant of the three proteins after induction, where 1 represents long N-terminal mapylrs, 2 represents N-his mapylrs, and 3 represents C-his mapylrs.
- the mapylrs with longer N-terminal have higher expression levels and higher protein solubility.
- the highest protein concentration can reach 120 mg/mL (3.4 mM).
- FIG8 shows a schematic diagram of the structure of the pKM-CAS1.0-K1Lys1-5 plasmid.
- FIG. 9 shows a schematic diagram of the structure of the pKM-MaPylRS plasmid.
- FIG. 10 shows a schematic diagram of the structure of the pKM-CAS1.0-KlglpA plasmid.
- In vitro protein synthesis reaction refers to the reaction of synthesizing protein in an in vitro cell-free synthesis system, including at least the translation process. Including but not limited to IVT reaction (in vitro translation reaction), IVTT reaction (in vitro transcription and translation reaction), IVDTT reaction (in vitro replication transcription and translation reaction). In the present invention, IVTT reaction is preferred.
- IVTT reaction corresponding to IVTT system, is the process of transcribing and translating DNA into protein (Protein) in vitro.
- D2P system D-to-P system
- D_to_P system DNA-to-Protein system
- the corresponding in vitro protein synthesis method is also called D2P method, D-to-P method, D_to_P method, DNA-to-Protein method.
- Cell-free system refers to a method of in vitro protein synthesis that is not secretory expression through intact cells. It should be noted that in the in vitro cell-free protein synthesis system of the present invention, it is also allowed to add cell components to promote the reaction, but the added cells are not primarily intended to secrete and express exogenous target proteins. In addition, in the intact cell-free D2P system constructed under the guidance of the present invention, a small amount of intact cells are intentionally added (for example, the protein content provided by them is not more than 30wt% compared to the protein content provided by the cell extract). Such a “circumvention" method is also included in the scope of protection of the present invention.
- Exogenous protein "target protein”, “target protein”, “target translation product” have the same meaning and can be translated as “objective protein”, “interested protein”, “objective translated product”, “interested protein product”, etc., and can be used interchangeably in the present invention.
- D2P DNA-to-Protein, from DNA template to protein product.
- D2P technology D2P system, D2P method, D2P kit, etc.
- the expression system of the present invention can be used interchangeably, all referring to the in vitro protein expression system of the present invention, and other descriptions can also be used, such as: protein in vitro synthesis system, in vitro protein synthesis system, cell-free system, cell-free system, cell-free protein synthesis system, cell-free in vitro protein synthesis system, in vitro cell-free protein synthesis system, in vitro cell-free synthesis system, CFS system (cell-free system), CFPS system (cell-free protein synthesis system) and other descriptions.
- the reaction mechanism can include an in vitro translation system (which can be abbreviated as IVT system, a mR2P system), an in vitro transcription translation system (which can be abbreviated as IVTT system, a D2P system), an in vitro replication transcription translation system (which can be abbreviated as IVDTT system, a D2P system), etc.
- the IVTT system is preferred.
- the in vitro protein synthesis system a "protein synthesis factory" ("Protein Factory” or "proteinfactory” or “Proteinfactory”).
- the in vitro protein synthesis system provided by the present invention adopts an open description method for its components.
- the cell-free protein synthesis system of the present invention uses exogenous DNA, mRNA or a combination thereof as a nucleic acid template for protein synthesis, and achieves in vitro synthesis of the target protein by artificially controlling the addition of substrates and transcription, translation-related protein factors and other substances required for protein synthesis.
- protein and “protein” have the same meaning, are both translated as protein, and can be used interchangeably.
- system and “system” are both translated as system and can be used interchangeably.
- protein synthesis amount protein expression amount
- protein expression yield protein expression yield
- cell extract, cell extract, cell lysate, cell disruption and cell lysis product have the same meaning and can be used interchangeably.
- cell extract, cell lysate and the like can be used as descriptions.
- energy system energy system, energy system, and energy supply system have the same meaning and can be used interchangeably.
- Energy regeneration system and energy regeneration system have the same meaning and can be used interchangeably.
- the energy regeneration system is a preferred embodiment or component of the energy system.
- the present invention provides a cell-free protein synthesis system, which at least includes a cell extract or a cell lysate.
- the cell-free protein synthesis system further comprises one or more components selected from the following group: a substrate for synthesizing RNA, a substrate for synthesizing protein, polyethylene glycol or its analogues, magnesium ions, potassium ions, a buffer, RNA polymerase, an energy regeneration system, dithiothreitol, and an optional aqueous solvent.
- the substrate for synthesizing RNA includes: nucleoside monophosphate, nucleoside triphosphate or a combination thereof.
- the substrate for synthesizing protein includes: 20 natural amino acids and unnatural amino acids.
- the magnesium ions are derived from a magnesium ion source, and the magnesium ion source is selected from the following group: magnesium acetate, magnesium glutamate, or a combination thereof.
- the potassium ions are derived from a potassium ion source, and the potassium ion source is selected from the following group: potassium acetate, potassium glutamate, or a combination thereof.
- the energy regeneration system is selected from the following group: one or a combination of creatine phosphate/creatine phosphate enzyme system, glycolysis pathway and its intermediate energy system.
- the energy regeneration system comprises a glucose/phosphate system
- the phosphate is selected from the following group: tripotassium phosphate, triammonium phosphate, trisodium phosphate, dipotassium hydrogen phosphate, diammonium hydrogen phosphate, disodium hydrogen phosphate, potassium dihydrogen phosphate, ammonium dihydrogen phosphate, sodium dihydrogen phosphate, or a combination thereof.
- the buffer is selected from the following group: 4-hydroxyethylpiperazineethanesulfonic acid, tris(hydroxymethyl)aminomethane, or a combination thereof.
- the polyethylene glycol includes polyethylene glycol with a molecular weight (Da) of 200-10000, such as PEG200, 400, 1500, 2000, 4000, 6000, 8000, 10000, etc., preferably, polyethylene glycol with a molecular weight of 3000-10000.
- Da molecular weight
- the RNA polymerase is not particularly limited and can be selected from one or more RNA polymerases.
- a typical RNA polymerase is T7 RNA polymerase.
- the in vitro protein synthesis system includes: cell extract, 4-hydroxyethylpiperazineethanesulfonic acid, potassium acetate, magnesium acetate, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytosine triphosphate (CTP), thymidine triphosphate (TTP), amino acid mixture, creatine phosphate, dithiothreitol (DTT), creatine phosphate kinase, and RNA polymerase.
- cell extract 4-hydroxyethylpiperazineethanesulfonic acid
- potassium acetate magnesium acetate
- ATP adenosine triphosphate
- GTP guanosine triphosphate
- CTP cytosine triphosphate
- TTP thymidine triphosphate
- amino acid mixture amino acid mixture
- creatine phosphate dithiothreitol (DTT)
- DTT dithiothreitol
- DTT creatine phosphat
- the cell extract does not contain intact cells, and a typical cell extract includes ribosomes, aminoacyl-tRNA synthetases, initiation factors and elongation factors required for protein synthesis, and termination release factors for protein translation.
- the cell extract also contains some other proteins derived from the cytoplasm of the cell, especially soluble proteins.
- the proportion of the cell extract in the in vitro cell-free protein synthesis system is not particularly limited.
- the cell extract accounts for 20-70% of the in vitro cell-free protein synthesis system, preferably 30-60%, and more preferably 40-50%.
- the protein content of the cell extract is 20-100 mg/mL, preferably 50-100 mg/mL.
- the method for determining the protein content is the Coomassie Brilliant Blue determination method.
- the present invention also provides a vector or vector combination, wherein the vector contains the nucleic acid construct of the present invention.
- the vector is selected from: bacterial plasmid, bacteriophage, yeast plasmid, animal cell vector, shuttle vector; the vector is a transposon vector.
- Methods for preparing recombinant vectors are well known to those of ordinary skill in the art. As long as they can replicate and be stable in the host, any plasmid and vector can be used.
- the template DNA is a nucleotide sequence encoding any target protein to be synthesized, which may be an original sequence, an artificially synthesized sequence, or an artificially modified sequence.
- the template DNA may be used to synthesize the corresponding RNA and/or protein.
- the preparation method of the cell extract is not limited.
- a preferred preparation method is
- the solid-liquid separation method is not particularly limited, and a preferred method is centrifugation.
- the centrifugation is performed in a liquid state.
- the centrifugation temperature is not particularly limited.
- the centrifugation is performed at 1-10°C, more preferably, at 2-6°C.
- the washing treatment method is not particularly limited.
- a preferred washing treatment method is to use a washing liquid at a pH of 7-8 (preferably 7.4).
- the washing liquid is not particularly limited.
- the washing liquid is selected from the following group: potassium 4-hydroxyethylpiperazineethanesulfonate, potassium acetate, magnesium acetate, or a combination thereof.
- the N-terminus of the Mapylrs protein has one more Gly than the reported sequence, and the C-terminus has an affinity purification tag 6*His (Figure 3):
- the amplified product was digested with DnpI, ligated and transformed into DH5 ⁇ competent cells, and the plasmid was extracted and sequenced.
- gRNA-F2 AATCGAAGTAACTCTAGCCATCGG (SEQ ID NO: 27);
- gRNA-R2 AAACCCGATGGCTAGAGTTACTTC (SEQ ID NO: 28)
- Dilute the primers to 10 ⁇ M add 10 ⁇ L of gRNA-F2 and gRNA-R2 to the PCR tube, centrifuge to the bottom of the tube after mixing, and anneal according to the following procedure:
- the donor Donor i.e. the homologous recombination sequence of the KlglpA site and the MaPylRS expression structure.
- the promoter of MaPylRS was KlPGK1 promoter and the terminator was ScCYC1 terminator.
- the donor DNA construction and transformation method are as follows
- DW14-1 and DW14-2 are two sets of parallel experiments, and the promoter is TIF11; the promoter of DW14-3 is TEF1; DW14-4 and DW14-5 are two sets of parallel experiments, and the promoter is ADH1; DW14-6 and DW14-7 are two sets of parallel experiments, and the promoter is GAP1; DW14-8 and DW14-9 are two sets of parallel experiments, and the promoter is HXK4; the promoter of DW14-10 is PGK1.
- Example 5 The reaction systems obtained after the modified MaPylRS in Example 6 and the original MaPylRS (Example 5) were integrated into yeast to compare the activity effects of ncaa introduction.
- Example 9 For specific measurement conditions, see Example 9, and for specific results, see Figures 15 to 17, where sl-3 represents the reaction system of yeast integrated with the original MaPylRS, and sl-9 represents the reaction system of yeast integrated with the modified MaPylRS of the present application.
- Orthogonal tRNA and Prock were added to the systems, and a system without adding ncaa (i.e., non) was used as a control.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
本发明涉及一种生物技术领域,较佳地,涉及一种氨酰-tRNA合成酶及其核酸构建物、基因工程菌株以及在无细胞合成非天然氨基酸蛋白质中的应用。The present invention relates to the field of biotechnology, and preferably to an aminoacyl-tRNA synthetase and a nucleic acid construct thereof, a genetically engineered strain, and an application thereof in cell-free synthesis of non-natural amino acid proteins.
无论是蛋白质结构功能研究还是抗体偶联药物的生产,越来越多的情况需要在蛋白质多肽链中引入新的功能团。这可以利用蛋白质正交翻译体系,将具有特殊化学功能团的非天然氨基酸按照预设的遗传密码子编入肽链中。虽然非天然氨基酸导入可以在细胞内实现,但是由于与经典蛋白质翻译体系存在竞争,同时非天然氨基酸的特殊化学活性侧链基团常使其无法自由穿过细胞膜,要实现含有非天然氨基酸蛋白的大规模生产,无细胞体外蛋白质翻译体系具有无可替代的优势。Whether it is protein structure and function research or the production of antibody-drug conjugates, more and more cases require the introduction of new functional groups into protein polypeptide chains. This can be achieved by using the protein orthogonal translation system to encode non-natural amino acids with special chemical functional groups into the peptide chain according to the preset genetic codon. Although the introduction of non-natural amino acids can be achieved in cells, due to competition with the classical protein translation system and the special chemically active side chain groups of non-natural amino acids, they often cannot freely pass through the cell membrane. To achieve large-scale production of proteins containing non-natural amino acids, the cell-free in vitro protein translation system has irreplaceable advantages.
无细胞体外蛋白质翻译体系因为没有细胞膜的束缚,因此理论上可以通过提高反应体系中主要成分的浓度推动反应进行。非天然氨基酸通过特定的氨酰-tRNA合成酶催化形成的氨酰-tRNA,转运到核糖体,通过与密码子配对,在核糖体的催化下将非天然氨基酸装配在肽链中。其中,非天然氨基酰-tRNA与转运蛋白的结合能力以及与核糖体的匹配度相对于经典氨酰-tRNA较弱,不利用反应的进行。需要通过提高底物浓度促进反应进行。非天然氨基酸导入所使用的正交氨酰tRNA合成酶通常是通过对天然的氨酰-tRNA合成酶底物结合位点进行突变得到的,底物特异性的改变常需要付出催化活性下降的代价。为了补偿这种活性下降需要使用高浓度的酶或tRNA或非天然氨基酸,使反应向有利于肽链合成方向进行。Because the cell-free in vitro protein translation system is not bound by the cell membrane, the reaction can be theoretically promoted by increasing the concentration of the main components in the reaction system. The unnatural amino acid is transported to the ribosome by the aminoacyl-tRNA formed by the catalysis of a specific aminoacyl-tRNA synthetase, and the unnatural amino acid is assembled in the peptide chain under the catalysis of the ribosome by pairing with the codon. Among them, the binding ability of the unnatural aminoacyl-tRNA to the transport protein and the matching degree with the ribosome are weaker than those of the classical aminoacyl-tRNA, and the reaction is not carried out. It is necessary to promote the reaction by increasing the substrate concentration. The orthogonal aminoacyl-tRNA synthetase used for the introduction of unnatural amino acids is usually obtained by mutating the substrate binding site of the natural aminoacyl-tRNA synthetase, and the change in substrate specificity often requires the price of decreased catalytic activity. In order to compensate for this decreased activity, a high concentration of enzyme or tRNA or unnatural amino acid is required to make the reaction proceed in a direction that is conducive to peptide chain synthesis.
常用的正交翻译体系pylrs-tRNACUA pyl来自于古细菌Methanosarcina mazei和Methanosarcina barkeri PylRS(MmPylRS,MbPylRS)。由于其特殊结构,pylrs对tRNA的识别不依赖于反密码子环,且具有底物识别可塑性,通过定向进化,利用pylrs–tRNACUA pyl已经可以实现超过200种非天然氨基酸的导入。The commonly used orthogonal translation system pylrs-tRNA CUA pyl comes from the archaea Methanosarcina mazei and Methanosarcina barkeri PylRS (MmPylRS, MbPylRS). Due to its special structure, the recognition of tRNA by pylrs is independent of the anticodon loop and has substrate recognition plasticity. Through directed evolution, the use of pylrs-tRNA CUA pyl has enabled the introduction of more than 200 unnatural amino acids.
但是无论是MmPylRS还是MbPylRS的结构中存在一个溶解度较低的N-端结构域,导致它们可溶性差。即使经过密码子优化MmPylRS与MbPylRS在大肠杆菌中的表达量也很难提高,经过浓缩后蛋白浓度仍然无法超过8mg/ml。不能充分发挥无细胞体外蛋白翻译体系的优势,非天然氨基酸的导入效率非常低,不适合于实际生产应用。However, both MmPylRS and MbPylRS have an N-terminal domain with low solubility, which leads to their poor solubility. Even after codon optimization, the expression level of MmPylRS and MbPylRS in E. coli is difficult to increase, and the protein concentration after concentration still cannot exceed 8 mg/ml. The advantages of the cell-free in vitro protein translation system cannot be fully utilized, and the introduction efficiency of non-natural amino acids is very low, which is not suitable for actual production applications.
Methanomethylophilus alvus PylRS(PylRS)是一种氨酰-tRNA合成酶。这种合成酶属于核苷酸类别,其全称为pyrrolysyl-tRNA synthetase,通常简称为PylRS。PylRS的主要作用是帮助原核生物合成使用特殊氨基酸-吡咯赖氨酸(Pyl),这种氨基酸在所有已知的真核生物中都不存在[1]。Methanomethylophilus alvus PylRS (PylRS) is an aminoacyl-tRNA synthetase. This synthetase belongs to the nucleotide class and its full name is pyrrolysyl-tRNA synthetase, commonly referred to as PylRS. The main function of PylRS is to help prokaryotes synthesize a special amino acid - pyrrolysine (Pyl), which does not exist in any known eukaryotic organisms [1] .
Methanomethylophilus alvus是一种产甲烷的古菌,最初从牛反刍动物胃中分离出来。PylRS是通过对M.alvus基因组进行序列分析来发现的,可以用来改造外来蛋白质、表达出新的生物活性,因此在合成生物学领域具有重要应用价值。Methanomethylophilus alvus is a methanogenic archaeon originally isolated from the stomach of bovine ruminants. PylRS was discovered by sequence analysis of the M. alvus genome and can be used to modify foreign proteins and express new biological activities, so it has important application value in the field of synthetic biology.
最近Chin等人发现了来自于Methanomethylophilus alvus的pylrs(MaPylRS),与MmPylRS以及MbPylRS,mapylrs具有相似的催化结构域以及tRNA结合结构域,但是缺失N-端结构域。Recently, Chin et al. discovered a pylrs (MaPylRS) from Methanomethylophilus alvus, which has similar catalytic domains and tRNA binding domains as MmPylRS, MbPylRS and mapylrs, but lacks the N-terminal domain.
目前的科学研究中,MaPylRS主要应用于大肠杆菌(Escherichia coli)等原核生物中对外源蛋白质的改造[2],在真核生物中也有成功应用的报道[3]。据最新研究成果,MaPylRS在酿酒酵母(Saccharomyces cerevisiae)[4]中表达时,能实现靶非天然氨基酸高效定点插入目标蛋白。但无论是大肠杆菌还是酵母中,MaPylRS均是以外源质粒的方式存在于细胞中的,尚无直接重组至其基因组中稳定高效率表达MaPylRS的真核细胞系统[5]。In current scientific research, MaPylRS is mainly used to modify exogenous proteins in prokaryotes such as Escherichia coli [2] , and there are also reports of successful applications in eukaryotic organisms [3] . According to the latest research results, when MaPylRS is expressed in Saccharomyces cerevisiae [4] , it can achieve efficient site-specific insertion of target non-natural amino acids into target proteins. However, whether in Escherichia coli or yeast, MaPylRS exists in the cell in the form of an exogenous plasmid, and there is no eukaryotic cell system that can directly recombinantly express MaPylRS in its genome stably and efficiently [5] .
蛋白质体外合成系统(Protein Expression Systems in vitro)指使用细胞裂解液等非体内条件下进行蛋白质合成。只需在反应体系中添加DNA或RNA模板,RNA聚合酶及必要的氨基酸、ATP以及辅助因子等组分来完成目标蛋白质的合成。由于无需进行整个细胞复杂代谢步骤,该体系能够实现快速高效的蛋白质大规模生产。由于其较高的灵活性和简化流程,已经成为生物医药领域和科研中广泛使用的工具之一[6]。目前,经常实验的商业化体外蛋白表达系统包括大肠杆菌系统(E.coli extract,ECE)[7]、兔网织红细胞(Rabbit reticulocyte Lysate,RRL)[8]、麦胚(Wheat germ extract,WGE)[9]、昆虫(Insect cell extract,ICE)[10]和人源系统[11]。Protein Expression Systems in vitro refers to the use of cell lysate and other non-in vivo conditions to synthesize proteins. It is only necessary to add DNA or RNA templates, RNA polymerase and necessary amino acids, ATP and auxiliary factors to the reaction system to complete the synthesis of the target protein. Since the complex metabolic steps of the entire cell are not required, the system can achieve rapid and efficient large-scale protein production. Due to its high flexibility and simplified process, it has become one of the widely used tools in the biomedical field and scientific research [6] . At present, the commercial in vitro protein expression systems that are frequently used include Escherichia coli system (E. coli extract, ECE) [7] , rabbit reticulocyte lysate (Rabbit reticulocyte lysate, RRL) [8] , wheat germ extract (WGE) [9] , insect cell extract (ICE) [10] and human system [11] .
因为缺失溶解度低的N-端结构域,mapylrs在大肠杆菌中更容易表达,最终表达产物可以浓缩到40mg/ml也不会产生沉淀。适用于无细胞非天然氨基酸导入体系。Because of the lack of the low-solubility N-terminal domain, mapylrs is easier to express in E. coli, and the final expression product can be concentrated to 40 mg/ml without precipitation. It is suitable for cell-free non-natural amino acid introduction system.
CRISPR/Cas(Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR associated)是一种广泛存在于细菌和古细菌中的免疫系统。作为生物体防御机制的一部分,它可以识别并消灭入侵宿主细胞的外源DNA 或RNA,已被发展为高效、准确的基因编辑工具[12]。该技术可以精确地剪切DNA,使得研究人员能够增加或减少基因的特定部分来研究它们的功能,也可用于治疗某些遗传性疾病[13]。在使用CRISPR/Cas9系统进行基因编辑时,需要将特定的gRNA序列与Cas9核酸酶蛋白配对,并输送到目标细胞之中。当gRNA与Cas9结合后,将会被运载至目标基因组上。在DNA双链切口的形成过程中,gRNA调控Cas9酶发现基因组上的PAM序列(即"protospacer adjacent motif"),并识别其上游20bp的序列。然后,Cas9酶会在PAM上游3个碱基的位置产生一个双链切口。同时,如果提供了供体DNA(donor DNA),则断裂的DNA链可以通过同源重组(HDR)的机制连接,从而实现基因改造的目的[14]。利用CRISPR/Cas9系统对酿酒酵母(S.cerevisiae)进行基因组改造的实例有很多,包括基因点突变、基因敲除和基因插入[15,16]。例如,在现有技术中广泛采用的pCAS质粒上同时具有Cas9基因序列及gRNA元件[17],能够通过一次转化即可实现酿酒酵母的基因组改造。克鲁维酵母(Kluyveromyces)是一类酵母真菌,属于酿酒酵母的一种。其在工业上具有广泛的应用,特别是在食品和饮料工业方面。克鲁维酵母被广泛用于乳制品、葡萄酒、啤酒等食品饮料的发酵过程中。它能够分解糖类和产生多种酶,促进发酵过程的进行并赋予食品良好的口感和风味。与其他酿酒酵母相比,克鲁维酵母具有一些独特的特性,例如较高的温度适应性和耐酸性。这使得它在特定条件下更适合某些工业应用,例如高温发酵和酸性环境。此外,克鲁维酵母作为宿主系统用于表达药用蛋白也有很多优势。第一,通过合适的基因表达载体和启动子序列设计,克鲁维酵母可以实现外源基因高水平表达。第二,克鲁维酵母拥有一套完善的蛋白质折叠和修饰系统,能够正确地折叠复杂的药用蛋白,并进行必要的糖基化修饰。第三,克鲁维酵母易于培养,并且能够在胞内进行可溶性表达,从而方便目标蛋白的提取和纯化,工程化生产成本较低。总体而言,克鲁维酵母是一种潜力巨大的宿主系统,可以用于高效表达药用蛋白。CRISPR/Cas (Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR associated) is an immune system widely found in bacteria and archaea. As part of the organism's defense mechanism, it can identify and eliminate foreign DNA or RNA that invades host cells, and has been developed into an efficient and accurate gene editing tool [12] . This technology can precisely cut DNA, allowing researchers to increase or decrease specific parts of genes to study their functions, and can also be used to treat certain genetic diseases [13] . When using the CRISPR/Cas9 system for gene editing, a specific gRNA sequence needs to be paired with the Cas9 nuclease protein and delivered to the target cell. When the gRNA binds to Cas9, it will be carried to the target genome. During the formation of the double-stranded DNA nick, the gRNA controls the Cas9 enzyme to find the PAM sequence (i.e. "protospacer adjacent motif") on the genome and recognize the sequence 20bp upstream of it. The Cas9 enzyme then creates a double-stranded nick 3 bases upstream of the PAM. At the same time, if donor DNA is provided, the broken DNA chains can be connected through the mechanism of homologous recombination (HDR), thereby achieving the purpose of genetic modification [14] . There are many examples of using the CRISPR/Cas9 system to modify the genome of S. cerevisiae, including gene point mutation, gene knockout and gene insertion [15,16] . For example, the pCAS plasmid widely used in the prior art has both the Cas9 gene sequence and the gRNA element [17] , which can achieve genome modification of S. cerevisiae through a single transformation. Kluyveromyces is a type of yeast fungus, belonging to S. cerevisiae. It has a wide range of industrial applications, especially in the food and beverage industry. Kluyveromyces is widely used in the fermentation process of food and beverages such as dairy products, wine, and beer. It can decompose sugars and produce a variety of enzymes, promote the fermentation process and give the food a good taste and flavor. Compared with other brewing yeasts, Kluyveromyces has some unique characteristics, such as higher temperature adaptability and acid resistance. This makes it more suitable for certain industrial applications under specific conditions, such as high-temperature fermentation and acidic environments. In addition, Kluyveromyces has many advantages as a host system for expressing pharmaceutical proteins. First, through the design of appropriate gene expression vectors and promoter sequences, Kluyveromyces can achieve high-level expression of exogenous genes. Second, Kluyveromyces has a complete protein folding and modification system that can correctly fold complex pharmaceutical proteins and perform necessary glycosylation modifications. Third, Kluyveromyces is easy to culture and can be expressed soluble intracellularly, which facilitates the extraction and purification of target proteins, and the cost of engineered production is low. Overall, Kluyveromyces is a host system with great potential that can be used to efficiently express pharmaceutical proteins.
参考文献References
1.Yanagisawa T,Seki E,Tanabe H,Fujii Y,Sakamoto K,Yokoyama S.Crystal Structure of Pyrrolysyl-tRNA Synthetase from a Methanogenic Archaeon ISO4-G1 and Its Structure-Based Engineering for Highly-Productive Cell-Free Genetic Code Expansion with Non-Canonical Amino Acids.Int J Mol Sci.2023Mar 26;24(7):6256.doi:10.3390/ijms24076256.PMID:37047230;PMCID:PMC10094482.1.Yanagisawa T, Seki E, Tanabe H, Fujii Y, Sakamoto K, Yokoyama S. Crystal Structure of Pyrrolysyl-tRNA Synthetase from a Methanogenic Archaeon ISO4-G1 and Its Structure-Based Engineerin g for Highly-Productive Cell-Free Genetic Code Expansion with Non-Canonical Amino Acids.In t J Mol Sci.2023Mar 26;24(7):6256.doi:10.3390/ijms24076256.PMID:37047230; PMCID:PMC10094482.
2.Kobayashi T,Yanagisawa T,Sakamoto K,Yokoyama S.Recognition of non-alpha-amino substrates by pyrrolysyl-tRNA synthetase.J Mol Biol.2009Feb 6;385(5):1352-60.doi:10.1016/j.jmb.2008.11.059.Epub 2008Dec 11.PMID:19100747.2.Kobayashi T,Yanagisawa T,Sakamoto K,Yokoyama S.Recognition of non-alpha-amino substrates by pyrrolysyl-tRN A synthesis.J Mol Biol.2009Feb 6;385(5):1352-60.doi:10.1016/j.jmb.2008.11.059.Epub 2008Dec 11.PMID:19100747.
3.Beránek V,Willis JCW,Chin JW.An Evolved Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetase/tRNA Pair Is Highly Active and Orthogonal in Mammalian Cells.Biochemistry.2019 Feb 5;58(5):387-390.doi:10.1021/acs.biochem.8b00808.Epub 2018 Sep 27.PMID:30260626;PMCID:PMC6365905.3.Beránek V, Willis JCW, Chin JW.An Evolved Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetase/tRNA Pair Is Highly Active and Orthogonal in Mammalian Cells.Biochemistry.2019 Feb 5;58(5):387-390.doi:10.1021/acs.biochem.8b00808.Epub 2018 Sep 27.PMID:30260626;PMCID:PMC6365905.
4.Stieglitz JT,Lahiri P,Stout MI,Van Deventer JA.Exploration of Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetase Activity in Yeast.ACS Synth Biol.2022 May 20;11(5):1824-1834.doi:10.1021/acssynbio.2c00001.Epub 2022 Apr 13.PMID:35417129;PMCID:PMC10112046.4.Stieglitz JT, Lahiri P, Stout MI, Van Deventer JA. Exploration of Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetase Activity in Yeast.ACS Synth Biol.2022 May 20;11(5):1824-1834.doi:10.1021/acssynbio.2c00001.Epub 2022 Apr 13.PMID:35417129;PMCID:PMC10112046.
5.Avila-Crump S,Hemshorn ML,Jones CM,Mbengi L,Meyer K,Griffis JA,Jana S,Petrina GE,Pagar VV,Karplus PA,Petersson EJ,Perona JJ,Mehl RA,Cooley RB.Generating Efficient Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetases for Structurally Diverse Non-Canonical Amino Acids.ACS Chem Biol.2022 Dec 16;17(12):3458-3469.doi:10.1021/acschembio.2c00639.Epub 2022 Nov 16.PMID:36383641;PMCID:PMC9833845.5.Avila-Crump S, Hemshorn ML, Jones CM, Mbengi L, Meyer K, Griffis JA, Jana S, Petrina GE, Pagar VV, Karplus PA, Petersson EJ, Perona JJ, Mehl RA, Cooley RB. Generating Efficient Methanomethylophilus alvus Pyrr olysyl-tRNA Synthetases for Structurally Diverse Non-Canonical Amino Acids.ACS Chem Biol.2022 Dec 16;17(12):3458-3469.doi:10.1021/acschembio.2c00639.Epub 2022 Nov 16.PMID:36383641; PMCID:PMC9833845.
6.Craig D,Howell MT,Gibbs CL,Hunt T,Jackson RJ.Plasmid cDNA-directed protein synthesis in a coupled eukaryotic in vitro transcription-translation system.Nucleic Acids Res.1992 Oct 11;20(19):4987-95.doi:10.1093/nar/20.19.4987.PMID:1383935;PMCID:PMC334274.6.Craig D,Howell MT,Gibbs CL,Hunt T,Jackson RJ.Plasmid cDNA-directed protein synthesis in a coupled eukaryotic in vitro transcr iption-translation system.Nucleic Acids Res.1992 Oct 11;20(19):4987-95.doi:10.1093/nar/20.19.4987.PMID:1383935; PMCID:PMC334274.
7.Kim DM,Kigawa T,Choi CY,Yokoyama S.A highly efficient cell-free protein synthesis system from Escherichia coli.Eur J Biochem.1996 Aug 1;239(3):881-6.doi:10.1111/j.1432-1033.1996.0881u.x.PMID:8774739.7.Kim DM,Kigawa T,Choi CY,Yokoyama S.A highly efficient cell-free protein synthesis system from Esch erichia coli.Eur J Biochem.1996 Aug 1;239(3):881-6.doi:10.1111/j.1432-1033.1996.0881u.x.PMID:8774739.
8.Jackson RJ,Hunt T.Preparation and use of nuclease-treated rabbit reticulocyte lysates for the translation of eukaryotic messenger RNA.Methods Enzymol.1983;96:50-74.doi:10.1016/s0076-6879(83)96008-1.PMID:6656641.8.Jackson RJ,Hunt T.Preparation and use of nuclease-treated rabbit reticulocyte lysates for the translatio n of eukaryotic messenger RNA.Methods Enzymol.1983;96:50-74.doi:10.1016/s0076-6879(83)96008-1.PMID:6656641.
9.Erickson AH,Blobel G.Cell-free translation of messenger RNA in a wheat germ system.Methods Enzymol.1983;96:38-50.doi:10.1016/s0076-6879(83)96007-x.PMID:6656637.9.Erickson AH, Blobel G.Cell-free translation of messenger RNA in a wheat germ system.Methods Enzymol.1983;96:38-50.doi:10.1016/s0076-6879(83)96007-x.PMID:6656637.
10.Ezure T,Suzuki T,Higashide S,Shintani E,Endo K,Kobayashi S,Shikata M,Ito M,Tanimizu K,Nishimura O.Cell-free protein synthesis system prepared from insect cells by freeze-thawing.Biotechnol Prog.2006 Nov-Dec;22(6):1570-7.doi:10.1021/bp060110v.PMID:17137303.10.Ezure T, Suzuki T, Higashide S, Shintani E, Endo K, Kobayashi S, Shikata M, Ito M, Tanimizu K, Nishimura O.Cell-free protein synthesis system prepared from inspect cells by freeze-thawing.Biotechnol Prog.2006 Nov-Dec;22(6):1570-7.doi:10.1021/bp060110v.PMID:17137303.
11.Mikami S,Kobayashi T,Masutani M,Yokoyama S,Imataka H.A human cell-derived in vitro coupled transcription/translation system optimized for production of recombinant proteins.Protein Expr Purif.2008Dec;62(2):190-8.doi:10.1016/j.pep.2008.09.002.Epub 2008Sep 11.PMID:18814849.11.Mikami S,Kobayashi T,Masutani M,Yokoyama S,Imataka H.A human cell-derived in vitro coupled transcription/translation system optimized for production of recombinant proteins.Protein Expr Purif.2008Dec;62(2):190-8.doi:10.1016/j.pep.2008.09.002.Epub 2008Sep 11.PMID:18814849.
12.Sander JD,Joung JK.CRISPR-Cas systems for editing,regulating and targeting genomes.Nat Biotechnol.2014Apr;32(4):347-55.doi:10.1038/nbt.2842.Epub 2014Mar 2.PMID:24584096;PMCID:PMC4022601.12.Sander JD,Joung JK.CRISPR-Cas systems for editing,regulating and targeting genomes.Nat Biot echnol.2014Apr;32(4):347-55.doi:10.1038/nbt.2842.Epub 2014Mar 2.PMID:24584096;PMCID:PMC4022601.
13.Jinek M,Chylinski K,Fonfara I,Hauer M,Doudna JA,Charpentier E.A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity.Science.2012Aug 17;337(6096):816-21.doi:10.1126/science.1225829.Epub 2012Jun 28.PMID:22745249;PMCID:PMC6286148.13. Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E.A programmable dual-RNA-guided DNA endonuclease in adaptive ba cterial immunity.Science.2012Aug 17;337(6096):816-21.doi:10.1126/science.1225829.Epub 2012Jun 28.PMID:22745249;PMCID:PMC6286148.
14.Bibikova M,Carroll D,Segal DJ,Trautman JK,Smith J,Kim YG,Chandrasegaran S.Stimulation of homologous recombination through targeted cleavage by chimeric nucleases.Mol Cell Biol.2001Jan;21(1):289-97.doi:10.1128/MCB.21.1.289-297.2001.PMID:11113203;PMCID:PMC88802.14.Bibikova M, Carroll D, Segal DJ, Trautman JK, Smith J, Kim YG, Chandrasegaran S. Stimulation of homologous recombination through target ed cleavage by chimeric nucleases.Mol Cell Biol.2001Jan;21(1):289-97.doi:10.1128/MCB.21.1.289-297.2001.PMID:11113203;PMCID:PMC88802.
15.DiCarlo JE,Norville JE,Mali P,Rios X,Aach J,Church GM.Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems.Nucleic Acids Res.2013Apr;41(7):4336-43.doi:10.1093/nar/gkt135.Epub 2013Mar 4.PMID:23460208;PMCID:PMC3627607.15.DiCarlo JE,Norville JE,Mali P,Rios X,Aach J,Church GM.Genome engineering in Saccharomyces cerevisiae using CRISPR-Ca s systems.Nucleic Acids Res.2013Apr;41(7):4336-43.doi:10.1093/nar/gkt135.Epub 2013Mar 4.PMID:23460208;PMCID:PMC3627607.
16.Bao Z,Xiao H,Liang J,Zhang L,Xiong X,Sun N,Si T,Zhao H.Homology-integrated CRISPR-Cas(HI-CRISPR)system for one-step multigene disruption in Saccharomyces cerevisiae.ACS Synth Biol.2015May 15;4(5):585-94.doi:10.1021/sb500255k.Epub 2014Sep 19.PMID:25207793.16.Bao Z, Xiao H, Liang J, Zhang L, Xiong X, Sun N, Si T, Zhao H.Homology-integrated CRISPR-Cas(HI-CRISPR) system for one-step multigene disruption in Saccharomyces cerevisiae.ACS Synth Biol.2015May 15;4(5):585-94.doi:10.1021/sb500255k.Epub 2014Sep 19.PMID:25207793.
17.Ryan OW,Skerker JM,Maurer MJ,Li X,Tsai JC,Poddar S,Lee ME,DeLoache W,Dueber JE,Arkin AP,Cate JH.Selection of chromosomal DNA libraries using a multiplex CRISPR system.Elife.2014Aug 19;3:e03703.doi:10.7554/eLife.03703.PMID:25139909;PMCID:PMC4161972.17.Ryan OW,Skerker JM,Maurer MJ,Li X,Tsai JC,Poddar S,Lee ME,DeLoache W,Dueber JE,Arkin AP,Cate JH.Selection of chromosomal D NA libraries using a multiplex CRISPR system.Elife.2014Aug 19;3:e03703.doi:10.7554/eLife.03703.PMID:25139909;PMCID:PMC4161972.
为了克服现有技术的缺陷,本发明通过在MaPylRS基因的N-端与C-端分别设计了组氨酸标签,以及方便标签切除的thrombin酶切位点。结果发现在N端或C端增加序列的设置,不仅促进酶的稳定性,更易纯化,而且还进一步提高了酶的催化活性。In order to overcome the defects of the prior art, the present invention designs a histidine tag at the N-terminus and C-terminus of the MaPylRS gene, and a thrombin restriction site for convenient tag removal. It is found that adding a sequence at the N-terminus or C-terminus not only promotes the stability of the enzyme, makes it easier to purify, but also further improves the catalytic activity of the enzyme.
另外,在现有的文献报导及商业试剂盒中,都是通过外源手动加入MaPylRS蛋白质,或转化/转染等导入含有其表达结构的质粒。对实验结果判断、实验结果复杂性及稳定性、目标蛋白生产成本等都有不好的影响。本发明通过CRISPR/Cas9高效基因编辑系统,将MaPylRS基因整合到克鲁维酵母基因组中,实现MaPylRS基因在酵母体内稳定高效表达,无需使用抗生素筛选及维护,并在此基础上实现目标蛋白的非天然氨基酸插入。In addition, in existing literature reports and commercial kits, MaPylRS protein is manually added exogenously, or a plasmid containing its expression structure is introduced by transformation/transfection, etc. This has a negative impact on the judgment of experimental results, the complexity and stability of experimental results, and the production cost of the target protein. The present invention integrates the MaPylRS gene into the Kluyveromyces genome through the CRISPR/Cas9 efficient gene editing system, achieves stable and efficient expression of the MaPylRS gene in yeast, does not require antibiotic screening and maintenance, and achieves the insertion of non-natural amino acids into the target protein on this basis.
本发明第一发明提供一种重组的氨酰-tRNA合成酶具有式I所述的结构:The first invention of the present invention provides a recombinant aminoacyl-tRNA synthetase having the structure described in Formula I:
A1-A2-A3-A4(I):A1-A2-A3-A4(I):
式I中,In Formula I,
A1为为不存在或为组氨酸标签;A1 is absent or a histidine tag;
A2为不存在或为凝血酶切位点;A2 is absent or is a thrombin cleavage site;
A3为不存在或为标签蛋白;A3 is absent or a tagged protein;
A4为氨酰-tRNA合成酶;A4 is an aminoacyl-tRNA synthetase;
“-”独立地为键或氨基酸连接序列,"-" is independently a bond or an amino acid linking sequence,
且A1~A3至少有一个存在,And at least one of A1~A3 exists,
所述A1~A4之间的连接,既可以是从N端到C端,也可以是从C端到N端。The connection between A1 to A4 can be from the N-terminus to the C-terminus or from the C-terminus to the N-terminus.
进一步优选地,所述氨酰-tRNA合成酶选自天然或突变的Pyl-tRNA合成酶(PylRS)、Leu-tRNA合成酶(LeuRS)、Tyr-tRNA合成酶(TyrRS)、Phe-tRNA合成酶(PheRS)或TrP-tRNA合成酶(TrpRS)。Further preferably, the aminoacyl-tRNA synthetase is selected from natural or mutant Pyl-tRNA synthetase (PylRS), Leu-tRNA synthetase (LeuRS), Tyr-tRNA synthetase (TyrRS), Phe-tRNA synthetase (PheRS) or TrP-tRNA synthetase (TrpRS).
进一步优选地,所述氨酰-tRNA合成酶选自天然或突变的MaPylRS、MmPylRS、MbPylRS、EcTyrRS、MjTyrRS、EcLeuRS、ScPheRS、ScTrpRS或BsTrpRS。Further preferably, the aminoacyl-tRNA synthetase is selected from natural or mutant MaPylRS, MmPylRS, MbPylRS, EcTyrRS, MjTyrRS, EcLeuRS, ScPheRS, ScTrpRS or BsTrpRS.
进一步优选地,所述氨酰-tRNA合成酶选自天然或突变的MaPylRS。Further preferably, the aminoacyl-tRNA synthetase is selected from natural or mutated MaPylRS.
进一步优选地,所述氨酰-tRNA合成酶的序列为SEQ ID NO:60。Further preferably, the sequence of the aminoacyl-tRNA synthetase is SEQ ID NO:60.
进一步优选的,所述氨酰-tRNA合成酶包含SEQ ID NO:60所示的序列或其活性片段,或者为具有与SEQ ID NO:60所示氨基酸序列≥85%同源性(优选地,≥90%的同源性;等优选地≥95%的同源性;最优选地,≥97%的同源性,如98%以上,99%以上)且具有与SEQ ID NO:60序列相同活性的多肽。Further preferably, the aminoacyl-tRNA synthetase comprises the sequence shown in SEQ ID NO:60 or an active fragment thereof, or is a polypeptide having ≥85% homology (preferably, ≥90% homology; preferably ≥95% homology; most preferably, ≥97% homology, such as 98% or more, 99% or more) with the amino acid sequence shown in SEQ ID NO:60 and having the same activity as the sequence of SEQ ID NO:60.
进一步优选地,所述组氨酸标签的结构为n×His,其中1≦n≦50;优选2≦n≦30;进一步优选地,5≦n≦20;更优选6≦n≦10。More preferably, the structure of the histidine tag is n×His, wherein 1≦n≦50; preferably 2≦n≦30; more preferably, 5≦n≦20; and more preferably 6≦n≦10.
进一步优选地,所述标签蛋白选自T7标签、CBP标签、CMyc标签、FLAG标签、Spot标签、C标签、Avi标签、Streg标签、SUMO标签、GST标签、MBP标签或其组合;优选T7标签。Further preferably, the tag protein is selected from T7 tag, CBP tag, CMyc tag, FLAG tag, Spot tag, C tag, Avi tag, Streg tag, SUMO tag, GST tag, MBP tag or a combination thereof; preferably T7 tag.
进一步优选地,A4的N端或C端连接A1-A2-A3:Further preferably, the N-terminus or C-terminus of A4 is connected to A1-A2-A3:
进一步优选地,在氨酰-tRNA合成酶的N端或C端连接有his标签。More preferably, a his tag is connected to the N-terminus or C-terminus of the aminoacyl-tRNA synthetase.
进一步优选地,在氨酰-tRNA合成酶的N端按从N端到C端的顺序连接有凝血酶切位点、标签蛋白。More preferably, a thrombin cleavage site and a tag protein are connected to the N-terminus of the aminoacyl-tRNA synthetase in order from the N-terminus to the C-terminus.
进一步优选地,在氨酰-tRNA合成酶的N端按从N端到C端的顺序连接有his标签、凝血酶切位点、标签蛋白。Further preferably, a his tag, a thrombin cleavage site, and a tag protein are connected to the N-terminus of the aminoacyl-tRNA synthetase in order from the N-terminus to the C-terminus.
进一步优选地,所述重组氨酰-tRNA合成酶的氨基酸序列选自以下的任意一种或多种:SEQ ID NO:1~SEQ ID NO:3或SEQ ID NO:64。Further preferably, the amino acid sequence of the recombinant aminoacyl-tRNA synthetase is selected from any one or more of the following: SEQ ID NO: 1 to SEQ ID NO: 3 or SEQ ID NO: 64.
进一步优选地,所述的重组的氨酰-tRNA合成酶包含SEQ ID NO:1~3、SEQ ID NO:64任意一条所述的序列或其活性片段,或者为各自分别具有与SEQ ID NO:1~3、SEQ ID NO:64任意一条所述所示氨基酸序列≥85%同源性(优选地,≥90%的同源性;等优选地≥95%的同源性;最优选地,≥97%的同源性,如98%以上,99%以上)且具有与SEQ ID NO:1~3、SEQ ID NO:64任意一条所述序列相同活性的多肽。Further preferably, the recombinant aminoacyl-tRNA synthetase comprises the sequence described in any one of SEQ ID NO:1-3, SEQ ID NO:64 or an active fragment thereof, or is a polypeptide having ≥85% homology (preferably, ≥90% homology; preferably ≥95% homology; most preferably, ≥97% homology, such as 98% or more, 99% or more) with the amino acid sequence described in any one of SEQ ID NO:1-3, SEQ ID NO:64 and having the same activity as the sequence described in any one of SEQ ID NO:1-3, SEQ ID NO:64.
进一步优选地,所述重组氨酰-tRNA合成酶的编码序列选自以下的任意一种或多种:SEQ ID NO:4、SEQ ID NO:61、SEQ ID NO:62或SEQ ID NO:63。Further preferably, the coding sequence of the recombinant aminoacyl-tRNA synthetase is selected from any one or more of the following: SEQ ID NO: 4, SEQ ID NO: 61, SEQ ID NO: 62 or SEQ ID NO: 63.
进一步优选地,所述的重组的氨酰-tRNA合成酶的编码序列包含SEQ ID NO:4、SEQ ID NO:61、SEQ ID NO:62、SEQ ID NO:63任意一条所述的序列或其活性片段,或者为各自分别具有与上述所示核苷酸序列≥85%同源性(优选地,≥90%的同源性;等优选地≥95%的同源性;最优选地,≥97%的同源性,如98%以上,99%以上)且各自分别具有与SEQ ID NO:4、SEQ ID NO:61、SEQ ID NO:62或SEQ ID NO:63任意一条所述序列相同活性的核苷酸。Further preferably, the coding sequence of the recombinant aminoacyl-tRNA synthetase comprises any one of SEQ ID NO:4, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63 or an active fragment thereof, or each of which has ≥85% homology (preferably, ≥90% homology; preferably ≥95% homology; most preferably, ≥97% homology, such as 98% or more, 99% or more) with the nucleotide sequence shown above and each of which has the same activity as any one of SEQ ID NO:4, SEQ ID NO:61, SEQ ID NO:62 or SEQ ID NO:63.
本发明第二方面提供一种编码本发明第一方面所述的重组的氨酰-tRNA合成酶的核酸构建物。The second aspect of the present invention provides a nucleic acid construct encoding the recombinant aminoacyl-tRNA synthetase described in the first aspect of the present invention.
进一步优选地,所述核酸构建物至少含有结构如式II所述的核酸序列:Z1-Z2-Z3-Z4,式中,Z1~Z4分别为用于构成所述构建物的元件;“-”独立地为键或核苷酸连接序列;Z1为不存在或为组氨酸标签的编码序列,Z2为不存在或为凝血酶切位点的编码序列,Z3为不存在或为标签蛋白的编码序列,Z4为氨酰-tRNA合成酶的编码序列;其中Z1~Z3至少有一个存在。Further preferably, the nucleic acid construct contains at least a nucleic acid sequence as described in Formula II: Z1-Z2-Z3-Z4, wherein Z1 to Z4 are respectively elements used to constitute the construct; "-" is independently a bond or a nucleotide connection sequence; Z1 is absent or is a coding sequence for a histidine tag, Z2 is absent or is a coding sequence for a thrombin cleavage site, Z3 is absent or is a coding sequence for a tag protein, and Z4 is a coding sequence for an aminoacyl-tRNA synthetase; wherein at least one of Z1 to Z3 is present.
式II的结构,即可以是从5’至3’,也可以是从3’至5’:The structure of formula II can be from 5' to 3' or from 3' to 5':
也即,所述Z1-Z2-Z3的编码产物与Z4的编码产物的N端或C端相连。That is, the encoding product of Z1-Z2-Z3 is connected to the N-terminus or C-terminus of the encoding product of Z4.
更优选地,所述Z1-Z2-Z3的编码产物与Z4的编码产物的N端相连。More preferably, the encoding product of Z1-Z2-Z3 is connected to the N-terminus of the encoding product of Z4.
进一步优选地,所述Z1编码的氨基酸序列为HHHHHH;更优选地,所述Z1编码包含HHHHHH的序列或其活性片段,或者为具有与上述所示核苷酸序列≥85%同源性(优选地,≥90%的同源性;等优选地≥95%的同源性;最优选地,≥97%的同源性,如98%以上,99%以上)且具有与上述序列序列相同活性的核苷酸。Further preferably, the amino acid sequence encoded by Z1 is HHHHHH; more preferably, Z1 encodes a sequence comprising HHHHHH or an active fragment thereof, or a nucleotide having ≥85% homology with the nucleotide sequence shown above (preferably, ≥90% homology; preferably ≥95% homology; most preferably, ≥97% homology, such as above 98%, above 99%) and having the same activity as the above sequence.
进一步优选地,所述Z2编码的氨基酸序列为LVPRGS;更优选地,所述Z2编码包含LVPRGS的序列或其活性片段,或者为具有与上述所示核苷酸序列≥85%同源性(优选地,≥90%的同源性;等优选地≥95%的同源性;最优选地,≥97%的同源性,如98%以上,99%以上)且具有与上述序列序列相同活性的核苷酸。Further preferably, the amino acid sequence encoded by Z2 is LVPRGS; more preferably, Z2 encodes a sequence containing LVPRGS or an active fragment thereof, or a nucleotide having ≥85% homology with the nucleotide sequence shown above (preferably, ≥90% homology; preferably ≥95% homology; most preferably, ≥97% homology, such as above 98%, above 99%) and having the same activity as the above sequence.
进一步优选地,所述Z3编码的氨基酸序列为SEQ ID NO:59;更优选地,所述Z编码包含SEQ ID NO:59所示的序列或其活性片段,或者为具有与上述所示核苷酸序列≥85%同源性(优选地,≥90%的同源性;等优选地≥95%的同源性;最优选地,≥97%的同源性,如98%以上,99%以上)且具有与上述序列序列相同活性的核苷酸。Further preferably, the amino acid sequence encoded by the Z3 is SEQ ID NO:59; more preferably, the Z code contains the sequence shown in SEQ ID NO:59 or an active fragment thereof, or is a nucleotide having ≥85% homology with the nucleotide sequence shown above (preferably, ≥90% homology; preferably ≥95% homology; most preferably, ≥97% homology, such as above 98%, above 99%) and having the same activity as the above sequence.
进一步优选地,所述的核酸构建物还包括启动子。More preferably, the nucleic acid construct further comprises a promoter.
进一步优选地,所述所述核酸结构物含有式III所述的结构:Further preferably, the nucleic acid structure comprises the structure described in Formula III:
Z5-Z1-Z2-Z3-Z4。Z5-Z1-Z2-Z3-Z4.
进一步优选地,所述启动子选自PGK1、GAP1、ADH1、HXK1、GAPDH1、TEF1或TIF11。Further preferably, the promoter is selected from PGK1, GAP1, ADH1, HXK1, GAPDH1, TEF1 or TIF11.
本发明第三方面提供一种载体,所述载体含有本发明第二方面所提供的核酸构建物编。The third aspect of the present invention provides a vector, wherein the vector contains the nucleic acid construct provided by the second aspect of the present invention.
本发明第四方面提供一种基因工程菌株,所述基因工程菌株的基因组的一个或多个位点整合有本发明第二方面所述的核酸构建物。The fourth aspect of the present invention provides a genetically engineered strain, wherein one or more sites of the genome of the genetically engineered strain are integrated with the nucleic acid construct described in the second aspect of the present invention.
进一步优选地,所述位点选自Lys1-5、glpA或UPF1。Further preferably, the site is selected from Lys1-5, glpA or UPF1.
进一步优选地,所述核酸构建物进一步包含启动子和终止子。More preferably, the nucleic acid construct further comprises a promoter and a terminator.
进一步优选地,所述启动子选自PGK1、GAP1、ADH1、HXK1、GAPDH1、TEF1、TIF11、GAL1、GAL7或GAL10。Further preferably, the promoter is selected from PGK1, GAP1, ADH1, HXK1, GAPDH1, TEF1, TIF11, GAL1, GAL7 or GAL10.
进一步优选地,所述终止子选自CYC1、GPM1、TDH2或ACT1。Further preferably, the terminator is selected from CYC1, GPM1, TDH2 or ACT1.
进一步优选地,所述基因工程菌株中含有本发明第一方面提供的重组氨酰-tRNA合成酶。Further preferably, the genetically engineered strain contains the recombinant aminoacyl-tRNA synthetase provided by the first aspect of the present invention.
进一步优选地,所述基因工程菌株中含有本发明第三方面提供的载体。Further preferably, the genetically engineered strain contains the vector provided in the third aspect of the present invention.
进一步优选地,所述的菌株来源于哺乳动物细胞、植物细胞、酵母细胞、昆虫细胞、原核细胞之一或其任意组合。Further preferably, the strain is derived from mammalian cells, plant cells, yeast cells, insect cells, prokaryotic cells or any combination thereof.
本发明第五方面提供一种用于合成非天然氨基酸掺入的蛋白质的方法,采用本发明第一方面所述的重组氨酰-tRNA合成酶、或采用本发明第四方面所述的基因工程菌株提供重组氨酰-tRNA合成酶。The fifth aspect of the present invention provides a method for synthesizing proteins incorporated with non-natural amino acids, using the recombinant aminoacyl-tRNA synthetase described in the first aspect of the present invention, or using the genetically engineered strain described in the fourth aspect of the present invention to provide the recombinant aminoacyl-tRNA synthetase.
本发明第六方面提供一种合成含有非天然氨基酸的蛋白质的无细胞体系,其特征在于,所述无细胞体系至少包括:(a)细胞提取物,以及(b)本发明第一方面提供的重组氨酰-tRNA合成酶或本发明第二方面提供的核酸构建物或本发明第三方面提供的载体中的一种或多种;所述细胞提取物来源于哺乳动物细胞、植物细胞、酵母细胞、昆虫细胞、原核细胞之一或其任意组合。The sixth aspect of the present invention provides a cell-free system for synthesizing proteins containing unnatural amino acids, characterized in that the cell-free system at least comprises: (a) a cell extract, and (b) one or more of the recombinant aminoacyl-tRNA synthetase provided by the first aspect of the present invention, the nucleic acid construct provided by the second aspect of the present invention, or the vector provided by the third aspect of the present invention; the cell extract is derived from one of mammalian cells, plant cells, yeast cells, insect cells, prokaryotic cells, or any combination thereof.
本发明第七方面提供一种合成含有非天然氨基酸的蛋白质的无细胞体系,其特征在于,所述无细胞体系至少包括细胞提取物,所述细胞提取物来自于本发明第四方面提供所述的基因工程菌株。The seventh aspect of the present invention provides a cell-free system for synthesizing proteins containing non-natural amino acids, characterized in that the cell-free system at least includes a cell extract, and the cell extract is derived from the genetically engineered strain provided in the fourth aspect of the present invention.
进一步优选地,所述细胞提取物,选自以下任一种来源或组合:大肠杆菌、乳酸克鲁维酵母、麦胚细胞、昆虫细胞、兔网织红细胞、CHO细胞、COS细胞、VERO细胞、BHK细胞、人纤维肉瘤HT1080细胞、或者其组合。Further preferably, the cell extract is selected from any one or combination of the following sources: Escherichia coli, Kluyveromyces lactis, wheat germ cells, insect cells, rabbit reticulocytes, CHO cells, COS cells, VERO cells, BHK cells, human fibrosarcoma HT1080 cells, or a combination thereof.
进一步优选的,所述的细胞提取物来源于酵母细胞。Further preferably, the cell extract is derived from yeast cells.
进一步的,所述酵母细胞选自毕赤酵母、芬兰毕赤酵母(Pichia finlandica)、喜海藻糖毕赤酵母(Pichia trehalophila)、科克拉马毕赤酵母(Pichia koclamae)、膜醭毕赤酵母(Pichia membranaefaciens)、微小毕赤酵母(Pichia minuta)、甲醇诱导型酵母(Ogataeaminuta)、林氏毕赤酵母(Pichia lindneri)、仙人掌毕赤酵母(Pichia opuntiae)、耐热毕赤酵母(Pichia thermotolerans)、柳毕赤酵母(Pichia salictaria)、松栎毕赤酵母(Pichia guercuum)、皮杰普毕赤酵母(Pichia pijperi)、树干毕赤酵母(Pichiastiptis)、甲醇毕赤酵母(Pichia methanolica)、毕赤酵母菌(Pichia sp.)、酿酒酵母(Saccharomyces cerevisiae)、啤酒酵母、甘蔗糖蜜酵母、酵母菌(Saccharomyces sp.)、多形汉逊酵母(Hansenulapolymorpha)、产朊假丝酵母、克鲁维酵母之一或其组合。Furthermore, the yeast cell is selected from Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia koclamae, Pichia membraneaefaciens, Pichia minuta, Ogataeaminuta, Pichia lindneri, Pichia opuntiae, Pichia thermotolerans ), Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia sp., Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces molasses, Saccharomyces sp., Hansenula polymorpha, Candida utilis, Kluyveromyces, or a combination thereof.
进一步的,更进一步优选地,所述的克鲁维酵母进一步包括:乳酸克鲁维酵母(Kluyveromyces,K.lactis)、马克斯克鲁维酵母(Kluyveromyces marxianus)、、多布克鲁维酵母(Kluyveromyces dobzhanskii)、海泥克鲁维酵母(Kluyveromyces aestuarii)、非发酵克鲁维酵母(Kluyveromyces nonfermentans)、威克海姆克鲁维酵母(Kluyveromyces wickerhamii)、耐热克鲁维酵母(Kluyveromyces thermotolerans)、脆壁克鲁维酵母(Kluyveromyces fragilis)、湖北克鲁维酵母(Kluyveromyces hubeiensis)、多孢克鲁维酵母(Kluyveromyces polysporus)、暹罗克鲁维酵母(Kluyveromyces siamensis)、亚罗克鲁维酵母(Kluyveromyces yarrowii)之一或其组合;较佳地,所述的酵母细胞为克鲁维酵母细胞,更佳地为乳酸克鲁维酵母细胞。Furthermore, more preferably, the Kluyveromyces further includes: Kluyveromyces lactis (Kluyveromyces, K.lactis), Kluyveromyces marxianus, Kluyveromyces dobzhanskii, Kluyveromyces aestuarii, Kluyveromyces nonfermentans, Kluyveromyces wickerhamii, Kluyveromyces thermotolerant The yeast cell is one of Kluyveromyces thermotolerans, Kluyveromyces fragilis, Kluyveromyces hubeiensis, Kluyveromyces polysporus, Kluyveromyces siamensis and Kluyveromyces yarrowii, or a combination thereof; preferably, the yeast cell is a Kluyveromyces cell, more preferably a Kluyveromyces lactis cell.
进一步优选地,所述无细胞体系还包括:非天然氨基酸、正交tRNA和包含目的蛋白基因序列的模板,所述目的蛋白基因序列中的编码氨基酸的密码子被突变。Further preferably, the cell-free system further comprises: a non-natural amino acid, an orthogonal tRNA and a template comprising a target protein gene sequence, wherein the codon encoding the amino acid in the target protein gene sequence is mutated.
进一步优选地,所述的非天然氨基酸的结构式为式(2)化合物或其盐形式。More preferably, the non-natural amino acid has a structural formula of the compound of formula (2) or a salt thereof.
其中n选自1-20的自然数,R1选自取代或未取代的C5-C60的芳基或杂芳基、取代或未取代的C1-C20的烷基、取代或未取代的C2-C20的烯基或取代或未取代的C2-C20的炔基,A选自O或-CH2-。 wherein n is selected from a natural number of 1-20, R1 is selected from substituted or unsubstituted C5-C60 aryl or heteroaryl, substituted or unsubstituted C1-C20 alkyl, substituted or unsubstituted C2-C20 alkenyl or substituted or unsubstituted C2-C20 alkynyl, and A is selected from O or -CH2- .
在另一优选例中,n选自1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20。In another preferred embodiment, n is selected from 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20.
在另一优选例中,n选自1-10的自然数。In another preferred embodiment, n is a natural number selected from 1-10.
在另一优选例中,n选自1、2、3、4、5、6、7、8、9或10。In another preferred embodiment, n is selected from 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10.
在另一优选例中,n选自1-6的自然数。In another preferred embodiment, n is a natural number selected from 1-6.
在另一优选例中,n选自1、2、3、4、5或6。In another preferred embodiment, n is selected from 1, 2, 3, 4, 5 or 6.
在另一优选例中,所述的R1选自取代或未取代的C5-C30的芳基或杂芳基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C5-C30 aryl or heteroaryl.
在另一优选例中,所述的R1选自取代或未取代的苯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted phenyl.
在另一优选例中,所述的R1选自取代或未取代的C2-C20的烯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C20 alkenyl.
在另一优选例中,所述的R1选自取代或未取代的C2-C10的烯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C10 alkenyl.
在另一优选例中,所述的R1选自取代或未取代的C2-C6的烯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C6 alkenyl.
在另一优选例中,所述的R1选自或取代或未取代的C2-C20的炔基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C20 alkynyl.
在另一优选例中,所述的R1选自或取代或未取代的C2-C10的炔基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C10 alkynyl.
在另一优选例中,所述的R1选自或取代或未取代的C2-C6的炔基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C6 alkynyl.
在另一优选例中,所述的A选自O。In another preferred embodiment, the A is selected from O.
在另一优选例中,所述的A选自-CH2-。In another preferred embodiment, the A is selected from -CH 2 -.
在另一优选例中,所述的取代基为本领域常见的取代基团,如芳基、杂芳基、烷基、环烷基、芳基氧基、杂芳基氧基、烷基氧基、环烷基氧基、羟基、巯基、酯基、羧基、氰基、卤素、硝基、磺酸基、叠氮基、烯基、炔基、磷酸基等。In another preferred embodiment, the substituents are common substituents in the art, such as aryl, heteroaryl, alkyl, cycloalkyl, aryloxy, heteroaryloxy, alkyloxy, cycloalkyloxy, hydroxyl, thiol, ester, carboxyl, cyano, halogen, nitro, sulfonic acid, azido, alkenyl, alkynyl, phosphate, etc.
在另一优选例中,所述的非天然氨基酸的结构式选自如下的之一或组合,或其盐形式:
In another preferred embodiment, the structural formula of the non-natural amino acid is selected from one or a combination of the following, or a salt form thereof:
进一步优选地,所述的目的蛋白选自:荧光素蛋白、荧光素酶(如萤火虫荧光素酶)、荧光蛋白(如绿色荧光蛋白、黄色荧光蛋白)、氨酰tRNA合成酶、甘油醛-3-磷酸脱氢酶、过氧化氢酶、肌动蛋白、抗体的可变区域、萤光素酶突变、α-淀粉酶、肠道菌素A、丙型肝炎病毒E2糖蛋白、胰岛素前体、干扰素αA、细胞因子,干扰素α2b、白细胞介素-1β、溶菌酶素、血清白蛋白、单链抗体段(scFV)、甲状腺素运载蛋白、酪氨酸酶、木聚糖酶或其组合。Further preferably, the target protein is selected from: luciferin, luciferase (such as firefly luciferase), fluorescent protein (such as green fluorescent protein, yellow fluorescent protein), aminoacyl-tRNA synthetase, glyceraldehyde-3-phosphate dehydrogenase, catalase, actin, variable region of antibody, luciferase mutation, α-amylase, enterobactin A, hepatitis C virus E2 glycoprotein, insulin precursor, interferon αA, cytokine, interferon α2b, interleukin-1β, lysozyme, serum albumin, single-chain antibody fragment (scFV), thyroxine transporter, tyrosinase, xylanase or a combination thereof.
进一步优选地,所述的目的蛋白包括野生型蛋白、突变型蛋白或重组型蛋白。Further preferably, the target protein includes a wild-type protein, a mutant protein or a recombinant protein.
本发明第八方面提供一种本发明第四方面所述的基因工程菌株的制备方法,其特征在于,通过转化、转染或基因编辑技术将本发明第二方面所述的核酸构建物转入或整合到细胞中。The eighth aspect of the present invention provides a method for preparing the genetically engineered strain described in the fourth aspect of the present invention, characterized in that the nucleic acid construct described in the second aspect of the present invention is transferred or integrated into a cell through transformation, transfection or gene editing technology.
进一步优选地,通过基因编辑技术将本发明第二方面所述的核酸构建物整合到细胞的基因组中。Further preferably, the nucleic acid construct described in the second aspect of the present invention is integrated into the genome of the cell by gene editing technology.
进一步优选地,将核酸构建物通过活性位点整合到细胞的基因组中。Further preferably, the nucleic acid construct is integrated into the genome of the cell via the active site.
进一步优选地,所述核酸构建物进一步包含启动子和终止子。More preferably, the nucleic acid construct further comprises a promoter and a terminator.
进一步优选地,所述启动子选自PGK1、GAP1、ADH1、HXK1、GAPDH1、TEF1或TIF11。Further preferably, the promoter is selected from PGK1, GAP1, ADH1, HXK1, GAPDH1, TEF1 or TIF11.
进一步优选地,所述终止子选自CYC1、GPM1、TDH2或ACT1。Further preferably, the terminator is selected from CYC1, GPM1, TDH2 or ACT1.
进一步优选地,所述位点选自Lys1-5、glpA或UPF1。Further preferably, the site is selected from Lys1-5, glpA or UPF1.
本发明第九方面提供一种试剂盒,其特征在于,其特征在于,所述试剂盒含有本发明第六方面或第七方面所述的反应体系。The ninth aspect of the present invention provides a kit, characterized in that the kit contains the reaction system described in the sixth aspect or the seventh aspect of the present invention.
本发明第十方面提供一种体外合成含有非天然氨基酸的蛋白质的方法,其特征在于,采用本发明第六方面或第七方面所述的无细胞体系或本发明第九方面所述的试剂盒进行制备。The tenth aspect of the present invention provides a method for in vitro synthesis of proteins containing non-natural amino acids, characterized in that the method is prepared using the cell-free system described in the sixth or seventh aspect of the present invention or the kit described in the ninth aspect of the present invention.
本发明的优势在于:The advantages of the present invention are:
(1)在天然或改造的MaPylRS合成酶的N-端与C-端分别设计了含有His tag的结构,会进一步促进酶的稳定性,更易纯化,而且提高了酶的催化活性,有利提高非天然氨基酸的导入效率;(1) The His tag structures were designed at the N-terminus and C-terminus of the natural or modified MaPylRS synthetase, which further promoted the stability of the enzyme, made it easier to purify it, and increased the catalytic activity of the enzyme, which was beneficial to improving the efficiency of the introduction of non-natural amino acids.
(2)在天然或改造的MaPylRS合成酶的N-端连接His tag、T7标签和thrombin酶切位点,会进一步促进酶的稳定性,更易纯化,而且提高了酶的催化活性,有利提高非天然氨基酸的导入效率;(2) Connecting a His tag, T7 tag, and thrombin restriction site to the N-terminus of the natural or modified MaPylRS synthetase will further promote the stability of the enzyme, make it easier to purify, and increase the catalytic activity of the enzyme, which is beneficial to improving the efficiency of the introduction of non-natural amino acids;
(3)在天然或改造的MaPylRS合成酶的N-端T7标签和thrombin酶切位点,会进一步促进酶的稳定性,更易纯化,而且提高了酶的催化活性,有利提高非天然氨基酸的导入效率(3) The N-terminal T7 tag and thrombin cleavage site of the natural or modified MaPylRS synthetase will further promote the stability of the enzyme, make it easier to purify, and increase the catalytic activity of the enzyme, which is beneficial to improve the efficiency of the introduction of non-natural amino acids.
(4)本发明通过CRISPR/Cas9,结合高效的转化技术,将前述设计的MaPylRS重组的表达结构整合到细胞基因组中,实现了MaPylRS在细胞基因组内的稳定存在和MaPylRS蛋白质的持续表达。(4) The present invention integrates the aforementioned designed MaPylRS recombinant expression structure into the cell genome through CRISPR/Cas9 combined with efficient transformation technology, thereby achieving the stable existence of MaPylRS in the cell genome and the continuous expression of MaPylRS protein.
(5)将插入重组MaPylRS的克鲁维酵母菌株制备成体外表达系统,实现了外源目标蛋白的定点非天然氨基酸插入、大大简化了制备步骤,节约了成本,并增加了合成插入了非天然氨基酸蛋白质的稳定性。(5) The Kluyveromyces strain inserted with the recombinant MaPylRS was prepared into an in vitro expression system, which achieved the site-specific insertion of non-natural amino acids into the exogenous target protein, greatly simplified the preparation steps, saved costs, and increased the stability of the synthesized protein with the non-natural amino acid insertion.
图1表示的是long N-terminal mapylrs构建图Figure 1 shows the construction diagram of long N-terminal mapylrs
图2表示的是N-his mapylrs构建图Figure 2 shows the N-his mapylrs construction diagram
图3表示的是C-his mapylrs构建图Figure 3 shows the construction diagram of C-his mapylrs
图4表示的是三种蛋白经过诱导后裂解上清的电泳图,其中1表示的是long N-terminal mapylrs,2表示的是N-his mapylrs,3表示的是C-his mapylrs,具有更长的N-terminal的mapylrs蛋白表达量更高,并且蛋白的可溶性也更高。蛋白浓度最高可以达到120mg/mL(3.4mM)。Figure 4 shows the electrophoresis of the lysis supernatant of the three proteins after induction, where 1 represents long N-terminal mapylrs, 2 represents N-his mapylrs, and 3 represents C-his mapylrs. The mapylrs with longer N-terminal have higher expression levels and higher protein solubility. The highest protein concentration can reach 120 mg/mL (3.4 mM).
图5表示的是三种形式构建表达的蛋白在相同浓度下导入非天然氨基酸的活性比较,三种形式的mapylrs均可催化含有非天然氨基酸蛋白表达。图中倒三角形代表的是未添加非天然氨基酸,圆点代表的是添加非天然氨基酸。nhis代表N-his mapylrs,chis代表C-his mapylrs;N-terminal代表long N-terminal mapylrs。Figure 5 shows the comparison of the activities of proteins constructed and expressed in three forms when unnatural amino acids were introduced at the same concentration. All three forms of MAPYRS can catalyze the expression of proteins containing unnatural amino acids. The inverted triangle in the figure represents the absence of unnatural amino acids, and the dot represents the addition of unnatural amino acids. Nhis represents N-his MAPYRS, chis represents C-his MAPYRS; N-terminal represents long N-terminal MAPYRS.
图6表示的是原始mapylrs(指未重组的mapylrs)与重组的mapylrs(long N-terminal mapylrs和N-his mapylrs)导入非天然氨基酸的活性比较,图中nhis代表N-his mapylrs,long terminal代表long N-terminal mapylrs,每个对比柱状图中左侧部分代表的是添加非天然氨基酸,右侧部分代表的是添加非天然氨基酸。Figure 6 shows the comparison of the activities of original mapylrs (referring to unrecombinant mapylrs) and recombinant mapylrs (long N-terminal mapylrs and N-his mapylrs) in introducing non-natural amino acids. In the figure, nhis represents N-his mapylrs, long terminal represents long N-terminal mapylrs, and the left part of each comparison bar represents the addition of non-natural amino acids, and the right part represents the addition of non-natural amino acids.
图7表示的是原始mapylrs与重组的mapylrs(long N-terminal mapylrs和N-his mapylrs)导入非天然氨基酸的RFP/GFP比较,图中nhis代表N-his mapylrs,long terminal代表long N-terminal mapylrs,no tag代表原始mapylrs。Figure 7 shows the RFP/GFP comparison between the original mapylrs and the recombinant mapylrs (long N-terminal mapylrs and N-his mapylrs) after the introduction of non-natural amino acids. In the figure, nhis represents N-his mapylrs, long terminal represents long N-terminal mapylrs, and no tag represents the original mapylrs.
图8表示的是pKM-CAS1.0-KlLys1-5质粒结构示意图。FIG8 shows a schematic diagram of the structure of the pKM-CAS1.0-K1Lys1-5 plasmid.
图9表示的是pKM-MaPylRS质粒结构示意图。FIG. 9 shows a schematic diagram of the structure of the pKM-MaPylRS plasmid.
图10表示的是pKM-CAS1.0-KlglpA质粒结构示意图。FIG. 10 shows a schematic diagram of the structure of the pKM-CAS1.0-KlglpA plasmid.
图11表示的是pKM-CAS1.0-KlUPF1质粒结构示意图。FIG. 11 shows a schematic diagram of the structure of the pKM-CAS1.0-K1UPF1 plasmid.
图12表示的是实施例9中测得的RFP活性图。FIG. 12 is a graph showing the RFP activity measured in Example 9.
图13表示的是实施例9中测得的GFP活性图。FIG. 13 is a graph showing the GFP activity measured in Example 9.
图14表示的是实施例9中测得的RFP/GFP活性图。FIG. 14 is a graph showing the RFP/GFP activity measured in Example 9.
图15表示的是实施例10中测得的GFP活性图。FIG. 15 is a graph showing the GFP activity measured in Example 10.
图16表示的是实施例10中测得的RFP活性图。FIG. 16 is a graph showing the RFP activity measured in Example 10.
图17表示的是实施例10中测得的RFP/GFP活性图。FIG. 17 is a graph showing the RFP/GFP activity measured in Example 10.
本文的附图中,出现“non”或“control”的,均指反应中为加入非天然氨基酸。In the figures herein, the words “non” or “control” refer to the addition of non-natural amino acids in the reaction.
需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。It should be noted that, in the absence of conflict, the embodiments and features in the embodiments of the present application may be combined with each other.
下面结合具体实施方式和实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,优先按照、参考上文所述的具体实施方式指引的条件,然后可按照常规条件,或按照制造厂商所建议的条件。The present invention is further described below in conjunction with specific embodiments and examples. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. The experimental methods in the following examples that do not specify specific conditions are preferably followed or referred to the conditions indicated in the specific embodiments described above, and then can be followed according to conventional conditions or according to the conditions recommended by the manufacturer.
除非另外说明,否则本发明中提及的百分比和份数是重量百分比和重量份数。Unless otherwise specified, the percentages and parts mentioned in the present invention are percentages and parts by weight.
如无特别说明,则本发明实施例中所用的材料和试剂均为市售产品。Unless otherwise specified, the materials and reagents used in the examples of the present invention are all commercially available products.
本申请中的温度单位如无特殊说明,均为摄氏度(℃)。Unless otherwise specified, the temperature units in this application are all degrees Celsius (°C).
名词和术语Nouns and terms
以下是针对本发明所采用的部分相关“名词”、“术语”的含义进行的解释或说明,以便于更好地理解本发明。相应的解释或说明适用于本发明的全文,既适用于下文,也适用于上文。本发明中涉及引用文献时,相关术语、名词、短语在引用文献中的定义也一并被引用,但是,与本发明中的定义相冲突时,以本发明中的定义为准。在引用文献中的定义与本发明中的定义发生冲突时,并不影响所引用的成分、物质、组合物、材料、体系、配方、种类、方法、设备等以引用文献中确定的内容为准。The following is an explanation or description of the meaning of some relevant "nouns" and "terms" used in the present invention, so as to better understand the present invention. The corresponding explanation or description applies to the full text of the present invention, both below and above. When the present invention involves references, the definitions of relevant terms, nouns, and phrases in the references are also quoted, but when they conflict with the definitions in the present invention, the definitions in the present invention shall prevail. When the definitions in the references conflict with the definitions in the present invention, it does not affect the cited components, substances, compositions, materials, systems, formulations, types, methods, equipment, etc., which shall be subject to the contents determined in the references.
本发明中,“优选”、“较佳”、“更优选”、“更佳”、“最优选”、“进一步优选”等优选实施方式,不构成对发明的涵盖范围及保护范围的任何意义上的限制,并非用于限定本发明的范围和实施方式,仅用于提供一些实施方式作为举例。In the present invention, preferred embodiments such as “preferred”, “better”, “more preferred”, “better”, “most preferred” and “further preferred” do not constitute any limitation on the scope of coverage and protection scope of the invention, and are not used to limit the scope and embodiments of the present invention, but are only used to provide some embodiments as examples.
本发明的描述中,对于“优选之一”、“优选方式之一”、“优选实施方式之一”、“优选例之一”、“优选例”、“在一优选的实施方式中”、“一些优选例中”、“一些优选方式中”、“优选为”、“优选”、“优选地”、“更优选”、“更优地”、“进一步优选”、“最优选”等优选方式,以及“实施方式之一”、“方式之一”、“示例”、“具体示例”、“举例如”、“作为举例”、“例如”、“比如”、“如”等示意的列举方式,同样不构成对发明的涵盖范围及保护范围的任何意义上的限制,且各方式所描述的具体特征包含于本发明的至少一个具体实施方式中。本发明中,各方式所描述的具体特征可以在任何的一个或者多个具体实施方式中以合适的方式结合。本发明中,各优选方式对应的技术特征或技术方案也可以通过任意合适的方式结合。In the description of the present invention, preferred modes such as "one of the preferred embodiments", "one of the preferred modes", "one of the preferred embodiments", "one of the preferred examples", "preferred examples", "in a preferred embodiment", "some preferred examples", "some preferred modes", "preferably", "preferably", "preferably", "more preferably", "more preferably", "further preferably", "most preferably", and illustrative enumeration modes such as "one of the embodiments", "one of the modes", "example", "specific example", "for example", "as an example", "for example", "such as", "such as", etc., do not constitute any limitation on the scope of coverage and protection scope of the invention, and the specific features described in each mode are included in at least one specific embodiment of the present invention. In the present invention, the specific features described in each mode may be combined in a suitable manner in any one or more specific embodiments. In the present invention, the technical features or technical solutions corresponding to each preferred mode may also be combined in any suitable manner.
本发明中,“其任意组合”,在数量上表示“大于1”,在涵盖范围上表示以下情形构成的组:“任选其中一个,或者任选其中至少两个构成的组”。In the present invention, "any combination thereof" means "greater than 1" in terms of quantity, and means a group consisting of the following situations in terms of coverage: "select any one of them, or select a group consisting of at least two of them".
本发明中,“一个或多个”、“一种或多种”等“一或多”的描述,与“至少一个”、“至少一种”、“其组合”、“或其组合”、“及其组合”、“或其任意组合”、“及其任意组合”等具有相同含义,可以互换使用,表示数量上等于“1”或“大于1”。In the present invention, the descriptions of "one or more", "one or more" and "one or more" have the same meaning as "at least one", "at least one", "a combination thereof", "or a combination thereof", "and combinations thereof", "or any combination thereof", "and any combination thereof", etc. and can be used interchangeably to indicate that the quantity is equal to "1" or "greater than 1".
本发明中,采用“或/和”、“和/或”表示“任选其一或者任选其组合”,也表示至少其一。In the present invention, "or/and" and "and/or" are used to represent "optionally one of them or optionally a combination of them", and also represent at least one of them.
术语“约”可以是指在本领域普通技术人员确定的特定值或组成的可接受误差范围内的值或组成,其将部分地取决于如何测量或测定值或组成。例如,如本文所用,表述“约100”包括99和101和之间的全部值(例如,99.1、99.2、99.3、99.4等)。The term "about" can refer to a value or composition that is within an acceptable error range for a particular value or composition as determined by one of ordinary skill in the art, which will depend in part on how the value or composition is measured or determined. For example, as used herein, the expression "about 100" includes all values between 99 and 101 (e.g., 99.1, 99.2, 99.3, 99.4, etc.).
序列同一性(或同源性)通过沿着预定的比较窗(其可以是参考核苷酸序列或蛋白的长度的50%、60%、70%、80%、90%、95%或100%)比较两个对齐的序列,并且确定出现相同的残基的位置的数目来确定。通常地,这表示为百分比。核苷酸序列的序列同一性的测量是本领域技术人员熟知的方法。Sequence identity (or homology) is determined by comparing two aligned sequences along a predetermined comparison window (which can be 50%, 60%, 70%, 80%, 90%, 95% or 100% of the length of the reference nucleotide sequence or protein) and determining the number of positions at which identical residues occur. Typically, this is expressed as a percentage. The measurement of sequence identity of nucleotide sequences is a method well known to those skilled in the art.
本发明所述的“通常”、“常规”、“一般”、“经常”、“往往”等方式描述的现有技术手段,也都被引用作为本发明内容的参考,如无特别说明,可视为本发明的部分技术特征的优选方式之一,且需要注意的是,不构成对发明的涵盖范围及保护范围的任何意义上的限制。The prior art means described in the present invention in such ways as “usually”, “conventional”, “general”, “frequently” and “often” are also cited as references to the contents of the present invention. Unless otherwise specified, they can be regarded as one of the preferred ways of some technical features of the present invention. It should be noted that they do not constitute any limitation on the scope of coverage and protection scope of the invention.
在本发明提及的所有文献及这些文献直接引用或者间接引用的文献,都在本申请中被引用作为参考,就如同每一篇文献被单独引用作为参考那样。All documents mentioned in the present invention and documents directly or indirectly cited by these documents are cited in this application as references, just as if each document was cited individually as a reference.
应理解,在本发明范围内,本发明的上述各技术特征和在下文(包括但不限于实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案,只要能够用于实施本发明的即可。限于篇幅,不再一一累述。It should be understood that within the scope of the present invention, the above-mentioned technical features of the present invention and the technical features specifically described below (including but not limited to the embodiments) can be combined with each other to form a new or preferred technical solution, as long as it can be used to implement the present invention. Due to space limitations, they will not be described one by one.
体外蛋白合成反应,是指在体外无细胞合成体系中合成蛋白的反应,至少包括翻译过程。包括但不限于IVT反应(体外翻译反应)、IVTT反应(体外转录翻译反应)、IVDTT反应(体外复制转录翻译反应)。本发明中,优选IVTT反应。IVTT反应,对应IVTT体系,是在体外将DNA转录翻译为蛋白质(Protein)的过程,因此,我们还将这类的体外蛋白合成体系称为D2P体系、D-to-P体系、D_to_P体系、DNA-to-Protein体系;相应的体外蛋白合成方法,还称为D2P方法、D-to-P方法、D_to_P方法、DNA-to-Protein方法。In vitro protein synthesis reaction refers to the reaction of synthesizing protein in an in vitro cell-free synthesis system, including at least the translation process. Including but not limited to IVT reaction (in vitro translation reaction), IVTT reaction (in vitro transcription and translation reaction), IVDTT reaction (in vitro replication transcription and translation reaction). In the present invention, IVTT reaction is preferred. IVTT reaction, corresponding to IVTT system, is the process of transcribing and translating DNA into protein (Protein) in vitro. Therefore, we also refer to this type of in vitro protein synthesis system as D2P system, D-to-P system, D_to_P system, DNA-to-Protein system; the corresponding in vitro protein synthesis method is also called D2P method, D-to-P method, D_to_P method, DNA-to-Protein method.
“无细胞体系”,是指进行体外蛋白合成时,并非通过完整细胞分泌表达的方式。需要说明的是,本发明的体外无细胞蛋白合成体系中,也允许添加细胞组分以促进反应,但所添加的细胞不以分泌表达外源目标蛋白为主要目的。此外,在本发明指导下构建的无完整细胞的D2P体系中,有意地添加少量完整细胞(例如,其提供的蛋白含量与细胞提取物提供的蛋白含量相比,不超过30wt%),这样的“规避”方式,也囊括在本发明的保护范围之内。"Cell-free system" refers to a method of in vitro protein synthesis that is not secretory expression through intact cells. It should be noted that in the in vitro cell-free protein synthesis system of the present invention, it is also allowed to add cell components to promote the reaction, but the added cells are not primarily intended to secrete and express exogenous target proteins. In addition, in the intact cell-free D2P system constructed under the guidance of the present invention, a small amount of intact cells are intentionally added (for example, the protein content provided by them is not more than 30wt% compared to the protein content provided by the cell extract). Such a "circumvention" method is also included in the scope of protection of the present invention.
目的蛋白:本发明的体外蛋白合成体系的目的表达产物,并非由宿主细胞分泌合成,而是基于外源核酸模板进行体外合成,也可称为外源蛋白。所述外源蛋白可以为蛋白、融合蛋白、含蛋白分子或融合蛋白分子的混合物;还广义地包括多肽。基于编码目标蛋白的核酸模板进行体外蛋白合成反应后所得产物,可以为单一物质,也可以为两种或两种以上物质的组合。“外源蛋白”、“目的蛋白”、“目标蛋白”、“目标翻译产物”,具有相同含义,可译为“objective protein”、“interested protein”、“objective translated product”、“interested protein product”等方式,本发明中可互换使用。Target protein: The target expression product of the in vitro protein synthesis system of the present invention is not synthesized by secretion from the host cell, but is synthesized in vitro based on an exogenous nucleic acid template, and can also be called an exogenous protein. The exogenous protein can be a protein, a fusion protein, a mixture of protein molecules or fusion protein molecules; it also broadly includes polypeptides. The product obtained after an in vitro protein synthesis reaction based on a nucleic acid template encoding the target protein can be a single substance or a combination of two or more substances. "Exogenous protein", "target protein", "target protein", "target translation product" have the same meaning and can be translated as "objective protein", "interested protein", "objective translated product", "interested protein product", etc., and can be used interchangeably in the present invention.
D2P,DNA-to-Protein,从DNA模板到蛋白质产物。比如,D2P技术、D2P体系、D2P方法、D2P试剂盒等等。D2P, DNA-to-Protein, from DNA template to protein product. For example, D2P technology, D2P system, D2P method, D2P kit, etc.
“本发明的表达系统”、“本发明的体外表达系统”、“体外无细胞表达系统”、“体外无细胞表达体系”可互换使用,均指本发明的体外蛋白表达体系,也可采用其它描述方式,如:蛋白质体外合成系统、体外蛋白合成体系、无细胞系统、无细胞体系、无细胞蛋白合成体系、无细胞体外蛋白合成体系、体外无细胞蛋白合成体系、体外无细胞合成体系、CFS体系(cell-free system)、CFPS体系(cell-free protein synthesis system)等描述方式。根据反应机理,可包括体外翻译体系(可简记为IVT体系,一种mR2P体系)、体外转录翻译体系(可简记为IVTT体系,一种D2P体系)、体外复制转录翻译体系(可简记为IVDTT体系,一种D2P体系)等。本发明中,优选IVTT体系。我们还将体外蛋白合成系统称为“蛋白质合成工厂”(“Protein Factory”或“proteinfactory”或“Proteinfactory”)。本发明提供的体外蛋白合成系统,对其组分是采用开放式的描述方式的。本发明的无细胞蛋白合成体系是以外源DNA、mRNA或者其组合作为蛋白质合成的核酸模板,通过人工控制补加蛋白质合成所需的底物和转录、翻译相关蛋白因子等物质,实现目标蛋白的体外合成。"The expression system of the present invention", "the in vitro expression system of the present invention", "the in vitro cell-free expression system", "the in vitro cell-free expression system" can be used interchangeably, all referring to the in vitro protein expression system of the present invention, and other descriptions can also be used, such as: protein in vitro synthesis system, in vitro protein synthesis system, cell-free system, cell-free system, cell-free protein synthesis system, cell-free in vitro protein synthesis system, in vitro cell-free protein synthesis system, in vitro cell-free synthesis system, CFS system (cell-free system), CFPS system (cell-free protein synthesis system) and other descriptions. According to the reaction mechanism, it can include an in vitro translation system (which can be abbreviated as IVT system, a mR2P system), an in vitro transcription translation system (which can be abbreviated as IVTT system, a D2P system), an in vitro replication transcription translation system (which can be abbreviated as IVDTT system, a D2P system), etc. In the present invention, the IVTT system is preferred. We also call the in vitro protein synthesis system a "protein synthesis factory" ("Protein Factory" or "proteinfactory" or "Proteinfactory"). The in vitro protein synthesis system provided by the present invention adopts an open description method for its components. The cell-free protein synthesis system of the present invention uses exogenous DNA, mRNA or a combination thereof as a nucleic acid template for protein synthesis, and achieves in vitro synthesis of the target protein by artificially controlling the addition of substrates and transcription, translation-related protein factors and other substances required for protein synthesis.
本发明中,“蛋白”与“蛋白质”具有相同含义,均译为protein,可以互换使用。In the present invention, "protein" and "protein" have the same meaning, are both translated as protein, and can be used interchangeably.
本发明中,“系统”和“体系”,均译为system,可以互换使用。In the present invention, "system" and "system" are both translated as system and can be used interchangeably.
本发明中,“蛋白合成量”、“蛋白表达量”与“蛋白表达产量”具有相同含义,可互换使用。In the present invention, "protein synthesis amount", "protein expression amount" and "protein expression yield" have the same meaning and can be used interchangeably.
本发明中,细胞提取物、细胞提取液、细胞裂解物、细胞破碎物、细胞溶解产物的含义相同,可以互换使用,英文可采用cell extract、cell lysate等描述方式。In the present invention, cell extract, cell extract, cell lysate, cell disruption and cell lysis product have the same meaning and can be used interchangeably. In English, cell extract, cell lysate and the like can be used as descriptions.
本发明中,能量体系、能量系统、能量供应体系具有同等含义,可互换使用。能量再生体系、能量再生系统具有同等含义,可互换使用。能量再生系统是能量系统的优选实施方式或者组成部分。In the present invention, energy system, energy system, and energy supply system have the same meaning and can be used interchangeably. Energy regeneration system and energy regeneration system have the same meaning and can be used interchangeably. The energy regeneration system is a preferred embodiment or component of the energy system.
进一步的,本发明提供了一种无细胞蛋白质合成体系,至少包括细胞提取物或细胞裂解液。Furthermore, the present invention provides a cell-free protein synthesis system, which at least includes a cell extract or a cell lysate.
进一步优选的,所述的无细胞蛋白质合成体系还包括选自下组的一种或多种组分:用于合成RNA的底物,用于合成蛋白的底物,聚乙二醇或其类似物,镁离子,钾离子,缓冲剂,RNA聚合酶,能量再生系统,二硫苏糖醇,任选的水性溶剂。Further preferably, the cell-free protein synthesis system further comprises one or more components selected from the following group: a substrate for synthesizing RNA, a substrate for synthesizing protein, polyethylene glycol or its analogues, magnesium ions, potassium ions, a buffer, RNA polymerase, an energy regeneration system, dithiothreitol, and an optional aqueous solvent.
进一步优选的,所述的合成RNA的底物包括:核苷单磷酸、核苷三磷酸之一或其组合。Further preferably, the substrate for synthesizing RNA includes: nucleoside monophosphate, nucleoside triphosphate or a combination thereof.
进一步优选的,所述的合成蛋白质的底物包括:20种天然氨基酸以及非天然氨基酸。Further preferably, the substrate for synthesizing protein includes: 20 natural amino acids and unnatural amino acids.
进一步优选的,所述镁离子来源于镁离子源,所述镁离子源选自下组:醋酸镁、谷氨酸镁之一或其组合。Further preferably, the magnesium ions are derived from a magnesium ion source, and the magnesium ion source is selected from the following group: magnesium acetate, magnesium glutamate, or a combination thereof.
进一步优选的,所述钾离子来源于钾离子源,所述钾离子源选自下组:醋酸钾、谷氨酸钾之一或其组合。Further preferably, the potassium ions are derived from a potassium ion source, and the potassium ion source is selected from the following group: potassium acetate, potassium glutamate, or a combination thereof.
进一步优选的,所述能量再生系统选自下组:磷酸肌酸/磷酸肌酸酶系统、糖酵解途径及其中间产物能量系统之一或其组合。Further preferably, the energy regeneration system is selected from the following group: one or a combination of creatine phosphate/creatine phosphate enzyme system, glycolysis pathway and its intermediate energy system.
进一步优选的,所述能量再生系统包括葡萄糖/磷酸盐体系,所述磷酸盐选自下组:磷酸三钾、磷酸三铵、磷酸三钠、磷酸氢二钾、磷酸氢二铵、磷酸氢二钠、磷酸二氢钾、磷酸二氢铵、磷酸二氢钠之一或其组合。Further preferably, the energy regeneration system comprises a glucose/phosphate system, and the phosphate is selected from the following group: tripotassium phosphate, triammonium phosphate, trisodium phosphate, dipotassium hydrogen phosphate, diammonium hydrogen phosphate, disodium hydrogen phosphate, potassium dihydrogen phosphate, ammonium dihydrogen phosphate, sodium dihydrogen phosphate, or a combination thereof.
进一步优选的,所述缓冲剂选自下组:4-羟乙基哌嗪乙磺酸、三羟甲基氨基甲烷之一或其组合。Further preferably, the buffer is selected from the following group: 4-hydroxyethylpiperazineethanesulfonic acid, tris(hydroxymethyl)aminomethane, or a combination thereof.
进一步优选的,所述体外蛋白质合成体系含有聚乙二醇(PEG)或其类似物。聚乙二醇或其类似物的浓度没有特别限制,通常,聚乙二醇或其类似物的浓度(w/v)为0.1-8%,较佳地,0.5-4%,更佳地,1-2%,以所述蛋白合成体系的总重量计。代表性的PEG选自下组:PEG3000、PEG3350、PEG6000、PEG8000之一或其组合。Further preferably, the in vitro protein synthesis system contains polyethylene glycol (PEG) or its analogs. The concentration of polyethylene glycol or its analogs is not particularly limited. Generally, the concentration (w/v) of polyethylene glycol or its analogs is 0.1-8%, preferably 0.5-4%, more preferably 1-2%, based on the total weight of the protein synthesis system. Representative PEG is selected from the following group: one of PEG3000, PEG3350, PEG6000, PEG8000 or a combination thereof.
进一步优选的,所述聚乙二醇包括分子量(Da)为200-10000的聚乙二醇,如PEG200、400、1500、2000、4000、6000、8000、10000等,较佳地,分子量为3000-10000的聚乙二醇。Further preferably, the polyethylene glycol includes polyethylene glycol with a molecular weight (Da) of 200-10000, such as PEG200, 400, 1500, 2000, 4000, 6000, 8000, 10000, etc., preferably, polyethylene glycol with a molecular weight of 3000-10000.
在本发明中,RNA聚合酶没有特别限制,可以选自一种或多种RNA聚合酶,典型的RNA聚合酶为T7 RNA聚合酶。In the present invention, the RNA polymerase is not particularly limited and can be selected from one or more RNA polymerases. A typical RNA polymerase is T7 RNA polymerase.
可选的方案为,本发明提供的体外蛋白合成体系包括:细胞提取物,4-羟乙基哌嗪乙磺酸,醋酸钾,醋酸镁,腺嘌呤核苷三磷酸(ATP),鸟嘌呤核苷三磷酸(GTP),胞嘧啶核苷三磷酸(CTP),胸腺嘧啶核苷三磷酸(TTP),氨基酸混合物,磷酸肌酸,二硫苏糖醇(DTT),磷酸肌酸激酶,RNA聚合酶。An optional scheme is that the in vitro protein synthesis system provided by the present invention includes: cell extract, 4-hydroxyethylpiperazineethanesulfonic acid, potassium acetate, magnesium acetate, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytosine triphosphate (CTP), thymidine triphosphate (TTP), amino acid mixture, creatine phosphate, dithiothreitol (DTT), creatine phosphate kinase, and RNA polymerase.
在本发明中,所述的细胞提取物不含完整的细胞,典型的细胞提取物包括用于蛋白翻译的核糖体、氨酰tRNA合成酶、蛋白质合成需要的起始因子和延伸因子以及终止释放因子。此外,细胞提取物中还含有一些源自细胞的细胞质中的其他蛋白,尤其是可溶性蛋白。In the present invention, the cell extract does not contain intact cells, and a typical cell extract includes ribosomes, aminoacyl-tRNA synthetases, initiation factors and elongation factors required for protein synthesis, and termination release factors for protein translation. In addition, the cell extract also contains some other proteins derived from the cytoplasm of the cell, especially soluble proteins.
在本发明中,所述细胞提取物在体外无细胞蛋白合成体系中的比例不受特别限制,通常所述细胞提取物在体外无细胞蛋白合成体系中所占体系为20-70%,较佳地,30-60%,更佳地,40-50%。In the present invention, the proportion of the cell extract in the in vitro cell-free protein synthesis system is not particularly limited. Usually, the cell extract accounts for 20-70% of the in vitro cell-free protein synthesis system, preferably 30-60%, and more preferably 40-50%.
在本发明中,所述的细胞提取物所含蛋白含量为20-100mg/mL,较佳为50-100mg/mL。所述的测定蛋白含量方法为考马斯亮蓝测定方法。In the present invention, the protein content of the cell extract is 20-100 mg/mL, preferably 50-100 mg/mL. The method for determining the protein content is the Coomassie Brilliant Blue determination method.
本发明还提供了一种载体或载体组合,所述载体含有本发明的核酸构建物。优选地,所述载体选自:细菌质粒、噬菌体、酵母质粒、动物细胞载体、穿梭载体;所述的载体为转座子载体。用于制备重组载体的方法是本领域普通技术人员所熟知的。只要其能够在宿主体内复制和稳定,任何质粒和载体都是可以被采用的。The present invention also provides a vector or vector combination, wherein the vector contains the nucleic acid construct of the present invention. Preferably, the vector is selected from: bacterial plasmid, bacteriophage, yeast plasmid, animal cell vector, shuttle vector; the vector is a transposon vector. Methods for preparing recombinant vectors are well known to those of ordinary skill in the art. As long as they can replicate and be stable in the host, any plasmid and vector can be used.
本领域普通技术人员可以使用熟知的方法构建含有本发明所述的启动子和/或目的基因序列的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等。Those skilled in the art can use well-known methods to construct expression vectors containing the promoter and/or target gene sequence of the present invention, including in vitro recombinant DNA technology, DNA synthesis technology, in vivo recombination technology, etc.
模板DNA(template DNA)Template DNA
模板DNA为编码任意待合成目标蛋白的核苷酸序列,其可以为原始序列,也可以为人工合成序列或人工改造序列,利用该模板DNA能够合成相应的RNA和/或蛋白质。The template DNA is a nucleotide sequence encoding any target protein to be synthesized, which may be an original sequence, an artificially synthesized sequence, or an artificially modified sequence. The template DNA may be used to synthesize the corresponding RNA and/or protein.
在本发明中,所述的细胞提取物的制备方法不受限制,一种优选的制备方法In the present invention, the preparation method of the cell extract is not limited. A preferred preparation method is
包括以下步骤:The following steps are involved:
(i)提供细胞;(i) providing cells;
(ii)对细胞进行洗涤处理,获得经洗涤的细胞;(ii) washing the cells to obtain washed cells;
(iii)对经洗涤的细胞进行破细胞处理,从而获得细胞粗提物;(iii) disrupting the washed cells to obtain a crude cell extract;
(iv)对所述细胞粗提物进行固液分离,获得液体部分,即为细胞提取物。(iv) subjecting the crude cell extract to solid-liquid separation to obtain a liquid portion, namely the cell extract.
在本发明中,所述的固液分离方式不受特别限制,一种优选的方式为离心。In the present invention, the solid-liquid separation method is not particularly limited, and a preferred method is centrifugation.
在一优选实施方式中,所述离心在液态下进行。In a preferred embodiment, the centrifugation is performed in a liquid state.
在本发明中,所述离心条件不受特别限制,一种优选的离心条件为5000-100000g,较佳地,8000-30000g。In the present invention, the centrifugation conditions are not particularly limited, and a preferred centrifugation condition is 5000-100000 g, preferably 8000-30000 g.
在本发明中,所述离心时间不受特别限制,一种优选的离心时间为0.5min-2h,较佳地,20min-50min。In the present invention, the centrifugation time is not particularly limited, and a preferred centrifugation time is 0.5 min-2 h, preferably 20 min-50 min.
在本发明中,所述离心的温度不受特别限制,优选的,所述离心在1-10℃下进行,较佳地,在2-6℃下进行。In the present invention, the centrifugation temperature is not particularly limited. Preferably, the centrifugation is performed at 1-10°C, more preferably, at 2-6°C.
在本发明中,所述的洗涤处理方式不受特别限制,一种优选的洗涤处理方式为采用洗涤液在pH为7-8(较佳地,7.4)下进行处理,所述洗涤液没有特别限制,典型的所述洗涤液选自下组:4-羟乙基哌嗪乙磺酸钾、醋酸钾、醋酸镁之一或其组合。In the present invention, the washing treatment method is not particularly limited. A preferred washing treatment method is to use a washing liquid at a pH of 7-8 (preferably 7.4). The washing liquid is not particularly limited. Typically, the washing liquid is selected from the following group: potassium 4-hydroxyethylpiperazineethanesulfonate, potassium acetate, magnesium acetate, or a combination thereof.
在本发明中,所述破细胞处理的方式不受特别限制,一种优选的所述的破细胞处理包括高压破碎、冻融(如液氮低温)破碎。In the present invention, the cell disruption treatment method is not particularly limited. A preferred cell disruption treatment includes high-pressure disruption and freeze-thaw (such as liquid nitrogen cryogenic) disruption.
所述体外无细胞蛋白合成体系中的核苷三磷酸混合物为腺嘌呤核苷三磷酸、鸟嘌呤核苷三磷酸、胞嘧啶核苷三磷酸和尿嘧啶核苷三磷酸。在本发明中,各种单核苷酸的浓度没有特别限制,通常每种单核苷酸的浓度为0.5-5mM,较佳地为1.0-2.0mM。The nucleoside triphosphate mixture in the in vitro cell-free protein synthesis system is adenosine triphosphate, guanosine triphosphate, cytosine triphosphate and uridine triphosphate. In the present invention, the concentration of various mononucleotides is not particularly limited, and usually the concentration of each mononucleotide is 0.5-5mM, preferably 1.0-2.0mM.
所述体外无细胞蛋白合成体系中的氨基酸混合物可包括天然或非天然氨基酸,可包括D型或L型氨基酸。代表性的氨基酸包括(但并不限于)20种天然氨基酸:甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、苯丙氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、蛋氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸和组氨酸。每种氨基酸的浓度通常为0.01-0.5mM,较佳地0.02-0.2mM,如0.05、0.06、0.07、0.08mM。The amino acid mixture in the in vitro cell-free protein synthesis system may include natural or non-natural amino acids, and may include D-type or L-type amino acids. Representative amino acids include (but are not limited to) 20 natural amino acids: glycine, alanine, valine, leucine, isoleucine, phenylalanine, proline, tryptophan, serine, tyrosine, cysteine, methionine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine. The concentration of each amino acid is generally 0.01-0.5 mM, preferably 0.02-0.2 mM, such as 0.05, 0.06, 0.07, 0.08 mM.
在优选例中,所述体外无细胞蛋白合成体系还含有聚乙二醇或其类似物。聚乙二醇或其类似物的浓度没有特别限制,通常,聚乙二醇或其类似物的浓度(w/v)为0.1-8%,较佳地,0.5-4%,更佳地,1-2%,以所述生物合成体系的总重量计。代表性的PEG例子包括(但并不限于):PEG3000,PEG8000,PEG6000和PEG3350。应理解,本发明的体系还可包括其他各种分子量的聚乙二醇(如PEG200、400、1500、2000、4000、6000、8000、10000等)。In a preferred embodiment, the in vitro cell-free protein synthesis system further contains polyethylene glycol or its analogs. The concentration of polyethylene glycol or its analogs is not particularly limited. Generally, the concentration (w/v) of polyethylene glycol or its analogs is 0.1-8%, preferably 0.5-4%, more preferably 1-2%, based on the total weight of the biosynthesis system. Representative examples of PEG include (but are not limited to): PEG3000, PEG8000, PEG6000 and PEG3350. It should be understood that the system of the present invention may also include polyethylene glycols of various other molecular weights (such as PEG200, 400, 1500, 2000, 4000, 6000, 8000, 10000, etc.).
在优选例中,所述体外无细胞蛋白合成体系还含有蔗糖。蔗糖的浓度没有特别限制,通常,蔗糖的浓度为0.03-40wt%,较佳地,0.08-10wt%,更佳地,0.1-5wt%,以所述蛋白合成体系的总重量计。In a preferred embodiment, the in vitro cell-free protein synthesis system further contains sucrose. The concentration of sucrose is not particularly limited, and generally, the concentration of sucrose is 0.03-40wt%, preferably 0.08-10wt%, and more preferably 0.1-5wt%, based on the total weight of the protein synthesis system.
一种特别优选的体外无细胞蛋白合成体系,除了酵母细胞提取物之外,还含有以下组分:22mM pH为7.4的4-羟乙基哌嗪乙磺酸,30-150mM醋酸钾,1.0-5.0mM醋酸镁,1.5-4mM核苷三磷酸混合物,0.08-0.24mM的氨基酸混合物,25mM磷酸肌酸,1.7mM二硫苏糖醇,0.27mg/mL磷酸肌酸激酶,1%-4%聚乙二醇,0.5%-2%蔗糖,0.027-0.054mg/mL T7RNA聚合酶。A particularly preferred in vitro cell-free protein synthesis system, in addition to yeast cell extract, also contains the following components: 22mM 4-hydroxyethylpiperazineethanesulfonic acid with a pH of 7.4, 30-150mM potassium acetate, 1.0-5.0mM magnesium acetate, 1.5-4mM nucleoside triphosphate mixture, 0.08-0.24mM amino acid mixture, 25mM creatine phosphate, 1.7mM dithiothreitol, 0.27mg/mL creatine phosphate kinase, 1%-4% polyethylene glycol, 0.5%-2% sucrose, and 0.027-0.054mg/mL T7 RNA polymerase.
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明。The present invention will be further described below in conjunction with specific examples. It should be understood that these examples are only used to illustrate the present invention.
而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件,例如Sam brook等人,分子克隆:实验室手册(New York:Cold Spring Harbor Laboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。除非另外说明,否则百分比和份数是重量百分比和重量份数。本发明以乳酸克鲁维酵母(Kluyveromyces lactis,简称K.lactis或kl)为实施例,但同样的设计和分析、实验方法也适用于其他酵母、动物细胞等真核细胞,以及原核细胞。It is not intended to limit the scope of the present invention. The experimental methods in the following examples where specific conditions are not specified are generally carried out under conventional conditions, such as the conditions described in Sam Brook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the conditions recommended by the manufacturer. Unless otherwise stated, percentages and parts are weight percentages and weight parts. The present invention takes Kluyveromyces lactis (K. lactis or kl for short) as an example, but the same design, analysis and experimental methods are also applicable to other eukaryotic cells such as yeast and animal cells, as well as prokaryotic cells.
本发明以乳酸克鲁维酵母(Kluyveromyces lactis,K.lactis)为实施例,但同样的设计和分析、实验方法也适用于其他酵母等低等真核细胞以及高等动物细胞。本发明中的基因改造方法为CRISPR-Cas9技术,但并不限于此,可以为已知的、现有的任意基因改造方法。The present invention uses Kluyveromyces lactis (K. lactis) as an example, but the same design, analysis and experimental methods are also applicable to other yeasts and other lower eukaryotic cells and higher animal cells. The gene modification method in the present invention is CRISPR-Cas9 technology, but it is not limited to this, and can be any known and existing gene modification method.
体外蛋白合成反应混合体系,也描述为体外蛋白合成反应混合物、反应混合体系、反应混合物,指包括体外蛋白合成体系、编码目标蛋白的核酸模板的混合体系;可以是均质的,也可以是非均质的,允许是溶液、乳浊液、悬浊液等液态体系。The in vitro protein synthesis reaction mixture system, also described as an in vitro protein synthesis reaction mixture, a reaction mixture system, or a reaction mixture, refers to a mixed system including an in vitro protein synthesis system and a nucleic acid template encoding a target protein; it may be homogeneous or heterogeneous, and may be a liquid system such as a solution, an emulsion, or a suspension.
本发明所述的Protein Factory中各组分的终浓度分别为:80%(v/v)乳酸克鲁维酵母提取物、15mM葡萄糖、320mM麦芽糊精(以葡糖糖单体计量摩尔浓度)、24mM磷酸三钾、1.8mM核苷三磷酸混合物(腺嘌呤核苷三磷酸、鸟嘌呤核苷三磷酸、胞嘧啶核苷三磷酸和尿嘧啶核苷三磷酸的混合物,每种核苷三磷酸的终浓度均为1.8mM)、0.7mM氨基酸混合物(甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、苯丙氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、蛋氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸和组氨酸,每种氨基酸的终浓度均为0.7mM)、L-天门冬氨酸镁、80mM醋酸钾、2%(w/v)聚乙二醇8000、9.78mMpH8.0 Tris·HCl缓冲液、6%(w/v)海藻糖。乳酸克鲁维酵母细胞提取物的制备过程采用常规技术手段,参考CN109593656A记载的方法制备。制备步骤概括而言,包括:提供经发酵培养的乳酸克鲁维酵母细胞的适量原料,用液氮将细胞速冻,将细胞打碎,离心收集上清液,即可得到细胞提取物。所得乳酸克鲁维酵母细胞提取物中的蛋白浓度为20~40mg/mL。在以下实施例中,选择以(即prock,N-Ε-炔丙氧基羰基-L-赖氨酸盐酸盐)作为非天然氨基酸(简称ncaa)的代表,但是并不限定本申请的ncaa仅仅指的就是prock。Protein of the present invention The final concentrations of the components in Factory are: 80% (v/v) Kluyveromyces lactis extract, 15 mM glucose, 320 mM maltodextrin (measured in molar concentration of glucose monomers), 24 mM tripotassium phosphate, 1.8 mM nucleoside triphosphate mixture (a mixture of adenosine triphosphate, guanosine triphosphate, cytosine triphosphate and uridine triphosphate, with the final concentration of each nucleoside triphosphate being 1.8 mM), 0.7 mM amino acid mixture (glycine, alanine, valine, leucine, isoleucine, phenylalanine, proline, tryptophan, serine, tyrosine, cysteine, methionine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine, with the final concentration of each amino acid being 0.7 mM), magnesium L-aspartate, 80 mM potassium acetate, 2% (w/v) polyethylene glycol 8000, 9.78 mM pH 8.0 Tris·HCl buffer, 6% (w/v) trehalose. The preparation process of Kluyveromyces lactis cell extract adopts conventional technical means, and is prepared by referring to the method described in CN109593656A. In summary, the preparation steps include: providing an appropriate amount of raw materials of Kluyveromyces lactis cells that have been fermented and cultured, freezing the cells with liquid nitrogen, breaking the cells, and collecting the supernatant by centrifugation to obtain a cell extract. The protein concentration in the obtained Kluyveromyces lactis cell extract is 20-40 mg/mL. In the following examples, (i.e., prock, N-E-propargyloxycarbonyl-L-lysine hydrochloride) is a representative of non-natural amino acids (referred to as NCAA), but the NCAA in this application is not limited to only referring to prock.
实施例1:Mapylrs蛋白三种构建方式:Example 1: Three methods for constructing Mapylrs proteins:
第一种,Long N-terminal mapylrs:The first type, Long N-terminal mapylrs:
Mapylrs蛋白N-末端包含31个氨基酸构成的前导肽,其中含有6*His亲和纯化标签,T7 tag以及thrombin酶切位点(图1):The N-terminus of the Mapylrs protein contains a 31-amino acid leader peptide, which contains a 6*His affinity purification tag, a T7 tag, and a thrombin restriction site (Figure 1):
Long N-terminal mapylrs氨基酸序列:
Long N-terminal mapylrs amino acid sequence:
理论分子量:34349.98DaTheoretical molecular weight: 34349.98Da
第二种,N-his mapylrs:The second type, N-his mapylrs:
Mapylrs蛋白N-末端只有6*His亲和纯化标签,以及柔性接口(图2):
The Mapylrs protein has only a 6*His affinity purification tag at the N-terminus, as well as a flexible interface (Figure 2):
N-his mapylrs理论分子量:32091.35DaTheoretical molecular weight of N-his MAPYRS: 32091.35Da
第三种,C-his mapylrs:The third type, C-his mapylrs:
Mapylrs蛋白N-末端比报道序列多一个Gly,C-末端有亲和纯化标签6*His(图3):
The N-terminus of the Mapylrs protein has one more Gly than the reported sequence, and the C-terminus has an affinity purification tag 6*His (Figure 3):
C-his mapylrs理论分子量:31685.98C-His MAPYLRs theoretical molecular weight: 31685.98
第四种,与Long N-terminal mapylrs相比,去除了his标签:
The fourth type, compared to the Long N-terminal mapylrs, removes the his tag:
实施例2:构建表达质粒Example 2: Construction of expression plasmid
Long-N-terminal mapylrs表达质粒:Long-N-terminal MAPYLRs expression plasmid:
委托生工生物技术公司合成mapylrs全基因(WP_015505008),其中密码子进行了针对大肠杆菌表达优化。合成后的基因序列为:We commissioned Sangon Biotech to synthesize the full mapylrs gene (WP_015505008), in which the codons were optimized for E. coli expression. The synthesized gene sequence is:
Nde I
N
Xho IXho
通过NdeI/Xho I酶切位点插入pET28a载体。Insert into pET28a vector through NdeI/Xho I restriction site.
N-his maplys以及C-his maplys表达载体则通过PCR方法在pET28a-long-terminal mapylrs的基础上进行修改。所使用的引物分别是:The N-his maplys and C-his maplys expression vectors were modified based on pET28a-long-terminal mapylrs by PCR. The primers used were:
N-his mapylrs
N-his mapylrs
C-his mapylrs:C-his mapylrs:
C-his vector R:
C-his vector R:
扩增产物经过DnpI酶切,连接转化DH5α感受态细胞,提取质粒并测序。The amplified product was digested with DnpI, ligated and transformed into DH5α competent cells, and the plasmid was extracted and sequenced.
实施例3Example 3
1)挑取细菌单克隆,接种100ml LB(含有100mg/L卡那霉素),37℃过夜。1) Pick a single bacterial colony and inoculate 100 ml LB (containing 100 mg/L kanamycin) at 37°C overnight.
2)第二天按1:100接种到新鲜LB培养基中(含有100mg/L卡那霉素),37℃培养到OD600≈0.6,加入终浓度为0.1mM的IPTG,16℃诱导表达20hr。2) The next day, the cells were inoculated into fresh LB medium (containing 100 mg/L kanamycin) at a ratio of 1:100, cultured at 37°C until OD600≈0.6, and IPTG was added at a final concentration of 0.1 mM. Expression was induced at 16°C for 20 hr.
3)4℃,5000rpm离心20min收集菌体,按照20g湿菌体加入100ml裂解缓冲液(25mM Tris-HCl,500mM NaCl,25mM imidazole,5mMβ-巯基乙醇,1mM PMSF,0.1%Triton X-100)重悬菌体。利用高压均质机裂解细胞。3) Collect the cells by centrifugation at 5000 rpm for 20 min at 4°C, and resuspend 20 g of wet cells in 100 ml of lysis buffer (25 mM Tris-HCl, 500 mM NaCl, 25 mM imidazole, 5 mM β-mercaptoethanol, 1 mM PMSF, 0.1% Triton X-100). Lyse the cells using a high-pressure homogenizer.
4)4℃,20000rpm 20min两次离心,重复除去细菌碎片,上清用HisTrap亲和层析柱分离纯化目的蛋白。缓冲液A:25mM TrisHCl pH7.8,500mM NaCl,25mM imidazole;缓冲液B:25mM TrisHCl pH7.8,150mM NaCl,500mM imidazole;4) Centrifuge twice at 4°C, 20,000 rpm for 20 min, remove bacterial debris repeatedly, and separate and purify the target protein using a HisTrap affinity chromatography column. Buffer A: 25 mM TrisHCl pH 7.8, 500 mM NaCl, 25 mM imidazole; Buffer B: 25 mM TrisHCl pH 7.8, 150 mM NaCl, 500 mM imidazole;
5)细胞裂解上清流过亲和层析柱后,用10cv的缓冲液A重复冲洗亲和柱,再用10cv缓冲液B从0%~100%梯度洗脱目的蛋白。5) After the cell lysis supernatant flows through the affinity chromatography column, the affinity column is repeatedly washed with 10 cv of buffer A, and then the target protein is eluted with 10 cv of buffer B in a gradient from 0% to 100%.
6)收集并合并含有目的蛋白的组分,利用超滤离心管浓缩样品;6) Collect and combine the fractions containing the target protein and concentrate the sample using an ultrafiltration centrifuge tube;
7)浓缩后的样品对透析缓冲液:50%甘油,25mM Hepes pH7.5充分透析后测定蛋白浓度,并分装,-80℃至少可以保存1年。经图4可知,前三种形式的重组mapylrs经过诱导表达后裂解上清。具有更长的N-terminal的mapylrs蛋白表达量更高,并且蛋白的可溶性也更高。蛋白浓度最高可以达到120mg/mL(3.4mM)。7) After the concentrated sample is dialyzed against the dialysis buffer: 50% glycerol, 25mM Hepes pH7.5, the protein concentration is determined and the sample is aliquoted. It can be stored at -80°C for at least 1 year. As shown in Figure 4, the first three forms of recombinant mapylrs are lysed after induction of expression. The mapylrs protein with a longer N-terminal has a higher expression level and a higher solubility. The highest protein concentration can reach 120mg/mL (3.4mM).
实施例4比较三种构建方式纯化后蛋白的活性Example 4 Comparison of the activity of proteins purified by three construction methods
用nanodrop测定三种蛋白的浓度,用不含DNAase Rnase的ddH2O,将三种蛋白的浓度调整一致。The concentrations of the three proteins were measured using nanodrop, and then adjusted to the same level using ddH2O that did not contain DNAase RNase.
含非天然氨基酸表达体系建立:Establishment of expression system containing unnatural amino acids:
Protein Factory 100ulProtein Factory 100ul
Prock(非天然氨基酸)500mM 1ulProck (unnatural amino acid) 500mM 1ul
matRNACUA pyl体外转录产物(未纯化)10ulmatRNA CUA pyl in vitro transcription product (unpurified) 10ul
GFP-TAG-RFP双荧光报告基因PCR产物3ulGFP-TAG-RFP dual fluorescence reporter gene PCR product 3ul
不同形式的Mapylrs纯化蛋白终浓度5uMThe final concentration of purified proteins of different forms of Mapylrs was 5uM
Ex485nm/Em535检测GFP荧光强度,Ex535nm/Em595nm检测RFP荧光强度,根据RFP的荧光强度以及RFP/GFP的比值,判断非天然氨基酸的导入效率(具体参见图5)Ex485nm/Em535 detects the GFP fluorescence intensity, Ex535nm/Em595nm detects the RFP fluorescence intensity, and the introduction efficiency of the unnatural amino acid is determined based on the RFP fluorescence intensity and the RFP/GFP ratio (see Figure 5 for details).
由图5可知,根据RFP/GFP可见,三种形式的Mapylrs均能实现非天然氨基酸的高效导入,均具有很高的活性。其中,N-端标签通过稳定蛋白结构(Long-N-terminal mapylrs)使之具有更好的活性。As shown in Figure 5, according to RFP/GFP, all three forms of Mapylrs can achieve efficient introduction of unnatural amino acids and have high activity. Among them, the N-terminal tag stabilizes the protein structure (Long-N-terminal mapylrs) to make it more active.
实施例5检测原始Mapylrs(即未重组的Mapylrs)与N-端标签mapylrs纯化产物对于非天然氨基酸导入的活性影响。Example 5 detected the effects of the original Mapylrs (ie, non-recombinant Mapylrs) and the purified products of N-terminal tagged Mapylrs on the activity of the introduction of non-natural amino acids.
将原始Mapylrs与Long-N-terminal mapylrs、N-his maplys进行活性对比,测活方法同实施例4,采用双荧光蛋白表达法。The activities of original Mapylrs, Long-N-terminal mapylrs and N-his maplys were compared. The activity measurement method was the same as in Example 4, using the dual fluorescent protein expression method.
由图6和图7可知,与原始Mapylrs(图中为no tag)相比Long-N-terminal mapylrs、N-his maplys对导入非天然氨基酸的活性均有提升,说明经过本发明的改造,得到的重组的Mapylrs相对于原始Mapylrs在非天然氨基酸的导入方面具有明显的进步。As can be seen from Figures 6 and 7, compared with the original Mapylrs (no tag in the figure), Long-N-terminal mapylrs and N-his maplys have improved activities for the introduction of non-natural amino acids, indicating that after the modification of the present invention, the recombinant Mapylrs obtained have obvious progress in the introduction of non-natural amino acids compared with the original Mapylrs.
为了克服现有非天然氨基酸插入系统中需外源手动加入MaPylRS蛋白质,或需要转化/转染等导入含有其表达结构的质粒的缺陷,本发明还进一步公开了通过基因编辑技术将MaPylRS蛋白整合到细胞基因组内,创造了一个能够稳定适量表达MaPylRS蛋白的菌株,进而形成了一个无需外源加入MaPylRS的简便高效的非天然氨基酸插入系统。In order to overcome the defects of the existing non-natural amino acid insertion system that requires the exogenous manual addition of MaPylRS protein, or the need to introduce a plasmid containing its expression structure by transformation/transfection, the present invention further discloses the integration of MaPylRS protein into the cell genome through gene editing technology, creating a strain that can stably express MaPylRS protein in an appropriate amount, thereby forming a simple and efficient non-natural amino acid insertion system without the need for exogenous addition of MaPylRS.
下面仅以Long-N-terminal maplyrs去除了his标签为例(也即采用第四种结构),对MaPylRS蛋白整合到细胞基因组进行验证,这并不会对本发明其它的MaPylRS进行限制。In the following, only the Long-N-terminal maplyrs with the his tag removed is taken as an example (i.e., the fourth structure is used) to verify the integration of MaPylRS protein into the cell genome, which does not limit other MaPylRS of the present invention.
实施例6通过CRISPR-Cas9技术将MaPylRS表达盒(即第四种结构的表达盒,下同)插入在KlLys1-5附近Example 6 Inserting the MaPylRS expression cassette (i.e., the expression cassette of the fourth structure, the same below) near KlLys1-5 by CRISPR-Cas9 technology
(1)KlLys1-5序列检索及CRISPR gRNA序列确定(1) KlLys1-5 sequence retrieval and CRISPR gRNA sequence determination
为了不影响乳酸克鲁维酵母其他基因正常表达,本发明将MaPylRS表达结构插入到乳酸克鲁维酵母tDNA KlLys1-5附近,表达之后即与非天然氨基酸结合执行定点插入。In order not to affect the normal expression of other genes of Kluyveromyces lactis, the present invention inserts the MaPylRS expression structure near the tDNA KlLys1-5 of Kluyveromyces lactis, and after expression, it combines with non-natural amino acids to perform site-specific insertion.
i.http://gtrnadb.ucsc.edu/GtRNAdb2/genomes/eukaryota/Kluy_lact_NRRL_Y_1140/Kluy_lact_NRRL_Y_1140-gene-list.html中检索得到tRNA-Lys-CTT-1-5,得到K.lactis酵母中Lys1-5基因序列。在本发明中将此序列命名为KlLys1-5(位于染色体E的856274-856346)。i. tRNA-Lys-CTT-1-5 was retrieved from http://gtrnadb.ucsc.edu/GtRNAdb2/genomes/eukaryota/Kluy_lact_NRRL_Y_1140/Kluy_lact_NRRL_Y_1140-gene-list.html, and the Lys1-5 gene sequence in K. lactis yeast was obtained. In the present invention, this sequence is named K1Lys1-5 (located at 856274-856346 of chromosome E).
ii.在KlLys1-5上下游500bp范围内搜索PAM序列(NGG),最终选择位于基因下游的PAM(染色体E的856876...856878位点),并确定KlLys1-5 gRNA序列(GTTCCCATTGATCCCATATC(SEQ ID NO:11),位于染色体E的856856...856875位点)。ii. Search for the PAM sequence (NGG) within 500bp upstream and downstream of KlLys1-5, and finally select the PAM located downstream of the gene (sites 856876...856878 of chromosome E), and determine the KlLys1-5 gRNA sequence (GTTCCCATTGATCCCATATC (SEQ ID NO: 11), located at sites 856856...856875 of chromosome E).
(2)KlLys1-5 CRISPR-Cas9质粒构建(2) Construction of KlLys1-5 CRISPR-Cas9 plasmid
根据设计的gRNA序列,设计载体构建所需的两条24nt引物。gRNA-F1:AATCGTTCCCATTGATCCCATATC(SEQ ID NO:12);gRNA-R1:AAACGATATGGGATCAATGGGAAC(SEQ ID NO:13)将拿到引物稀释至10μM,向PCR管中分别加入gRNA-F1和gRNA-R2各10μL,混合后离心至管底,按以下程序进行退火:According to the designed gRNA sequence, design two 24nt primers required for vector construction. gRNA-F1: AATCGTTCCCATTGATCCCATATC (SEQ ID NO: 12); gRNA-R1: AAACGATATGGGATCAATGGGAAC (SEQ ID NO: 13) Dilute the primers to 10μM, add 10μL of gRNA-F1 and gRNA-R2 to the PCR tube, mix and centrifuge to the bottom of the tube, and anneal according to the following procedure:
95℃,3min;72℃,30s;65℃,2min;60℃,2min;55℃,2min;50℃,2min;16℃,2min之后进行连接反应95℃, 3min; 72℃, 30s; 65℃, 2min; 60℃, 2min; 55℃, 2min; 50℃, 2min; 16℃, 2min, then carry out the connection reaction
A.反应体系:10×Buffer 1μL,质粒20-50ng,退火产物1μL,酶0.2μL,加水至10μL。A. Reaction system: 10×Buffer 1μL, plasmid 20-50ng, annealing product 1μL, enzyme 0.2μL, add water to 10μL.
B.反应程序:16℃,60min。B. Reaction procedure: 16°C, 60 min.
取商品化E.coli DH5α感受态细胞50μL,加入所有连接产物后混匀,按说明书要求完成转化过程。在含50mg/L卡纳霉素的LB平板上筛选,过夜培养。挑取5个单克隆在LB液体培养基中振荡培养,送测序确认阳性后,提取质粒保存,命名为pKM-CAS1.0-KlLys1-5(图8)。Take 50 μL of commercial E. coli DH5α competent cells, add all the ligation products and mix well, and complete the transformation process according to the instructions. Screen on LB plates containing 50 mg/L kanamycin and culture overnight. Pick 5 single clones and shake culture in LB liquid culture medium. After sequencing to confirm the positive, extract the plasmid and save it, named pKM-CAS1.0-KlLys1-5 (Figure 8).
(3)供体DNA构建及扩增(3) Donor DNA construction and amplification
首先构建供体Donor,即KlLys1-5位点的同源重组序列和MaPylRS表达结构。MaPylRS的启动子选用KlPGK1 promoter,终止子为ScCYC1 terminator。供体DNA构建及转化方法如下:First, the donor Donor, i.e. the homologous recombination sequence of KlLys1-5 sites and the MaPylRS expression structure, was constructed. The promoter of MaPylRS was KlPGK1 promoter, and the terminator was ScCYC1 terminator. The donor DNA construction and transformation method were as follows:
iii.基因合成含有MaPylRS表达盒的质粒(命名为pKM-MaPylRS,参见图9),并以其为模板,以引物PF1:AATGTTCCCATTGATCCCATATCCTTCGAGCGTCCCAAAACC(SEQ ID NO:14)和引物PR1:TTCAGTTCAAAAACGCCCCGTTCCTCATCACTAGAAG(SEQ ID NO:15)进行PCR扩增得到MaPylRS表达盒片段。iii. Gene synthesize a plasmid containing the MaPylRS expression cassette (named pKM-MaPylRS, see Figure 9), and use it as a template to perform PCR amplification with primers PF1: AATGTTCCATTGATCCCATATCCTTCGAGCGTCCCAAAACC (SEQ ID NO: 14) and primer PR1: TTCAGTTCAAAAACGCCCCGTTCCTCATCACTAGAAG (SEQ ID NO: 15) to obtain the MaPylRS expression cassette fragment.
iv.以乳酸克鲁维酵母游离质粒为模板,以引物PF2:GTTATTAATGTCGTGTGCCATAGGT(SEQ ID NO:16)和引物PR2:AGGTTTTGGGACGCTCGAAGGATATGGGATCAATGGGAA(SEQ ID NO:17)进行PCR扩增得到KlLys1-5位点的同源臂1片段;以乳酸克鲁维酵母游离质粒为模板,以引物PF3:TTCTAGTGATGAGGAACGGGGCGTTTTTGAACTGAATTTCG(SEQ ID NO:18)和引物PR3:CAGCATAGCATTTGAGTATTGTG(SEQ ID NO:19)进行PCR扩增得到KlLys1-5位点的同源臂2片段。iv. Using the lactic acid Kluyveromyces free plasmid as a template, primers PF2: GTTATTAATGTCGTGTGCCATAGGT (SEQ ID NO: 16) and primer PR2: AGGTTTTGGGACGCTCGAAGGATATGGGATCAATGGGAA (SEQ ID NO: 17) were used for PCR amplification to obtain the homology arm 1 fragment of the KlLys1-5 site; using the lactic acid Kluyveromyces free plasmid as a template, primers PF3: TTCTAGTGATGAGGAACGGGGCGTTTTTGAACTGAATTTCG (SEQ ID NO: 18) and primer PR3: CAGCATAGCATTTGAGTATTGTG (SEQ ID NO: 19) were used for PCR amplification to obtain the homology arm 2 fragment of the KlLys1-5 site.
v.将上述得到的3个PCR产物片段混合、稀释100倍为模板,以引物PF4:ATAGGTCAATTAATAATATGCCAGCAAT(SEQ ID NO:20)和引物PR4:GGGGAGCATAGCATTCAAAAACTTC(SEQ ID NO:21)再次进行PCR扩增;即可通过重叠延伸PCR得到将上述3个片段连接起来,成为线性供体DNA Donor,测序确认后,保存在-20℃冰箱。v. Mix the three PCR product fragments obtained above, dilute them 100 times and use them as templates, and perform PCR amplification again with primer PF4: ATAGGTCAATTAATAATATGCCAGCAAT (SEQ ID NO: 20) and primer PR4: GGGGAGCATAGCATTCAAAAACTTC (SEQ ID NO: 21); the three fragments mentioned above can be connected by overlapping extension PCR to form a linear donor DNA Donor. After sequencing confirmation, store it in a -20℃ refrigerator.
(4)高效乳酸克鲁维酵母感受态制备及转化(4) Preparation and transformation of efficient Kluyveromyces lactis competent cells
酵母感受态制备Preparation of competent yeast
将乳酸克鲁维酵母菌液在YPD固体培养基上划线并挑取单克隆,于25mL 2×YPD液体培养基中振荡培养过夜,取2mL菌液于50mL液体2×YPD培养基中继续振荡培养2-8h。20℃条件下3000g离心5min收集酵母细胞,加入500μL无菌水重悬,同样条件下离心收集细胞。配制感受态细胞溶液(5%v/v甘油,10%v/v DMSO)并将酵母细胞溶解于500μL该溶液中。分装50μL至1.5mL离心管中,-80℃保存。Streak the Kluyveromyces lactis liquid on YPD solid medium and pick a single clone, shake culture overnight in 25mL 2×YPD liquid medium, take 2mL of the liquid and continue shaking culture in 50mL liquid 2×YPD medium for 2-8h. Collect yeast cells by centrifugation at 3000g for 5min at 20℃, add 500μL sterile water to resuspend, and collect cells by centrifugation under the same conditions. Prepare competent cell solution (5% v/v glycerol, 10% v/v DMSO) and dissolve yeast cells in 500μL of the solution. Aliquot 50μL into 1.5mL centrifuge tubes and store at -80℃.
酵母DNA转化Yeast DNA transformation
将感受态细胞置于冰上融化30s,加入pKM-CAS1.0-KlLys1-5质粒200ng,加入供体DNA 2000ng。电击1.5kV,5mS之后立即加入1mL YPD液体培养基,培养2-3h,吸取200μL涂布于固体YPD(200μg/mL G418)培养基,培养2-3天至单菌落出现。Thaw the competent cells on ice for 30 seconds, add 200ng of pKM-CAS1.0-KlLys1-5 plasmid and 2000ng of donor DNA. Immediately add 1mL YPD liquid medium after electric shock at 1.5kV for 5mS, culture for 2-3h, and apply 200μL on solid YPD (200μg/mL G418) medium, and culture for 2-3 days until single colonies appear.
(5)基因编辑阳性鉴定(5) Positive identification of gene editing
在乳酸克鲁维酵母转化后的平板上挑取50-60个单克隆,每个单克隆置于5μL酵母裂解液中(Takara Mighty Prep Reagent for DNA),以菌体裂解液为模板,分别以引物Ma R1(MaPylRS序列内引物):ATCTCTTACTTGAACGGTGCTA(SEQ ID NO:22);1-5F1(KlLys1-5供体DNA 5’外侧引物):GGTTATCCATTCAGGCAATGAAG(SEQ ID NO:23)和引物Ma F1(MaPylRS序列内引物):CCATGTGAGAACCTCTTGG(SEQ ID NO:24);1-5R1(KlLys1-5供体DNA 5’外侧引物):CAGCATAGCATTTGAGTATTGTG(SEQ ID NO:25)进行PCR扩增,对KlLys1-5位点的CRISPR插入进行检测,有阳性条带表明MaPylRS序列插入靶位点成功。50-60 single clones were picked from the plate after transformation of Kluyveromyces lactis, and each single clone was placed in 5 μL yeast lysis buffer (Takara Mighty Prep Reagent for DNA). The bacterial lysate was used as a template and the primers Ma R1 (primer within the MaPylRS sequence): ATCTCTTACTTGAACGGTGCTA (SEQ ID NO: 22); 1-5F1 (primer outside the 5' of KlLys1-5 donor DNA): GGTTATCCATTCAGGCAA PCR amplification was performed using primers Ma F1 (primer within the MaPylRS sequence): CCATGTGAGAACCTCTTGG (SEQ ID NO: 24); 1-5R1 (primer outside the 5’ of KlLys1-5 donor DNA): CAGCATAGCATTTGAGTATTGTG (SEQ ID NO: 25) to detect CRISPR insertion at the KlLys1-5 site. The presence of a positive band indicated that the MaPylRS sequence was successfully inserted into the target site.
实施例7通过CRISPR-Cas9技术将MaPylRS表达盒插入在KlglpA附近Example 7 Insertion of the MaPylRS expression cassette near KlglpA by CRISPR-Cas9 technology
(1)KlglpA序列检索及CRISPR gRNA序列确定(1) KlglpA sequence retrieval and CRISPR gRNA sequence determination
为了不影响乳酸克鲁维酵母其他基因正常表达,本发明将MaPylRS表达结构插入到乳酸克鲁维酵母tDNA KlglpA附近,表达之后即与非天然氨基酸结合执行定点插入。In order not to affect the normal expression of other genes of Kluyveromyces lactis, the present invention inserts the MaPylRS expression structure near the tDNA KlglpA of Kluyveromyces lactis, and after expression, it combines with non-natural amino acids to perform site-specific insertion.
i.https://www.genome.jp/kegg/kegg2.html中检索glycerol-3-phosphate dehydrogenase,得到K.lactis酵母中glpA基因序列。在本发明中将此序列命名为KlglpA(位于染色体A的33084-35012)。i. Search for glycerol-3-phosphate dehydrogenase in https://www.genome.jp/kegg/kegg2.html and obtain the glpA gene sequence in K. lactis yeast. In the present invention, this sequence is named KlglpA (located at 33084-35012 of chromosome A).
ii.在KlglpA上游1000-2000bp范围内搜索PAM序列(NGG),最终选择位于基因下游的PAM(染色体A的31956...31958位点),并确定KlglpA gRNA序列(GAAGTAACTCTAGCCATCGG(SEQ ID NO:26),位于染色体A的31936...31955位点)。ii. Search for the PAM sequence (NGG) in the range of 1000-2000bp upstream of KlglpA, and finally select the PAM located downstream of the gene (sites 31956...31958 of chromosome A), and determine the KlglpA gRNA sequence (GAAGTAACTCTAGCCATCGG (SEQ ID NO: 26), located at sites 31936...31955 of chromosome A).
(2)KlglpA CRISPR-Cas9质粒构建(2) Construction of KlglpA CRISPR-Cas9 plasmid
根据设计的gRNA序列,设计载体构建所需的两条24nt引物。gRNA-F2:AATCGAAGTAACTCTAGCCATCGG(SEQ ID NO:27);gRNA-R2:AAACCCGATGGCTAGAGTTACTTC(SEQ ID NO:28)将拿到引物稀释至10μM,向PCR管中分别加入gRNA-F2和gRNA-R2各10μL,混合后离心至管底,按以下程序进行退火:According to the designed gRNA sequence, design two 24nt primers required for vector construction. gRNA-F2: AATCGAAGTAACTCTAGCCATCGG (SEQ ID NO: 27); gRNA-R2: AAACCCGATGGCTAGAGTTACTTC (SEQ ID NO: 28) Dilute the primers to 10μM, add 10μL of gRNA-F2 and gRNA-R2 to the PCR tube, centrifuge to the bottom of the tube after mixing, and anneal according to the following procedure:
95℃,3min;72℃,30s;65℃,2min;60℃,2min;55℃,2min;50℃,2min;16℃,2min95℃, 3min; 72℃, 30s; 65℃, 2min; 60℃, 2min; 55℃, 2min; 50℃, 2min; 16℃, 2min
之后进行连接反应Then the ligation reaction
A.反应体系:10×Buffer 1μL,质粒20-50ng,退火产物1μL,酶0.2μL,加水至10μL。A. Reaction system: 10×Buffer 1μL, plasmid 20-50ng, annealing product 1μL, enzyme 0.2μL, add water to 10μL.
B.反应程序:16℃,60min。B. Reaction procedure: 16°C, 60 min.
取商品化E.coli DH5α感受态细胞50μL,加入所有连接产物后混匀,按说明书要求完成转化过程。在含50mg/L卡纳霉素的LB平板上筛选,过夜培养。挑取5个单克隆在LB液体培养基中振荡培养,送测序确认阳性后,提取质粒保存,命名为pKM-CAS1.0-KlglpA(图10)。Take 50 μL of commercial E. coli DH5α competent cells, add all the ligation products and mix well, and complete the transformation process according to the instructions. Screen on LB plates containing 50 mg/L kanamycin and culture overnight. Pick 5 single clones and shake culture in LB liquid culture medium. After sequencing to confirm the positive, extract the plasmid and save it, named pKM-CAS1.0-KlglpA (Figure 10).
(3)供体DNA构建及扩增(3) Donor DNA construction and amplification
首先构建供体Donor,即KlglpA位点的同源重组序列和MaPylRS表达结构。MaPylRS的启动子选用KlPGK1 promoter,终止子为ScCYC1 terminator。供体DNA构建及转化方法如下First, the donor Donor, i.e. the homologous recombination sequence of the KlglpA site and the MaPylRS expression structure, was constructed. The promoter of MaPylRS was KlPGK1 promoter and the terminator was ScCYC1 terminator. The donor DNA construction and transformation method are as follows
iii.基因合成含有MaPylRS表达盒的质粒,并以其为模板,以引物PF5:CCATCAGTTACGGTAGATTCTCCAGTGCCTACGTTCCTCATCACTAGAAG(SEQ ID NO:29)和引物PR5:TGTTTTGCGCTTGGTTTTCTTTGTGGAGAAATTTCTTCGAGCGTCCCAAA(SEQ ID NO:30)进行PCR扩增得到MaPylRS表达盒片段。iii. Genetically synthesize a plasmid containing the MaPylRS expression cassette, and use it as a template to perform PCR amplification with primer PF5: CCATCAGTTACGGTAGATTCTCCAGTGCCTACGTTCCTCATCACTAGAAG (SEQ ID NO: 29) and primer PR5: TGTTTTGCGCTTGGTTTTCTTTGTGGAGAAATTTCTTCGAGCGTCCCAAA (SEQ ID NO: 30) to obtain the MaPylRS expression cassette fragment.
iv.以乳酸克鲁维酵母游离质粒为模板,以引物PF6:AAATTAAGGCAAACATACAGG(SEQ ID NO:31)和引物PR6:CAACAGTTCGGCTTCTAGTGATGAGGAACGTAGGCACTGGAGAATCTACC(SEQ ID NO:32)进行PCR扩增得到KlglpA位点的同源臂1片段;以乳酸克鲁维酵母游离质粒为模板,以引物PF7:GCTTGAGAAGGTTTTGGGACGCTCGAAGAAATTTCTCCACAAAGAAAACC(SEQ ID NO:33)和引物PR7:GACCTTTTATTTTGTCACCG(SEQ ID NO:34)进行PCR扩增得到KlglpA位点的同源臂2片段。iv. Using the lactic acid Kluyveromyces free plasmid as a template, primers PF6:AAATTAAGGCAAACATACAGG (SEQ ID NO: 31) and primer PR6:CAACAGTTCGGCTTCTAGTGATGAGGAACGTAGGCACTGGAGAATCTACC (SEQ ID NO: 32) were used for PCR amplification to obtain the homology arm 1 fragment of the KlglpA site; using the lactic acid Kluyveromyces free plasmid as a template, primers PF7:GCTTGAGAAGGTTTTGGGACGCTCGAAGAAATTTCTCCACAAAGAAAACC (SEQ ID NO: 33) and primer PR7:GACCTTTTATTTTGTCACCG (SEQ ID NO: 34) were used for PCR amplification to obtain the homology arm 2 fragment of the KlglpA site.
v.将上述得到的3个PCR产物片段混合、稀释100倍为模板,以引物PF8:ATATCGGATGACATGCAGCAA(SEQ ID NO:35)和引物PR8:TTGTGTACCAAAACTTTCACGG(SEQ ID NO:36)再次进行PCR扩增;即可通过重叠延伸PCR得到将上述3个片段连接起来,成为线性供体DNA Donor,测序确认后,保存在-20摄氏度冰箱。v. Mix the three PCR product fragments obtained above, dilute them 100 times and use them as templates, and perform PCR amplification again with primer PF8: ATATCGGATGACATGCAGCAA (SEQ ID NO: 35) and primer PR8: TTGTGTACCAAAACTTTCACGG (SEQ ID NO: 36); then the three fragments can be connected by overlapping extension PCR to form a linear donor DNA Donor. After sequencing confirmation, store it in a -20 degrees Celsius refrigerator.
(4)高效乳酸克鲁维酵母感受态制备及转化(4) Preparation and transformation of efficient Kluyveromyces lactis competent cells
酵母感受态制备Preparation of competent yeast
将乳酸克鲁维酵母菌液在YPD固体培养基上划线并挑取单克隆,于25mL 2×YPD液体培养基中振荡培养过夜,取2mL菌液于50mL液体2×YPD培养基中继续振荡培养2-8h。20℃条件下3000g离心5min收集酵母细胞,加入500μL无菌水重悬,同样条件下离心收集细胞。配制感受态细胞溶液(5%v/v甘油,10%v/v DMSO)并将酵母细胞溶解于500μL该溶液中。分装50μL至1.5mL离心管中,-80℃保存。Streak the Kluyveromyces lactis liquid on YPD solid medium and pick a single clone, shake culture overnight in 25mL 2×YPD liquid medium, take 2mL of the liquid and continue shaking culture in 50mL liquid 2×YPD medium for 2-8h. Collect yeast cells by centrifugation at 3000g for 5min at 20℃, add 500μL sterile water to resuspend, and collect cells by centrifugation under the same conditions. Prepare competent cell solution (5% v/v glycerol, 10% v/v DMSO) and dissolve yeast cells in 500μL of the solution. Aliquot 50μL into 1.5mL centrifuge tubes and store at -80℃.
酵母DNA转化Yeast DNA transformation
将感受态细胞置于冰上融化30s,加入pKM-CAS1.0-KlglpA质粒200ng,加入供体DNA 2000ng。电击1.5kV,5mS之后立即加入1mL YPD液体培养基,培养2-3h,吸取200μL涂布于固体YPD(200μg/mL G418)培养基,培养2-3天至单菌落出现。Thaw the competent cells on ice for 30 seconds, add 200ng of pKM-CAS1.0-KlglpA plasmid and 2000ng of donor DNA. Immediately add 1mL YPD liquid medium after electric shock at 1.5kV for 5mS, culture for 2-3h, and apply 200μL on solid YPD (200μg/mL G418) medium, and culture for 2-3 days until single colonies appear.
(5)基因编辑阳性鉴定(5) Positive identification of gene editing
在乳酸克鲁维酵母转化后的平板上挑取50-60个单克隆,每个单克隆置于5μL酵母裂解液中(Takara Mighty Prep Reagent for DNA),以菌体裂解液为模板,分别以引物Ma R2(MaPylRS序列内引物):CCATGTGAGAACCTCTTGG(SEQ ID NO:37);glpA F1(KlglpA供体DNA 5’外侧引物):AAATTAAGGCAAACATACAGG(SEQ ID NO:38)和引物Ma F2(MaPylRS序列内引物):ATCTCTTACTTGAACGGTGCTA(SEQ ID NO:39);glpA R1(KlglpA供体DNA 5’外侧引物):GACCTTTTATTTTGTCACCG(SEQ ID NO:40)进行PCR扩增,对KlglpA位点的CRISPR插入进行检测,有阳性条带表明MaPylRS序列插入靶位点成功。50-60 single clones were picked from the plate after transformation of Kluyveromyces lactis, and each single clone was placed in 5 μL yeast lysis buffer (Takara Mighty Prep Reagent for DNA). The bacterial lysate was used as a template and the primers Ma R2 (primer within the MaPylRS sequence): CCATGTGAGAACCTCTTGG (SEQ ID NO: 37); glpA F1 (primer outside the 5' of KlglpA donor DNA): AAATTAAGGCAAACATA CAGG (SEQ ID NO: 38) and primer Ma F2 (primer within the MaPylRS sequence): ATCTCTTACTTGAACGGTGCTA (SEQ ID NO: 39); glpA R1 (primer outside the 5' of KlglpA donor DNA): GACCTTTTATTTTGTCACCG (SEQ ID NO: 40) were used for PCR amplification to detect the CRISPR insertion at the KlglpA site. The presence of a positive band indicated that the MaPylRS sequence was successfully inserted into the target site.
实施例8通过CRISPR-Cas9技术将KlUPF1敲除并替换为MaPylRS表达盒Example 8 KlUPF1 was knocked out and replaced with MaPylRS expression cassette by CRISPR-Cas9 technology
(1)KlUPF1序列检索及CRISPR gRNA序列确定(1) KlUPF1 sequence retrieval and CRISPR gRNA sequence determination
根据文献,酵母UPF1蛋白被证明参与了mRNA的降解,删除该基因可以减缓未成熟mRNA转录本的降解速度。因此,本发明将KlUPF1基因完全敲除,并替换为MaPylRS序列,使非天然氨基酸转录本集聚,从而提高非天然氨基酸插入效率。According to the literature, yeast UPF1 protein has been shown to be involved in the degradation of mRNA, and deleting this gene can slow down the degradation rate of immature mRNA transcripts. Therefore, the present invention completely knocks out the K1UPF1 gene and replaces it with the MaPylRS sequence to aggregate the unnatural amino acid transcripts, thereby improving the efficiency of unnatural amino acid insertion.
i.在http://www.yeastgenome.org/中检索“UPF1”,得到酿酒酵母中ScUPF1基因序列。在NCBI数据库中以UPF1基因进行BLAST比对分析,确定乳酸克鲁维酵母中UPF1同源基因序列KlUPF1(位于染色体B的567908...570817)。i. Search for "UPF1" in http://www.yeastgenome.org/ to obtain the ScUPF1 gene sequence in Saccharomyces cerevisiae. Perform BLAST comparison analysis on the UPF1 gene in the NCBI database to determine the UPF1 homologous gene sequence KlUPF1 in Kluyveromyces lactis (located at 567908...570817 on chromosome B).
ii.在KlUPF1基因两端搜索PAM序列(NGG),并确定gRNA序列。gRNA选择的原则为:GC含量适中,本发明的标准为GC含量为40%-60%;避免poly T结构的存在。最终,本发明确定的KlUPF1 gRNA1序列为TTGGCAAACGCATCGTCATA(SEQ ID NO:41)和gRNA1序列为CTTAAGGAAGTACAATGGAG(SEQ ID NO:42)。ii. Search for PAM sequences (NGG) at both ends of the KlUPF1 gene and determine the gRNA sequence. The principles for gRNA selection are: moderate GC content, the standard of the present invention is 40%-60% GC content; avoid the presence of poly T structure. Finally, the KlUPF1 gRNA1 sequence determined by the present invention is TTGGCAAACGCATCGTCATA (SEQ ID NO: 41) and the gRNA1 sequence is CTTAAGGAAGTACAATGGAG (SEQ ID NO: 42).
(2)KlUPF1 CRISPR-Cas9质粒构建(2) Construction of KlUPF1 CRISPR-Cas9 plasmid
根据设计的gRNA1和gRNA2序列,设计载体构建所需的两条24nt引物。gRNA-F3:AATCTTGGCAAACGCATCGTCATA(SEQ ID NO:43);gRNA-R3:AAACTATGACGATGCGTTTGCCAA(SEQ ID NO:44);gRNA-F4:AATCCTTAAGGAAGTACAATGGAG(SEQ ID NO:45);gRNA-R4:AAACCTCCATTGTACTTCCTTAAG(SEQ ID NO:46)将拿到引物稀释至10μM,向PCR管中分别加入gRNA-F和gRNA-R各10μL,混合后离心至管底,按以下程序进行退火:According to the designed gRNA1 and gRNA2 sequences, design two 24nt primers required for vector construction. gRNA-F3: AATCTTGGCAAACGCATCGTCATA (SEQ ID NO: 43); gRNA-R3: AAACTATGACGATGCGTTTGCCAA (SEQ ID NO: 44); gRNA-F4: AATCCTTTAAGGAAGTACAATGGAG (SEQ ID NO: 45); gRNA-R4: AAACCTCCATTGTACTTCCTTAAG (SEQ ID NO: 46) Dilute the primers to 10μM, add 10μL of gRNA-F and gRNA-R to the PCR tube, centrifuge to the bottom of the tube after mixing, and anneal according to the following procedure:
95℃,3min;72℃,30s;65℃,2min;60℃,2min;55℃,2min;50℃,2min;16℃,2min95℃, 3min; 72℃, 30s; 65℃, 2min; 60℃, 2min; 55℃, 2min; 50℃, 2min; 16℃, 2min
之后进行连接反应。Then the ligation reaction is carried out.
A.反应体系:10×Buffer 1μL,质粒20-50ng,退火产物1μL,酶0.2μL,加水至10μL。A. Reaction system: 10×Buffer 1μL, plasmid 20-50ng, annealing product 1μL, enzyme 0.2μL, add water to 10μL.
B.反应程序:16℃,60min。B. Reaction procedure: 16°C, 60 min.
取商品化E.coli DH5α感受态细胞50μL,加入所有连接产物后混匀,按说明书要求完成转化过程。在含50mg/L卡纳霉素的LB平板上筛选,过夜培养。挑取5个单克隆在LB液体培养基中振荡培养,送测序确认阳性后,提取质粒保存,命名为pKM-CAS1.0-KlUPF1(图11)。Take 50 μL of commercial E. coli DH5α competent cells, add all the ligation products and mix well, and complete the transformation process according to the instructions. Screen on LB plates containing 50 mg/L kanamycin and culture overnight. Pick 5 single clones and shake culture in LB liquid culture medium. After sequencing to confirm the positive, extract the plasmid and save it, named pKM-CAS1.0-KlUPF1 (Figure 11).
(3)供体DNA构建及扩增(3) Donor DNA construction and amplification
本发明首先构建供体Donor,UPF1编码序列置换为MaPylRS的编码序列。即插入的MaPylRS沿用UPF1的启动子和终止子。供体DNA构建及转化方法如下The present invention first constructs a donor Donor, and replaces the UPF1 coding sequence with the MaPylRS coding sequence. That is, the inserted MaPylRS uses the promoter and terminator of UPF1. The donor DNA construction and transformation method are as follows:
iii.基因合成含有MaPylRS表达盒的质粒,并以其为模板,以引物PF9:AGTACAATTAGAATCAAGTTTCCTTATGGGTTCTTCTTCTTCTGG(SEQ ID NO:47)和引物PR9:TAATATTATTTAATTAATGGATTGATACGCGTTCATGTTTAGTTGATCTTAGCACCGTTC(SEQ ID NO:48)进行PCR扩增得到MaPylRS的编码序列。iii. Gene synthesize a plasmid containing the MaPylRS expression cassette, and use it as a template to perform PCR amplification with primer PF9: AGTACAATTAGAATCAAGTTTCCTTATGGGTTCTTCTTCTTCTGG (SEQ ID NO: 47) and primer PR9: TAATATTATTTAATTAATGGATTGATACGCGTTCATGTTTAGTTGATCTTAGCACCGTTC (SEQ ID NO: 48) to obtain the coding sequence of MaPylRS.
iv.以乳酸克鲁维酵母游离质粒为模板,以引物PF10:CAATGGATACAGTTTCTCGCTA(SEQ ID NO:49)和引物PR10:ACCAGAAGAAGAAGAACCCATAAGGAAACTTGATTCTAATTGT(SEQ ID NO:50)进行PCR扩增得到KlUPF1位点的同源臂1片段;以乳酸克鲁维酵母游离质粒为模板,以引物PF11:TTGAACGGTGCTAAGATCAACTAAACATGAACGCGTATCAATC(SEQ ID NO:51)和引物PR11:CTTCGAGACTTCCAATGATCTC(SEQ ID NO:52)进行PCR扩增得到KlUPF1位点的同源臂2片段。iv. Using Kluyveromyces lactis free plasmid as template, primer PF10:CAATGGATACAGTTTCTCGCTA (SEQ ID NO: 49) and primer PR10:ACCAGAAGAAGAAGAACCCATAAGGAAACTTGATTCTAATTGT (SEQ ID NO: 50) were used for PCR amplification to obtain the homology arm 1 fragment of the KlUPF1 site; using Kluyveromyces lactis free plasmid as template, primer PF11:TTGAACGGTGCTAAGATCAACTAAACATGAACGCGTATCAATC (SEQ ID NO: 51) and primer PR11:CTTCGAGACTTCCAATGATCTC (SEQ ID NO: 52) were used for PCR amplification to obtain the homology arm 2 fragment of the KlUPF1 site.
v.将上述得到的3个PCR产物片段混合、稀释100倍为模板,以引物PF12:GATCGTCCATTAGCTTATCTACAAATGCC(SEQ ID NO:53)和引物PR12:GTGAGAATGCCAGACGAT(SEQ ID NO:54)再次进行PCR扩增;即可通过重叠延伸PCR得到将上述3个片段连接起来,成为线性供体DNA Donor,测序确认后,保存在-20摄氏度冰箱。v. Mix the three PCR product fragments obtained above, dilute them 100 times and use them as templates, and perform PCR amplification again with primer PF12: GATCGTCCATTAGCTTATCTACAAATGCC (SEQ ID NO: 53) and primer PR12: GTGAGAATGCCAGACGAT (SEQ ID NO: 54); then the three fragments can be connected by overlapping extension PCR to form a linear donor DNA Donor. After sequencing confirmation, store it in a -20 degrees Celsius refrigerator.
(4)乳酸克鲁维酵母转化及阳性鉴定(4) Kluyveromyces lactis transformation and positive identification
酵母感受态制备Preparation of competent yeast
将乳酸克鲁维酵母菌液在YPD固体培养基上划线并挑取单克隆,于25mL 2×YPD液体培养基中振荡培养过夜,取2mL菌液于50mL液体2×YPD培养基中继续振荡培养2-8h。20℃条件下3000g离心5min收集酵母细胞,加入500μL无菌水重悬,同样条件下离心收集细胞。配制感受态细胞溶液(5%v/v甘油,10%v/v DMSO)并将酵母细胞溶解于500μL该溶液中。分装50μL至1.5mL离心管中,-80℃保存。Streak the Kluyveromyces lactis liquid on YPD solid medium and pick a single clone, shake culture overnight in 25mL 2×YPD liquid medium, take 2mL of the liquid and continue shaking culture in 50mL liquid 2×YPD medium for 2-8h. Collect yeast cells by centrifugation at 3000g for 5min at 20℃, add 500μL sterile water to resuspend, and collect cells by centrifugation under the same conditions. Prepare competent cell solution (5% v/v glycerol, 10% v/v DMSO) and dissolve yeast cells in 500μL of the solution. Aliquot 50μL into 1.5mL centrifuge tubes and store at -80℃.
酵母DNA转化Yeast DNA transformation
将感受态细胞置于冰上融化30s,加入pKM-CAS1.0-KlUPF1质粒200ng,加入供体DNA 2000ng。电击1.5kV,5mS之后立即加入1mL YPD液体培养基,培养2-3h,吸取200μL涂布于固体YPD(200μg/mL G418)培养基,培养2-3天至单菌落出现。Thaw the competent cells on ice for 30 seconds, add 200ng of pKM-CAS1.0-KlUPF1 plasmid and 2000ng of donor DNA. Immediately add 1mL YPD liquid medium after electric shock at 1.5kV for 5mS, culture for 2-3h, and apply 200μL on solid YPD (200μg/mL G418) medium, and culture for 2-3 days until single colonies appear.
(5)基因编辑阳性鉴定(5) Positive identification of gene editing
在乳酸克鲁维酵母转化后的平板上挑取50-60个单克隆,每个单克隆置于5μL酵母裂解液中(Takara Mighty Prep Reagent for DNA),以菌体裂解液为模板,分别以引物Ma R3(MaPylRS序列内引物):GTCTCTAGAAGCCAAGTCTTC(SEQ ID NO:55);UPF1 F1(KlUPF1供体DNA 5’外侧引物):GAACTGCCACGGGCT(SEQ ID NO:56)和引物Ma F3(MaPylRS序列内引物):GCTGCTCACGACGTTCA(SEQ ID NO:57);UPF1 R1(KlUPF1供体DNA 5’外侧引物):GCACTGTAATCAGGCAACT(SEQ ID NO:58)进行PCR扩增,对KlUPF1位点的CRISPR插入进行检测,有阳性条带表明MaPylRS序列插入靶位点成功。50-60 single clones were picked from the plate after transformation of Kluyveromyces lactis, and each single clone was placed in 5 μL yeast lysis buffer (Takara Mighty Prep Reagent for DNA). The bacterial lysate was used as a template and the primers Ma R3 (primer within the MaPylRS sequence): GTCTCTAGAAGCCAAGTCTTC (SEQ ID NO: 55); UPF1 F1 (primer outside the 5' of KlUPF1 donor DNA): GAACTGCCAC PCR amplification was performed using primers Ma F3 (primer within the MaPylRS sequence): GCTGCTCACGACGTTCA (SEQ ID NO: 57); UPF1 R1 (primer outside the 5’ of KlUPF1 donor DNA): GCACTGTAATCAGGCAACT (SEQ ID NO: 58) to detect CRISPR insertion at the KlUPF1 site. The presence of a positive band indicated that the MaPylRS sequence was successfully inserted into the target site.
实施例9活性测定Example 9 Activity Assay
将基因改造后的乳酸克鲁维酵母菌株制备成体外蛋白质合成系统(IVTT),并加入含有绿色荧光蛋白GFP和红色荧光蛋白RFP(green fluorescent protein,GFP;redfluorescent protein,RFP)双报告基因的质粒以测定改造菌株的对特定蛋白特定位点的非天然氨基酸插入能力。GFP用于检测蛋白质的整体表达情况,而RFP则用于检测在特定位置,即GFP的终止密码子TAG处是否成功插入了非天然氨基酸。如果出现了红色荧光信号,说明翻译相关蛋白读通了终止密码子TAG,意味着非天然氨基酸被成功插入,而缺乏红色荧光信号则表示翻译相关蛋白停止在了终止密码子TAG,非天然氨基酸插入失败。The genetically modified Kluyveromyces lactis strain was prepared into an in vitro protein synthesis system (IVTT), and a plasmid containing dual reporter genes of green fluorescent protein GFP and red fluorescent protein RFP (green fluorescent protein, GFP; red fluorescent protein, RFP) was added to determine the ability of the modified strain to insert non-natural amino acids into specific sites of specific proteins. GFP is used to detect the overall expression of the protein, while RFP is used to detect whether the non-natural amino acid is successfully inserted at a specific position, that is, the stop codon TAG of GFP. If a red fluorescent signal appears, it means that the translation-related protein reads through the stop codon TAG, which means that the non-natural amino acid is successfully inserted, while the lack of a red fluorescent signal means that the translation-related protein stops at the stop codon TAG and the insertion of the non-natural amino acid fails.
以实施例6得到的乳酸克鲁维酵母菌株为例,并对MaPylRS表达盒中的启动子进行调整,得到不同的乳酸克鲁维酵母菌株,制备成Protein Factory。Taking the Kluyveromyces lactis strain obtained in Example 6 as an example, the promoter in the MaPylRS expression cassette was adjusted to obtain different Kluyveromyces lactis strains and prepare them into a Protein Factory.
含非天然氨基酸表达体系建立:Establishment of expression system containing unnatural amino acids:
Protein Factory 100ulProtein Factory 100ul
Prock(非天然氨基酸)500mM 1ulProck (unnatural amino acid) 500mM 1ul
matRNACUA pyl体外转录产物(未纯化) 10ulmatRNA CUA pyl in vitro transcription product (unpurified) 10ul
GFP-TAG-RFP双荧光报告基因PCR产物 3ulGFP-TAG-RFP dual fluorescence reporter gene PCR product 3ul
通过检测RFP荧光强度,根据RFP的荧光强度以及RFP/GFP的比值,判断非天然氨基酸的导入效率(具体参见图12~图14)By detecting the fluorescence intensity of RFP, the efficiency of the introduction of unnatural amino acids can be determined based on the fluorescence intensity of RFP and the ratio of RFP/GFP (see Figures 12 to 14 for details).
图12~图14中,DW14-1和DW14-2为两组平行实验,启动子为TIF11;DW14-3的启动子为TEF1;DW14-4和DW14-5为两组平行实验,启动子为ADH1;DW14-6和DW14-7为两组平行实验,启动子为GAP1;DW14-8和DW14-9为两组平行实验,启动子为HXK4;DW14-10的启动子为PGK1。In Figures 12 to 14, DW14-1 and DW14-2 are two sets of parallel experiments, and the promoter is TIF11; the promoter of DW14-3 is TEF1; DW14-4 and DW14-5 are two sets of parallel experiments, and the promoter is ADH1; DW14-6 and DW14-7 are two sets of parallel experiments, and the promoter is GAP1; DW14-8 and DW14-9 are two sets of parallel experiments, and the promoter is HXK4; the promoter of DW14-10 is PGK1.
由图12可知,改造后的菌株均实现了对终止密码子TAG的读通,即证明本发明改造菌株均能够实现非天然氨基酸的导入。由图14可知,本发明改造的菌株均有较高非天然氨基酸的导入效率,特别是DW14-4~DW14-7以及DW14-10,具有很高的RFP/GFP值,说明均有非常高的非天然氨基酸的导入效率。As shown in Figure 12, the modified strains all achieved the read-through of the stop codon TAG, which proves that the modified strains of the present invention can achieve the introduction of non-natural amino acids. As shown in Figure 14, the modified strains of the present invention all have a high non-natural amino acid introduction efficiency, especially DW14-4 to DW14-7 and DW14-10, which have a very high RFP/GFP value, indicating that they all have a very high non-natural amino acid introduction efficiency.
实施例10Example 10
对实施例6改造后的MaPylRS和原始MaPylRS(实施例5)整合到酵母中之后得到的反应体系对ncaa导入的活性影响进行比较,具体测定条件参见实施例9,具体结果参见图15-图17,其中sl-3代表的是整合原始MaPylRS的酵母的反应体系,sl-9代表的是整合本申请改造的MaPylRS的酵母的反应体系,体系中均添加了正交tRNA和Prock,并各自采用不添加ncaa(即non)的体系作为对照,由图15-图17可知,将野生型和改造的MaPylRS整合到酵母细胞中得到的反应体系均能实现ncaa导入,但是经过本发明改造后的MaPylRS整合到酵母细胞之后得到的反应体系能明显提高ncaa的导入效率。The reaction systems obtained after the modified MaPylRS in Example 6 and the original MaPylRS (Example 5) were integrated into yeast to compare the activity effects of ncaa introduction. For specific measurement conditions, see Example 9, and for specific results, see Figures 15 to 17, where sl-3 represents the reaction system of yeast integrated with the original MaPylRS, and sl-9 represents the reaction system of yeast integrated with the modified MaPylRS of the present application. Orthogonal tRNA and Prock were added to the systems, and a system without adding ncaa (i.e., non) was used as a control. It can be seen from Figures 15 to 17 that the reaction systems obtained by integrating the wild-type and modified MaPylRS into yeast cells can both achieve ncaa introduction, but the reaction system obtained after the modified MaPylRS of the present invention is integrated into yeast cells can significantly improve the efficiency of ncaa introduction.
本发明采用的序列见如下表1The sequences used in the present invention are shown in Table 1 below
表1
Table 1
以上述依据本申请的理想实施例为启示,通过上述的说明内容,相关工作人员完全可以在不偏离本项申请技术思想的范围内,进行多样的变更以及修改。本项申请的技术性范围并不局限于说明书上的内容,必须要根据权利要求范围来确定其技术性范围Based on the above ideal embodiments of this application, the relevant staff can make various changes and modifications without deviating from the technical concept of this application through the above description. The technical scope of this application is not limited to the content in the specification, and its technical scope must be determined according to the scope of the claims.
Claims (25)
A1-A2-A3-A4(I):A recombinant aminoacyl-tRNA synthetase having the structure described in Formula I:
A1-A2-A3-A4(I):
Z5-Z1-Z2-Z3-Z4,Preferably, the nucleic acid structure comprises the structure described in Formula III:
Z5-Z1-Z2-Z3-Z4,
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202311806634.1A CN120210144A (en) | 2023-12-25 | 2023-12-25 | A modified aminoacyl-tRNA synthetase, nucleic acid construct and genetically engineered strain |
| CN202311806634.1 | 2023-12-25 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025140227A1 true WO2025140227A1 (en) | 2025-07-03 |
Family
ID=96097812
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2024/142051 Pending WO2025140227A1 (en) | 2023-12-25 | 2024-12-25 | Modified aminoacyl-trna synthetase, nucleic acid construct and genetically engineered strain |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN120210144A (en) |
| WO (1) | WO2025140227A1 (en) |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2016066995A1 (en) * | 2014-10-27 | 2016-05-06 | Medical Research Council | Incorporation of unnatural amino acids into proteins |
| CN108949801A (en) * | 2017-11-24 | 2018-12-07 | 康码(上海)生物科技有限公司 | A method of by knocking out nucleic acid enzyme system to regulate and control external biological synthesizing activity |
| CN112739823A (en) * | 2018-08-31 | 2021-04-30 | 国立研究开发法人理化学研究所 | Pyrrolysinyl-tRNA synthetases |
| WO2021165410A1 (en) * | 2020-02-21 | 2021-08-26 | European Molecular Biology Laboratory | Archaeal pyrrolysyl trna synthetases for orthogonal use |
| CN117165620A (en) * | 2017-08-31 | 2023-12-05 | 康码(上海)生物科技有限公司 | Nucleic acid construct for endogenously expressing RNA polymerase in cells |
-
2023
- 2023-12-25 CN CN202311806634.1A patent/CN120210144A/en active Pending
-
2024
- 2024-12-25 WO PCT/CN2024/142051 patent/WO2025140227A1/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2016066995A1 (en) * | 2014-10-27 | 2016-05-06 | Medical Research Council | Incorporation of unnatural amino acids into proteins |
| CN117165620A (en) * | 2017-08-31 | 2023-12-05 | 康码(上海)生物科技有限公司 | Nucleic acid construct for endogenously expressing RNA polymerase in cells |
| CN108949801A (en) * | 2017-11-24 | 2018-12-07 | 康码(上海)生物科技有限公司 | A method of by knocking out nucleic acid enzyme system to regulate and control external biological synthesizing activity |
| CN112739823A (en) * | 2018-08-31 | 2021-04-30 | 国立研究开发法人理化学研究所 | Pyrrolysinyl-tRNA synthetases |
| WO2021165410A1 (en) * | 2020-02-21 | 2021-08-26 | European Molecular Biology Laboratory | Archaeal pyrrolysyl trna synthetases for orthogonal use |
Non-Patent Citations (1)
| Title |
|---|
| RUAN JIANBING, MEI YANZHEN: "Applications of Poly-histidine Affinity Tags in Protein Drugs Development", BIOTECHNOLOGY BULLETIN, no. 6, 26 June 2012 (2012-06-26), pages 49 - 53, XP093330212, DOI: 10.13560/j.cnki.biotech.bull.1985.2012.06.004 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN120210144A (en) | 2025-06-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111778169B (en) | Method for improving in vitro protein synthesis efficiency | |
| JP7093417B2 (en) | How to regulate in vitro biosynthetic activity by knocking out the nuclease system | |
| CN109423496B (en) | A nucleic acid construct for endogenously expressing RNA polymerase in cells | |
| CN113481226B (en) | Signal peptide related sequence and application thereof in protein synthesis | |
| CN110408636B (en) | DNA sequence with multiple labels connected in series and application thereof in protein expression and purification system | |
| CN110408635B (en) | Application of nucleic acid construct containing streptavidin element in protein expression and purification | |
| JP2020528760A (en) | Use in the preparation of novel fusion proteins and the improvement of their protein synthesis | |
| CN110551745A (en) | Multiple histidine sequence tag and application thereof in protein expression and purification | |
| CN110938649A (en) | Protein synthesis system for improving expression quantity of foreign protein and application method thereof | |
| KR100975153B1 (en) | Method for producing water-soluble and active recombinant protein using PDI as a fusion partner | |
| US7109015B2 (en) | Removal of N-terminal methionine from proteins by engineered methionine aminopeptidase | |
| WO2025140227A1 (en) | Modified aminoacyl-trna synthetase, nucleic acid construct and genetically engineered strain | |
| WO2025092824A1 (en) | Genetically engineered strain based on trna modification, preparation method therefor, and use thereof | |
| CN120400113A (en) | A method for reducing impurity proteins in target protein synthesis | |
| CN111118065A (en) | Gene modification method of eukaryote, corresponding gene engineering cell and application thereof | |
| CN120060319A (en) | Method for removing release factor and application thereof | |
| CN120400114A (en) | A method for reducing the impact of interfering factors on target proteins | |
| CN119432629A (en) | A genetically engineered strain and its preparation method and its application in cell-free protein synthesis | |
| CN119372171A (en) | A recombinant DNA polymerase and its application in cell-free protein synthesis | |
| CN120380161A (en) | In-vitro cell-free protein synthesis system, method and application for inserting unnatural amino acid | |
| CN119220424A (en) | Genetically engineered strains for improving in vitro cell-free protein synthesis activity and their application in D2P | |
| CN120060320A (en) | Nucleic acid construct based on flocculation gene and application thereof | |
| CN119709819A (en) | Genetically engineered strain and application thereof in D2P |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24911151 Country of ref document: EP Kind code of ref document: A1 |