[go: up one dir, main page]

WO2025092824A1 - Genetically engineered strain based on trna modification, preparation method therefor, and use thereof - Google Patents

Genetically engineered strain based on trna modification, preparation method therefor, and use thereof Download PDF

Info

Publication number
WO2025092824A1
WO2025092824A1 PCT/CN2024/128530 CN2024128530W WO2025092824A1 WO 2025092824 A1 WO2025092824 A1 WO 2025092824A1 CN 2024128530 W CN2024128530 W CN 2024128530W WO 2025092824 A1 WO2025092824 A1 WO 2025092824A1
Authority
WO
WIPO (PCT)
Prior art keywords
pichia
cell
trna
nucleic acid
acid construct
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/128530
Other languages
French (fr)
Chinese (zh)
Inventor
郭敏
高鹏飞
高霈
代田纯
于雪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kangma Healthcode Shanghai Biotech Co Ltd
Original Assignee
Kangma Healthcode Shanghai Biotech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kangma Healthcode Shanghai Biotech Co Ltd filed Critical Kangma Healthcode Shanghai Biotech Co Ltd
Publication of WO2025092824A1 publication Critical patent/WO2025092824A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/10Cells modified by introduction of foreign genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/645Fungi ; Processes using fungi

Definitions

  • the present invention relates to the field of gene engineering technology, and in particular to a gene engineering strain and a preparation method and application thereof.
  • CRISPR Clustered Regularly Interspaced Short Palindromic Repeats
  • the core components of CRISPR technology include CRISPR sequences and Cas proteins.
  • the CRISPR sequence is a DNA sequence that contains a series of repeated and spaced sequences that come from foreign viruses or plasmids that bacteria or archaea have encountered.
  • Cas protein is the core protein in the CRISPR system, which can recognize and cut these foreign DNA sequences.
  • CRISPR technology is faster and more accurate, and can achieve large-scale gene editing in a short period of time.
  • Amber codon tRNA is a special tRNA that corresponds to the amber codon in the genetic code.
  • the amber codon is a stop codon, and when it is encountered, the protein synthesis process will stop.
  • the specific implementation method is to pair a specific unnatural amino acid with an amber codon so that it can be inserted into the protein sequence. In order to achieve this process, it is first necessary to design and synthesize an unnatural amino acid that can pair with an amber codon so that it can be recognized by the amber codon tRNA and inserted into the protein.
  • amber codon tRNA makes the amber codon no longer a termination signal for protein synthesis, but a signal for inserting unnatural amino acids.
  • amber codon tRNA and the corresponding unnatural amino acid are present in yeast cells, the process of synthesizing proteins will not stop at the amber codon, but will insert unnatural amino acids, thereby achieving protein modification.
  • non-natural amino acids can change the structure and properties of some drug molecules, thereby changing the activity, stability, pharmacokinetics and other characteristics of the drug.
  • By synthesizing non-natural amino acids more active and selective drug molecules can be designed and synthesized, improving the efficacy of drugs and reducing side effects.
  • the introduction of non-natural amino acids can change the solubility, lipid solubility, drug binding and other characteristics of drugs, thereby improving the absorption and distribution characteristics of drugs. Improve the bioavailability and efficacy of drugs in the body.
  • the introduction of unnatural amino acids can increase the stability of drug molecules, reduce drug degradation and failure, and extend the shelf life of drugs.
  • amber codon tRNA gene may have sequence instability or structural problems, which may prevent it from being correctly recognized and transcribed by yeast cells.
  • the amber codon tRNA gene may be incompatible with the transcription and translation mechanisms of yeast cells, resulting in failure to express normally.
  • other metabolic pathways and regulatory mechanisms within yeast cells may also affect the expression of amber codon tRNA genes.
  • yeast cells may have certain negative regulatory mechanisms that inhibit the expression of amber codon tRNA genes.
  • the expression of amber codon tRNA genes may be affected by intracellular environmental factors, such as nutritional conditions or temperature.
  • a technology for expressing amber codon tRNA in yeast is provided, which can achieve the purpose of synthesizing proteins containing unnatural amino acids in yeast cells.
  • the present invention utilizes CRISPR technology to integrate inhibitory tRNA into cells, so that the modified strain can efficiently and stably express inhibitory tRNA, thereby realizing the ability to express proteins containing non-natural amino acids (ncaa) outside the cell.
  • the first aspect of the present invention provides a nucleic acid construct, wherein the structure of the nucleic acid construct is formula (I): Z1-Z2-Z3,
  • Z1-Z3 are elements used to constitute the structure respectively;
  • Each "-" is independently a bond or a nucleotide linking sequence
  • Z1 is the promoter sequence
  • Z2 is none or a connection sequence
  • Z3 is the gene sequence of inhibitory tRNA.
  • the promoter is selected from tDNA, SNR52, SNR6, Pol III, U6, U3, SCR1, RPR1 or H1.
  • the promoter is preferably tDNA or SNR52.
  • the inhibitory tRNA is selected from amber codon tRNA, ochre codon tRNA or opal codon tRNA; preferably amber codon tRNA; further preferably tRNA TAG Pyl .
  • the structure of formula (I) is selected from Z1-Z2-tDNA TAG Pyl , and the tDNA TAG Pyl is selected from wild type or mutant type.
  • the structure of formula (I) is selected from tDNA-Z2-tDNA TAG Pyl , and the tDNA TAG Pyl is selected from wild type or mutant type.
  • the tDNA is selected from tDNA Arg , tDNA Ala , tDNA Asp , tDNA Cys , tDNA Gln , tDNA Leu , tDNA Pro , tDNA Tyr , tDNA Val , tDNA Ser , tDNA Gly , tDNA His , tDNA Ile , tDNA Lys , tDNA Met , tDNA Phe , tDNA Thr or tDNA Glu .
  • the promoter is selected from tDNA UCU Arg .
  • the structure of formula (I) is selected from tDNA Arg -Z2-tDNA TAG Pyl ; further preferably, the structure of formula (I) is selected from tDNA UCU Arg -Z2-tDNA TAG Pyl .
  • Z2 is selected from a connecting sequence, and the connecting sequence is selected from CTTTGTTTCT, i.e. SEQ ID NO: 30.
  • the Z3 is a single copy or multiple copy gene sequence of an inhibitory tRNA, and further preferably, the number of the multiple copies is less than or equal to 6.
  • sequence number of the structure of formula (I) is SEQ ID NO: 25 to 29.
  • the second aspect of the present invention provides a genetically engineered strain, wherein the genome of the strain is integrated with a nucleic acid construct comprising at least one inhibitory tRNA gene sequence.
  • nucleic acid construct is the nucleic acid construct described in the first aspect of the present invention.
  • strain is derived from bacteria, mammalian cells, human cells, plant cells, yeast cells, insect cells or any combination thereof.
  • the cell extract is more preferably selected from any of the following sources: Escherichia coli, Kluyveromyces lactis, wheat germ cells, Spodoptera frugiperda insect cells, rabbit reticulocytes, CHO cells, COS cells, VERO cells, BHK cells, human fibrosarcoma HT1080 cells, or a combination thereof.
  • the cell extract is derived from yeast cells.
  • the yeast cell is selected from Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia koclamae, Pichia membraneaefaciens, Pichia minuta, Ogataeaminuta, Pichia lindneri, Pichia opuntiae, Pichia thermotolerans ), Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia sp., Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces molasses, Saccharomyces sp., Hansenula polymorpha, Candida utilis, Kluyveromyces, or a combination thereof.
  • the Kluyveromyces further includes: Kluyveromyces lactis (Kluyveromyces, K. lactis), Kluyveromyces marxianus (Kluyveromyces marxianus),
  • the yeast cell is selected from the group consisting of Kluyveromyces dobzhanskii, Kluyveromyces aestuarii, Kluyveromyces nonfermentans, Kluyveromyces wickerhamii, Kluyveromyces thermotolerans, Kluyveromyces fragilis, Kluyveromyces hubeiensis, Kluyveromyces polysporus, Kluyveromyces siamensis and Kluyveromyces yarrowii, or a combination thereof; preferably, the yeast cell is a Kluyveromyces cell, more preferably a Kluyveromyces lactis cell.
  • the nucleic acid construct is integrated into the genome of the strain via an insertion site, and the insertion site is UPF1 or endogenous tDNA.
  • the third aspect of the present invention provides a method for preparing a genetically engineered strain according to the second aspect of the present invention, wherein the nucleic acid construct is inserted into the genome of the strain via the UPF1 site.
  • nucleic acid construct is inserted into the genome of the strain via any position in the UPF1 site.
  • the fourth aspect of the present invention provides a method for preparing a genetically engineered strain provided in the second aspect of the present invention, wherein the nucleic acid construct is inserted into the genome of the strain via 1 to 1000 bp at the 5' end and/or 3' end of the endogenous tDNA sequence; preferably 100 to 800 bp; further preferably 200 to 500 bp; and most preferably 300 bp.
  • nucleic acid construct is inserted into the 3' segment of the endogenous tDNA sequence.
  • the fifth aspect of the present invention provides a method for synthesizing proteins incorporating non-natural amino acids, using the genetically engineered strain described in the fifth aspect of the present invention to provide inhibitory tRNA.
  • the sixth aspect of the present invention provides a cell-free synthesis system for synthesizing proteins incorporated with non-natural amino acids, wherein the system at least comprises a cell extract derived from the genetically engineered strain provided by the second aspect of the present invention.
  • system also includes aminoacyl-tRNA synthetase and unnatural amino acids.
  • aminoacyl tRNA synthetase is selected from natural or mutant Pyl-tRNA synthetase (PylRS), Leu-tRNA synthetase (LeuRS), Tyr-tRNA synthetase (TyrRS), Phe-tRNA synthetase (PheRS) or TrP-tRNA synthetase (TrpRS), and is more preferably PylRS.
  • aminoacyl tRNA synthetase is selected from natural or mutant MaPylRS, MmPylRS, MbPylRS, EcTyrRS, MjTyrRS, EcLeuRS, ScPheRS, ScTrpRS, BsTrpRS
  • non-natural amino acids refer to amino acids other than the 20 natural amino acids.
  • the structural formula of the non-natural amino acid is a compound of formula (2) or a salt thereof.
  • n is selected from a natural number of 1-20
  • R1 is selected from substituted or unsubstituted C5-C60 aryl or heteroaryl, substituted or unsubstituted C1-C20 alkyl, substituted or unsubstituted C2-C20 alkenyl or substituted or unsubstituted C2-C20 alkynyl
  • A is selected from O or -CH2- .
  • n is selected from 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20.
  • n is a natural number selected from 1-10.
  • n is selected from 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10.
  • n is a natural number selected from 1-6.
  • n is selected from 1, 2, 3, 4, 5 or 6.
  • the R 1 is selected from substituted or unsubstituted C5-C30 aryl or heteroaryl.
  • the R 1 is selected from substituted or unsubstituted phenyl.
  • the R 1 is selected from substituted or unsubstituted C2-C20 alkenyl.
  • the R 1 is selected from substituted or unsubstituted C2-C10 alkenyl.
  • the R 1 is selected from substituted or unsubstituted C2-C6 alkenyl.
  • the R 1 is selected from substituted or unsubstituted C2-C20 alkynyl.
  • the R 1 is selected from substituted or unsubstituted C2-C10 alkynyl.
  • the R 1 is selected from substituted or unsubstituted C2-C6 alkynyl.
  • the A is selected from O.
  • the A is selected from -CH 2 -.
  • the substituents are common substituents in the art, such as aryl, heteroaryl, alkyl, cycloalkyl, aryloxy, heteroaryloxy, alkyloxy, cycloalkyloxy, hydroxyl, thiol, ester, carboxyl, cyano, halogen, nitro, sulfonic acid, azido, alkenyl, alkynyl, phosphate, etc.
  • the structural formula of the non-natural amino acid is selected from one or a combination of the following, or a salt form thereof:
  • the seventh aspect of the present invention provides an application of the genetically engineered strain provided in the second aspect of the present invention or the cell-free synthesis system provided in the sixth aspect of the present invention in the cell-free synthesis of proteins incorporated with non-natural amino acids.
  • the eighth aspect of the present invention provides a kit, which includes the cell-free synthesis system provided by the sixth aspect of the present invention.
  • the ninth aspect of the present invention provides a method for cell-free synthesis of proteins incorporated with non-natural amino acids, comprising the following steps: step (1), providing the cell-free synthesis system provided in the sixth aspect of the present invention or the kit provided in the eighth aspect of the present invention; step (2), adding a DNA molecule encoding an exogenous protein to the synthesis system or the kit described in step (1), and obtaining the protein by reaction in the presence of aminoacyl-tRNA synthetase and non-natural amino acids.
  • the exogenous protein is selected from: luciferin, luciferase (such as firefly luciferase), fluorescent protein (such as green fluorescent protein, yellow fluorescent protein, red fluorescent protein), aminoacyl-tRNA synthetase, glyceraldehyde-3-phosphate dehydrogenase, catalase, actin, variable region of antibody, luciferase mutation, ⁇ -amylase, enterobactin A, hepatitis C virus E2 glycoprotein, insulin precursor, interferon ⁇ A, cytokine, interferon ⁇ 2b, interleukin-1 ⁇ , lysozyme, serum albumin, single-chain antibody fragment (scFV), thyroxine transporter, tyrosinase, xylanase or a combination thereof.
  • luciferin such as firefly luciferase
  • fluorescent protein such as green fluorescent protein, yellow fluorescent protein, red fluorescent protein
  • the exogenous protein includes a wild-type protein, a mutant protein or a recombinant protein.
  • the inhibitory tRNA gene is directly recombined into the yeast cell genome, which solves the problem that the inhibitory tRNA gene cannot be expressed normally when it is directly inserted into the yeast cell genome.
  • the designed inhibitory tRNA structure can normally transcribe inhibitory tRNA and can normally transport non-natural amino acids to specific stop codons (such as TAG), thereby achieving the purpose of preparing non-natural amino acid proteins.
  • the recombinant strain has a stable structure and can be produced on an industrial scale, thereby further realizing the industrial production of non-natural amino acid proteins.
  • FIG. 1 shows the structure of the insert sequence 3002 of the present invention.
  • FIG. 2 shows the sequencing results of Example 1.
  • FIG. 3 shows the sequencing results of Example 4.
  • Figures 4 to 7 show the activity verification of the efficiency of non-natural amino acid insertion of strains transformed 1 to 10, among which Figures 4 and 6 show the activity verification of the efficiency of non-natural amino acid insertion of strains transformed by insertion near the endogenous tDNA; Figures 5 and 7 show the activity verification of the efficiency of non-natural amino acid insertion of strains transformed by insertion at the UPF1 site.
  • wt and upf1-wt have the same insertion sequence
  • mut1 to 4 have the same insertion sequence as upf1-mut1 to upf1-4, respectively.
  • preferred embodiments such as “preferred”, “better”, “more preferred”, “better”, “most preferred” and “further preferred” do not constitute any limitation on the scope of coverage and protection scope of the invention, and are not used to limit the scope and embodiments of the present invention, but are only used to provide some embodiments as examples.
  • preferred modes such as “one of the preferred embodiments”, “one of the preferred modes”, “one of the preferred embodiments”, “one of the preferred examples”, “preferred examples”, “in a preferred embodiment”, “some preferred examples”, “some preferred modes”, “preferably”, “preferably”, “preferably”, “more preferably”, “more preferably”, “further preferably”, “most preferably”, and illustrative enumeration modes such as “one of the embodiments", “one of the modes”, “example”, “specific example”, “for example”, “as an example”, “for example”, “such as”, “such as”, etc., do not constitute any limitation on the scope of coverage and protection scope of the invention, and the specific features described in each mode are included in at least one specific embodiment of the present invention.
  • the specific features described in each mode may be combined in a suitable manner in any one or more specific embodiments.
  • the technical features or technical solutions corresponding to each preferred mode may also be combined in any suitable manner.
  • any combination thereof means “greater than 1" in terms of quantity, and means a group consisting of the following situations in terms of coverage: “select any one of them, or select a group consisting of at least two of them”.
  • the term “about” can refer to a value or composition that is within an acceptable error range for a particular value or composition as determined by one of ordinary skill in the art, which will depend in part on how the value or composition is measured or determined.
  • the expression “about 100” includes all values between 99 and 101 (e.g., 99.1, 99.2, 99.3, 99.4, etc.).
  • Sequence identity is determined by comparing two aligned sequences along a predetermined comparison window (which can be 50%, 60%, 70%, 80%, 90%, 95% or 100% of the length of the reference nucleotide sequence or protein) and determining the number of positions at which identical residues occur. Typically, this is expressed as a percentage.
  • a predetermined comparison window which can be 50%, 60%, 70%, 80%, 90%, 95% or 100% of the length of the reference nucleotide sequence or protein
  • In vitro protein synthesis reaction refers to the reaction of synthesizing protein in an in vitro cell-free synthesis system, including at least the translation process. Including but not limited to IVT reaction (in vitro translation reaction), IVTT reaction (in vitro transcription and translation reaction), IVDTT reaction (in vitro replication transcription and translation reaction). In the present invention, IVTT reaction is preferred.
  • IVTT reaction corresponding to IVTT system, is the process of transcribing and translating DNA into protein (Protein) in vitro.
  • D2P system D-to-P system
  • D_to_P system DNA-to-Protein system
  • the corresponding in vitro protein synthesis method is also called D2P method, D-to-P method, D_to_P method, DNA-to-Protein method.
  • Cell-free system refers to a method of in vitro protein synthesis that is not secretory expression through intact cells. It should be noted that in the in vitro cell-free protein synthesis system of the present invention, it is also allowed to add cell components to promote the reaction, but the added cells are not primarily intended to secrete and express exogenous target proteins. In addition, in the intact cell-free D2P system constructed under the guidance of the present invention, a small amount of intact cells are intentionally added (for example, the protein content provided by them is not more than 30wt% compared to the protein content provided by the cell extract). Such a “circumvention" method is also included in the scope of protection of the present invention.
  • Exogenous protein The target expression product of the in vitro protein synthesis system of the present invention is not synthesized by secretion from the host cell, but is synthesized in vitro based on an exogenous nucleic acid template, and may also be referred to as the target protein.
  • the exogenous protein may be a protein, a fusion protein, a mixture of protein molecules or fusion protein molecules; it also broadly includes polypeptides.
  • the product obtained after an in vitro protein synthesis reaction based on a nucleic acid template encoding the target protein may be a single substance or a combination of two or more substances.
  • Exogenous protein "target protein”, “target protein”, “target translation product” have the same meaning and can be translated as “objective protein”, “interested protein”, “objective translated product”, “interested protein product”, etc., and may be used interchangeably in the present invention.
  • D2P DNA-to-Protein, from DNA template to protein product.
  • D2P technology D2P system, D2P method, D2P kit, and the like.
  • the expression system of the present invention can be used interchangeably, all referring to the in vitro protein expression system of the present invention, and other descriptions can also be used, such as: protein in vitro synthesis system, in vitro protein synthesis system, cell-free system, cell-free system, cell-free protein synthesis system, cell-free in vitro protein synthesis system, in vitro cell-free protein synthesis system, in vitro cell-free synthesis system, CFS system (cell-free system), CFPS system (cell-free protein synthesis system) and other descriptions.
  • the reaction mechanism can include an in vitro translation system (which can be abbreviated as IVT system, a mR2P system), an in vitro transcription translation system (which can be abbreviated as IVTT system, a D2P system), an in vitro replication transcription translation system (which can be abbreviated as IVDTT system, a D2P system), etc.
  • the IVTT system is preferred.
  • the in vitro protein synthesis system a "protein synthesis factory" ("Protein Factory” or "proteinfactory” or “Proteinfactory”).
  • the in vitro protein synthesis system provided by the present invention adopts an open description method for its components.
  • the cell-free protein synthesis system of the present invention uses exogenous DNA, mRNA or a combination thereof as a nucleic acid template for protein synthesis, and achieves in vitro synthesis of the target protein by artificially controlling the addition of substrates and transcription, translation-related protein factors and other substances required for protein synthesis.
  • protein and “protein” have the same meaning, are both translated as protein, and can be used interchangeably.
  • system and “system” are both translated as system and can be used interchangeably.
  • protein synthesis amount protein expression amount
  • protein expression yield protein expression yield
  • cell extract, cell extract, cell lysate, cell disruption and cell lysis product have the same meaning and can be used interchangeably.
  • cell extract, cell lysate and the like can be used as descriptions.
  • energy system energy system, energy system, and energy supply system have the same meaning and can be used interchangeably.
  • Energy regeneration system and energy regeneration system have the same meaning and can be used interchangeably.
  • the energy regeneration system is a preferred embodiment or component of the energy system.
  • the present invention provides a cell-free protein synthesis system, which at least includes a cell extract or a cell lysate.
  • the cell-free protein synthesis system further comprises one or more components selected from the following group: a substrate for synthesizing RNA, a substrate for synthesizing protein, polyethylene glycol or its analogues, magnesium ions, potassium ions, a buffer, RNA polymerase, an energy regeneration system, dithiothreitol, and an optional aqueous solvent.
  • the substrate for synthesizing RNA includes: nucleoside monophosphate, nucleoside triphosphate or a combination thereof.
  • the substrates for synthesizing proteins include: 20 natural amino acids and unnatural amino acids. Base acid.
  • the magnesium ions are derived from a magnesium ion source, and the magnesium ion source is selected from the following group: magnesium acetate, magnesium glutamate, or a combination thereof.
  • the potassium ions are derived from a potassium ion source, and the potassium ion source is selected from the following group: potassium acetate, potassium glutamate, or a combination thereof.
  • the energy regeneration system is selected from the following group: one or a combination of creatine phosphate/creatine phosphate enzyme system, glycolysis pathway and its intermediate energy system.
  • the energy regeneration system comprises a glucose/phosphate system
  • the phosphate is selected from the following group: tripotassium phosphate, triammonium phosphate, trisodium phosphate, dipotassium hydrogen phosphate, diammonium hydrogen phosphate, disodium hydrogen phosphate, potassium dihydrogen phosphate, ammonium dihydrogen phosphate, sodium dihydrogen phosphate, or a combination thereof.
  • the buffer is selected from the following group: 4-hydroxyethylpiperazineethanesulfonic acid, tris(hydroxymethyl)aminomethane, or a combination thereof.
  • the in vitro protein synthesis system contains polyethylene glycol (PEG) or its analogs.
  • concentration of polyethylene glycol or its analogs is not particularly limited. Generally, the concentration (w/v) of polyethylene glycol or its analogs is 0.1-8%, preferably 0.5-4%, more preferably 1-2%, based on the total weight of the protein synthesis system.
  • Representative PEG is selected from the following group: one of PEG3000, PEG3350, PEG6000, PEG8000 or a combination thereof.
  • the polyethylene glycol includes polyethylene glycol with a molecular weight (Da) of 200-10000, such as PEG200, 400, 1500, 2000, 4000, 6000, 8000, 10000, etc., preferably, polyethylene glycol with a molecular weight of 3000-10000.
  • Da molecular weight
  • the RNA polymerase is not particularly limited and can be selected from one or more RNA polymerases.
  • a typical RNA polymerase is T7 RNA polymerase.
  • the in vitro protein synthesis system includes: cell extract, 4-hydroxyethylpiperazineethanesulfonic acid, potassium acetate, magnesium acetate, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytosine triphosphate (CTP), thymidine triphosphate (TTP), amino acid mixture, creatine phosphate, dithiothreitol (DTT), creatine phosphate kinase, and RNA polymerase.
  • cell extract 4-hydroxyethylpiperazineethanesulfonic acid
  • potassium acetate magnesium acetate
  • ATP adenosine triphosphate
  • GTP guanosine triphosphate
  • CTP cytosine triphosphate
  • TTP thymidine triphosphate
  • amino acid mixture amino acid mixture
  • creatine phosphate dithiothreitol (DTT)
  • DTT dithiothreitol
  • DTT creatine phosphat
  • the cell extract does not contain intact cells, and a typical cell extract includes ribosomes, aminoacyl-tRNA synthetases, initiation factors and elongation factors required for protein synthesis, and termination release factors for protein translation.
  • the cell extract also contains some other proteins derived from the cytoplasm of the cell, especially soluble proteins.
  • the proportion of the cell extract in the in vitro cell-free protein synthesis system is not particularly limited.
  • the cell extract accounts for 20-70% of the in vitro cell-free protein synthesis system.
  • the protein content of the cell extract is 20-100 mg/mL, preferably 50-100 mg/mL.
  • the method for determining the protein content is the Coomassie Brilliant Blue determination method.
  • the present invention also provides a vector or vector combination, wherein the vector contains the nucleic acid construct of the present invention.
  • the vector is selected from: bacterial plasmid, bacteriophage, yeast plasmid, animal cell vector, shuttle vector; the vector is a transposon vector.
  • Methods for preparing recombinant vectors are well known to those of ordinary skill in the art. As long as they can replicate and be stable in the host, any plasmid and vector can be used.
  • the template DNA is a nucleotide sequence encoding any target protein to be synthesized, which may be an original sequence, an artificially synthesized sequence, or an artificially modified sequence.
  • the template DNA may be used to synthesize the corresponding RNA and/or protein.
  • the preparation method of the cell extract is not limited.
  • a preferred preparation method is
  • the solid-liquid separation method is not particularly limited, and a preferred method is centrifugation.
  • the centrifugation is performed in a liquid state.
  • the centrifugation conditions are not particularly limited, and a preferred centrifugation condition is 5000-100000 g, preferably 8000-30000 g.
  • the centrifugation time is not particularly limited, and a preferred centrifugation time is 0.5 min-2 h, preferably 20 min-50 min.
  • the centrifugation temperature is not particularly limited.
  • the centrifugation is performed at 1-10°C, more preferably, at 2-6°C.
  • the washing treatment method is not particularly limited.
  • a preferred washing treatment method is to use a washing liquid at a pH of 7-8 (preferably 7.4).
  • the washing liquid is not particularly limited.
  • the washing liquid is selected from the following group: potassium 4-hydroxyethylpiperazineethanesulfonate, potassium acetate, magnesium acetate, or a combination thereof.
  • the cell disruption treatment method is not particularly limited.
  • a preferred cell disruption treatment includes high-pressure disruption and freeze-thaw (such as liquid nitrogen cryogenic) disruption.
  • the nucleoside triphosphate mixture in the in vitro cell-free protein synthesis system is adenosine triphosphate, guanosine triphosphate, cytosine triphosphate and uridine triphosphate.
  • concentration of various mononucleotides is not particularly limited, and usually the concentration of each mononucleotide is 0.5-5mM, preferably 1.0-2.0mM.
  • the amino acid mixture in the in vitro cell-free protein synthesis system may include natural or non-natural amino acids, and may include D-type or L-type amino acids.
  • Representative amino acids include (but are not limited to) 20 natural amino acids: glycine, alanine, valine, leucine, isoleucine, phenylalanine, proline, tryptophan, serine, tyrosine, cysteine, methionine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine.
  • the concentration of each amino acid is generally 0.01-0.5 mM, preferably 0.02-0.2 mM, such as 0.05, 0.06, 0.07, 0.08 mM.
  • the in vitro cell-free protein synthesis system further contains polyethylene glycol or its analogs.
  • concentration of polyethylene glycol or its analogs is not particularly limited. Generally, the concentration (w/v) of polyethylene glycol or its analogs is 0.1-8%, preferably 0.5-4%, more preferably 1-2%, based on the total weight of the biosynthesis system.
  • Representative examples of PEG include (but are not limited to): PEG3000, PEG8000, PEG6000 and PEG3350. It should be understood that the system of the present invention may also include polyethylene glycols of various other molecular weights (such as PEG200, 400, 1500, 2000, 4000, 6000, 8000, 10000, etc.).
  • the in vitro cell-free protein synthesis system further contains sucrose.
  • concentration of sucrose is not particularly limited, and generally, the concentration of sucrose is 0.03-40wt%, preferably 0.08-10wt%, and more preferably 0.1-5wt%, based on the total weight of the protein synthesis system.
  • a particularly preferred in vitro cell-free protein synthesis system in addition to yeast cell extract, also contains the following components: 22mM 4-hydroxyethylpiperazineethanesulfonic acid with a pH of 7.4, 30-150mM potassium acetate, 1.0-5.0mM magnesium acetate, 1.5-4mM nucleoside triphosphate mixture, 0.08-0.24mM amino acid mixture, 25mM creatine phosphate, 1.7mM dithiothreitol, 0.27mg/mL creatine phosphate kinase, 1%-4% polyethylene glycol, 0.5%-2% sucrose, and 0.027-0.054mg/mL T7 RNA polymerase.
  • the present invention is not intended to limit the scope of the present invention.
  • the experimental methods in the following examples where specific conditions are not specified are generally carried out under conventional conditions, such as those described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or under conditions recommended by the manufacturer. Unless otherwise specified, percentages and parts are by weight.
  • the present invention is only an example of Kluyveromyces lactis, K. lactis or kl for short, but the same design, analysis and experimental methods are also applicable to other eukaryotic cells such as yeast and animal cells, as well as prokaryotic cells.
  • the present invention uses Kluyveromyces lactis (K. lactis) as an example, but the same design, analysis and experimental methods are also applicable to other yeasts and other lower eukaryotic cells and higher animal cells.
  • the gene modification method in the present invention is CRISPR-Cas9 technology, but it is not limited to this, and can be any known and existing gene modification method.
  • the in vitro protein synthesis reaction mixture system also described as an in vitro protein synthesis reaction mixture, a reaction mixture system, or a reaction mixture, refers to a mixed system including an in vitro protein synthesis system and a nucleic acid template encoding a target protein; it may be homogeneous or heterogeneous, and may be a liquid system such as a solution, an emulsion, or a suspension.
  • Protein of the present invention The final concentrations of the components in Factory are: 80% (v/v) Kluyveromyces lactis extract, 15 mM glucose, 320 mM maltodextrin (measured in molar concentration of glucose monomers), 24 mM tripotassium phosphate, 1.8 mM nucleoside triphosphate mixture (a mixture of adenosine triphosphate, guanosine triphosphate, cytosine triphosphate and uridine triphosphate, with the final concentration of each nucleoside triphosphate being 1.8 mM), 0.7 mM amino acid mixture (glycine, alanine, valine, leucine, isoleucine, phenylalanine, proline, tryptophan, serine, tyrosine, cysteine, methionine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine, with the final concentration
  • the lactic acid Kluyveromyces extract includes endogenously expressed T7 RNA polymerase.
  • the lactic acid Kluyveromyces extract is modified in the following manner: using a modified strain based on the lactic acid Kluyveromyces strain ATCC8585; using the method described in CN109423496A, the coding gene of T7 RNA polymerase is integrated into the genome of lactic acid Kluyveromyces to obtain a modified strain so that it can endogenously express T7 RNA polymerase; using the modified strain to culture cell raw materials, and then preparing cell extracts.
  • the preparation process of lactic acid Kluyveromyces cell extract adopts conventional technical means, and is prepared with reference to the method described in CN109593656A.
  • the preparation steps include: providing an appropriate amount of raw materials of lactic acid Kluyveromyces cells that have been fermented and cultured, quick-freezing the cells with liquid nitrogen, breaking the cells, and collecting the supernatant by centrifugation to obtain a cell extract.
  • the protein concentration in the obtained lactic acid Kluyveromyces cell extract is 20-40 mg/mL.
  • the (i.e., prock, N-E-propargyloxycarbonyl-L-lysine hydrochloride) is a representative of non-natural amino acids (abbreviated as NCAA), but the NCAA of the present invention is not limited to only prock.
  • the present invention provides a method for screening tDNA Pyl mutants, which is as follows:
  • tDNA UCU Arg -tDNA TAG Pyl (3002) structure (see Figure 1)
  • five insertion fragments containing different tDNA TAG Pyl sequences (1 wild type and 4 mutants) were designed and inserted into the following two sites in the genome of Kluyveromyces lactis (a total of 10 structures).
  • tDNA UCU Arg is only used as a representative of a promoter, but it is not a limitation of the promoter described in the present invention.
  • the insertion position was selected in the middle of the UPF1 gene.
  • Primers 1 and 2 were designed near the insertion position for PCR amplification of the left homology arm; primers 3 and 4 were used for PCR amplification of the right homology arm.
  • the primer sequences are as follows:
  • CTTTTAGCAG-3’ (SEQ ID No: 3)
  • GAGTTA-3 (SEQ ID No: 4)
  • Primer 4 5’-CTTTGAGCCCACTCCATTG-3’ (SEQ ID No: 5)
  • the primer sequences are as follows:
  • Endogenous tRNA gene 5’-AGATGAGAAGAGACTGCGAG-3’ (SEQ ID No: 8);
  • Primers 7 and 8 were designed about 1000 bp to the left of the insertion position for PCR amplification of the left homology arm; primers 9 and 10 were designed about 1000 bp to the right of the insertion position for PCR amplification of the right homology arm.
  • the columns are as follows:
  • primer sequences are as follows:
  • step (2) The left and right homology arm PCR products obtained in step (2) and the insert fragment PCR product obtained in step (3) were connected by overlap PCR, and a total of five gene editing vectors with five different tDNA TAG Pyl sequences were obtained.
  • step (6) The left and right homology arm PCR products obtained in step (4) and the insert fragment PCR product obtained in step (5) were connected by overlap PCR, and a total of five gene editing vectors with five different tDNA TAG Pyl sequences were obtained.
  • step (6) and step (7) A total of 10 gene editing vectors obtained in step (6) and step (7) are transformed into lactic acid Kluyveromyces strains respectively.
  • the IVTT technology was used to translate the GFP-TAG-RFP structure in large quantities.
  • the expression level of RFP was used to reflect the read-through rate of the termination codon TAG, and the control group without the addition of non-natural amino acids was used to reflect the fidelity of the amber codon tRNA in transporting amino acids.
  • connection primers are:
  • Primer 13 CATTACATCCAGACTCAGAGCTAGGAGATACTACTTTGGAATGC (SEQ ID No: 16)
  • Primer 14 AGTCTCTCTGACCAGCATATTGTGTATAACTACTGTTGTTGG (SEQ ID No: 17)
  • the PCR products of the left and right homology arms and the insert fragment were used as templates for overlap PCR amplification.
  • the PCR amplification reaction system was: 3 ⁇ L of PCR product mixture; 0.5 ⁇ L of primer 13; 0.5 ⁇ L of primer 14; 10 ⁇ L of 2xMix; 6 ⁇ L of ddH2O.
  • the PCR amplification reaction program was: 95°C for 3 min; 95°C for 30 s, 60°C for 1 min, 72°C for 3 min, 35 cycles; 72°C for 5 min; 20°C for 1 min.
  • step (3) Adjust the parameters of the electroporator to 2KV, place the tube in the electroporation cup in step (2) and perform an electric shock treatment. Then add about 700ul of YPD medium to the electroporation cup, gently pipette and mix, and transfer to a centrifuge tube. Incubate at 200 rpm and 30°C for about 30 minutes.
  • the tRNA gene contained in the gene editing vector can transcribe a special tRNA for recognizing the termination codon TAG, which does not exist in the wild-type lactic acid Kluyveromyces strain. Therefore, two specific primers were designed on the gene sequence, and two primers were designed on the nearby lactic acid Kluyveromyces genome. The four primers formed two pairs.
  • the primer sequences are as follows:
  • Primer16 TTCTGATTAGAAGTCAGACGCG (SEQ ID No: 19)
  • Primer18 TGATGATCAAGAACCACTCG (SEQ ID No: 21)
  • the reaction system was: 1 ⁇ L of nucleic acid of monoclonal strain; 0.5 ⁇ L of Primer 15; 0.5 ⁇ L of Primer 17; 10 ⁇ L of 2xMix; 8 ⁇ L of ddH2O. And 1 ⁇ L of nucleic acid of monoclonal strain; 0.5 ⁇ L of Primer 16; 0.5 ⁇ L of Primer 18; 10 ⁇ L of 2xMix; 8 ⁇ L of ddH2O.
  • the response program was: 95°C for 3 min; 95°C for 30 s, 57°C for 1 min, 72°C for 3 min, 35 cycles; 72°C for 5 min; 20°C for 1 min.
  • the monoclonal strains initially identified as positive after identification were stored, and their nucleic acids were amplified by PCR using primers designed on two genomes.
  • the resulting PCR products were entrusted to Qingke Biotech Co., Ltd. for sequencing.
  • the primers are as follows:
  • Primer20 CTTTGTTGAACATCATCCCGGTGAGAATGCC(SEQ ID No: 23)
  • the reaction system was: monoclonal strain nucleic acid 1 ⁇ L; primer 5 0.5 ⁇ L; primer 6 0.5 ⁇ L; 2xMix 10 ⁇ L; ddH2O 8 ⁇ Ll.
  • the response program was: 95°C for 3 min; 95°C for 30 s, 65°C for 1 min, 72°C for 3 min, 35 cycles; 72°C for 5 min; 20°C for 1 min.
  • the sequencing result i.e., the inserted sequence (SEQ ID No: 25), was compared with the theoretical sequence (see FIG3 ). After comparison, if the two insertion sequences are consistent, the tested strain is determined to be a positive strain that has successfully undergone gene editing.
  • Example 5 According to a similar operation method, the strains of transformation 1 to transformation 3 and transformation 5 to transformation 10 were obtained respectively (the insertion sequences of transformation 1 to transformation 3 and transformation 5 are SEQ ID No: 26 to 29 respectively, and the insertion sequences corresponding to transformation 6 to transformation 10 are the same as those of transformation 1 to transformation 5, and the only difference is that the insertion sites of transformation 1 to transformation 5 and transformation 6 to transformation 10 are different)
  • the strain to be tested which was frozen in a -80°C refrigerator, was streaked on a YPD plate, and a single colony was picked and inoculated into a 250mL Erlenmeyer flask containing 100mL of seed culture medium, and cultured in a 30°C shaker at 200rpm for 24h; the seed liquid was transferred to a 1000mL Erlenmeyer flask containing 400mL of fermentation culture medium, and cultured in a 30°C shaker at 200rpm until the harvest period, the cells were collected by high-speed centrifugation, and the cell lysate was prepared under liquid nitrogen protection.
  • Example 7 Verification of the expression efficiency of in vitro synthesis of target proteins with introduction of unnatural amino acids
  • Ex485nm/Em535 was used to detect the fluorescence intensity of EGFP, and Ex535nm/Em595nm was used to detect the fluorescence intensity of RFP.
  • the efficiency of the introduction of unnatural amino acids was determined based on the ratio of RFP/GFP and the fluorescence intensity of RFP (see Figures 4-7).
  • the modified strains can express normally and show strong RFP fluorescence values, indicating that the modified strains have successfully achieved the purpose of introducing non-natural amino acids into proteins and have high activity, especially the strain of modification 1 (wt in Figure 4) has a very high read-through rate.
  • the modified strains have good fidelity in the process of introducing non-natural amino acids.
  • the strain of modification 1 (wt in Figure 6) has extremely high fidelity.
  • strains obtained by genetic modification of the present invention have a high read-through rate and fidelity for in vitro expression of non-natural amino acids, have good activity, and have extremely high reference value for the expanded application of the field of in vitro introduction of non-natural amino acids.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Mycology (AREA)
  • Cell Biology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

Provided are a genetically engineered strain based on tRNA modification, a preparation method therefor, and a use thereof. Specifically, an inhibitory tRNA coding sequence is directly recombined into a yeast cell genome, and the problem that an inhibitory tRNA gene cannot be normally expressed when directly inserted into a yeast cell genome is solved, thereby achieving the purpose of preparing an unnatural amino acid protein.

Description

一种基于tRNA改造的基因工程菌株及其制备方法和应用A genetically engineered strain based on tRNA modification and its preparation method and application 技术领域Technical Field

本发明涉及基因工程技术领域,具体地涉及一种基因工程菌株及其制备方法和应用。The present invention relates to the field of gene engineering technology, and in particular to a gene engineering strain and a preparation method and application thereof.

背景技术Background Art

CRISPR(Clustered Regularly Interspaced Short Palindromic Repeats)技术是一种革命性的基因编辑工具,被广泛应用于生物学和医学研究领域。它基于细菌和古菌天然的免疫系统,能够精确地编辑、添加或删除基因序列。CRISPR技术的核心组成部分包括CRISPR序列和Cas蛋白。CRISPR序列是一段DNA序列,其中包含了一系列重复和间隔的序列,这些序列来自于细菌或古菌曾经遇到过的外来病毒或质粒。Cas蛋白是CRISPR系统中的核心蛋白,它能够识别和切割这些外来DNA序列。相对于传统的基因编辑方法,CRISPR技术更加快速和精准,能够在短时间内实现大规模的基因编辑。CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) technology is a revolutionary gene editing tool that is widely used in the fields of biology and medical research. It is based on the natural immune system of bacteria and archaea, and can accurately edit, add or delete gene sequences. The core components of CRISPR technology include CRISPR sequences and Cas proteins. The CRISPR sequence is a DNA sequence that contains a series of repeated and spaced sequences that come from foreign viruses or plasmids that bacteria or archaea have encountered. Cas protein is the core protein in the CRISPR system, which can recognize and cut these foreign DNA sequences. Compared with traditional gene editing methods, CRISPR technology is faster and more accurate, and can achieve large-scale gene editing in a short period of time.

琥珀密码子tRNA是一种特殊的tRNA,它与遗传密码中的琥珀密码子相对应。琥珀密码子是一种终止密码子,当遇到该密码子时,蛋白质合成过程会终止。然而,通过将琥珀密码子tRNA引入到酵母细胞中,可以实现非天然氨基酸的插入。具体实现方法是将特定的非天然氨基酸与琥珀密码子配对,从而使其插入到蛋白质序列中。为了实现这一过程,首先需要设计和合成一种能够与琥珀密码子配对的非天然氨基酸,使其能够被琥珀密码子tRNA识别从而插入到蛋白质中。琥珀密码子tRNA的引入使得琥珀密码子不再是蛋白质合成的终止信号,而成为插入非天然氨基酸的信号。当酵母细胞中存在琥珀密码子tRNA和相应的非天然氨基酸时,合成蛋白质的过程不会在琥珀密码子处停止,而是会插入非天然氨基酸,从而实现对蛋白质的改造。Amber codon tRNA is a special tRNA that corresponds to the amber codon in the genetic code. The amber codon is a stop codon, and when it is encountered, the protein synthesis process will stop. However, by introducing amber codon tRNA into yeast cells, the insertion of unnatural amino acids can be achieved. The specific implementation method is to pair a specific unnatural amino acid with an amber codon so that it can be inserted into the protein sequence. In order to achieve this process, it is first necessary to design and synthesize an unnatural amino acid that can pair with an amber codon so that it can be recognized by the amber codon tRNA and inserted into the protein. The introduction of amber codon tRNA makes the amber codon no longer a termination signal for protein synthesis, but a signal for inserting unnatural amino acids. When amber codon tRNA and the corresponding unnatural amino acid are present in yeast cells, the process of synthesizing proteins will not stop at the amber codon, but will insert unnatural amino acids, thereby achieving protein modification.

非天然氨基酸的引入可以改变部分药物分子的结构和性质,从而改变药物的活性、稳定性和药代动力学等特性。通过合成非天然氨基酸,可以设计和合成更具活性和选择性的药物分子,提高药物的疗效和减少副作用。其次,非天然氨基酸的引入可以改变药物的溶解度、脂溶性、药物结合性等特性,从而改善药物的吸收和分布特性, 提高药物在体内的生物利用度和疗效。此外,非天然氨基酸的引入可以增加药物分子的稳定性,减少药物的降解和失效,延长药物的有效期。The introduction of non-natural amino acids can change the structure and properties of some drug molecules, thereby changing the activity, stability, pharmacokinetics and other characteristics of the drug. By synthesizing non-natural amino acids, more active and selective drug molecules can be designed and synthesized, improving the efficacy of drugs and reducing side effects. Secondly, the introduction of non-natural amino acids can change the solubility, lipid solubility, drug binding and other characteristics of drugs, thereby improving the absorption and distribution characteristics of drugs. Improve the bioavailability and efficacy of drugs in the body. In addition, the introduction of unnatural amino acids can increase the stability of drug molecules, reduce drug degradation and failure, and extend the shelf life of drugs.

然而,插入的琥珀密码子tRNA基因可能存在序列不稳定性或结构问题,导致其无法被酵母细胞正确识别和转录。其次,琥珀密码子tRNA基因可能与酵母细胞的转录和翻译机制不兼容,导致无法正常表达。此外,酵母细胞内的其他代谢途径和调控机制也可能对琥珀密码子tRNA基因的表达产生影响。例如,酵母细胞可能存在某些负调控机制,抑制了琥珀密码子tRNA基因的表达。或者,琥珀密码子tRNA基因的表达可能受到细胞内环境因素的影响,如营养条件或温度等。However, the inserted amber codon tRNA gene may have sequence instability or structural problems, which may prevent it from being correctly recognized and transcribed by yeast cells. Secondly, the amber codon tRNA gene may be incompatible with the transcription and translation mechanisms of yeast cells, resulting in failure to express normally. In addition, other metabolic pathways and regulatory mechanisms within yeast cells may also affect the expression of amber codon tRNA genes. For example, yeast cells may have certain negative regulatory mechanisms that inhibit the expression of amber codon tRNA genes. Alternatively, the expression of amber codon tRNA genes may be affected by intracellular environmental factors, such as nutritional conditions or temperature.

因此提供一种在酵母中表达琥珀密码子tRNA的技术,可以实现在酵母细胞中合成含有非天然氨基酸蛋白质的目的。Therefore, a technology for expressing amber codon tRNA in yeast is provided, which can achieve the purpose of synthesizing proteins containing unnatural amino acids in yeast cells.

发明内容Summary of the invention

本发明通过利用CRISPR技术将抑制性tRNA整合到细胞中,使改造菌株可以高效稳定表达抑制性tRNA,从而实现细胞外表达含有非天然氨基酸(ncaa)蛋白的能力。The present invention utilizes CRISPR technology to integrate inhibitory tRNA into cells, so that the modified strain can efficiently and stably express inhibitory tRNA, thereby realizing the ability to express proteins containing non-natural amino acids (ncaa) outside the cell.

本发明第一方面提供一种核酸构建物,所述核酸构建物的结构为式(I):Z1-Z2-Z3,The first aspect of the present invention provides a nucleic acid construct, wherein the structure of the nucleic acid construct is formula (I): Z1-Z2-Z3,

式中,Z1-Z3分别为用于构成所述构建物的元件;In the formula, Z1-Z3 are elements used to constitute the structure respectively;

各“-”独立地为键或核苷酸连接序列;Each "-" is independently a bond or a nucleotide linking sequence;

Z1为启动子序列;Z1 is the promoter sequence;

Z2为无或连接序列;Z2 is none or a connection sequence;

Z3为抑制性tRNA的基因序列。Z3 is the gene sequence of inhibitory tRNA.

进一步的,所述的启动子选自tDNA、SNR52、SNR6、Pol III、U6、U3、SCR1、RPR1或H1。Furthermore, the promoter is selected from tDNA, SNR52, SNR6, Pol III, U6, U3, SCR1, RPR1 or H1.

进一步的,所述的启动子优选为tDNA或SNR52。Furthermore, the promoter is preferably tDNA or SNR52.

进一步的,所述的抑制性tRNA选自琥珀密码子tRNA、赭石密码子tRNA或蛋白石密码子tRNA;优选琥珀密码子tRNA;进一步优选tRNATAG PylFurthermore, the inhibitory tRNA is selected from amber codon tRNA, ochre codon tRNA or opal codon tRNA; preferably amber codon tRNA; further preferably tRNA TAG Pyl .

进一步的,所述式(I)结构选自Z1-Z2-tDNATAG Pyl,所述tDNATAG Pyl选自野生型或突变型。Furthermore, the structure of formula (I) is selected from Z1-Z2-tDNA TAG Pyl , and the tDNA TAG Pyl is selected from wild type or mutant type.

进一步的,所述式(I)结构选自tDNA-Z2-tDNATAG Pyl,所述tDNATAG Pyl选自野生型或突变型。Furthermore, the structure of formula (I) is selected from tDNA-Z2-tDNA TAG Pyl , and the tDNA TAG Pyl is selected from wild type or mutant type.

进一步的,所述的tDNA选自tDNAArg、tDNAAla、tDNAAsp、tDNACys、tDNAGln、 tDNALeu、tDNAPro、tDNATyr、tDNAVal、tDNASer、tDNAGly、tDNAHis、tDNAIle、tDNALys、tDNAMet、tDNAPhe、tDNAThr或tDNAGluFurthermore, the tDNA is selected from tDNA Arg , tDNA Ala , tDNA Asp , tDNA Cys , tDNA Gln , tDNA Leu , tDNA Pro , tDNA Tyr , tDNA Val , tDNA Ser , tDNA Gly , tDNA His , tDNA Ile , tDNA Lys , tDNA Met , tDNA Phe , tDNA Thr or tDNA Glu .

进一步优选的,所述的启动子选自tDNAUCU ArgFurther preferably, the promoter is selected from tDNA UCU Arg .

进一步优选的,所述式(I)结构选自tDNAArg-Z2-tDNATAG Pyl;更进一步优选的,所述式(I)结构选自tDNAUCU Arg-Z2-tDNATAG PylFurther preferably, the structure of formula (I) is selected from tDNA Arg -Z2-tDNA TAG Pyl ; further preferably, the structure of formula (I) is selected from tDNA UCU Arg -Z2-tDNA TAG Pyl .

进一步优选的,所述Z2选自连接序列,所述连接序列选自CTTTGTTTCT,即SEQ ID NO:30。Further preferably, Z2 is selected from a connecting sequence, and the connecting sequence is selected from CTTTGTTTCT, i.e. SEQ ID NO: 30.

进一步的,所述Z3为抑制性tRNA的单拷贝或多拷贝基因序列,进一步优选的,所述多拷贝的数量小于等于6。Furthermore, the Z3 is a single copy or multiple copy gene sequence of an inhibitory tRNA, and further preferably, the number of the multiple copies is less than or equal to 6.

进一步的,所述式(I)结构的序列号为SEQ ID NO:25~29。Furthermore, the sequence number of the structure of formula (I) is SEQ ID NO: 25 to 29.

本发明第二方面提供一种基因工程菌株,所述菌株的基因组中整合有至少包含一种抑制性tRNA基因序列的核酸构建物The second aspect of the present invention provides a genetically engineered strain, wherein the genome of the strain is integrated with a nucleic acid construct comprising at least one inhibitory tRNA gene sequence.

进一步的,所述核酸构建物为本发明第一方面所述的核酸构建物。Furthermore, the nucleic acid construct is the nucleic acid construct described in the first aspect of the present invention.

进一步的,所述的菌株来源于细菌、哺乳动物细胞、人体细胞、植物细胞、酵母细胞、昆虫细胞之一或其任意组合。Furthermore, the strain is derived from bacteria, mammalian cells, human cells, plant cells, yeast cells, insect cells or any combination thereof.

进一步的,所述细胞提取物,更优选选自以下任一种来源:大肠杆菌、乳酸克鲁维酵母、麦胚细胞、Spodoptera frugiperda昆虫细胞、兔网织红细胞、CHO细胞、COS细胞、VERO细胞、BHK细胞、人纤维肉瘤HT1080细胞、或者其组合。Furthermore, the cell extract is more preferably selected from any of the following sources: Escherichia coli, Kluyveromyces lactis, wheat germ cells, Spodoptera frugiperda insect cells, rabbit reticulocytes, CHO cells, COS cells, VERO cells, BHK cells, human fibrosarcoma HT1080 cells, or a combination thereof.

进一步优选的,所述的细胞提取物来源于酵母细胞。Further preferably, the cell extract is derived from yeast cells.

进一步的,所述酵母细胞选自毕赤酵母、芬兰毕赤酵母(Pichia finlandica)、喜海藻糖毕赤酵母(Pichia trehalophila)、科克拉马毕赤酵母(Pichia koclamae)、膜醭毕赤酵母(Pichia membranaefaciens)、微小毕赤酵母(Pichia minuta)、甲醇诱导型酵母(Ogataeaminuta)、林氏毕赤酵母(Pichia lindneri)、仙人掌毕赤酵母(Pichia opuntiae)、耐热毕赤酵母(Pichia thermotolerans)、柳毕赤酵母(Pichia salictaria)、松栎毕赤酵母(Pichia guercuum)、皮杰普毕赤酵母(Pichia pijperi)、树干毕赤酵母(Pichiastiptis)、甲醇毕赤酵母(Pichia methanolica)、毕赤酵母菌(Pichia sp.)、酿酒酵母(Saccharomyces cerevisiae)、啤酒酵母、甘蔗糖蜜酵母、酵母菌(Saccharomyces sp.)、多形汉逊酵母(Hansenulapolymorpha)、产朊假丝酵母、克鲁维酵母之一或其组合。Furthermore, the yeast cell is selected from Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia koclamae, Pichia membraneaefaciens, Pichia minuta, Ogataeaminuta, Pichia lindneri, Pichia opuntiae, Pichia thermotolerans ), Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia sp., Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces molasses, Saccharomyces sp., Hansenula polymorpha, Candida utilis, Kluyveromyces, or a combination thereof.

更进一步优选地,所述的克鲁维酵母进一步包括:乳酸克鲁维酵母(Kluyveromyces,K.lactis)、马克斯克鲁维酵母(Kluyveromyces marxianus)、、多布克 鲁维酵母(Kluyveromyces dobzhanskii)、海泥克鲁维酵母(Kluyveromyces aestuarii)、非发酵克鲁维酵母(Kluyveromyces nonfermentans)、威克海姆克鲁维酵母(Kluyveromyces wickerhamii)、耐热克鲁维酵母(Kluyveromyces thermotolerans)、脆壁克鲁维酵母(Kluyveromyces fragilis)、湖北克鲁维酵母(Kluyveromyces hubeiensis)、多孢克鲁维酵母(Kluyveromyces polysporus)、暹罗克鲁维酵母(Kluyveromyces siamensis)、亚罗克鲁维酵母(Kluyveromyces yarrowii)之一或其组合;较佳地,所述的酵母细胞为克鲁维酵母细胞,更佳地为乳酸克鲁维酵母细胞。More preferably, the Kluyveromyces further includes: Kluyveromyces lactis (Kluyveromyces, K. lactis), Kluyveromyces marxianus (Kluyveromyces marxianus), The yeast cell is selected from the group consisting of Kluyveromyces dobzhanskii, Kluyveromyces aestuarii, Kluyveromyces nonfermentans, Kluyveromyces wickerhamii, Kluyveromyces thermotolerans, Kluyveromyces fragilis, Kluyveromyces hubeiensis, Kluyveromyces polysporus, Kluyveromyces siamensis and Kluyveromyces yarrowii, or a combination thereof; preferably, the yeast cell is a Kluyveromyces cell, more preferably a Kluyveromyces lactis cell.

进一步优选的,所述核酸构建物经插入位点整合到菌株的基因组中,所述的插入位点为UPF1或内源tDNA。Further preferably, the nucleic acid construct is integrated into the genome of the strain via an insertion site, and the insertion site is UPF1 or endogenous tDNA.

本发明第三方面提供本发明第二方面所述的一种基因工程菌株的制备方法,将所述的核酸构建物经UPF1位点插入到所述菌株的基因组中。The third aspect of the present invention provides a method for preparing a genetically engineered strain according to the second aspect of the present invention, wherein the nucleic acid construct is inserted into the genome of the strain via the UPF1 site.

进一步的,所述核酸构建物经UPF1位点中的任一位置插入到所述菌株的基因组中。Furthermore, the nucleic acid construct is inserted into the genome of the strain via any position in the UPF1 site.

本发明第四方面提供本发明第二方面所提供的一种基因工程菌株的制备方法,将所述的核酸构建物经内源tDNA序列5’端和/或3’端1~1000bp处插入到所述菌株的基因组中;优选100~800bp;进一步优选200~500bp;最优选300bp。The fourth aspect of the present invention provides a method for preparing a genetically engineered strain provided in the second aspect of the present invention, wherein the nucleic acid construct is inserted into the genome of the strain via 1 to 1000 bp at the 5' end and/or 3' end of the endogenous tDNA sequence; preferably 100 to 800 bp; further preferably 200 to 500 bp; and most preferably 300 bp.

进一步的,所述的核酸构建物经内源tDNA序列的3’段插入。Furthermore, the nucleic acid construct is inserted into the 3' segment of the endogenous tDNA sequence.

本发明第五方面提供一种用于合成非天然氨基酸掺入的蛋白质的方法,采用本发明第五方面所述的基因工程菌株提供抑制性tRNA。The fifth aspect of the present invention provides a method for synthesizing proteins incorporating non-natural amino acids, using the genetically engineered strain described in the fifth aspect of the present invention to provide inhibitory tRNA.

本发明第六方面提供一种用于合成非天然氨基酸掺入的蛋白质的无细胞合成体系,所述体系至少包括细胞提取物,所述细胞提取物来自于本发明第二方面所提供的基因工程菌株。The sixth aspect of the present invention provides a cell-free synthesis system for synthesizing proteins incorporated with non-natural amino acids, wherein the system at least comprises a cell extract derived from the genetically engineered strain provided by the second aspect of the present invention.

进一步的,所述体系还包括氨酰tRNA合成酶和非天然氨基酸。Furthermore, the system also includes aminoacyl-tRNA synthetase and unnatural amino acids.

进一步的,所述的氨酰tRNA合成酶选自天然或突变的Pyl-tRNA合成酶(PylRS)、Leu-tRNA合成酶(LeuRS)、Tyr-tRNA合成酶(TyrRS)、Phe-tRNA合成酶(PheRS)或TrP-tRNA合成酶(TrpRS)。进一步优选为PylRS。Furthermore, the aminoacyl tRNA synthetase is selected from natural or mutant Pyl-tRNA synthetase (PylRS), Leu-tRNA synthetase (LeuRS), Tyr-tRNA synthetase (TyrRS), Phe-tRNA synthetase (PheRS) or TrP-tRNA synthetase (TrpRS), and is more preferably PylRS.

进一步的,所述的氨酰tRNA合成酶选自天然或突变的MaPylRS、MmPylRS、MbPylRS、EcTyrRS、MjTyrRS、EcLeuRS、ScPheRS、ScTrpRS、BsTrpRSFurthermore, the aminoacyl tRNA synthetase is selected from natural or mutant MaPylRS, MmPylRS, MbPylRS, EcTyrRS, MjTyrRS, EcLeuRS, ScPheRS, ScTrpRS, BsTrpRS

进一步的,所述的非天然氨基酸指的是20种天然氨基酸之外的氨基酸种类。Furthermore, the non-natural amino acids refer to amino acids other than the 20 natural amino acids.

进一步的,所述的非天然氨基酸的结构式为式(2)化合物或其盐形式。 Furthermore, the structural formula of the non-natural amino acid is a compound of formula (2) or a salt thereof.

其中n选自1-20的自然数,R1选自取代或未取代的C5-C60的芳基或杂芳基、取代或未取代的C1-C20的烷基、取代或未取代的C2-C20的烯基或取代或未取代的C2-C20的炔基,A选自O或-CH2-。 wherein n is selected from a natural number of 1-20, R1 is selected from substituted or unsubstituted C5-C60 aryl or heteroaryl, substituted or unsubstituted C1-C20 alkyl, substituted or unsubstituted C2-C20 alkenyl or substituted or unsubstituted C2-C20 alkynyl, and A is selected from O or -CH2- .

在另一优选例中,n选自1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20。In another preferred embodiment, n is selected from 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20.

在另一优选例中,n选自1-10的自然数。In another preferred embodiment, n is a natural number selected from 1-10.

在另一优选例中,n选自1、2、3、4、5、6、7、8、9或10。In another preferred embodiment, n is selected from 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10.

在另一优选例中,n选自1-6的自然数。In another preferred embodiment, n is a natural number selected from 1-6.

在另一优选例中,n选自1、2、3、4、5或6。In another preferred embodiment, n is selected from 1, 2, 3, 4, 5 or 6.

在另一优选例中,所述的R1选自取代或未取代的C5-C30的芳基或杂芳基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C5-C30 aryl or heteroaryl.

在另一优选例中,所述的R1选自取代或未取代的苯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted phenyl.

在另一优选例中,所述的R1选自取代或未取代的C2-C20的烯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C20 alkenyl.

在另一优选例中,所述的R1选自取代或未取代的C2-C10的烯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C10 alkenyl.

在另一优选例中,所述的R1选自取代或未取代的C2-C6的烯基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C6 alkenyl.

在另一优选例中,所述的R1选自或取代或未取代的C2-C20的炔基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C20 alkynyl.

在另一优选例中,所述的R1选自或取代或未取代的C2-C10的炔基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C10 alkynyl.

在另一优选例中,所述的R1选自或取代或未取代的C2-C6的炔基。In another preferred embodiment, the R 1 is selected from substituted or unsubstituted C2-C6 alkynyl.

在另一优选例中,所述的A选自O。In another preferred embodiment, the A is selected from O.

在另一优选例中,所述的A选自-CH2-。In another preferred embodiment, the A is selected from -CH 2 -.

在另一优选例中,所述的取代基为本领域常见的取代基团,如芳基、杂芳基、烷基、环烷基、芳基氧基、杂芳基氧基、烷基氧基、环烷基氧基、羟基、巯基、酯基、羧基、氰基、卤素、硝基、磺酸基、叠氮基、烯基、炔基、磷酸基等。In another preferred embodiment, the substituents are common substituents in the art, such as aryl, heteroaryl, alkyl, cycloalkyl, aryloxy, heteroaryloxy, alkyloxy, cycloalkyloxy, hydroxyl, thiol, ester, carboxyl, cyano, halogen, nitro, sulfonic acid, azido, alkenyl, alkynyl, phosphate, etc.

在另一优选例中,所述的非天然氨基酸的结构式选自如下的之一或组合,或其盐形式:

In another preferred embodiment, the structural formula of the non-natural amino acid is selected from one or a combination of the following, or a salt form thereof:

本发明第七方面提供一种本发明第二方面提供的基因工程菌株或本发明第六方面提供的无细胞合成体系在无细胞合成非天然氨基酸掺入的蛋白质中的应用。The seventh aspect of the present invention provides an application of the genetically engineered strain provided in the second aspect of the present invention or the cell-free synthesis system provided in the sixth aspect of the present invention in the cell-free synthesis of proteins incorporated with non-natural amino acids.

本发明第八方面提供一种试剂盒,所述的试剂盒包括本发明第六方面提供的无细胞合成体系。The eighth aspect of the present invention provides a kit, which includes the cell-free synthesis system provided by the sixth aspect of the present invention.

本发明第九方面提供一种无细胞合成非天然氨基酸掺入的蛋白质的方法,包括如下步骤:步骤(1),提供本发明第六方面提供的无细胞合成体系或本发明第八方面提供的试剂盒;步骤(2),在步骤(1)所述的合成体系或试剂盒中加入编码外源蛋白的DNA分子,在氨酰tRNA合成酶和非天然氨基酸存在的条件下,经反应得到所述的蛋白质。The ninth aspect of the present invention provides a method for cell-free synthesis of proteins incorporated with non-natural amino acids, comprising the following steps: step (1), providing the cell-free synthesis system provided in the sixth aspect of the present invention or the kit provided in the eighth aspect of the present invention; step (2), adding a DNA molecule encoding an exogenous protein to the synthesis system or the kit described in step (1), and obtaining the protein by reaction in the presence of aminoacyl-tRNA synthetase and non-natural amino acids.

进一步的,所述的外源蛋白所述的外源蛋白选自:荧光素蛋白、荧光素酶(如萤火虫荧光素酶)、荧光蛋白(如绿色荧光蛋白、黄色荧光蛋白、红色荧光蛋白)、氨酰tRNA合成酶、甘油醛-3-磷酸脱氢酶、过氧化氢酶、肌动蛋白、抗体的可变区域、萤光素酶突变、α-淀粉酶、肠道菌素A、丙型肝炎病毒E2糖蛋白、胰岛素前体、干扰素αA、细胞因子,干扰素α2b、白细胞介素-1β、溶菌酶素、血清白蛋白、单链抗体段(scFV)、甲状腺素运载蛋白、酪氨酸酶、木聚糖酶或其组合。Furthermore, the exogenous protein is selected from: luciferin, luciferase (such as firefly luciferase), fluorescent protein (such as green fluorescent protein, yellow fluorescent protein, red fluorescent protein), aminoacyl-tRNA synthetase, glyceraldehyde-3-phosphate dehydrogenase, catalase, actin, variable region of antibody, luciferase mutation, α-amylase, enterobactin A, hepatitis C virus E2 glycoprotein, insulin precursor, interferon αA, cytokine, interferon α2b, interleukin-1β, lysozyme, serum albumin, single-chain antibody fragment (scFV), thyroxine transporter, tyrosinase, xylanase or a combination thereof.

进一步的,所述的外源蛋白包括野生型蛋白、突变型蛋白或重组型蛋白。Furthermore, the exogenous protein includes a wild-type protein, a mutant protein or a recombinant protein.

本发明的优势在于:The advantages of the present invention are:

(1)将抑制性tRNA基因直接重组到酵母细胞基因组中,解决了直接在酵母细胞基因组中插入抑制性tRNA基因导致该基因无法正常表达的问题。(1) The inhibitory tRNA gene is directly recombined into the yeast cell genome, which solves the problem that the inhibitory tRNA gene cannot be expressed normally when it is directly inserted into the yeast cell genome.

(2)设计的抑制性tRNA结构能够正常转录出抑制性tRNA,且可以正常转运非天然氨基酸到特定的终止密码子(如TAG)位置,从而实现了制备非天然氨基酸蛋白质的目的。(2) The designed inhibitory tRNA structure can normally transcribe inhibitory tRNA and can normally transport non-natural amino acids to specific stop codons (such as TAG), thereby achieving the purpose of preparing non-natural amino acid proteins.

(3)通过在特定位置插入非天然氨基酸可以实现特殊功能蛋白质的生产,在化 工、制药等领域有着非常大的发展潜力。(3) The production of proteins with special functions can be achieved by inserting unnatural amino acids at specific positions. The fields of engineering and pharmaceuticals have great development potential.

(4)重组后的菌株结构稳定,可以进行工业化生产,从而可以进一步实现非天然氨基酸蛋白质的工业化生产。(4) The recombinant strain has a stable structure and can be produced on an industrial scale, thereby further realizing the industrial production of non-natural amino acid proteins.

应理解,在本发明范围内中,本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案。限于篇幅,在此不再一一累述。It should be understood that within the scope of the present invention, the above-mentioned technical features of the present invention and the technical features specifically described below (such as embodiments) can be combined with each other to form a new or preferred technical solution. Due to space limitations, they will not be described one by one here.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1表示的是本发明的插入片段序列3002结构。FIG. 1 shows the structure of the insert sequence 3002 of the present invention.

图2表示的是实施例1的测序结果。FIG. 2 shows the sequencing results of Example 1.

图3表示的是实施例4的测序结果。FIG. 3 shows the sequencing results of Example 4.

图4~图7表示的是改造1~改造10的菌株对非天然氨基酸插入效率的活性验证,其中图4和图6表示的是经内源tDNA附近位点插入改造的菌株对非天然氨基酸插入效率的活性验证;图5和图7表示的是经UPF1位点插入改造的菌株对非天然氨基酸插入效率的活性验证。在图4~图7中,wt与upf1-wt对应的插入序列相同,mut1~4分别与upf1-mut1~upf1-4对应的插入序列相同。Figures 4 to 7 show the activity verification of the efficiency of non-natural amino acid insertion of strains transformed 1 to 10, among which Figures 4 and 6 show the activity verification of the efficiency of non-natural amino acid insertion of strains transformed by insertion near the endogenous tDNA; Figures 5 and 7 show the activity verification of the efficiency of non-natural amino acid insertion of strains transformed by insertion at the UPF1 site. In Figures 4 to 7, wt and upf1-wt have the same insertion sequence, and mut1 to 4 have the same insertion sequence as upf1-mut1 to upf1-4, respectively.

具体实施方式DETAILED DESCRIPTION

需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。It should be noted that, in the absence of conflict, the embodiments and features in the embodiments of the present application may be combined with each other.

下面结合具体实施方式和实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,优先按照、参考上文所述的具体实施方式指引的条件,然后可按照常规条件,或按照制造厂商所建议的条件。The present invention is further described below in conjunction with specific embodiments and examples. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. The experimental methods in the following examples that do not specify specific conditions are preferably followed or referred to the conditions indicated in the specific embodiments described above, and then can be followed according to conventional conditions or according to the conditions recommended by the manufacturer.

除非另外说明,否则本发明中提及的百分比和份数是重量百分比和重量份数。Unless otherwise specified, the percentages and parts mentioned in the present invention are percentages and parts by weight.

如无特别说明,则本发明实施例中所用的材料和试剂均为市售产品。Unless otherwise specified, the materials and reagents used in the examples of the present invention are all commercially available products.

本申请中的温度单位如无特殊说明,均为摄氏度(℃)。Unless otherwise specified, the temperature units in this application are all degrees Celsius (°C).

名词和术语Nouns and terms

以下是针对本发明所采用的部分相关“名词”、“术语”的含义进行的解释或说明,以便于更好地理解本发明。相应的解释或说明适用于本发明的全文,既适用于下文,也适用于上文。本发明中涉及引用文献时,相关术语、名词、短语在引用文献中 的定义也一并被引用,但是,与本发明中的定义相冲突时,以本发明中的定义为准。在引用文献中的定义与本发明中的定义发生冲突时,并不影响所引用的成分、物质、组合物、材料、体系、配方、种类、方法、设备等以引用文献中确定的内容为准。The following is an explanation or description of the meaning of some relevant "nouns" and "terms" used in the present invention, so as to facilitate a better understanding of the present invention. The corresponding explanation or description applies to the entire text of the present invention, both below and above. When the present invention involves references, the relevant terms, nouns, and phrases in the references are The definitions in the cited documents are also included, but in case of conflict with the definitions in the present invention, the definitions in the present invention shall prevail. In case of conflict between the definitions in the cited documents and the definitions in the present invention, it does not affect the cited components, substances, compositions, materials, systems, formulations, types, methods, equipment, etc., which shall prevail.

本发明中,“优选”、“较佳”、“更优选”、“更佳”、“最优选”、“进一步优选”等优选实施方式,不构成对发明的涵盖范围及保护范围的任何意义上的限制,并非用于限定本发明的范围和实施方式,仅用于提供一些实施方式作为举例。In the present invention, preferred embodiments such as “preferred”, “better”, “more preferred”, “better”, “most preferred” and “further preferred” do not constitute any limitation on the scope of coverage and protection scope of the invention, and are not used to limit the scope and embodiments of the present invention, but are only used to provide some embodiments as examples.

本发明的描述中,对于“优选之一”、“优选方式之一”、“优选实施方式之一”、“优选例之一”、“优选例”、“在一优选的实施方式中”、“一些优选例中”、“一些优选方式中”、“优选为”、“优选”、“优选地”、“更优选”、“更优地”、“进一步优选”、“最优选”等优选方式,以及“实施方式之一”、“方式之一”、“示例”、“具体示例”、“举例如”、“作为举例”、“例如”、“比如”、“如”等示意的列举方式,同样不构成对发明的涵盖范围及保护范围的任何意义上的限制,且各方式所描述的具体特征包含于本发明的至少一个具体实施方式中。本发明中,各方式所描述的具体特征可以在任何的一个或者多个具体实施方式中以合适的方式结合。本发明中,各优选方式对应的技术特征或技术方案也可以通过任意合适的方式结合。In the description of the present invention, preferred modes such as "one of the preferred embodiments", "one of the preferred modes", "one of the preferred embodiments", "one of the preferred examples", "preferred examples", "in a preferred embodiment", "some preferred examples", "some preferred modes", "preferably", "preferably", "preferably", "more preferably", "more preferably", "further preferably", "most preferably", and illustrative enumeration modes such as "one of the embodiments", "one of the modes", "example", "specific example", "for example", "as an example", "for example", "such as", "such as", etc., do not constitute any limitation on the scope of coverage and protection scope of the invention, and the specific features described in each mode are included in at least one specific embodiment of the present invention. In the present invention, the specific features described in each mode may be combined in a suitable manner in any one or more specific embodiments. In the present invention, the technical features or technical solutions corresponding to each preferred mode may also be combined in any suitable manner.

本发明中,“其任意组合”,在数量上表示“大于1”,在涵盖范围上表示以下情形构成的组:“任选其中一个,或者任选其中至少两个构成的组”。In the present invention, "any combination thereof" means "greater than 1" in terms of quantity, and means a group consisting of the following situations in terms of coverage: "select any one of them, or select a group consisting of at least two of them".

本发明中,“一个或多个”、“一种或多种”等“一或多”的描述,与“至少一个”、“至少一种”、“其组合”、“或其组合”、“及其组合”、“或其任意组合”、“及其任意组合”等具有相同含义,可以互换使用,表示数量上等于“1”或“大于1”。In the present invention, the descriptions of "one or more", "one or more" and "one or more" have the same meaning as "at least one", "at least one", "a combination thereof", "or a combination thereof", "and combinations thereof", "or any combination thereof", "and any combination thereof", etc. and can be used interchangeably to indicate that the quantity is equal to "1" or "greater than 1".

本发明中,采用“或/和”、“和/或”表示“任选其一或者任选其组合”,也表示至少其一。In the present invention, "or/and" and "and/or" are used to represent "optionally one of them or optionally a combination of them", and also represent at least one of them.

术语“约”可以是指在本领域普通技术人员确定的特定值或组成的可接受误差范围内的值或组成,其将部分地取决于如何测量或测定值或组成。例如,如本文所用,表述“约100”包括99和101和之间的全部值(例如,99.1、99.2、99.3、99.4等)。The term "about" can refer to a value or composition that is within an acceptable error range for a particular value or composition as determined by one of ordinary skill in the art, which will depend in part on how the value or composition is measured or determined. For example, as used herein, the expression "about 100" includes all values between 99 and 101 (e.g., 99.1, 99.2, 99.3, 99.4, etc.).

序列同一性(或同源性)通过沿着预定的比较窗(其可以是参考核苷酸序列或蛋白的长度的50%、60%、70%、80%、90%、95%或100%)比较两个对齐的序列,并且确定出现相同的残基的位置的数目来确定。通常地,这表示为百分比。核苷酸序列的序列同一性的测量是本领域技术人员熟知的方法。 Sequence identity (or homology) is determined by comparing two aligned sequences along a predetermined comparison window (which can be 50%, 60%, 70%, 80%, 90%, 95% or 100% of the length of the reference nucleotide sequence or protein) and determining the number of positions at which identical residues occur. Typically, this is expressed as a percentage. The measurement of sequence identity of nucleotide sequences is a method well known to those skilled in the art.

本发明所述的“通常”、“常规”、“一般”、“经常”、“往往”等方式描述的现有技术手段,也都被引用作为本发明内容的参考,如无特别说明,可视为本发明的部分技术特征的优选方式之一,且需要注意的是,不构成对发明的涵盖范围及保护范围的任何意义上的限制。The prior art means described in the present invention in such ways as “usually”, “conventional”, “general”, “frequently” and “often” are also cited as references to the contents of the present invention. Unless otherwise specified, they can be regarded as one of the preferred ways of some technical features of the present invention. It should be noted that they do not constitute any limitation on the scope of coverage and protection scope of the invention.

在本发明提及的所有文献及这些文献直接引用或者间接引用的文献,都在本申请中被引用作为参考,就如同每一篇文献被单独引用作为参考那样。All documents mentioned in the present invention and documents directly or indirectly cited by these documents are cited in this application as references, just as if each document was cited individually as a reference.

应理解,在本发明范围内,本发明的上述各技术特征和在下文(包括但不限于实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案,只要能够用于实施本发明的即可。限于篇幅,不再一一累述。It should be understood that within the scope of the present invention, the above-mentioned technical features of the present invention and the technical features specifically described below (including but not limited to the embodiments) can be combined with each other to form a new or preferred technical solution, as long as it can be used to implement the present invention. Due to space limitations, they will not be described one by one.

体外蛋白合成反应,是指在体外无细胞合成体系中合成蛋白的反应,至少包括翻译过程。包括但不限于IVT反应(体外翻译反应)、IVTT反应(体外转录翻译反应)、IVDTT反应(体外复制转录翻译反应)。本发明中,优选IVTT反应。IVTT反应,对应IVTT体系,是在体外将DNA转录翻译为蛋白质(Protein)的过程,因此,我们还将这类的体外蛋白合成体系称为D2P体系、D-to-P体系、D_to_P体系、DNA-to-Protein体系;相应的体外蛋白合成方法,还称为D2P方法、D-to-P方法、D_to_P方法、DNA-to-Protein方法。In vitro protein synthesis reaction refers to the reaction of synthesizing protein in an in vitro cell-free synthesis system, including at least the translation process. Including but not limited to IVT reaction (in vitro translation reaction), IVTT reaction (in vitro transcription and translation reaction), IVDTT reaction (in vitro replication transcription and translation reaction). In the present invention, IVTT reaction is preferred. IVTT reaction, corresponding to IVTT system, is the process of transcribing and translating DNA into protein (Protein) in vitro. Therefore, we also refer to this type of in vitro protein synthesis system as D2P system, D-to-P system, D_to_P system, DNA-to-Protein system; the corresponding in vitro protein synthesis method is also called D2P method, D-to-P method, D_to_P method, DNA-to-Protein method.

“无细胞体系”,是指进行体外蛋白合成时,并非通过完整细胞分泌表达的方式。需要说明的是,本发明的体外无细胞蛋白合成体系中,也允许添加细胞组分以促进反应,但所添加的细胞不以分泌表达外源目标蛋白为主要目的。此外,在本发明指导下构建的无完整细胞的D2P体系中,有意地添加少量完整细胞(例如,其提供的蛋白含量与细胞提取物提供的蛋白含量相比,不超过30wt%),这样的“规避”方式,也囊括在本发明的保护范围之内。"Cell-free system" refers to a method of in vitro protein synthesis that is not secretory expression through intact cells. It should be noted that in the in vitro cell-free protein synthesis system of the present invention, it is also allowed to add cell components to promote the reaction, but the added cells are not primarily intended to secrete and express exogenous target proteins. In addition, in the intact cell-free D2P system constructed under the guidance of the present invention, a small amount of intact cells are intentionally added (for example, the protein content provided by them is not more than 30wt% compared to the protein content provided by the cell extract). Such a "circumvention" method is also included in the scope of protection of the present invention.

外源蛋白:本发明的体外蛋白合成体系的目的表达产物,并非由宿主细胞分泌合成,而是基于外源核酸模板进行体外合成,也可称为目的蛋白。所述外源蛋白可以为蛋白、融合蛋白、含蛋白分子或融合蛋白分子的混合物;还广义地包括多肽。基于编码目标蛋白的核酸模板进行体外蛋白合成反应后所得产物,可以为单一物质,也可以为两种或两种以上物质的组合。“外源蛋白”、“目的蛋白”、“目标蛋白”、“目标翻译产物”,具有相同含义,可译为“objective protein”、“interested protein”、“objective translated product”、“interested protein product”等方式,本发明中可互换使用。Exogenous protein: The target expression product of the in vitro protein synthesis system of the present invention is not synthesized by secretion from the host cell, but is synthesized in vitro based on an exogenous nucleic acid template, and may also be referred to as the target protein. The exogenous protein may be a protein, a fusion protein, a mixture of protein molecules or fusion protein molecules; it also broadly includes polypeptides. The product obtained after an in vitro protein synthesis reaction based on a nucleic acid template encoding the target protein may be a single substance or a combination of two or more substances. "Exogenous protein", "target protein", "target protein", "target translation product" have the same meaning and can be translated as "objective protein", "interested protein", "objective translated product", "interested protein product", etc., and may be used interchangeably in the present invention.

D2P,DNA-to-Protein,从DNA模板到蛋白质产物。比如,D2P技术、D2P体系、 D2P方法、D2P试剂盒等等。D2P, DNA-to-Protein, from DNA template to protein product. For example, D2P technology, D2P system, D2P method, D2P kit, and the like.

“本发明的表达系统”、“本发明的体外表达系统”、“体外无细胞表达系统”、“体外无细胞表达体系”可互换使用,均指本发明的体外蛋白表达体系,也可采用其它描述方式,如:蛋白质体外合成系统、体外蛋白合成体系、无细胞系统、无细胞体系、无细胞蛋白合成体系、无细胞体外蛋白合成体系、体外无细胞蛋白合成体系、体外无细胞合成体系、CFS体系(cell-free system)、CFPS体系(cell-free protein synthesis system)等描述方式。根据反应机理,可包括体外翻译体系(可简记为IVT体系,一种mR2P体系)、体外转录翻译体系(可简记为IVTT体系,一种D2P体系)、体外复制转录翻译体系(可简记为IVDTT体系,一种D2P体系)等。本发明中,优选IVTT体系。我们还将体外蛋白合成系统称为“蛋白质合成工厂”(“Protein Factory”或“proteinfactory”或“Proteinfactory”)。本发明提供的体外蛋白合成系统,对其组分是采用开放式的描述方式的。本发明的无细胞蛋白合成体系是以外源DNA、mRNA或者其组合作为蛋白质合成的核酸模板,通过人工控制补加蛋白质合成所需的底物和转录、翻译相关蛋白因子等物质,实现目标蛋白的体外合成。"The expression system of the present invention", "the in vitro expression system of the present invention", "the in vitro cell-free expression system", "the in vitro cell-free expression system" can be used interchangeably, all referring to the in vitro protein expression system of the present invention, and other descriptions can also be used, such as: protein in vitro synthesis system, in vitro protein synthesis system, cell-free system, cell-free system, cell-free protein synthesis system, cell-free in vitro protein synthesis system, in vitro cell-free protein synthesis system, in vitro cell-free synthesis system, CFS system (cell-free system), CFPS system (cell-free protein synthesis system) and other descriptions. According to the reaction mechanism, it can include an in vitro translation system (which can be abbreviated as IVT system, a mR2P system), an in vitro transcription translation system (which can be abbreviated as IVTT system, a D2P system), an in vitro replication transcription translation system (which can be abbreviated as IVDTT system, a D2P system), etc. In the present invention, the IVTT system is preferred. We also call the in vitro protein synthesis system a "protein synthesis factory" ("Protein Factory" or "proteinfactory" or "Proteinfactory"). The in vitro protein synthesis system provided by the present invention adopts an open description method for its components. The cell-free protein synthesis system of the present invention uses exogenous DNA, mRNA or a combination thereof as a nucleic acid template for protein synthesis, and achieves in vitro synthesis of the target protein by artificially controlling the addition of substrates and transcription, translation-related protein factors and other substances required for protein synthesis.

本发明中,“蛋白”与“蛋白质”具有相同含义,均译为protein,可以互换使用。In the present invention, "protein" and "protein" have the same meaning, are both translated as protein, and can be used interchangeably.

本发明中,“系统”和“体系”,均译为system,可以互换使用。In the present invention, "system" and "system" are both translated as system and can be used interchangeably.

本发明中,“蛋白合成量”、“蛋白表达量”与“蛋白表达产量”具有相同含义,可互换使用。In the present invention, "protein synthesis amount", "protein expression amount" and "protein expression yield" have the same meaning and can be used interchangeably.

本发明中,细胞提取物、细胞提取液、细胞裂解物、细胞破碎物、细胞溶解产物的含义相同,可以互换使用,英文可采用cell extract、cell lysate等描述方式。In the present invention, cell extract, cell extract, cell lysate, cell disruption and cell lysis product have the same meaning and can be used interchangeably. In English, cell extract, cell lysate and the like can be used as descriptions.

本发明中,能量体系、能量系统、能量供应体系具有同等含义,可互换使用。能量再生体系、能量再生系统具有同等含义,可互换使用。能量再生系统是能量系统的优选实施方式或者组成部分。In the present invention, energy system, energy system, and energy supply system have the same meaning and can be used interchangeably. Energy regeneration system and energy regeneration system have the same meaning and can be used interchangeably. The energy regeneration system is a preferred embodiment or component of the energy system.

进一步的,本发明提供了一种无细胞蛋白质合成体系,至少包括细胞提取物或细胞裂解液。Furthermore, the present invention provides a cell-free protein synthesis system, which at least includes a cell extract or a cell lysate.

进一步优选的,所述的无细胞蛋白质合成体系还包括选自下组的一种或多种组分:用于合成RNA的底物,用于合成蛋白的底物,聚乙二醇或其类似物,镁离子,钾离子,缓冲剂,RNA聚合酶,能量再生系统,二硫苏糖醇,任选的水性溶剂。Further preferably, the cell-free protein synthesis system further comprises one or more components selected from the following group: a substrate for synthesizing RNA, a substrate for synthesizing protein, polyethylene glycol or its analogues, magnesium ions, potassium ions, a buffer, RNA polymerase, an energy regeneration system, dithiothreitol, and an optional aqueous solvent.

进一步优选的,所述的合成RNA的底物包括:核苷单磷酸、核苷三磷酸之一或其组合。Further preferably, the substrate for synthesizing RNA includes: nucleoside monophosphate, nucleoside triphosphate or a combination thereof.

进一步优选的,所述的合成蛋白质的底物包括:20种天然氨基酸以及非天然氨 基酸。Further preferably, the substrates for synthesizing proteins include: 20 natural amino acids and unnatural amino acids. Base acid.

进一步优选的,所述镁离子来源于镁离子源,所述镁离子源选自下组:醋酸镁、谷氨酸镁之一或其组合。Further preferably, the magnesium ions are derived from a magnesium ion source, and the magnesium ion source is selected from the following group: magnesium acetate, magnesium glutamate, or a combination thereof.

进一步优选的,所述钾离子来源于钾离子源,所述钾离子源选自下组:醋酸钾、谷氨酸钾之一或其组合。Further preferably, the potassium ions are derived from a potassium ion source, and the potassium ion source is selected from the following group: potassium acetate, potassium glutamate, or a combination thereof.

进一步优选的,所述能量再生系统选自下组:磷酸肌酸/磷酸肌酸酶系统、糖酵解途径及其中间产物能量系统之一或其组合。Further preferably, the energy regeneration system is selected from the following group: one or a combination of creatine phosphate/creatine phosphate enzyme system, glycolysis pathway and its intermediate energy system.

进一步优选的,所述能量再生系统包括葡萄糖/磷酸盐体系,所述磷酸盐选自下组:磷酸三钾、磷酸三铵、磷酸三钠、磷酸氢二钾、磷酸氢二铵、磷酸氢二钠、磷酸二氢钾、磷酸二氢铵、磷酸二氢钠之一或其组合。Further preferably, the energy regeneration system comprises a glucose/phosphate system, and the phosphate is selected from the following group: tripotassium phosphate, triammonium phosphate, trisodium phosphate, dipotassium hydrogen phosphate, diammonium hydrogen phosphate, disodium hydrogen phosphate, potassium dihydrogen phosphate, ammonium dihydrogen phosphate, sodium dihydrogen phosphate, or a combination thereof.

进一步优选的,所述缓冲剂选自下组:4-羟乙基哌嗪乙磺酸、三羟甲基氨基甲烷之一或其组合。Further preferably, the buffer is selected from the following group: 4-hydroxyethylpiperazineethanesulfonic acid, tris(hydroxymethyl)aminomethane, or a combination thereof.

进一步优选的,所述体外蛋白质合成体系含有聚乙二醇(PEG)或其类似物。聚乙二醇或其类似物的浓度没有特别限制,通常,聚乙二醇或其类似物的浓度(w/v)为0.1-8%,较佳地,0.5-4%,更佳地,1-2%,以所述蛋白合成体系的总重量计。代表性的PEG选自下组:PEG3000、PEG3350、PEG6000、PEG8000之一或其组合。Further preferably, the in vitro protein synthesis system contains polyethylene glycol (PEG) or its analogs. The concentration of polyethylene glycol or its analogs is not particularly limited. Generally, the concentration (w/v) of polyethylene glycol or its analogs is 0.1-8%, preferably 0.5-4%, more preferably 1-2%, based on the total weight of the protein synthesis system. Representative PEG is selected from the following group: one of PEG3000, PEG3350, PEG6000, PEG8000 or a combination thereof.

进一步优选的,所述聚乙二醇包括分子量(Da)为200-10000的聚乙二醇,如PEG200、400、1500、2000、4000、6000、8000、10000等,较佳地,分子量为3000-10000的聚乙二醇。Further preferably, the polyethylene glycol includes polyethylene glycol with a molecular weight (Da) of 200-10000, such as PEG200, 400, 1500, 2000, 4000, 6000, 8000, 10000, etc., preferably, polyethylene glycol with a molecular weight of 3000-10000.

在本发明中,RNA聚合酶没有特别限制,可以选自一种或多种RNA聚合酶,典型的RNA聚合酶为T7 RNA聚合酶。In the present invention, the RNA polymerase is not particularly limited and can be selected from one or more RNA polymerases. A typical RNA polymerase is T7 RNA polymerase.

可选的方案为,本发明提供的体外蛋白合成体系包括:细胞提取物,4-羟乙基哌嗪乙磺酸,醋酸钾,醋酸镁,腺嘌呤核苷三磷酸(ATP),鸟嘌呤核苷三磷酸(GTP),胞嘧啶核苷三磷酸(CTP),胸腺嘧啶核苷三磷酸(TTP),氨基酸混合物,磷酸肌酸,二硫苏糖醇(DTT),磷酸肌酸激酶,RNA聚合酶。An optional scheme is that the in vitro protein synthesis system provided by the present invention includes: cell extract, 4-hydroxyethylpiperazineethanesulfonic acid, potassium acetate, magnesium acetate, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytosine triphosphate (CTP), thymidine triphosphate (TTP), amino acid mixture, creatine phosphate, dithiothreitol (DTT), creatine phosphate kinase, and RNA polymerase.

在本发明中,所述的细胞提取物不含完整的细胞,典型的细胞提取物包括用于蛋白翻译的核糖体、氨酰tRNA合成酶、蛋白质合成需要的起始因子和延伸因子以及终止释放因子。此外,细胞提取物中还含有一些源自细胞的细胞质中的其他蛋白,尤其是可溶性蛋白。In the present invention, the cell extract does not contain intact cells, and a typical cell extract includes ribosomes, aminoacyl-tRNA synthetases, initiation factors and elongation factors required for protein synthesis, and termination release factors for protein translation. In addition, the cell extract also contains some other proteins derived from the cytoplasm of the cell, especially soluble proteins.

在本发明中,所述细胞提取物在体外无细胞蛋白合成体系中的比例不受特别限制,通常所述细胞提取物在体外无细胞蛋白合成体系中所占体系为20-70%,较佳地, 30-60%,更佳地,40-50%。In the present invention, the proportion of the cell extract in the in vitro cell-free protein synthesis system is not particularly limited. Usually, the cell extract accounts for 20-70% of the in vitro cell-free protein synthesis system. Preferably, 30-60%, more preferably 40-50%.

在本发明中,所述的细胞提取物所含蛋白含量为20-100mg/mL,较佳为50-100mg/mL。所述的测定蛋白含量方法为考马斯亮蓝测定方法。In the present invention, the protein content of the cell extract is 20-100 mg/mL, preferably 50-100 mg/mL. The method for determining the protein content is the Coomassie Brilliant Blue determination method.

本发明还提供了一种载体或载体组合,所述载体含有本发明的核酸构建物。优选地,所述载体选自:细菌质粒、噬菌体、酵母质粒、动物细胞载体、穿梭载体;所述的载体为转座子载体。用于制备重组载体的方法是本领域普通技术人员所熟知的。只要其能够在宿主体内复制和稳定,任何质粒和载体都是可以被采用的。The present invention also provides a vector or vector combination, wherein the vector contains the nucleic acid construct of the present invention. Preferably, the vector is selected from: bacterial plasmid, bacteriophage, yeast plasmid, animal cell vector, shuttle vector; the vector is a transposon vector. Methods for preparing recombinant vectors are well known to those of ordinary skill in the art. As long as they can replicate and be stable in the host, any plasmid and vector can be used.

本领域普通技术人员可以使用熟知的方法构建含有本发明所述的启动子和/或目的基因序列的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等。Those skilled in the art can use well-known methods to construct expression vectors containing the promoter and/or target gene sequence of the present invention, including in vitro recombinant DNA technology, DNA synthesis technology, in vivo recombination technology, etc.

模板DNA(template DNA)Template DNA

模板DNA为编码任意待合成目标蛋白的核苷酸序列,其可以为原始序列,也可以为人工合成序列或人工改造序列,利用该模板DNA能够合成相应的RNA和/或蛋白质。The template DNA is a nucleotide sequence encoding any target protein to be synthesized, which may be an original sequence, an artificially synthesized sequence, or an artificially modified sequence. The template DNA may be used to synthesize the corresponding RNA and/or protein.

在本发明中,所述的细胞提取物的制备方法不受限制,一种优选的制备方法In the present invention, the preparation method of the cell extract is not limited. A preferred preparation method is

包括以下步骤:The following steps are involved:

(i)提供细胞;(i) providing cells;

(ii)对细胞进行洗涤处理,获得经洗涤的细胞;(ii) washing the cells to obtain washed cells;

(iii)对经洗涤的细胞进行破细胞处理,从而获得细胞粗提物;(iii) disrupting the washed cells to obtain a crude cell extract;

(iv)对所述细胞粗提物进行固液分离,获得液体部分,即为细胞提取物。(iv) subjecting the crude cell extract to solid-liquid separation to obtain a liquid portion, namely the cell extract.

在本发明中,所述的固液分离方式不受特别限制,一种优选的方式为离心。In the present invention, the solid-liquid separation method is not particularly limited, and a preferred method is centrifugation.

在一优选实施方式中,所述离心在液态下进行。In a preferred embodiment, the centrifugation is performed in a liquid state.

在本发明中,所述离心条件不受特别限制,一种优选的离心条件为5000-100000g,较佳地,8000-30000g。In the present invention, the centrifugation conditions are not particularly limited, and a preferred centrifugation condition is 5000-100000 g, preferably 8000-30000 g.

在本发明中,所述离心时间不受特别限制,一种优选的离心时间为0.5min-2h,较佳地,20min-50min。In the present invention, the centrifugation time is not particularly limited, and a preferred centrifugation time is 0.5 min-2 h, preferably 20 min-50 min.

在本发明中,所述离心的温度不受特别限制,优选的,所述离心在1-10℃下进行,较佳地,在2-6℃下进行。In the present invention, the centrifugation temperature is not particularly limited. Preferably, the centrifugation is performed at 1-10°C, more preferably, at 2-6°C.

在本发明中,所述的洗涤处理方式不受特别限制,一种优选的洗涤处理方式为采用洗涤液在pH为7-8(较佳地,7.4)下进行处理,所述洗涤液没有特别限制,典型的所述洗涤液选自下组:4-羟乙基哌嗪乙磺酸钾、醋酸钾、醋酸镁之一或其组合。 In the present invention, the washing treatment method is not particularly limited. A preferred washing treatment method is to use a washing liquid at a pH of 7-8 (preferably 7.4). The washing liquid is not particularly limited. Typically, the washing liquid is selected from the following group: potassium 4-hydroxyethylpiperazineethanesulfonate, potassium acetate, magnesium acetate, or a combination thereof.

在本发明中,所述破细胞处理的方式不受特别限制,一种优选的所述的破细胞处理包括高压破碎、冻融(如液氮低温)破碎。In the present invention, the cell disruption treatment method is not particularly limited. A preferred cell disruption treatment includes high-pressure disruption and freeze-thaw (such as liquid nitrogen cryogenic) disruption.

所述体外无细胞蛋白合成体系中的核苷三磷酸混合物为腺嘌呤核苷三磷酸、鸟嘌呤核苷三磷酸、胞嘧啶核苷三磷酸和尿嘧啶核苷三磷酸。在本发明中,各种单核苷酸的浓度没有特别限制,通常每种单核苷酸的浓度为0.5-5mM,较佳地为1.0-2.0mM。The nucleoside triphosphate mixture in the in vitro cell-free protein synthesis system is adenosine triphosphate, guanosine triphosphate, cytosine triphosphate and uridine triphosphate. In the present invention, the concentration of various mononucleotides is not particularly limited, and usually the concentration of each mononucleotide is 0.5-5mM, preferably 1.0-2.0mM.

所述体外无细胞蛋白合成体系中的氨基酸混合物可包括天然或非天然氨基酸,可包括D型或L型氨基酸。代表性的氨基酸包括(但并不限于)20种天然氨基酸:甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、苯丙氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、蛋氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸和组氨酸。每种氨基酸的浓度通常为0.01-0.5mM,较佳地0.02-0.2mM,如0.05、0.06、0.07、0.08mM。The amino acid mixture in the in vitro cell-free protein synthesis system may include natural or non-natural amino acids, and may include D-type or L-type amino acids. Representative amino acids include (but are not limited to) 20 natural amino acids: glycine, alanine, valine, leucine, isoleucine, phenylalanine, proline, tryptophan, serine, tyrosine, cysteine, methionine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine. The concentration of each amino acid is generally 0.01-0.5 mM, preferably 0.02-0.2 mM, such as 0.05, 0.06, 0.07, 0.08 mM.

在优选例中,所述体外无细胞蛋白合成体系还含有聚乙二醇或其类似物。聚乙二醇或其类似物的浓度没有特别限制,通常,聚乙二醇或其类似物的浓度(w/v)为0.1-8%,较佳地,0.5-4%,更佳地,1-2%,以所述生物合成体系的总重量计。代表性的PEG例子包括(但并不限于):PEG3000,PEG8000,PEG6000和PEG3350。应理解,本发明的体系还可包括其他各种分子量的聚乙二醇(如PEG200、400、1500、2000、4000、6000、8000、10000等)。In a preferred embodiment, the in vitro cell-free protein synthesis system further contains polyethylene glycol or its analogs. The concentration of polyethylene glycol or its analogs is not particularly limited. Generally, the concentration (w/v) of polyethylene glycol or its analogs is 0.1-8%, preferably 0.5-4%, more preferably 1-2%, based on the total weight of the biosynthesis system. Representative examples of PEG include (but are not limited to): PEG3000, PEG8000, PEG6000 and PEG3350. It should be understood that the system of the present invention may also include polyethylene glycols of various other molecular weights (such as PEG200, 400, 1500, 2000, 4000, 6000, 8000, 10000, etc.).

在优选例中,所述体外无细胞蛋白合成体系还含有蔗糖。蔗糖的浓度没有特别限制,通常,蔗糖的浓度为0.03-40wt%,较佳地,0.08-10wt%,更佳地,0.1-5wt%,以所述蛋白合成体系的总重量计。In a preferred embodiment, the in vitro cell-free protein synthesis system further contains sucrose. The concentration of sucrose is not particularly limited, and generally, the concentration of sucrose is 0.03-40wt%, preferably 0.08-10wt%, and more preferably 0.1-5wt%, based on the total weight of the protein synthesis system.

一种特别优选的体外无细胞蛋白合成体系,除了酵母细胞提取物之外,还含有以下组分:22mM pH为7.4的4-羟乙基哌嗪乙磺酸,30-150mM醋酸钾,1.0-5.0mM醋酸镁,1.5-4mM核苷三磷酸混合物,0.08-0.24mM的氨基酸混合物,25mM磷酸肌酸,1.7mM二硫苏糖醇,0.27mg/mL磷酸肌酸激酶,1%-4%聚乙二醇,0.5%-2%蔗糖,0.027-0.054mg/mL T7RNA聚合酶。A particularly preferred in vitro cell-free protein synthesis system, in addition to yeast cell extract, also contains the following components: 22mM 4-hydroxyethylpiperazineethanesulfonic acid with a pH of 7.4, 30-150mM potassium acetate, 1.0-5.0mM magnesium acetate, 1.5-4mM nucleoside triphosphate mixture, 0.08-0.24mM amino acid mixture, 25mM creatine phosphate, 1.7mM dithiothreitol, 0.27mg/mL creatine phosphate kinase, 1%-4% polyethylene glycol, 0.5%-2% sucrose, and 0.027-0.054mg/mL T7 RNA polymerase.

下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明。The present invention will be further described below in conjunction with specific examples. It should be understood that these examples are only used to illustrate the present invention.

而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件,例如Sam brook等人,分子克隆:实验室手册(New York:Cold Spring Harbor Laboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。除非另外说明,否则百分比和份数是重量百分比和重量份数。本发明以乳酸克鲁维酵母 (Kluyveromyces lactis,简称K.lactis或kl)为实施例,但同样的设计和分析、实验方法也适用于其他酵母、动物细胞等真核细胞,以及原核细胞。The present invention is not intended to limit the scope of the present invention. The experimental methods in the following examples where specific conditions are not specified are generally carried out under conventional conditions, such as those described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or under conditions recommended by the manufacturer. Unless otherwise specified, percentages and parts are by weight. The present invention is only an example of Kluyveromyces lactis, K. lactis or kl for short, but the same design, analysis and experimental methods are also applicable to other eukaryotic cells such as yeast and animal cells, as well as prokaryotic cells.

本发明以乳酸克鲁维酵母(Kluyveromyces lactis,K.lactis)为实施例,但同样的设计和分析、实验方法也适用于其他酵母等低等真核细胞以及高等动物细胞。本发明中的基因改造方法为CRISPR-Cas9技术,但并不限于此,可以为已知的、现有的任意基因改造方法。The present invention uses Kluyveromyces lactis (K. lactis) as an example, but the same design, analysis and experimental methods are also applicable to other yeasts and other lower eukaryotic cells and higher animal cells. The gene modification method in the present invention is CRISPR-Cas9 technology, but it is not limited to this, and can be any known and existing gene modification method.

体外蛋白合成反应混合体系,也描述为体外蛋白合成反应混合物、反应混合体系、反应混合物,指包括体外蛋白合成体系、编码目标蛋白的核酸模板的混合体系;可以是均质的,也可以是非均质的,允许是溶液、乳浊液、悬浊液等液态体系。The in vitro protein synthesis reaction mixture system, also described as an in vitro protein synthesis reaction mixture, a reaction mixture system, or a reaction mixture, refers to a mixed system including an in vitro protein synthesis system and a nucleic acid template encoding a target protein; it may be homogeneous or heterogeneous, and may be a liquid system such as a solution, an emulsion, or a suspension.

本发明所述的Protein Factory中各组分的终浓度分别为:80%(v/v)乳酸克鲁维酵母提取物、15mM葡萄糖、320mM麦芽糊精(以葡糖糖单体计量摩尔浓度)、24mM磷酸三钾、1.8mM核苷三磷酸混合物(腺嘌呤核苷三磷酸、鸟嘌呤核苷三磷酸、胞嘧啶核苷三磷酸和尿嘧啶核苷三磷酸的混合物,每种核苷三磷酸的终浓度均为1.8mM)、0.7mM氨基酸混合物(甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、苯丙氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、蛋氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸和组氨酸,每种氨基酸的终浓度均为0.7mM)、L-天门冬氨酸镁、80mM醋酸钾、2%(w/v)聚乙二醇8000、9.78mMpH8.0 Tris·HCl缓冲液、6%(w/v)海藻糖。其中,所述乳酸克鲁维酵母提取物中包括内源性表达的T7RNA聚合酶。所述乳酸克鲁维酵母提取物经过以下方式改造:采用基于乳酸克鲁维酵母菌株ATCC8585的改造菌株;采用CN109423496A所记载的方法,将T7 RNA聚合酶的编码基因整合到乳酸克鲁维酵母的基因组中,获得改造菌株,使其可以内源性表达T7 RNA聚合酶;以改造菌株培养出细胞原料,然后制备细胞提取物。乳酸克鲁维酵母细胞提取物的制备过程采用常规技术手段,参考CN109593656A记载的方法制备。制备步骤概括而言,包括:提供经发酵培养的乳酸克鲁维酵母细胞的适量原料,用液氮将细胞速冻,将细胞打碎,离心收集上清液,即可得到细胞提取物。所得乳酸克鲁维酵母细胞提取物中的蛋白浓度为20~40mg/mL。在以下实施例中,选择以(即prock,N-Ε-炔丙氧基羰基-L-赖氨酸盐酸盐)作为非天然氨基酸(简称ncaa)的代表,但是并不限定本发明的ncaa仅仅指的就是prock。Protein of the present invention The final concentrations of the components in Factory are: 80% (v/v) Kluyveromyces lactis extract, 15 mM glucose, 320 mM maltodextrin (measured in molar concentration of glucose monomers), 24 mM tripotassium phosphate, 1.8 mM nucleoside triphosphate mixture (a mixture of adenosine triphosphate, guanosine triphosphate, cytosine triphosphate and uridine triphosphate, with the final concentration of each nucleoside triphosphate being 1.8 mM), 0.7 mM amino acid mixture (glycine, alanine, valine, leucine, isoleucine, phenylalanine, proline, tryptophan, serine, tyrosine, cysteine, methionine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine, with the final concentration of each amino acid being 0.7 mM), magnesium L-aspartate, 80 mM potassium acetate, 2% (w/v) polyethylene glycol 8000, 9.78 mM pH 8.0 Tris·HCl buffer, 6% (w/v) trehalose. Wherein, the lactic acid Kluyveromyces extract includes endogenously expressed T7 RNA polymerase. The lactic acid Kluyveromyces extract is modified in the following manner: using a modified strain based on the lactic acid Kluyveromyces strain ATCC8585; using the method described in CN109423496A, the coding gene of T7 RNA polymerase is integrated into the genome of lactic acid Kluyveromyces to obtain a modified strain so that it can endogenously express T7 RNA polymerase; using the modified strain to culture cell raw materials, and then preparing cell extracts. The preparation process of lactic acid Kluyveromyces cell extract adopts conventional technical means, and is prepared with reference to the method described in CN109593656A. In summary, the preparation steps include: providing an appropriate amount of raw materials of lactic acid Kluyveromyces cells that have been fermented and cultured, quick-freezing the cells with liquid nitrogen, breaking the cells, and collecting the supernatant by centrifugation to obtain a cell extract. The protein concentration in the obtained lactic acid Kluyveromyces cell extract is 20-40 mg/mL. In the following examples, the (i.e., prock, N-E-propargyloxycarbonyl-L-lysine hydrochloride) is a representative of non-natural amino acids (abbreviated as NCAA), but the NCAA of the present invention is not limited to only prock.

优选地,本发明提供一种tDNAPyl突变体筛选的方法,具体如下: Preferably, the present invention provides a method for screening tDNA Pyl mutants, which is as follows:

(1)以tDNAUCU Arg-tDNATAG Pyl(3002)结构为基础(见图1),设计5种含有不同tDNATAG Pyl序列(1种野生型,4种突变体)的插入片段分别插入到乳酸克鲁维酵母的基因组中的如下两个位点(共10种结构),需要说明的是,此实施例中仅以tDNAUCU Arg作为一个启动子的代表,但是并不是对本发明所述启动子的限定。(1) Based on the tDNA UCU Arg -tDNA TAG Pyl (3002) structure (see Figure 1), five insertion fragments containing different tDNA TAG Pyl sequences (1 wild type and 4 mutants) were designed and inserted into the following two sites in the genome of Kluyveromyces lactis (a total of 10 structures). It should be noted that in this example, tDNA UCU Arg is only used as a representative of a promoter, but it is not a limitation of the promoter described in the present invention.

(2)针对UPF1基因设计基于CRISPR/Cas9的sgRNA作用位点,所述sgRNA作用位点的核苷酸序列为:(2) Designing a CRISPR/Cas9-based sgRNA action site for the UPF1 gene, the nucleotide sequence of the sgRNA action site is:

UPF1:5’-CATTGCATAACTTGGTCAGC-3’(SEQ ID No:1);UPF1:5’-CATTGCATAACTTGGTCAGC-3’(SEQ ID No: 1);

插入位置选择在UPF1基因中部在插入位置附近设计引物1和引物2用于PCR扩增出左同源臂;引物3和引物4用于PCR扩增出右同源臂。所述引物序列如下:The insertion position was selected in the middle of the UPF1 gene. Primers 1 and 2 were designed near the insertion position for PCR amplification of the left homology arm; primers 3 and 4 were used for PCR amplification of the right homology arm. The primer sequences are as follows:

引物1:Primer 1:

5’-CTGTTTCTTTGAGAGACGATGAC-3’(SEQ ID No:2)5’-CTGTTTCTTTGAGAGACGATGAC-3’(SEQ ID No: 2)

引物2:Primer 2:

5’-CAACAGTTCGGCTTCTAGTGATGAGGAACGCCACATCTTCTCTA5’-CAACAGTTCGGCTTCTAGTGATGAGGAACGCCACATCTTCTCTA

CTTTTAGCAG-3’(SEQ ID No:3)CTTTTAGCAG-3’(SEQ ID No: 3)

引物3:Primer 3:

5’-CCTTGCTTGAGAAGGTTTTGGGACGCTCGAAGAGAGCTCAGTGTCC5’-CCTTGCTTGAGAAGGTTTTGGGACGCTCGAAGAGAGCTCAGTGTCC

GAGTTA-3’(SEQ ID No:4)GAGTTA-3’ (SEQ ID No: 4)

引物4:5’-CTTTGAGCCCACTCCATTG-3’(SEQ ID No:5)Primer 4: 5’-CTTTGAGCCCACTCCATTG-3’ (SEQ ID No: 5)

(3)以3002结构阳性菌株为模版,引物5和引物6为引物,PCR扩增带有部分同源臂序列的插入片段;(3) Using the 3002 structure-positive strain as a template and primers 5 and 6 as primers, PCR amplified the insert fragment with partial homology arm sequences;

所述引物序列如下:The primer sequences are as follows:

引物5:Primer 5:

5’-AGTACGATTAACTGCTAAAAGTAGAGAAGATGTGGCGTTCCTC5’-AGTACGATTAACTGCTAAAAGTAGAGAAGATGTGGCGTTCCTC

ATCACTAGAAGC-3’(SEQ ID No:6)ATCACTAGAAGC-3’(SEQ ID No: 6)

引物6:Primer 6:

5’-TGACCAAGTTATGCAATGCTAACTCGGACACTGAGCTCTCTTCGA5’-TGACCAAGTTATGCAATGCTAACTCGGACACTGAGCTTCTTCGA

GCGTCCCAAAAC-3’(SEQ ID No:7)GCGTCCCAAAAC-3’(SEQ ID No: 7)

(4)针对内源tRNA基因设计基于CRISPR/Cas9的sgRNA作用位点,所述sgRNA作用位点的核苷酸序列为:(4) Designing a CRISPR/Cas9-based sgRNA action site for the endogenous tRNA gene, the nucleotide sequence of the sgRNA action site is:

内源tRNA基因:5’-AGATGAGAAGAGACTGCGAG-3’(SEQ ID No:8);Endogenous tRNA gene: 5’-AGATGAGAAGAGACTGCGAG-3’ (SEQ ID No: 8);

在插入位置左侧1000bp左右处设计引物7和引物8用于PCR扩增出左同源臂;在插入位置右侧1000bp左右处设计引物9和引物10用于PCR扩增出右同源臂。所述引物序 列如下:Primers 7 and 8 were designed about 1000 bp to the left of the insertion position for PCR amplification of the left homology arm; primers 9 and 10 were designed about 1000 bp to the right of the insertion position for PCR amplification of the right homology arm. The columns are as follows:

引物7:Primer 7:

5’-GTCAGTTTCATTTAGCCTCA-3’(SEQ ID No:9)5’-GTCAGTTTCATTAGCCTCA-3’(SEQ ID No: 9)

引物8:Primer 8:

5’-CTCCGTCAAAGTTCCTGATTCAGTCTCTTCTCATCTCCACTAG-3’(SEQ ID No:10)5’-CTCCGTCAAAGTTCCTGATTCAGTCTCTTCTCATCTCCACTAG-3’(SEQ ID No: 10)

引物9:Primer 9:

5’-GTAGCAGGTACAGGAGAAAGAGTAAATAATGCGAGCGGAATTTGAGC-3’(SEQ ID No:11)5’-GTAGCAGGTACAGGAGAAAGAGTAAATAATGCGAGCGGAATTTGAGC-3’(SEQ ID No: 11)

引物10:Primer 10:

5’-GCCTGTCAAGTTATAACCAG-3’(SEQ ID No:12)5’-GCCTGTCAAGTTATAACCAG-3’(SEQ ID No: 12)

(5)以3002结构阳性菌株为模版,引物11和引物12为引物,PCR扩增目的片段;所述引物序列如下:(5) Using the 3002 structure-positive strain as a template and primers 11 and 12 as primers, PCR amplified the target fragment; the primer sequences are as follows:

引物11:Primer 11:

5’-GAGATTACTAGTGGAGATGAGAAGAGACTGAATCAGGAACTTTGACGGAG-3’(SEQ ID No:13)5’-GAGATTACTAGTGGAGATGAGAAGAGACTGAATCAGGAACTTTGACGGAG-3’(SEQ ID No: 13)

引物12:Primer 12:

5’-CCAGCTCAAATTCCGCTCGCATTATTTACTCTTTCTCCTGTACCTGC-3’(SEQ ID No:14)5’-CCAGCTCAAATTCCGCTCGCATTATTTACTCTTTCTCCTGTACCTGC-3’(SEQ ID No: 14)

(5)将步骤(2)获得的左右同源臂PCR产物和步骤(3)获得的插入片段PCR产物进行overlap PCR连接,5种不同的tDNATAG Pyl序列共得到基因编辑载体5种。(5) The left and right homology arm PCR products obtained in step (2) and the insert fragment PCR product obtained in step (3) were connected by overlap PCR, and a total of five gene editing vectors with five different tDNA TAG Pyl sequences were obtained.

(6)将步骤(4)获得的左右同源臂PCR产物和步骤(5)获得的插入片段PCR产物进行overlap PCR连接,5种不同的tDNATAG Pyl序列共得到基因编辑载体5种。(6) The left and right homology arm PCR products obtained in step (4) and the insert fragment PCR product obtained in step (5) were connected by overlap PCR, and a total of five gene editing vectors with five different tDNA TAG Pyl sequences were obtained.

(7)将步骤(6)和步骤(7)中得到的共计10种基因编辑载体分别转化到乳酸克鲁维酵母菌株中。(7) A total of 10 gene editing vectors obtained in step (6) and step (7) are transformed into lactic acid Kluyveromyces strains respectively.

(8)IVTT技术大量翻译GFP-TAG-RFP结构,通过RFP的表达量来反应终止密码子TAG的读通率,通过不加入非天然氨基酸的对照组来反应琥珀密码子tRNA转运氨基酸的忠实性。(8) The IVTT technology was used to translate the GFP-TAG-RFP structure in large quantities. The expression level of RFP was used to reflect the read-through rate of the termination codon TAG, and the control group without the addition of non-natural amino acids was used to reflect the fidelity of the amber codon tRNA in transporting amino acids.

实施例1:基因编辑载体的构建(改造4)Example 1: Construction of gene editing vector (transformation 4)

委托擎科生物有限公司合成含有tRNA序列质粒的大肠杆菌(订单号:SH0052101),序列号为:SEQ ID No:15。 We commissioned Qingke Biotechnology Co., Ltd. to synthesize Escherichia coli containing a tRNA sequence plasmid (order number: SH0052101), the sequence number is: SEQ ID No: 15.

(2)设计引物扩增出内源tRNA基因插入目标基因后的左右同源臂,引物设计时保证不同片段之间携带40bp左右重复序列。最后用连接引物进行overlap pcr连接,所述连接引物为:(2) Design primers to amplify the left and right homologous arms of the endogenous tRNA gene after it is inserted into the target gene. When designing primers, ensure that there are about 40bp of repeated sequences between different fragments. Finally, use the connection primers for overlap PCR connection. The connection primers are:

引物13:CATTACATCCAGACTCAGAGCTAGGAGATACTACTTTGGAATGC(SEQ ID No:16)Primer 13: CATTACATCCAGACTCAGAGCTAGGAGATACTACTTTGGAATGC (SEQ ID No: 16)

引物14:AGTCTCTCTGACCAGCATATTGTGTATAACTACTGTTGTTGG(SEQ ID No:17)Primer 14: AGTCTCTCTGACCAGCATATTGTGTATAACTACTGTTGTTGG (SEQ ID No: 17)

(3)将左右同源臂和插入片段的PCR产物作为模版进行overlap PCR扩增,PCR扩增反应体系为:pcr产物混合物3μL;引物13 0.5μL;引物14 0.5μL;2xMix 10μL;ddH2O 6μL。PCR扩增反应程序为:95℃3min;95℃30s,60℃1min,72℃3min,35个循环;72℃5min;20℃1min。(3) The PCR products of the left and right homology arms and the insert fragment were used as templates for overlap PCR amplification. The PCR amplification reaction system was: 3 μL of PCR product mixture; 0.5 μL of primer 13; 0.5 μL of primer 14; 10 μL of 2xMix; 6 μL of ddH2O. The PCR amplification reaction program was: 95°C for 3 min; 95°C for 30 s, 60°C for 1 min, 72°C for 3 min, 35 cycles; 72°C for 5 min; 20°C for 1 min.

Overlap PCR结束后电泳观察连接片段大小,筛选大小正确的PCR产物并委托擎科生物有限公司进行测序处理(测序结果如图2),测序对比后序列正确即可作为基因编辑载体使用。After the Overlap PCR is completed, observe the size of the connected fragments by electrophoresis, select the PCR products with the correct size and entrust Qingke Biotechnology Co., Ltd. to perform sequencing (the sequencing results are shown in Figure 2). After sequencing comparison, if the sequence is correct, it can be used as a gene editing vector.

实施例2:乳酸克鲁维酵母感受态的制备Example 2: Preparation of Kluyveromyces lactis competent cells

(1)将保存在-80℃冰箱中的乳酸克鲁维酵母菌株甘油菌取出置于冰中,融解后在YPD平板上划线,然后放置在30℃环境中培养至长出明显菌落。(1) Take out the glycerol culture of Kluyveromyces lactis strain stored in a -80°C refrigerator and place it on ice. After thawing, streak it on a YPD plate and then culture it at 30°C until obvious colonies grow.

(2)挑取平板上的单个菌落接种到100ml液体YPD中,容器为250ml锥形瓶。30℃,200rpm条件下培养18h。(2) Pick a single colony from the plate and inoculate it into 100 ml of liquid YPD in a 250 ml conical flask. Cultivate at 30°C and 200 rpm for 18 h.

(3)转移培养液到50ml离心管中,每管最多40ml。4℃,5000rpm离心5min,倒掉上清液后加入等同上清液体积的1M山梨醇,轻柔吸打混匀后4℃,5000rpm离心5min,倒掉上清液,重复2-3次后加入3-5ml 1M山梨醇轻柔吸打混匀,100ul/管分装到小离心管中,迅速转移至-80℃冰箱中保存。(3) Transfer the culture medium to a 50 ml centrifuge tube, with a maximum of 40 ml per tube. Centrifuge at 4°C, 5000 rpm for 5 min, discard the supernatant, add 1 M sorbitol in an amount equal to the supernatant volume, gently pipette to mix, centrifuge at 4°C, 5000 rpm for 5 min, discard the supernatant, repeat 2-3 times, add 3-5 ml 1 M sorbitol, gently pipette to mix, dispense 100 ul/tube into small centrifuge tubes, and quickly transfer to a -80°C refrigerator for storage.

实施例3:基因编辑载体转入乳酸克鲁维酵母菌株感受态中Example 3: Gene editing vectors transferred into competent Kluyveromyces lactis strains

(1)提前制备好sgRNA、基因编辑载体和感受态细胞等实验材料,在冰上溶解后备用。(1) Prepare experimental materials such as sgRNA, gene editing vectors, and competent cells in advance, and dissolve them on ice for later use.

(2)在感受态细胞中分别加入2ng左右的基因编辑载体和0.2ng左右的sgRNA,轻柔吸打混匀后转移到电转杯中。(2) Add about 2 ng of gene editing vector and about 0.2 ng of sgRNA to the competent cells, mix gently by pipetting, and transfer to the electroporation cup.

(3)电转仪参数调节到电压2KV,放入步骤(2)中的电转杯后进行电击处理,然后在电转杯中加入700ul左右的YPD培养基,轻柔吸打混匀后转移到离心管中,在 200rpm 30℃环境中培养30min左右。(3) Adjust the parameters of the electroporator to 2KV, place the tube in the electroporation cup in step (2) and perform an electric shock treatment. Then add about 700ul of YPD medium to the electroporation cup, gently pipette and mix, and transfer to a centrifuge tube. Incubate at 200 rpm and 30°C for about 30 minutes.

(4)摇培结束后将离心管中液体均匀涂抹在YPD琼脂糖平板上,30℃环境中过夜培养。(4) After the shaking culture is completed, evenly spread the liquid in the centrifuge tube on the YPD agarose plate and culture it overnight at 30°C.

实施例4:发生基因编辑的乳酸克鲁维酵母菌株的鉴定Example 4: Identification of gene-edited Kluyveromyces lactis strains

基因编辑载体中所含的tRNA基因可以转录出用于识别终止密码子TAG的特殊tRNA,该基因在野生型乳酸克鲁维酵母菌株中不存在。因此在该基因序列上设计了两个特异性引物,再在附近的乳酸克鲁维酵母基因组上设计两个引物,四个引物组成两对。所述引物序列如下:The tRNA gene contained in the gene editing vector can transcribe a special tRNA for recognizing the termination codon TAG, which does not exist in the wild-type lactic acid Kluyveromyces strain. Therefore, two specific primers were designed on the gene sequence, and two primers were designed on the nearby lactic acid Kluyveromyces genome. The four primers formed two pairs. The primer sequences are as follows:

Primer15:GACAAATAGTAGCTCGCGTG(SEQ ID No:18)Primer15:GACAAATAGTAGCTCGCGTG(SEQ ID No: 18)

Primer16:TTCTGATTAGAAGTCAGACGCG(SEQ ID No:19)Primer16: TTCTGATTAGAAGTCAGACGCG (SEQ ID No: 19)

Primer17:TCCAATGATCTCCCACAGG(SEQ ID No:20)Primer17:TCCAATGATCTCCCACAGG(SEQ ID No: 20)

Primer18:TGATGATCAAGAACCACTCG(SEQ ID No:21)Primer18: TGATGATCAAGAACCACTCG (SEQ ID No: 21)

反应体系为:单克隆菌株核酸1μL;Primer 15 0.5μL;Primer 17 0.5μL;2xMix10μL;ddH2O 8μLl。以及单克隆菌株核酸1μL;Primer 16 0.5μL;Primer 18 0.5μL;2xMix 10μL;ddH2O 8μL。The reaction system was: 1 μL of nucleic acid of monoclonal strain; 0.5 μL of Primer 15; 0.5 μL of Primer 17; 10 μL of 2xMix; 8 μL of ddH2O. And 1 μL of nucleic acid of monoclonal strain; 0.5 μL of Primer 16; 0.5 μL of Primer 18; 10 μL of 2xMix; 8 μL of ddH2O.

应程序为:95℃3min;95℃30s,57℃1min,72℃3min,35个循环;72℃5min;20℃1min。The response program was: 95°C for 3 min; 95°C for 30 s, 57°C for 1 min, 72°C for 3 min, 35 cycles; 72°C for 5 min; 20°C for 1 min.

挑取转化平板上的单克隆菌株,提取核酸后使用上述两对引物进行PCR扩增。PCR产物在1.5%琼脂糖凝胶上电泳,阳性菌株使用两对引物均可得到条带,阴性菌株则没有条带。Pick the monoclonal strain on the transformation plate, extract the nucleic acid and use the above two pairs of primers to perform PCR amplification. The PCR product is electrophoresed on 1.5% agarose gel. The positive strain can obtain bands using both pairs of primers, while the negative strain has no bands.

将鉴定后初步认定为阳性的单克隆菌株保存起来,其核酸再使用两个基因组上设计的引物进行PCR扩增,得到的PCR产物委托擎科生物有限公司进行测序处理。所述引物如下:The monoclonal strains initially identified as positive after identification were stored, and their nucleic acids were amplified by PCR using primers designed on two genomes. The resulting PCR products were entrusted to Qingke Biotech Co., Ltd. for sequencing. The primers are as follows:

Primer19:GTCCGCTTCTTACAATGGATACAGTTTCTCGC(SEQ ID No:22)Primer19:GTCCGCTTCTTACAATGGATACAGTTTCTCGC (SEQ ID No: 22)

Primer20:CTTTGTTGAACATCATCCCGGTGAGAATGCC(SEQ ID No:23)Primer20:CTTTGTTGAACATCATCCCGGTGAGAATGCC(SEQ ID No: 23)

测序引物:TGACTCTGACTCATTTCTTGCG(SEQ ID No:24)Sequencing primer: TGACTCTGACTCATTTCTTGCG (SEQ ID No: 24)

反应体系为:单克隆菌株核酸1μL;引物5 0.5μL;引物6 0.5μL;2xMix 10μL;ddH2O 8μLl。The reaction system was: monoclonal strain nucleic acid 1 μL; primer 5 0.5 μL; primer 6 0.5 μL; 2xMix 10 μL; ddH2O 8 μLl.

应程序为:95℃3min;95℃30s,65℃1min,72℃3min,35个循环;72℃5min;20℃1min。The response program was: 95°C for 3 min; 95°C for 30 s, 65°C for 1 min, 72°C for 3 min, 35 cycles; 72°C for 5 min; 20°C for 1 min.

将测序结果即插入序列(SEQ ID No:25)和理论序列进行对比(参见图3), 对比后两者插入序列一致,则认定所测试的菌株是成功进行了基因编辑的阳性菌株。The sequencing result, i.e., the inserted sequence (SEQ ID No: 25), was compared with the theoretical sequence (see FIG3 ). After comparison, if the two insertion sequences are consistent, the tested strain is determined to be a positive strain that has successfully undergone gene editing.

实施例5:按照类似的操作方法,分别得到改造1~改造3、改造5~改造10菌株(改造1~改造3、改造5的插入序列分别为SEQ ID No:26~29,改造6~改造10对应的插入序列与改造1~改造5相同,区别仅在于改造1~改造5与改造6~改造10的插入位点不同)Example 5: According to a similar operation method, the strains of transformation 1 to transformation 3 and transformation 5 to transformation 10 were obtained respectively (the insertion sequences of transformation 1 to transformation 3 and transformation 5 are SEQ ID No: 26 to 29 respectively, and the insertion sequences corresponding to transformation 6 to transformation 10 are the same as those of transformation 1 to transformation 5, and the only difference is that the insertion sites of transformation 1 to transformation 5 and transformation 6 to transformation 10 are different)

实施例6:裂解液制备Example 6: Lysate Preparation

将-80℃冰箱冻存的待测菌株在YPD平板划线后挑取单菌落接种至含100mL种子培养基的250mL三角瓶中,于30℃摇床中在200rpm转速下振荡培养24h;量取种子液转接至含400mL发酵培养基的1000mL三角瓶中,于30℃摇床中在200rpm转速下振荡培养至收获期,采用高速离心方法收集细胞,并在液氮保护下制备细胞裂解液。The strain to be tested, which was frozen in a -80°C refrigerator, was streaked on a YPD plate, and a single colony was picked and inoculated into a 250mL Erlenmeyer flask containing 100mL of seed culture medium, and cultured in a 30°C shaker at 200rpm for 24h; the seed liquid was transferred to a 1000mL Erlenmeyer flask containing 400mL of fermentation culture medium, and cultured in a 30°C shaker at 200rpm until the harvest period, the cells were collected by high-speed centrifugation, and the cell lysate was prepared under liquid nitrogen protection.

实施例7:验证体外合成导入非天然氨基酸的目的蛋白的表达效率Example 7: Verification of the expression efficiency of in vitro synthesis of target proteins with introduction of unnatural amino acids

(1)配置Protein Factory(1) Configure Protein Factory

(2)采用如下无细胞体外翻译条件:(2) The following cell-free in vitro translation conditions were used:

Protein Factory 100μlProtein Factory 100μl

Prock(非天然氨基酸)500mM 1μlProck (unnatural amino acid) 500mM 1μl

GFP-TAG-RFP双荧光报告基因PCR产物 3μlGFP-TAG-RFP dual fluorescence reporter gene PCR product 3μl

Mapylrs终浓度 5μMMapylrs final concentration 5 μM

Ex485nm/Em535检测EGFP荧光强度,Ex535nm/Em595nm检测RFP荧光强度,根据RFP/GFP的比值以及RFP的荧光强度,判断非天然氨基酸的导入效率(参见图4-7)。Ex485nm/Em535 was used to detect the fluorescence intensity of EGFP, and Ex535nm/Em595nm was used to detect the fluorescence intensity of RFP. The efficiency of the introduction of unnatural amino acids was determined based on the ratio of RFP/GFP and the fluorescence intensity of RFP (see Figures 4-7).

由图4-5可知,经过改造的所有菌株均能正常表达,且均显示了较强的RFP荧光值,说明经过改造后的菌株已经成功实现非天然氨基酸导入蛋白的目的,且具有较高的活性,特别是改造1的菌株(图4中的wt)具有很高的读通率。由图6-7可知,经过改造后的菌株在非天然氨基酸导入过程中具有好的忠实性。尤其是改造1的菌株(图6中的wt)对具有极高的忠实性性。由此可见,经过本发明基因改造得到的菌株,对于体外表达导入非天然氨基酸具有较高的读通率和忠实性,具有很好的活性,对于体外导入非天然氨基酸领域的拓展应用具有极高的参考价值。 As shown in Figures 4-5, all the modified strains can express normally and show strong RFP fluorescence values, indicating that the modified strains have successfully achieved the purpose of introducing non-natural amino acids into proteins and have high activity, especially the strain of modification 1 (wt in Figure 4) has a very high read-through rate. As shown in Figures 6-7, the modified strains have good fidelity in the process of introducing non-natural amino acids. In particular, the strain of modification 1 (wt in Figure 6) has extremely high fidelity. It can be seen that the strains obtained by genetic modification of the present invention have a high read-through rate and fidelity for in vitro expression of non-natural amino acids, have good activity, and have extremely high reference value for the expanded application of the field of in vitro introduction of non-natural amino acids.

本发明采用的序列见如下表1The sequences used in the present invention are shown in Table 1 below

表1

Table 1

以上述依据本申请的理想实施例为启示,通过上述的说明内容,相关工作人员完全可以在不偏离本项申请技术思想的范围内,进行多样的变更以及修改。本项申请的技术性范围并不局限于说明书上的内容,必须要根据权利要求范围来确定其技术性范围 Based on the above ideal embodiments of this application, the relevant staff can make various changes and modifications without deviating from the technical concept of this application through the above description. The technical scope of this application is not limited to the content in the specification, and its technical scope must be determined according to the scope of the claims.

Claims (20)

一种核酸构建物,其特征在于:A nucleic acid construct, characterized in that: 所述核酸构建物的结构式为(I):Z1-Z2-Z3,The structural formula of the nucleic acid construct is (I): Z1-Z2-Z3, 式中,Z1-Z3分别为用于构成所述构建物的元件;In the formula, Z1-Z3 are elements used to constitute the structure respectively; 各“-”独立地为键或核苷酸连接序列;Each "-" is independently a bond or a nucleotide linking sequence; Z1为启动子序列;Z1 is the promoter sequence; Z2为无或连接序列;Z2 is none or a connection sequence; Z3为抑制性tRNA的基因序列。Z3 is the gene sequence of inhibitory tRNA. 根据权利要求1所述的核酸构建物,其特征在于:The nucleic acid construct according to claim 1, characterized in that: 其中,所述所述的启动子选自tDNA、SNR52、SNR6、Pol III、U6、U3、SCR1、RPR1或H1。Wherein, the promoter is selected from tDNA, SNR52, SNR6, Pol III, U6, U3, SCR1, RPR1 or H1. 根据权利要求1-3任一项所述的核酸构建物,其特征在于:所述的抑制性tRNA选自琥珀密码子tRNA、赭石密码子tRNA或蛋白石密码子tRNA;优选琥珀密码子tRNA;进一步优选tRNATAG PylThe nucleic acid construct according to any one of claims 1 to 3, characterized in that: the inhibitory tRNA is selected from amber codon tRNA, ochre codon tRNA or opal codon tRNA; preferably amber codon tRNA; further preferably tRNA TAG Pyl . 根据权利要求2-4任一项所述的核酸构建物,所述式(I)结构选自Z1-Z2-tDNATAG Pyl,所述tDNATAG Pyl选自野生型或突变型。According to any one of claims 2 to 4, the structure of formula (I) is selected from Z1-Z2-tDNA TAG Pyl , and the tDNA TAG Pyl is selected from wild type or mutant type. 根据权利要求2-5任一项所述的核酸构建物,所述式(I)结构选自tDNA-Z2-tDNATAG PylThe nucleic acid construct according to any one of claims 2 to 5, wherein the structure of formula (I) is selected from tDNA-Z2-tDNA TAG Pyl . 根据权利要求2-6任一项所述的核酸构建物,所述Z2选自连接序列,所述连接序列选自CTTTGTTTCT。According to the nucleic acid construct according to any one of claims 2 to 6, Z2 is selected from a linker sequence, and the linker sequence is selected from CTTTGTTTCT. 根据权利要求1-6任意一项所述的核酸构建物,其特征在于所述Z3为抑制性tRNA的单拷贝或多拷贝基因序列,优选地,所述多拷贝的数量小于等于6。The nucleic acid construct according to any one of claims 1 to 6, characterized in that Z3 is a single copy or multiple copy gene sequence of an inhibitory tRNA, preferably, the number of the multiple copies is less than or equal to 6. 一种基因工程菌株,其特征在于:所述菌株的基因组中整合有至少包含一种抑制性tRNA基因序列的核酸构建物。A genetically engineered strain, characterized in that a nucleic acid construct containing at least one inhibitory tRNA gene sequence is integrated into the genome of the strain. 根据权利要求8所述的基因工程菌株,其特征在于:所述核酸构建物为权利要求1-7任意一项所述的核酸构建物。The genetically engineered strain according to claim 8, characterized in that the nucleic acid construct is the nucleic acid construct according to any one of claims 1-7. 根据权利要求8或9所述的一种基因工程菌株,其特征在于:所述的菌株来源于细菌、哺乳动物细胞、人体细胞、植物细胞、酵母细胞、昆虫细胞之一或其任意组合。 A genetically engineered strain according to claim 8 or 9, characterized in that the strain is derived from bacteria, mammalian cells, human cells, plant cells, yeast cells, insect cells or any combination thereof. 根据权利要求8-10任一项所述的一种基因工程菌株,其特征在于:所述的酵母细胞选自毕赤酵母(Pichia pastoris)、芬兰毕赤酵母(Pichia finlandica)、喜海藻糖毕赤酵母(Pichia trehalophila)、科克拉马毕赤酵母(Pichia koclamae)、膜醭毕赤酵母(Pichia membranaefaciens)、微小毕赤酵母(Pichia minuta)、甲醇诱导型酵母(Ogataeaminuta)、林氏毕赤酵母(Pichia lindneri)、仙人掌毕赤酵母(Pichia opuntiae)、耐热毕赤酵母(Pichia thermotolerans)、柳毕赤酵母(Pichia salictaria)、松栎毕赤酵母(Pichia g uercuum)、皮杰普毕赤酵母(Pichia pijperi)、树干毕赤酵母(Pichiastiptis)、甲醇毕赤酵母(Pichia methanolica)、毕赤酵母菌(Pichia sp.)、酿酒酵母(Saccharomyces cerevisiae)、啤酒酵母、甘蔗糖蜜酵母、酵母菌(Saccharomyces sp.)、多形汉逊酵母(Hansenulapolymorpha)、产朊假丝酵母、克鲁维酵母之一或其组合。A genetically engineered strain according to any one of claims 8 to 10, characterized in that the yeast cell is selected from Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia koclamae, Pichia membraneaefaciens, Pichia minuta, Ogataeaminuta, Pichia lindneri, Pichia opuntiae, Pichia thermotolerant Pichia thermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia sp., Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces cerevisiae, Saccharomyces molasses, Saccharomyces sp., Hansenula polymorpha, Candida utilis, Kluyveromyces, or a combination thereof. 根据权利要求所述的8-11任一项所述的基因工程菌株,其特征在于,所述核酸构建物经插入位点整合到菌株的基因组中,所述的插入位点为UPF1或内源tDNA。The genetically engineered strain according to any one of claims 8 to 11, characterized in that the nucleic acid construct is integrated into the genome of the strain via an insertion site, and the insertion site is UPF1 or endogenous tDNA. 根据权利要求8-12任一项所述的一种基因工程菌株的制备方法,其特征在于:将所述的核酸构建物经UPF1位点插入到所述菌株的基因组中。The method for preparing a genetically engineered strain according to any one of claims 8 to 12, characterized in that the nucleic acid construct is inserted into the genome of the strain via the UPF1 site. 根据权利要求8-12任一项所述的一种基因工程菌株的制备方法,其特征在于:将所述的核酸构建物经内源tDNA序列5’端和/或3’端1~1000bp处插入到所述菌株的基因组中;优选100~800bp;进一步优选200~500bp。The method for preparing a genetically engineered strain according to any one of claims 8 to 12 is characterized in that: the nucleic acid construct is inserted into the genome of the strain via 1 to 1000 bp at the 5' end and/or 3' end of the endogenous tDNA sequence; preferably 100 to 800 bp; further preferably 200 to 500 bp. 一种用于合成非天然氨基酸掺入的蛋白质的方法,其特征在于:采用权利要求8-12任一项所述的基因工程菌株提供抑制性tRNA。A method for synthesizing proteins incorporated with non-natural amino acids, characterized in that the genetically engineered strain described in any one of claims 8 to 12 is used to provide inhibitory tRNA. 一种用于合成非天然氨基酸掺入的蛋白质的无细胞合成体系,其特征在于:所述体系至少包括细胞提取物,所述细胞提取物来自于权利要求8-12任一项基因工程菌株。A cell-free synthesis system for synthesizing proteins incorporated with non-natural amino acids, characterized in that the system at least comprises a cell extract, which is derived from the genetically engineered strain of any one of claims 8-12. 根据权利要求16所述的无细胞合成体系,其特征在于:所述体系还包括氨酰tRNA合成酶和非天然氨基酸。The cell-free synthesis system according to claim 16, characterized in that the system also includes aminoacyl tRNA synthetase and unnatural amino acids. 权利要求8-12所述的基因工程菌株或权利要求16或17所述的无细胞合成体系在无细胞合成非天然氨基酸掺入的蛋白质中的应用。Use of the genetically engineered strains of claims 8 to 12 or the cell-free synthesis system of claim 16 or 17 in the cell-free synthesis of proteins incorporating non-natural amino acids. 一种试剂盒,其特征在于,所述的试剂盒包括权利要求16或17所述的无细胞蛋白质合成体系的组分。A kit, characterized in that the kit comprises the components of the cell-free protein synthesis system according to claim 16 or 17. 一种无细胞合成非天然氨基酸掺入的蛋白质的方法,其特征在于:包括如下步骤:步骤(1),提供权利要求16或17所述的无细胞蛋白质合成体系或权利要求19所述的试剂盒;步骤(2),在步骤(1)所述的合成体系或试剂盒中加入编码外 源蛋白的DNA分子,在氨酰tRNA合成酶和非天然氨基酸存在的条件下,经反应得到所述的蛋白质。 A method for cell-free synthesis of proteins incorporating non-natural amino acids, characterized in that it comprises the following steps: step (1), providing the cell-free protein synthesis system according to claim 16 or 17 or the kit according to claim 19; step (2), adding a coding exon to the synthesis system or kit described in step (1) The protein is obtained by reacting a DNA molecule of a source protein in the presence of aminoacyl tRNA synthetase and unnatural amino acids.
PCT/CN2024/128530 2023-10-31 2024-10-30 Genetically engineered strain based on trna modification, preparation method therefor, and use thereof Pending WO2025092824A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202311442372.5 2023-10-31
CN202311442372.5A CN119913187A (en) 2023-10-31 2023-10-31 A genetically engineered strain based on tRNA modification and its preparation method and application

Publications (1)

Publication Number Publication Date
WO2025092824A1 true WO2025092824A1 (en) 2025-05-08

Family

ID=95504957

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/128530 Pending WO2025092824A1 (en) 2023-10-31 2024-10-30 Genetically engineered strain based on trna modification, preparation method therefor, and use thereof

Country Status (2)

Country Link
CN (1) CN119913187A (en)
WO (1) WO2025092824A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090155844A1 (en) * 2006-02-22 2009-06-18 Riken METHOD OF SYNTHESIZING A SUPPRESSOR tRNA, DNA CONSTRUCT AND USE THEREOF FOR PRODUCING A NON-NATURAL AMINO ACID-INCORPORATED PROTEIN
US20110014650A1 (en) * 2007-12-11 2011-01-20 The Scripps Research Institute In vivo unnatural amino acid expression in the methylotrophic yeast pichia pastoris
US20160115487A1 (en) * 2013-05-29 2016-04-28 Bradley C. Bundy Cell-free synthetic incorporation of non-natural amino acids into proteins
WO2019041635A1 (en) * 2017-08-31 2019-03-07 康码(上海)生物科技有限公司 Nucleic acid construct for endogenous expression rna polymerase in cells
WO2023036210A1 (en) * 2021-09-08 2023-03-16 北京大学 Preparation method for protein containing unnatural amino acids

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090155844A1 (en) * 2006-02-22 2009-06-18 Riken METHOD OF SYNTHESIZING A SUPPRESSOR tRNA, DNA CONSTRUCT AND USE THEREOF FOR PRODUCING A NON-NATURAL AMINO ACID-INCORPORATED PROTEIN
US20110014650A1 (en) * 2007-12-11 2011-01-20 The Scripps Research Institute In vivo unnatural amino acid expression in the methylotrophic yeast pichia pastoris
US20160115487A1 (en) * 2013-05-29 2016-04-28 Bradley C. Bundy Cell-free synthetic incorporation of non-natural amino acids into proteins
WO2019041635A1 (en) * 2017-08-31 2019-03-07 康码(上海)生物科技有限公司 Nucleic acid construct for endogenous expression rna polymerase in cells
WO2023036210A1 (en) * 2021-09-08 2023-03-16 北京大学 Preparation method for protein containing unnatural amino acids

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
GAO WEI, BU NING, LU YUAN: "Recent advances in cell-free unnatural protein synthesis", CHINESE JOURNAL OF BIOTECHNOLOGY, ZHONGGUO KEXUEYUAN WEISHENGWU YANJIUSUO, CHINESE ACADEMY OF SCIENCES, INSTITUTE OF MICROBIOLOGY, CN, vol. 34, no. 9, 14 July 2018 (2018-07-14), CN , pages 1371 - 1385, XP093310747, ISSN: 1000-3061, DOI: 10.13345/j.cjb.180203 *
SEKI KOSUKE, GALINDO JOEY L., KARIM ASHTY S., JEWETT MICHAEL C.: "A Cell-Free Gene Expression Platform for Discovering and Characterizing Stop Codon Suppressing tRNAs", ACS CHEMICAL BIOLOGY, AMERICAN CHEMICAL SOCIETY, vol. 18, no. 6, 16 June 2023 (2023-06-16), pages 1324 - 1334, XP093310746, ISSN: 1554-8929, DOI: 10.1021/acschembio.3c00051 *
WANG L. ET AL: "A new functional suppressor tRNA/aminoacyl-tRNA synthetase apir for in vivo incorporation of unnatural amino acids into proteins", JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, AMERICAN CHEMICAL SOCIETY, vol. 122, 4 May 2000 (2000-05-04), pages 5010 - 5011, XP002390005, DOI: 10.1021/ja000595y *

Also Published As

Publication number Publication date
CN119913187A (en) 2025-05-02

Similar Documents

Publication Publication Date Title
CN111405912B (en) Polynucleotides, compositions and methods for genome editing
CN110093284B (en) Method for improving protein synthesis efficiency in cell
JP7093417B2 (en) How to regulate in vitro biosynthetic activity by knocking out the nuclease system
CN109423496B (en) A nucleic acid construct for endogenously expressing RNA polymerase in cells
CN110408636B (en) DNA sequence with multiple labels connected in series and application thereof in protein expression and purification system
WO2018161374A1 (en) Protein synthesis system for protein synthesis in vitro, kit and preparation method thereof
JP7028986B2 (en) Tandem DNA element that can increase protein synthesis efficiency
CN110551745A (en) Multiple histidine sequence tag and application thereof in protein expression and purification
CN110938649A (en) Protein synthesis system for improving expression quantity of foreign protein and application method thereof
AU2022348582A1 (en) Composition of transfer rnas and use in production of proteins containing non-standard amino acids
CN113215005A (en) In-vitro cell-free protein synthesis system (D2P system), kit and application thereof
CN112342248B (en) Method for changing in-vitro protein synthesis capacity by gene knockout and application thereof
WO2025092824A1 (en) Genetically engineered strain based on trna modification, preparation method therefor, and use thereof
CN119529023A (en) A signal peptide for enhancing protein expression in a cell-free system and its application in D2P
WO2025140227A1 (en) Modified aminoacyl-trna synthetase, nucleic acid construct and genetically engineered strain
WO2024051855A1 (en) Nucleic acid construct and use thereof in ivtt system
WO2020135623A1 (en) Method for changing in-vitro protein synthesis capability by edc3 gene knockout and application thereof
CN119372171A (en) A recombinant DNA polymerase and its application in cell-free protein synthesis
US20250051402A1 (en) Method for regulating in vitro biosynthesis activity by knocking-out of nuclease system
CN120060319A (en) Method for removing release factor and application thereof
CN119432629A (en) A genetically engineered strain and its preparation method and its application in cell-free protein synthesis
CN111118065A (en) Gene modification method of eukaryote, corresponding gene engineering cell and application thereof
CN120400114A (en) A method for reducing the impact of interfering factors on target proteins
CN120400113A (en) A method for reducing impurity proteins in target protein synthesis
WO2024141087A1 (en) In-vitro cell-free protein synthesis system for inserting non-natural amino acid, and method and use

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24884824

Country of ref document: EP

Kind code of ref document: A1