[go: up one dir, main page]

CN113801235A - Insulin lispro derivative and application thereof - Google Patents

Insulin lispro derivative and application thereof Download PDF

Info

Publication number
CN113801235A
CN113801235A CN202010531593.XA CN202010531593A CN113801235A CN 113801235 A CN113801235 A CN 113801235A CN 202010531593 A CN202010531593 A CN 202010531593A CN 113801235 A CN113801235 A CN 113801235A
Authority
CN
China
Prior art keywords
insulin lispro
seq
fusion protein
boc
chain
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202010531593.XA
Other languages
Chinese (zh)
Inventor
刘慧玲
陈卫
李克朗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ningbo Kunpeng Biotech Co Ltd
Original Assignee
Ningbo Kunpeng Biotech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ningbo Kunpeng Biotech Co Ltd filed Critical Ningbo Kunpeng Biotech Co Ltd
Priority to CN202010531593.XA priority Critical patent/CN113801235A/en
Publication of CN113801235A publication Critical patent/CN113801235A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/62Insulins
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/08Drugs for disorders of the metabolism for glucose homeostasis
    • A61P3/10Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • C12P21/06Preparation of peptides or proteins produced by the hydrolysis of a peptide bond, e.g. hydrolysate products
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/02Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/50Fusion polypeptide containing protease site
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/60Fusion polypeptide containing spectroscopic/fluorescent detection, e.g. green fluorescent protein [GFP]

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Diabetes (AREA)
  • General Health & Medical Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Medicinal Chemistry (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Microbiology (AREA)
  • Endocrinology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Emergency Medicine (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Public Health (AREA)
  • Hematology (AREA)
  • Obesity (AREA)
  • Veterinary Medicine (AREA)
  • Toxicology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Peptides Or Proteins (AREA)

Abstract

The invention provides a lispro insulin derivative and a preparation method thereof. Specifically, the invention provides a fusion protein comprising a green fluorescent protein folding unit and insulin lispro or an active fragment thereof. The expression level of the fusion protein is obviously improved, and the insulin lispro protein in the fusion protein is correctly folded and has biological activity. Moreover, the folding unit of the green fluorescent protein in the fusion protein can be digested into small fragments by protease, has large molecular weight difference compared with the target protein, and is easy to separate. The invention also provides a method for preparing insulin lispro by using the fusion protein and a preparation intermediate.

Description

Insulin lispro derivative and application thereof
Technical Field
The invention relates to the technical field of biology, and particularly relates to a insulin lispro derivative and application thereof.
Background
Diabetes is a major disease threatening human health worldwide. In China, the prevalence rate of diabetes is on a rapid rising trend along with the change of life styles and the accelerated aging process of people. Acute and chronic complications of diabetes, especially chronic complications, accumulate a plurality of organs, are disabled, have high fatality rate, seriously affect physical and psychological health of patients and bring heavy burden to individuals, families and society.
Insulin lispro is the first analogue of fast-acting human insulin, belongs to the third generation of insulin, and is the sequential transposition of proline and lysine at positions 28 and 29 in the B chain of human insulin. The interchange changes the space structure of the tail end of the B chain, reduces the non-polar contact between insulin monomers in a dimer and the interaction between beta sheets, changes the self-polymerization characteristic of insulin, is easy to dissociate and can be quickly decomposed after injection, thereby accelerating the absorption after subcutaneous injection and being beneficial to controlling the hyperglycemia which is quickly raised after a meal. The secretion mode of human insulin can be well simulated, the pharmacokinetic characteristic of the human insulin is about half of that of the conventional human insulin, the acting time is 10-15 minutes, the peak reaching time is 1-1.5 hours, and the action duration is shortened to 4-6 hours, so that a patient is not easily superposed with the action of insulin before meals or at night while obtaining good blood sugar control, and the incidence rate of hypoglycemia at night is remarkably reduced. In addition, compared with human insulin, it has the advantages of high safety, good patient compliance, etc.
At present, the preparation process of insulin lispro in the prior art is complex, the yield is not high, and further popularization and application of the insulin lispro are affected. Therefore, the development of a preparation and purification method of insulin lispro with simple process, environmental friendliness, high yield and high yield is urgently needed in the field.
Disclosure of Invention
The invention aims to provide a insulin lispro derivative and application thereof.
In a first aspect of the invention, there is provided a recombinant insulin lispro fusion protein having the structure of formula I:
A-FP-TEV-R-G (I)
in the formula (I), the compound is shown in the specification,
"-" represents a peptide bond;
a is a null or leader peptide,
FP is a folding unit of green fluorescent protein,
TEV is an enzyme cutting site, preferably a TEV enzyme cutting site;
r is arginine or lysine for enzyme digestion;
g is insulin lispro or an active fragment thereof;
wherein said green fluorescent protein fold units comprise 2-6, preferably 2-3 β -sheet units selected from the group consisting of:
beta-sheet unit Amino acid sequence
u1 VPILVELDGDVNG(SEQ ID NO:11)
u2 HKFSVRGEGEGDAT(SEQ ID NO:12)
u3 KLTLKFICTT(SEQ ID NO:13)
u4 YVQERTISFKD(SEQ ID NO:14)
u5 TYKTRAEVKFEGD(SEQ ID NO:15)
u6 TLVNRIELKGIDF(SEQ ID NO:16)
u7 HNVYITADKQ(SEQ ID NO:17)
u8 GIKANFKIRHNVED(SEQ ID NO:18)
u9 VQLADHYQQNTPIG(SEQ ID NO:19)
u10 HYLSTQSVLSKD(SEQ ID NO:20)
u11 HMVLLEFVTAAGI(SEQ ID NO:21)。
In another preferred embodiment, the green fluorescent protein folding unit is u8-u9, u9-u10-u11 or u10-u 11.
In another preferred embodiment, G is Boc-modified lispro precursor having the structure of formula II:
GB-X-GA (II)
in the formula (I), the compound is shown in the specification,
GB is 28 th Boc modified insulin lispro B chain, the amino acid sequence is shown as 1 st to 30 th of SEQ ID NO. 5,
x is a connecting peptide, preferably, the amino acid sequence of X is R, RR, RRR, or as shown in SEQ ID NO. 6-9 (RRGSKR, RRAAKR, RRYPGDVKR or RREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKR);
GA is insulin lispro A chain, and the amino acid sequence is shown in 32-52 of SEQ ID NO. 5.
In another preferred embodiment, the R is used for enzyme digestion by trypsin and carboxypeptidase.
In another preferred embodiment, G is Boc modified insulin lispro with the sequence shown in SEQ ID NO. 5.
In another preferred embodiment, intrachain disulfide bonds are present between GB-X-GA.
In another preferred embodiment, the recombinant insulin lispro fusion protein has the sequence as shown in SEQ ID NO 1, 22 and 23.
In another preferred embodiment, interchain disulfide bonds are formed between the 7 th position of the B chain and the 7 th position of the A chain (A7-B7), and the 19 th position of the B chain and the 20 th position of the A chain (A20-B19).
In another preferred embodiment, an intrachain disulfide bond is formed between the 6 th position and the 11 th position of the A chain (A6-A11) of the insulin lispro A chain.
In another preferred embodiment, position 28 of said GB is N epsilon- (tert-butoxycarbonyl) -lysine.
In a second aspect of the invention, there is provided a double-stranded insulin lispro fusion protein having the structure shown in formula III:
A-FP-TEV-R-D (III)
in the formula (I), the compound is shown in the specification,
"-" represents a peptide bond;
a is a null or leader peptide, preferably a leader peptide having the sequence shown in SEQ ID NO 2,
FP is a folding unit of green fluorescent protein,
TEV is an enzyme cutting site, preferably a TEV enzyme cutting site (the sequence is ENLYFQG, SEQ ID NO: 4);
r is arginine or lysine for enzyme digestion;
d is Boc modified double-chain insulin lispro, and the main chain has a structure shown in the following formula IV;
Figure BDA0002535439000000031
in the formula (I), the compound is shown in the specification,
"║" represents a disulfide bond;
GA is insulin lispro A chain, the amino acid sequence is shown as 32-52 of SEQ ID NO. 5,
x is nothing or a connecting peptide;
GB is a 28 th Boc modified insulin lispro B chain, and the amino acid sequence is shown as 1 st to 30 th positions of SEQ ID NO. 5;
wherein said green fluorescent protein fold units comprise 2-6, preferably 2-3 β -sheet units selected from the group consisting of:
beta-sheet unit Amino acid sequence
u1 VPILVELDGDVNG(SEQ ID NO:11)
u2 HKFSVRGEGEGDAT(SEQ ID NO:12)
u3 KLTLKFICTT(SEQ ID NO:13)
u4 YVQERTISFKD(SEQ ID NO:14)
u5 TYKTRAEVKFEGD(SEQ ID NO:15)
u6 TLVNRIELKGIDF(SEQ ID NO:16)
u7 HNVYITADKQ(SEQ ID NO:17)
u8 GIKANFKIRHNVED(SEQ ID NO:18)
u9 VQLADHYQQNTPIG(SEQ ID NO:19)
u10 HYLSTQSVLSKD(SEQ ID NO:20)
u11 HMVLLEFVTAAGI(SEQ ID NO:21)。
In another preferred embodiment, the green fluorescent protein folding unit is u8-u9, u9-u10-u11, or u10-u 11.
In another preferred embodiment, the amino acid sequence of the green fluorescent protein folding unit is shown in SEQ ID NO 3, 24 and 25.
In another preferred embodiment, the C-terminus of the B chain of insulin lispro is linked to the N-terminus of the a chain of insulin lispro via a linker peptide.
In another preferred embodiment, X is a linker peptide, preferably X has the amino acid sequence R, RR, RRR, or as shown in SEQ ID NO:6-9 (RRGSKR, RRAAKR, RRYPGDVKR or RRE AEDLQVGQVELGGGPGAGSLQPLALEGSLQKR).
In a third aspect of the invention, there is provided a Boc-modified insulin lispro precursor having the structure shown in formula II:
GB-X-GA (II)
in the formula (I), the compound is shown in the specification,
GB is 28 th Boc modified insulin lispro B chain, the amino acid sequence is shown as 1 st to 30 th of SEQ ID NO. 5,
x is a connecting peptide, preferably, the amino acid sequence of X is R, RR, RRR, or as shown in SEQ ID NO. 6-9 (RRGSKR, RRAAKR, RRYPGDVKR or RREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKR);
GA is insulin lispro A chain, and the amino acid sequence is shown in 32-52 of SEQ ID NO. 5.
In another preferred embodiment, the protected lysine is N epsilon- (tert-butyloxycarbonyl) -lysine.
In a fourth aspect of the invention, there is provided a Boc-modified lispro insulin having the structure shown in formula IV:
Figure BDA0002535439000000051
in the formula (I), the compound is shown in the specification,
"║" represents a disulfide bond;
GA is insulin lispro A chain, the amino acid sequence is shown as 32-52 of SEQ ID NO. 5,
GB is insulin lispro B chain, the amino acid sequence is shown in 1 st to 30 th positions of SEQ ID NO. 5, and the lysine at 28 th position of the B chain is N epsilon- (tert-butyloxycarbonyl) -lysine.
In a fifth aspect of the present invention, there is provided a method of preparing insulin lispro, the method comprising the steps of:
(i) fermenting by using recombinant bacteria to prepare recombinant insulin lispro fusion protein (first protein) fermentation liquor;
(ii) carrying out enzyme digestion on the recombinant insulin lispro fusion protein (first protein) so as to obtain a mixed solution I containing Boc modified insulin lispro (second protein);
(iii) carrying out deprotection treatment on the Boc modified insulin lispro (second protein) to obtain a mixed solution II containing the deprotected insulin lispro (third protein);
(iv) and purifying the mixed solution II to obtain the insulin lispro.
In another preferred embodiment, in step (ii), the temperature of the enzyme cleavage is 32-42 ℃, preferably 36-38 ℃.
In another preferred embodiment, the purity of the prepared insulin lispro is higher than 99%.
In another preferred embodiment, the prepared insulin lispro has insulin lispro activity.
In another preferred embodiment, the protected lysine is lysine with a protecting group.
In another preferred embodiment, the protected lysine is N epsilon- (tert-butyloxycarbonyl) -lysine.
In another preferred embodiment, in step (i), the recombinant bacterium is used for the fermentative production of recombinant insulin lispro fusion protein.
In another preferred embodiment, the recombinant bacterium comprises or integrates an expression cassette for expressing the recombinant insulin lispro fusion protein.
In another preferred example, in step (i), the recombinant insulin lispro fusion protein inclusion bodies are isolated from the fermentation broth of the recombinant bacteria.
In another preferred embodiment, in step (i), the method further comprises the step of denaturing and renaturing the inclusion bodies to obtain the recombinant insulin lispro fusion protein (first protein) with correct protein folding.
In another preferred embodiment, the recombinant insulin lispro fusion protein with correct protein folding comprises intrachain disulfide bonds between the a chain and the B chain of insulin lispro.
In another preferred embodiment, the recombinant insulin lispro fusion protein is as described in the first aspect of the invention.
In another preferred embodiment, in step (ii), the cleavage is performed with trypsin.
In another preferred embodiment, in step (ii), the trypsin is recombinant porcine trypsin.
In another preferred embodiment, in step (ii), the mass ratio of trypsin to recombinant insulin lispro is 1:1000-40000, preferably 1: 3000-30000.
In another preferred embodiment, in step (ii), the enzyme is cleaved for 10-30h, preferably 14-20 h.
In another preferred embodiment, in step (ii), the pH of the enzyme cleavage is 7.0-9.5, preferably 8.0-9.0.
In another preferred example, in the step (iii), an acidic solvent such as hydrochloric acid is added to the reaction system to carry out the deprotection treatment.
In another preferred embodiment, in step (iii), the acid is added to a pH of 1.0 to 2.5 under stirring.
In another preferred embodiment, in step (iii), the temperature of the deprotection reaction is 4 to 37 ℃, preferably 18 to 25 ℃.
In another preferred embodiment, in step (iii), the deprotection reaction time is 0.5 to 8 hours, preferably 0.5 to 3 hours.
In another preferred embodiment, said Boc-lispro insulin is N epsilon- (tert-butyloxycarbonyl) -lysine lispro insulin.
In a sixth aspect of the invention there is provided an insulin lispro formulation prepared using the method of the fifth aspect of the invention.
In another preferred embodiment, the purity of the insulin lispro contained in the insulin lispro preparation is higher than 99%.
In another preferred embodiment, the insulin lispro included in the insulin lispro formulation has native insulin lispro activity.
In a seventh aspect of the invention, there is provided an isolated polynucleotide encoding a recombinant insulin lispro fusion protein according to the first aspect of the invention, an insulin lispro backbone fusion protein according to the second aspect of the invention, a Boc-modified insulin lispro precursor according to the third aspect of the invention, or a Boc-modified insulin lispro backbone according to the fourth aspect of the invention.
In an eighth aspect of the invention, there is provided a vector comprising a polynucleotide according to the seventh aspect of the invention.
In another preferred embodiment, the carrier is selected from the group consisting of: DNA, RNA, plasmids, lentiviral vectors, adenoviral vectors, retroviral vectors, transposons, or combinations thereof.
In a ninth aspect of the invention, there is provided a host cell comprising the vector of the eighth aspect of the invention, or a polynucleotide of the seventh aspect of the invention integrated into a chromosome, or expressing the recombinant insulin lispro fusion protein of the first aspect of the invention, the insulin lispro backbone fusion protein of the second aspect of the invention, the Boc-modified insulin lispro precursor of the third aspect of the invention, or the Boc-modified insulin lispro backbone of the fourth aspect of the invention.
In another preferred embodiment, the host cell is Escherichia coli, Bacillus subtilis, a yeast cell, an insect cell, a mammalian cell, or a combination thereof.
In a tenth aspect of the invention, there is provided a formulation or pharmaceutical composition comprising a recombinant insulin lispro fusion protein according to the first aspect of the invention, an insulin lispro backbone fusion protein according to the second aspect of the invention, a Boc-modified insulin lispro precursor according to the third aspect of the invention, or a Boc-modified insulin lispro backbone according to the fourth aspect of the invention, and a pharmaceutically acceptable carrier.
It is to be understood that within the scope of the present invention, the above-described features of the present invention and those specifically described below (e.g., in the examples) may be combined with each other to form new or preferred embodiments. Not to be reiterated herein, but to the extent of space.
Drawings
FIG. 1 shows a map of plasmid pBAD-FP-TEV-R-G.
FIG. 2 shows a map of the plasmid pEvol-pylRs-pylT.
FIG. 3 shows an SDS-PAGE electrophoresis of insulin lispro fusion proteins after inclusion body renaturation.
FIG. 4 shows the HPLC profile of insulin lispro after the third chromatography.
Detailed Description
The inventor of the invention has extensively and deeply researched and found a insulin lispro derivative and a preparation method thereof. Specifically, the invention provides a fusion protein comprising a green fluorescent protein folding unit and insulin lispro or an active fragment thereof. The expression level of the fusion protein is obviously improved, and the insulin lispro protein in the fusion protein is correctly folded and has biological activity. Moreover, the folding unit of the green fluorescent protein in the fusion protein can be digested into small fragments by protease, has large molecular weight difference compared with the target protein, and is easy to separate. The invention also provides a method for preparing insulin lispro by using the fusion protein and a preparation intermediate.
Term(s) for
In order that the disclosure may be more readily understood, certain terms are first defined. As used in this application, each of the following terms shall have the meaning given below, unless explicitly specified otherwise herein. Other definitions are set forth throughout the application.
The term "about" can refer to a value or composition that is within an acceptable error range for the particular value or composition as determined by one of ordinary skill in the art, which will depend in part on how the value or composition is measured or determined.
Insulin lispro
The insulin products are the first major drug variety in the diabetes market, occupy about 53% of the market share, and mainly comprise third-generation recombinant insulin. Insulin lispro belongs to third-generation recombinant insulin, is quick-acting insulin (or called as prandial insulin), takes effect 10-15 minutes after subcutaneous injection, reaches a peak value within 1-1.5 hours, and has the action duration of 4-6 hours.
Construction of insulin lispro expression plasmid
Synthesizing the FP-TEV-R-G target gene, wherein the two ends of the gene have recognition sites of restriction enzymes Nco I and Xho I, and the gene A for encoding insulin lispro is contained. The sequence is subjected to codon optimization, and can realize high-level expression of functional protein in escherichia coli. After expression, the expression vector "pBAD/His A" (Kana) was digested with restriction enzymes Nco I and Xho IR) "and comprisesThe plasmid with the target gene of 'FP-TEV-R-G' is cut by enzyme, the cut products are separated by agarose electrophoresis, then extracted by an agarose gel DNA recovery kit, and finally two DNA fragments are connected by T4 DNA ligase. The ligation product was chemically transformed into E.coli Top10 cells, and the transformed cells were cultured overnight on LB agar medium (10g/L yeast peptone, 5g/L yeast extract powder, 10g/L NaCl, 1.5% agar) containing 50. mu.g/mL kanamycin. 3 viable colonies were picked, cultured overnight in 5mL of liquid LB medium (10g/L yeast peptone, 5g/L yeast extract powder, 10g/L NaCl) containing 50. mu.g/mL kanamycin, and plasmid extraction was performed using a plasmid miniprep kit. The extracted plasmid was then sequenced using sequencing oligonucleotide primer 5'-ATGCCATAGCATTTTTATCC-3' to confirm correct insertion. The resulting plasmid was named "pBAD-FP-TEV-R-G".
Fusion proteins
By using the green fluorescent protein folding unit, two fusion proteins are constructed, namely the recombinant insulin lispro fusion protein comprising a single-chain insulin lispro precursor according to the first aspect of the invention and the double-chain insulin lispro fusion protein comprising a double-chain insulin lispro according to the second aspect of the invention. In fact, the scope of protection of two fusion proteins of the invention may partially overlap, for example, insulin lispro in a double-stranded form contained in the fusion protein, where the C-terminus of the B-chain may also be linked to the N-terminus of the a-chain by a linker peptide, or may be considered as a single chain containing intrachain disulfide bonds.
The green fluorescent protein fold unit FP comprised in the fusion protein of the invention comprises 2 to 6, preferably 2 to 3 β -sheet units selected from the group consisting of:
amino acid sequence
u1 VPILVELDGDVNG(SEQ ID NO:11)
u2 HKFSVRGEGEGDAT(SEQ ID NO:12)
u3 KLTLKFICTT(SEQ ID NO:13)
u4 YVQERTISFKD(SEQ ID NO:14)
u5 TYKTRAEVKFEGD(SEQ ID NO:15)
u6 TLVNRIELKGIDF(SEQ ID NO:16)
u7 HNVYITADKQ(SEQ ID NO:17)
u8 GIKANFKIRHNVED(SEQ ID NO:18)
u9 VQLADHYQQNTPIG(SEQ ID NO:19)
u10 HYLSTQSVLSKD(SEQ ID NO:20)
u11 HMVLLEFVTAAGI(SEQ ID NO:21)。
In another preferred embodiment, the green fluorescent protein folding unit FP can be selected from: u8, u9, u2-u3, u4-u5, u8-u9, u1-u2-u3, u2-u3-u4, u3-u4-u5, u5-u6-u7, u8-u9-u10, u9-u 9-u 9-u 9, u9-u 9-u 9-u 9, u 9-36u 9, u 9-9, u 36u 9-36u 9, u 36u 9-9, u 9-36u 9-9, u 9-36u 9-9, u 9-9, u 9-36u 9-9, u 9-36u 9-9, u-36u-9, u 36u 9, u 9-36u 9, u 36u 9-9, u 9-36u 9, u-9, u 9-9, u-9, u 9-36u-9, u-9, u 9-9, u 9-9, u 36u-9, u 9-36u 9, u 36u-9, u-36u-9, u-9-, u1-I-u5, u2-I-u4, u3-I-u8, u5-I-u6, or u10-I-u 11.
In another preferred embodiment, the green fluorescent protein folding unit is u8-u9, u9-u10-u11, or u10-u 11.
In another preferred embodiment, the fusion protein of the present invention has a structure represented by formula I:
FP-TEV-R-G (I)
"-" represents a peptide bond;
FP is a green fluorescent protein folding unit;
TEV is an enzyme cutting site, preferably a TEV enzyme cutting site (recognition sequence is ENLYFQG, SEQ ID NO: 4);
r is an enzyme cutting site, preferably a trypsin enzyme cutting site (recognition site is R or K);
a is Boc modified insulin lispro precursor, and has a structure shown in formula II:
(B1F~B28Boc-K~B29P~B30T)-X-(A1G~A21N) (Ⅱ)
in the formula (I), the compound is shown in the specification,
x is a connecting peptide, preferably, X has an amino acid sequence of R, RR, RRR, or as shown in SEQ ID NO. 6-9 (RRGSKR, RRAAKR, RRYPGDVKR or
RREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKR)。
In another preferred embodiment, the sequence of the recombinant insulin lispro precursor of the invention is shown in SEQ ID NO:22, and the sequence is:
VQLADHYQQNTPIGHYLSTQSVLSKDHMVLLEFVTAAGIENLYFQGRFVNQHLCGSHLVEALYLVCGERGFFYTK(Boc) PTRGIVEQCCTSICSLYQLENYCN wherein K (Boc) is Boc-modified lysine.
The term "fusion protein" as used herein also includes variants having the above-described activities. These variants include (but are not limited to): deletion, insertion and/or substitution of 1 to 3 (usually 1 to 2, more preferably 1) amino acids, and addition or deletion of one or several (usually up to 3, preferably up to 2, more preferably up to 1) amino acids at the C-terminal and/or N-terminal. For example, in the art, substitutions with amino acids of similar or similar properties will not generally alter the function of the protein. Also, for example, the addition or deletion of one or several amino acids at the C-terminus and/or N-terminus does not generally alter the structure and function of the protein. In addition, the term also includes monomeric and multimeric forms of the polypeptides of the invention. The term also includes linear as well as non-linear polypeptides (e.g., cyclic peptides).
The invention also includes active fragments, derivatives and analogs of the above fusion proteins. As used herein, the terms "fragment," "derivative," and "analog" refer to a polypeptide that substantially retains the function or activity of a fusion protein of the invention. The polypeptide fragment, derivative or analogue of the present invention may be (i) a polypeptide in which one or more conserved or non-conserved amino acid residues (preferably conserved amino acid residues) are substituted, or (ii) a polypeptide having a substituent group in one or more amino acid residues, or (iii) a polypeptide in which a polypeptide is fused with another compound (such as a compound for increasing the half-life of the polypeptide, e.g., polyethylene glycol), or (iv) a polypeptide in which an additional amino acid sequence is fused with the polypeptide sequence (a fusion protein in which a tag sequence such as a leader sequence, a secretory sequence or 6His is fused). Such fragments, derivatives and analogs are within the purview of those skilled in the art in view of the teachings herein.
A preferred class of reactive derivatives refers to polypeptides formed by the replacement of up to 3, preferably up to 2, more preferably up to 1 amino acid with an amino acid of similar or analogous nature compared to the amino acid sequence of the present invention. These conservative variants are preferably produced by amino acid substitutions according to Table A.
TABLE A
Initial residue(s) Representative substitutions Preferred substitutions
Ala(A) Val;Leu;Ile Val
Arg(R) Lys;Gln;Asn Lys
Asn(N) Gln;His;Lys;Arg Gln
Asp(D) Glu Glu
Cys(C) Ser Ser
Gln(Q) Asn Asn
Glu(E) Asp Asp
Gly(G) Pro;Ala Ala
His(H) Asn;Gln;Lys;Arg Arg
Ile(I) Leu;Val;Met;Ala;Phe Leu
Leu(L) Ile;Val;Met;Ala;Phe Ile
Lys(K) Arg;Gln;Asn Arg
Met(M) Leu;Phe;Ile Leu
Phe(F) Leu;Val;Ile;Ala;Tyr Leu
Pro(P) Ala Ala
Ser(S) Thr Thr
Thr(T) Ser Ser
Trp(W) Tyr;Phe Tyr
Tyr(Y) Trp;Phe;Thr;Ser Phe
Val(V) Ile;Leu;Met;Phe;Ala Leu
The invention also provides analogs of the fusion proteins of the invention. These analogs may differ from the polypeptides of the invention by amino acid sequence differences, by modifications that do not affect the sequence, or by both. Analogs also include analogs having residues other than the natural L-amino acids (e.g., D-amino acids), as well as analogs having non-naturally occurring or synthetic amino acids (e.g., beta, gamma-amino acids). It is to be understood that the polypeptides of the present invention are not limited to the representative polypeptides exemplified above.
In addition, modifications may be made to the fusion proteins of the invention. Modified (generally without altering primary structure) forms include: chemically derivatized forms of the polypeptide, such as acetylation or carboxylation, in vivo or in vitro. Modifications also include glycosylation, such as those resulting from glycosylation modifications in the synthesis and processing of the polypeptide or in further processing steps. Such modification may be accomplished by exposing the polypeptide to an enzyme that performs glycosylation, such as a mammalian glycosylase or deglycosylase. Modified forms also include sequences having phosphorylated amino acid residues (e.g., phosphotyrosine, phosphoserine, phosphothreonine). Also included are polypeptides modified to increase their resistance to proteolysis or to optimize solubility.
The term "polynucleotide encoding a fusion protein of the present invention" may include a polynucleotide encoding a fusion protein of the present invention, and may also include polynucleotides that additionally include coding and/or non-coding sequences.
The invention also relates to variants of the above polynucleotides which encode fragments, analogs and derivatives of the polypeptides or fusion proteins having the same amino acid sequence as the present invention. These nucleotide variants include substitution variants, deletion variants and insertion variants. As is known in the art, an allelic variant is a substitution of a polynucleotide, which may be a substitution, deletion, or insertion of one or more nucleotides, without substantially altering the function of the fusion protein encoded thereby.
The present invention also relates to polynucleotides which hybridize to the sequences described above and which have at least 50%, preferably at least 70%, and more preferably at least 80% identity between the two sequences. The present invention particularly relates to polynucleotides hybridizable under stringent conditions (or stringent conditions) with the polynucleotides of the present invention. In the present invention, "stringent conditions" mean: (1) hybridization and elution at lower ionic strength and higher temperature, such as 0.2 XSSC, 0.1% SDS, 60 ℃; or (2) adding denaturant during hybridization, such as 50% (v/v) formamide, 0.1% calf serum/0.1% Ficoll, 42 deg.C, etc.; or (3) hybridization occurs only when the identity between two sequences is at least 90% or more, preferably 95% or more.
The fusion proteins and polynucleotides of the invention are preferably provided in isolated form, and more preferably, purified to homogeneity.
The full-length sequence of the polynucleotide of the present invention can be obtained by PCR amplification, recombination, or artificial synthesis. For PCR amplification, primers can be designed based on the nucleotide sequences disclosed herein, particularly open reading frame sequences, and the sequences can be amplified using commercially available cDNA libraries or cDNA libraries prepared by conventional methods known to those skilled in the art as templates. When the sequence is long, two or more PCR amplifications are often required, and then the amplified fragments are spliced together in the correct order.
Once the sequence of interest has been obtained, it can be obtained in large quantities by recombinant methods. This is usually done by cloning it into a vector, transferring it into a cell, and isolating the relevant sequence from the propagated host cell by conventional methods.
In addition, the sequence can be synthesized by artificial synthesis, especially when the fragment length is short. Generally, fragments with long sequences are obtained by first synthesizing a plurality of small fragments and then ligating them.
At present, DNA sequences encoding the proteins of the present invention (or fragments or derivatives thereof) have been obtained completely by chemical synthesis. The DNA sequence may then be introduced into various existing DNA molecules (or vectors, for example) and cells known in the art.
Methods for amplifying DNA/RNA using PCR techniques are preferably used to obtain the polynucleotides of the invention. Particularly, when it is difficult to obtain a full-length cDNA from a library, it is preferable to use the RACE method (RACE-cDNA terminal rapid amplification method), and primers used for PCR can be appropriately selected based on the sequence information of the present invention disclosed herein and synthesized by a conventional method. The amplified DNA/RNA fragments can be isolated and purified by conventional methods, such as by gel electrophoresis.
Expression vector
The invention also relates to vectors comprising the polynucleotides of the invention, as well as genetically engineered host cells transformed with the vectors of the invention or the coding sequences of the fusion proteins of the invention, and methods for producing the polypeptides of the invention by recombinant techniques.
The polynucleotide sequences of the present invention may be used to express or produce recombinant fusion proteins by conventional recombinant DNA techniques. Generally, the following steps are performed:
(1) transforming or transducing a suitable host cell with a polynucleotide (or variant) of the invention encoding a fusion protein of the invention, or with a recombinant expression vector comprising the polynucleotide;
(2) a host cell cultured in a suitable medium;
(3) isolating and purifying the protein from the culture medium or the cells.
In the present invention, the polynucleotide sequence encoding the fusion protein may be inserted into a recombinant expression vector. The term "recombinant expression vector" refers to a bacterial plasmid, bacteriophage, yeast plasmid, plant cell virus, mammalian cell virus such as adenovirus, retrovirus, or other vectors well known in the art. Any plasmid or vector may be used as long as it can replicate and is stable in the host. An important feature of expression vectors is that they generally contain an origin of replication, a promoter, a marker gene and translation control elements.
Methods well known to those skilled in the art can be used to construct expression vectors containing the DNA sequences encoding the fusion proteins of the present invention and appropriate transcription/translation control signals. These methods include in vitro recombinant DNA techniques, DNA synthesis techniques, in vivo recombinant techniques, and the like. The DNA sequence may be operably linked to a suitable promoter in an expression vector to direct mRNA synthesis. Representative examples of such promoters are: lac or trp promoter of E.coli; a lambda phage PL promoter; eukaryotic promoters include CMV immediate early promoter, HSV thymidine kinase promoter, early and late SV40 promoter, LTRs of retrovirus, and other known promoters capable of controlling gene expression in prokaryotic or eukaryotic cells or viruses. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator.
Furthermore, the expression vector preferably comprises one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase, neomycin resistance and Green Fluorescent Protein (GFP) for eukaryotic cell culture, or tetracycline or ampicillin resistance for E.coli.
Vectors comprising the appropriate DNA sequences described above, together with appropriate promoter or control sequences, may be used to transform appropriate host cells to enable expression of the protein.
The host cell may be a prokaryotic cell, such as a bacterial cell; or lower eukaryotic cells, such as yeast cells; or higher eukaryotic cells, such as mammalian cells. Representative examples are: escherichia coli, streptomyces; bacterial cells of salmonella typhimurium; fungal cells such as yeast, plant cells (e.g., ginseng cells).
When the polynucleotide of the present invention is expressed in higher eukaryotic cells, transcription will be enhanced if an enhancer sequence is inserted into the vector. Enhancers are cis-acting elements of DNA, usually about 10 to 300 base pairs, that act on a promoter to increase transcription of a gene. Examples include the SV40 enhancer at the late side of the replication origin at 100 to 270 bp, the polyoma enhancer at the late side of the replication origin, and adenovirus enhancers.
It will be clear to one of ordinary skill in the art how to select appropriate vectors, promoters, enhancers and host cells.
Transformation of a host cell with recombinant DNA can be carried out using conventional techniques well known to those skilled in the art. When the host is prokaryotic, e.g., E.coli, competent cells capable of DNA uptake can be harvested after exponential growth phase using CaCl2Methods, the steps used are well known in the art. Another method is to use MgCl2. If desired, transformation can also be carried out by electroporation. When the host is a eukaryote, the following DNA transfection methods may be used: calcium phosphate coprecipitation, conventional mechanical methods such as microinjection, electroporation, liposome encapsulation, etc.
The obtained transformant can be cultured by a conventional method to express the polypeptide encoded by the gene of the present invention. The medium used in the culture may be selected from various conventional media depending on the host cell used. The culturing is performed under conditions suitable for growth of the host cell. After the host cells have been grown to an appropriate cell density, the selected promoter is induced by suitable means (e.g., temperature shift or chemical induction) and the cells are cultured for an additional period of time.
The recombinant polypeptide in the above method may be expressed intracellularly or on the cell membrane, or secreted extracellularly. If necessary, the recombinant protein can be isolated and purified by various separation methods using its physical, chemical and other properties. These methods are well known to those skilled in the art. Examples of such methods include, but are not limited to: conventional renaturation treatment, treatment with a protein precipitant (such as salt precipitation), centrifugation, cell lysis by osmosis, sonication, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, High Performance Liquid Chromatography (HPLC), and other various liquid chromatography techniques, and combinations thereof.
The main advantages of the invention include:
(1) the method does not need to remove excessive inorganic salt in the supernatant of the fermentation liquor by adopting methods such as dilution, ultrafiltration and liquid exchange, and the like, so that the obtained inclusion body has higher purity and less pigment, and separated substances and purification cost are reduced for subsequent purification. And the cation chromatography in the method is used for separating the insulin lispro, and the one-step yield is over 80 percent.
(2) Due to B28Protection of Boc lysine, no recognition of B during trypsin digestion28The lysine does not produce some by-products, can improve the enzyme digestion yield, reduces the impurities of the insulin analogues of the lysine, and provides convenience for subsequent purification and separation.
(3) In the enzyme digestion process, the enzyme digestion yield is improved by optimizing the proportion of trypsin, controlling the enzyme digestion temperature and adding an enzyme digestion auxiliary agent.
(4) And in the deprotection step, the Boc-lispro insulin is converted into lispro insulin without being carried out in an organic system, so that the process steps are reduced, the environmental pollution is low, and the cost is lower.
(5) The invention adopts two-step ion exchange chromatography and one-step reverse phase chromatography for separation and purification, replaces the conventional four-step chromatography, reduces the production period, reduces the use of organic solvent and saves the cost.
(6) The fusion protein of the invention contains insulin lispro with high specific gravity (fusion ratio is increased), the green fluorescent protein in the fusion protein contains arginine or lysine, can be digested into small fragments by protease, has large molecular weight difference compared with the target protein, and is easy to separate.
The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Experimental procedures without specific conditions noted in the following examples, generally followed by conventional conditions, such as Sambrook et al, molecular cloning: the conditions described in the Laboratory Manual (New York: Cold Spring Harbor Laboratory Press,1989), or according to the manufacturer's recommendations. Unless otherwise indicated, percentages and parts are percentages and parts by weight.
Example 1 construction and expression of insulin lispro expression Strain
The construction of insulin lispro expression plasmid is described in the examples in patent application No. 201910210102.9. The DNA fragment containing the fusion protein FP-TEV-R-G was cloned into the NcoI-XhoI site downstream of the araBAD promoter of the expression vector plasmid pBAD/His A (purchased from NTCC, kanamycin resistance) to obtain plasmid pBAD-FP-TEV-R-G. The plasmid map is shown in FIG. 1.
The DNA sequence of pylRs was then cloned into the SpeI-SalI site downstream of the araBAD promoter of the expression vector plasmid pEvol-pBpF (available from NTCC for chloramphenicol resistance), while the DNA sequence of the tRNA (pylTcua) of lysyl-tRNA synthetase was PCR inserted downstream of the proK promoter. This plasmid was designated pEvol-pylRs-pylT. The plasmid map is shown in FIG. 2.
The constructed plasmid pBAD-FP-TEV-R-G and pEvol-pylRs-pylT are jointly transformed into an escherichia coli TOP10 strain, and a recombinant escherichia coli strain expressing the insulin lispro fusion protein FP-TEV-R-G is obtained through screening. Wherein the sequence of FP is u8-u9, and the amino acid sequence of fusion protein is shown as SEQ ID NO. 1.
GIKANFKIRHNVEDVQLADHYQQNTPIGENLYFQGRFVNQHLCGSHLVEALYLVCGERGFFYTK(Boc) PTRGIVEQCCTSICSLYQLENYCN (SEQ ID NO:1) Using the same procedure, the following expression plasmids were constructed:
pBAD-FP (u9-u10-u11) -TEV-R-G, amino acid sequence of fusion protein
VQLADHYQQNTPIGHYLSTQSVLSKDHMVLLEFVTAAGIENLYFQGRFVNQHLCGSHLVEALYLVCGERGFFYTK(Boc)PTRGIVEQCCTSICSLYQLENYCN(SEQ ID NO:22)。
pBAD-FP (u10-u11) -TEV-R-G, amino acid sequence of fusion protein
HYLSTQSVLSKDHMVLLEFVTAAGIENLYFQGRFVNQHLCGSHLVEALYLVCGERGFFYTK(Boc)PTRGIVEQCCTSICSLYQLENYCN(SEQ ID NO:23)。
pBAD-FP (gIII) -R-G, amino acid sequence of fusion protein
KKLLFAIPLVVPFYSHSTMELEICSWYHMGIRSFLEQKLISEEDLNSAVDRFVNQHLCGSHLVEALYLVCGERGFFYTK(Boc) PTRGIVEQCCTSICSLYQLENYCN (SEQ ID NO: 10). Wherein FP (gIII) represents a gIII signal peptide derived from green fluorescent protein.
Constructing corresponding expression strains by using a conventional method in the field, and carrying out electrophoresis detection on the expression quantity of the insulin lispro fusion protein in the fermentation liquor.
Figure BDA0002535439000000161
Preparing seed liquid culture medium, inoculating, performing two-stage culture to obtain second-stage seed liquid, and culturing for 20 hr with OD600About 180, obtaining about 3L fermentation liquor after fermentation is finished, and obtaining about 130g/L wet thalli by centrifugation. And (3) after the fermentation liquor is centrifuged, adding a crushing buffer solution, performing bacteria crushing twice by using a high-pressure homogenizer, adding Tween 80 and EDTA-2Na with a certain concentration after centrifugation, washing once, centrifuging, and collecting precipitates to obtain the inclusion body. About 40g of wet inclusion bodies per liter of fermentation broth can finally be obtained.
Example 2 solubilization and renaturation of Inclusion bodies
Adding 8mol/L urea solution into the obtained inclusion body, adjusting the pH value to 9.0-10.0 by sodium hydroxide, stirring for 1-3h at room temperature, controlling the protein concentration to be 10-20 mg/mL, supplementing beta-mercaptoethanol until the final concentration is 15-20m mol/L, and continuing stirring for 0.5-1.0 h.
And (3) dropwise adding the inclusion body dissolving solution into a renaturation buffer solution, diluting and renaturing by 5-10 times, maintaining the pH value of the renaturation solution to be 9.0-10.0, and stirring and renaturing for 10-20 hours.
After renaturation is carried out for 20h, HPLC detects the fusion content of insulin lispro with correct renaturation folding, and the renaturation rate reaches over 75 percent. FIG. 3 shows an SDS-PAGE electrophoresis of insulin lispro fusion proteins after inclusion body renaturation.
EXAMPLE 3 cleavage of fusion proteins
Adding dilute hydrochloric acid into the renaturation solution to adjust the pH value to 8.0-9.5, adding recombinant trypsin according to the ratio of 1:3000, adding carboxypeptidase B according to the ratio of 1:15000, and obtaining Boc-insulin after enzyme digestion, wherein the enzyme digestion temperature is 18-25 ℃, and the enzyme digestion time is 14-20 h.
And after enzyme digestion is carried out for 16h, detecting the Boc-lispro insulin content in the enzyme digestion solution by using HPLC, and finishing enzyme digestion when the concentration difference of the Boc-lispro insulin detected for two continuous hours is less than 3%. Finally, the concentration of Boc-lispro insulin in the enzyme digestion solution is 0.4-0.6g/L, and the enzyme digestion rate is more than 80%.
Example 4 first chromatography
According to the difference of isoelectric points of proteins, anion exchange packing is selected to carry out crude extraction on Boc-lispro insulin. Balancing 3-5 column volumes of the chromatographic column by using 5-20mmol/L sodium carbonate and pH8.0-9.0 buffer solution, fully combining the Boc-lispro insulin with an anion filler, wherein the loading capacity of the loading solution is lower than 45g/L, linearly eluting 15 column volumes by using 100-500mmol/L sodium chloride after the loading is finished, and collecting the eluted protein solution, wherein the yield of the Boc-lispro insulin is more than 90 percent, and the purity is more than 70 percent.
Example 5 deprotection
Adding 2-3% hydrochloric acid into the Boc lispro insulin solution, stirring at room temperature for reaction for 3-5h, adding NaOH to adjust the pH value of the protein solution to be more than 2.5 after the reaction is finished, terminating deprotection reaction, and finally obtaining lispro insulin, wherein the deprotection yield is more than 85% and the purity is more than 75% by HPLC (high performance liquid chromatography).
EXAMPLE 6 second chromatography
According to the difference of the charges of the substances, the insulin lispro is purified by adopting an anion exchange chromatography technology to remove part of impurities. Balancing 3-5 column volumes of an ion column by using 20mmol/L glycine and pH9.0 buffer solution, combining insulin lispro protein solution with a cationic filler, controlling the loading capacity of insulin lispro not to exceed 10mg/mL, isocratically eluting by using a sodium chloride solution containing 0.27mol/L, and collecting an insulin lispro sample. The purity of insulin lispro in the collected liquid was 97.83%, and the yield was 87.96%.
EXAMPLE 7 third chromatography
According to the difference of the hydrophobicity of the substances, the insulin lispro is finely purified by adopting a reverse phase chromatographic column technology. Diluting the insulin lispro solution obtained by the secondary chromatography by more than 4 times with pure water, and combining with C8 reversed phase filler. Controlling the loading capacity of insulin lispro to be not higher than 5mg/mL, and isocratically eluting 10CV with 200mmol/L sodium acetate and 26% acetonitrile. The results show that: the yield of insulin lispro is 93.8%, and the purity is 99.70%. FIG. 4 shows the HPLC profile of insulin lispro after the third chromatography.
All documents referred to herein are incorporated by reference into this application as if each were individually incorporated by reference. Furthermore, it should be understood that various changes and modifications of the present invention can be made by those skilled in the art after reading the above teachings of the present invention, and these equivalents also fall within the scope of the present invention as defined by the appended claims.
Sequence listing
<110> Ningbo spread Biotechnology Ltd
<120> insulin lispro derivative and application thereof
<130> P2020-0840
<160> 25
<170> SIPOSequenceListing 1.0
<210> 1
<211> 88
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 1
Gly Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Val Gln
1 5 10 15
Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Glu Asn Leu Tyr
20 25 30
Phe Gln Gly Arg Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val
35 40 45
Glu Ala Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Lys
50 55 60
Pro Thr Arg Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu
65 70 75 80
Tyr Gln Leu Glu Asn Tyr Cys Asn
85
<210> 2
<211> 12
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 2
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val
1 5 10
<210> 3
<211> 28
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 3
Gly Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Val Gln
1 5 10 15
Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly
20 25
<210> 4
<211> 7
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 4
Glu Asn Leu Tyr Phe Gln Gly
1 5
<210> 5
<211> 52
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 5
Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr
1 5 10 15
Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Lys Pro Thr Arg Gly
20 25 30
Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu
35 40 45
Asn Tyr Cys Asn
50
<210> 6
<211> 6
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 6
Arg Arg Gly Ser Lys Arg
1 5
<210> 7
<211> 6
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 7
Arg Arg Ala Ala Lys Arg
1 5
<210> 8
<211> 9
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 8
Arg Arg Tyr Pro Gly Asp Val Lys Arg
1 5
<210> 9
<211> 35
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 9
Arg Arg Glu Ala Glu Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly
1 5 10 15
Gly Pro Gly Ala Gly Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu
20 25 30
Gln Lys Arg
35
<210> 10
<211> 103
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 10
Lys Lys Leu Leu Phe Ala Ile Pro Leu Val Val Pro Phe Tyr Ser His
1 5 10 15
Ser Thr Met Glu Leu Glu Ile Cys Ser Trp Tyr His Met Gly Ile Arg
20 25 30
Ser Phe Leu Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu Asn Ser Ala
35 40 45
Val Asp Arg Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu
50 55 60
Ala Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Lys Pro
65 70 75 80
Thr Arg Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr
85 90 95
Gln Leu Glu Asn Tyr Cys Asn
100
<210> 11
<211> 13
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 11
Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly
1 5 10
<210> 12
<211> 14
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 12
His Lys Phe Ser Val Arg Gly Glu Gly Glu Gly Asp Ala Thr
1 5 10
<210> 13
<211> 10
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 13
Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr
1 5 10
<210> 14
<211> 11
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 14
Tyr Val Gln Glu Arg Thr Ile Ser Phe Lys Asp
1 5 10
<210> 15
<211> 13
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 15
Thr Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp
1 5 10
<210> 16
<211> 13
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 16
Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe
1 5 10
<210> 17
<211> 10
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 17
His Asn Val Tyr Ile Thr Ala Asp Lys Gln
1 5 10
<210> 18
<211> 14
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 18
Gly Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp
1 5 10
<210> 19
<211> 14
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 19
Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly
1 5 10
<210> 20
<211> 12
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 20
His Tyr Leu Ser Thr Gln Ser Val Leu Ser Lys Asp
1 5 10
<210> 21
<211> 13
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 21
His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile
1 5 10
<210> 22
<211> 99
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 22
Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly His Tyr
1 5 10 15
Leu Ser Thr Gln Ser Val Leu Ser Lys Asp His Met Val Leu Leu Glu
20 25 30
Phe Val Thr Ala Ala Gly Ile Glu Asn Leu Tyr Phe Gln Gly Arg Phe
35 40 45
Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu
50 55 60
Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Lys Pro Thr Arg Gly Ile
65 70 75 80
Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu Asn
85 90 95
Tyr Cys Asn
<210> 23
<211> 85
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 23
His Tyr Leu Ser Thr Gln Ser Val Leu Ser Lys Asp His Met Val Leu
1 5 10 15
Leu Glu Phe Val Thr Ala Ala Gly Ile Glu Asn Leu Tyr Phe Gln Gly
20 25 30
Arg Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu
35 40 45
Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Lys Pro Thr Arg
50 55 60
Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu
65 70 75 80
Glu Asn Tyr Cys Asn
85
<210> 24
<211> 39
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 24
Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly His Tyr
1 5 10 15
Leu Ser Thr Gln Ser Val Leu Ser Lys Asp His Met Val Leu Leu Glu
20 25 30
Phe Val Thr Ala Ala Gly Ile
35
<210> 25
<211> 25
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 25
His Tyr Leu Ser Thr Gln Ser Val Leu Ser Lys Asp His Met Val Leu
1 5 10 15
Leu Glu Phe Val Thr Ala Ala Gly Ile
20 25

Claims (10)

1. A recombinant insulin lispro fusion protein having the structure shown in formula I:
A-FP-TEV-R-G (I)
in the formula (I), the compound is shown in the specification,
"-" represents a peptide bond;
a is a null or leader peptide,
FP is a folding unit of green fluorescent protein,
TEV is an enzyme cutting site, preferably a TEV enzyme cutting site;
r is arginine or lysine for enzyme digestion;
g is insulin lispro or an active fragment thereof;
wherein said green fluorescent protein fold units comprise 2-6 β -sheet units selected from the group consisting of:
beta-sheet unit Amino acid sequence u1 VPILVELDGDVNG(SEQ ID NO:11) u2 HKFSVRGEGEGDAT(SEQ ID NO:12) u3 KLTLKFICTT(SEQ ID NO:13) u4 YVQERTISFKD(SEQ ID NO:14) u5 TYKTRAEVKFEGD(SEQ ID NO:15) u6 TLVNRIELKGIDF(SEQ ID NO:16) u7 HNVYITADKQ(SEQ ID NO:17) u8 GIKANFKIRHNVED(SEQ ID NO:18) u9 VQLADHYQQNTPIG(SEQ ID NO:19) u10 HYLSTQSVLSKD(SEQ ID NO:20) u11 HMVLLEFVTAAGI(SEQ ID NO:21)。
2. The fusion protein of claim 1, wherein the green fluorescent protein folding unit is u8-u9, u9-u10-u11, or u10-u 11.
3. The fusion protein of claim 1, wherein G is Boc-modified lispro precursor having the structure of formula II:
GB-X-GA (II)
in the formula (I), the compound is shown in the specification,
GB is 28 th Boc modified insulin lispro B chain, the amino acid sequence is shown as 1 st to 30 th of SEQ ID NO. 5,
x is a connecting peptide;
GA is insulin lispro A chain, and the amino acid sequence is shown in 32-52 of SEQ ID NO. 5.
4. The fusion protein of claim 1, wherein the recombinant insulin lispro fusion protein has the sequence set forth in SEQ ID NOs 1, 22, and 23.
5. A double-chain insulin lispro fusion protein is characterized by having a structure shown in formula III:
A-FP-TEV-R-D (III)
in the formula (I), the compound is shown in the specification,
"-" represents a peptide bond;
a is a null or leader peptide,
FP is a folding unit of green fluorescent protein,
TEV is an enzyme cutting site, preferably a TEV enzyme cutting site;
r is arginine or lysine for enzyme digestion;
d is Boc modified double-chain insulin lispro with the structure shown in the following formula IV;
Figure FDA0002535438990000021
in the formula (I), the compound is shown in the specification,
"|" represents a disulfide bond;
GA is insulin lispro A chain, the amino acid sequence is shown as 32-52 of SEQ ID NO. 5,
x is nothing or a connecting peptide;
GB is a 28 th Boc modified insulin lispro B chain, and the amino acid sequence is shown as 1 st to 30 th positions of SEQ ID NO. 5;
wherein said green fluorescent protein fold units comprise 2-6 β -sheet units selected from the group consisting of:
Figure FDA0002535438990000022
Figure FDA0002535438990000031
6. a Boc-modified insulin lispro precursor having the structure of formula II:
GB-X-GA (II)
in the formula (I), the compound is shown in the specification,
GB is 28 th Boc modified insulin lispro B chain, the amino acid sequence is shown as 1 st to 30 th of SEQ ID NO. 5,
x is connecting peptide, and the amino acid sequence of X is R, RR, RRR, or as shown in SEQ ID NO. 6-9;
GA is insulin lispro A chain, and the amino acid sequence is shown in 32-52 of SEQ ID NO. 5.
7. A Boc-modified double-stranded insulin lispro having the structure of formula IV:
Figure FDA0002535438990000032
in the formula (I), the compound is shown in the specification,
"|" represents a disulfide bond;
GA is insulin lispro A chain, the amino acid sequence is shown as 32-52 of SEQ ID NO. 5,
GB is insulin lispro B chain, the amino acid sequence is shown in 1 st to 30 th positions of SEQ ID NO. 5, and the lysine at 28 th position of the B chain is N epsilon- (tert-butyloxycarbonyl) -lysine.
8. An isolated polynucleotide encoding the recombinant insulin lispro fusion protein of claim 1, the double-stranded insulin lispro fusion protein of claim 5, the Boc-modified insulin lispro precursor of claim 6, or the Boc-modified double-stranded insulin of claim 7.
9. A vector comprising the polynucleotide of claim 8.
10. A host cell comprising the vector of claim 9, or having integrated into its chromosome the exogenous polynucleotide of claim 8, or expressing the recombinant insulin lispro fusion protein of claim 1, the double-stranded insulin lispro fusion protein of claim 5, the Boc-modified insulin lispro precursor of claim 6, or the Boc-modified double-stranded insulin lispro of claim 7.
CN202010531593.XA 2020-06-11 2020-06-11 Insulin lispro derivative and application thereof Withdrawn CN113801235A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010531593.XA CN113801235A (en) 2020-06-11 2020-06-11 Insulin lispro derivative and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010531593.XA CN113801235A (en) 2020-06-11 2020-06-11 Insulin lispro derivative and application thereof

Publications (1)

Publication Number Publication Date
CN113801235A true CN113801235A (en) 2021-12-17

Family

ID=78943751

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010531593.XA Withdrawn CN113801235A (en) 2020-06-11 2020-06-11 Insulin lispro derivative and application thereof

Country Status (1)

Country Link
CN (1) CN113801235A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114805544A (en) * 2022-06-23 2022-07-29 北京惠之衡生物科技有限公司 Insulin lispro precursor, recombinant genetic engineering bacterium thereof and construction method thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010023069A1 (en) * 1999-12-29 2001-09-20 Kjeldsen Thomas Borglum Method for making insulin precursors and insulin precursor analogs
CN104619726A (en) * 2012-03-23 2015-05-13 苏州鲲鹏生物技术有限公司 Fusion protein composed of superfolded green fluorescent protein and use thereof
CN105535942A (en) * 2016-01-28 2016-05-04 通化东宝药业股份有限公司 Insulin lispro-protamine sulfate preparation and preparation method thereof
EP3260548A1 (en) * 2016-06-21 2017-12-27 Enzypep B.V. Enzymatic coupling of (oligo)peptides to the b-chain of an insulin receptor ligand
CN108383902A (en) * 2012-09-26 2018-08-10 印第安纳大学研究及科技有限公司 Insulin analog dimer

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010023069A1 (en) * 1999-12-29 2001-09-20 Kjeldsen Thomas Borglum Method for making insulin precursors and insulin precursor analogs
CN104619726A (en) * 2012-03-23 2015-05-13 苏州鲲鹏生物技术有限公司 Fusion protein composed of superfolded green fluorescent protein and use thereof
CN108383902A (en) * 2012-09-26 2018-08-10 印第安纳大学研究及科技有限公司 Insulin analog dimer
CN105535942A (en) * 2016-01-28 2016-05-04 通化东宝药业股份有限公司 Insulin lispro-protamine sulfate preparation and preparation method thereof
EP3260548A1 (en) * 2016-06-21 2017-12-27 Enzypep B.V. Enzymatic coupling of (oligo)peptides to the b-chain of an insulin receptor ligand

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114805544A (en) * 2022-06-23 2022-07-29 北京惠之衡生物科技有限公司 Insulin lispro precursor, recombinant genetic engineering bacterium thereof and construction method thereof

Similar Documents

Publication Publication Date Title
AU2020242724B2 (en) Aminoacyl-tRNA synthetase for efficiently introducing lysine derivative in protein
CN113773398B (en) De-glu insulin derivative and application thereof
CN113614113B (en) Fusion protein containing fluorescent protein fragment and use thereof
CN113135990A (en) Liraglutide derivatives and preparation method thereof
CN115667318B (en) A semaglutide derivative and its preparation method and application
CN113801234B (en) A kind of semaglutide derivative and its application
CN113801233B (en) A kind of preparation method of semaglutide
CN113801235A (en) Insulin lispro derivative and application thereof
CN113801236A (en) Preparation method of insulin lispro
CN113773400B (en) A kind of insulin aspart derivative and its application
CN113773399B (en) Insulin glargine derivative and application thereof
CN113773392B (en) Preparation method of insulin glargine
CN113773397B (en) A kind of preparation method of insulin degludec
CN113773391B (en) Preparation method of insulin aspart
CN114057886B (en) A kind of semaglutide derivative and preparation method thereof
CN113773396A (en) Insulin detemir derivative and application thereof
CN113773395A (en) Preparation method of insulin detemir
CN114075295A (en) Efficient renaturation liquid of Boc-human insulin fusion protein inclusion body and renaturation method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20211217