WO1993002198A1 - Recombinant secretable fusion proteins - Google Patents
Recombinant secretable fusion proteins Download PDFInfo
- Publication number
- WO1993002198A1 WO1993002198A1 PCT/FI1991/000225 FI9100225W WO9302198A1 WO 1993002198 A1 WO1993002198 A1 WO 1993002198A1 FI 9100225 W FI9100225 W FI 9100225W WO 9302198 A1 WO9302198 A1 WO 9302198A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- host
- protein
- fusion protein
- transformed
- expression vector
- Prior art date
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 59
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 59
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 119
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 100
- 108020001580 protein domains Proteins 0.000 claims abstract description 35
- 239000013604 expression vector Substances 0.000 claims abstract description 29
- 239000013598 vector Substances 0.000 claims abstract description 20
- 101800001707 Spacer peptide Proteins 0.000 claims abstract description 14
- 238000004519 manufacturing process Methods 0.000 claims abstract description 13
- 230000004071 biological effect Effects 0.000 claims abstract description 9
- 108091005763 multidomain proteins Proteins 0.000 claims abstract description 7
- 230000001131 transforming effect Effects 0.000 claims abstract description 7
- 238000012258 culturing Methods 0.000 claims abstract description 3
- 238000000034 method Methods 0.000 claims description 41
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 37
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 36
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 31
- 108020004511 Recombinant DNA Proteins 0.000 claims description 29
- 241000588724 Escherichia coli Species 0.000 claims description 28
- 239000012634 fragment Substances 0.000 claims description 22
- 108060003951 Immunoglobulin Proteins 0.000 claims description 17
- 102000018358 immunoglobulin Human genes 0.000 claims description 17
- 108020004414 DNA Proteins 0.000 claims description 16
- 239000001963 growth medium Substances 0.000 claims description 16
- 108010059892 Cellulase Proteins 0.000 claims description 15
- 229940106157 cellulase Drugs 0.000 claims description 15
- 241000233866 Fungi Species 0.000 claims description 12
- 108010031186 Glycoside Hydrolases Proteins 0.000 claims description 12
- 102000005744 Glycoside Hydrolases Human genes 0.000 claims description 12
- 125000006850 spacer group Chemical group 0.000 claims description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 10
- 229940088598 enzyme Drugs 0.000 claims description 10
- 241000894006 Bacteria Species 0.000 claims description 9
- 102000004190 Enzymes Human genes 0.000 claims description 9
- 108090000790 Enzymes Proteins 0.000 claims description 9
- 210000004027 cell Anatomy 0.000 claims description 9
- 102100022624 Glucoamylase Human genes 0.000 claims description 8
- 230000001580 bacterial effect Effects 0.000 claims description 8
- 210000001322 periplasm Anatomy 0.000 claims description 8
- 239000000427 antigen Substances 0.000 claims description 7
- 108091007433 antigens Proteins 0.000 claims description 7
- 102000036639 antigens Human genes 0.000 claims description 7
- 108700007698 Genetic Terminator Regions Proteins 0.000 claims description 6
- 239000003550 marker Substances 0.000 claims description 6
- 230000003248 secreting effect Effects 0.000 claims description 6
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 claims description 5
- 241000701959 Escherichia virus Lambda Species 0.000 claims description 5
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 claims description 5
- 241000499912 Trichoderma reesei Species 0.000 claims description 5
- 230000002538 fungal effect Effects 0.000 claims description 5
- 241000588701 Pectobacterium carotovorum Species 0.000 claims description 4
- 108020005091 Replication Origin Proteins 0.000 claims description 4
- 108010087558 pectate lyase Proteins 0.000 claims description 4
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical group CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 claims description 3
- 108010051457 Acid Phosphatase Chemical group 0.000 claims description 3
- 102000013563 Acid Phosphatase Human genes 0.000 claims description 3
- 244000063299 Bacillus subtilis Species 0.000 claims description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 3
- 108010079246 OMPA outer membrane proteins Proteins 0.000 claims description 3
- 108010003533 Viral Envelope Proteins Proteins 0.000 claims description 3
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical group C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 claims description 3
- 108010051210 beta-Fructofuranosidase Chemical group 0.000 claims description 3
- 229940088597 hormone Drugs 0.000 claims description 3
- 239000005556 hormone Substances 0.000 claims description 3
- 239000001573 invertase Chemical group 0.000 claims description 3
- 235000011073 invertase Nutrition 0.000 claims description 3
- 241000191967 Staphylococcus aureus Species 0.000 claims description 2
- 241000194017 Streptococcus Species 0.000 claims description 2
- 238000013518 transcription Methods 0.000 claims description 2
- 230000035897 transcription Effects 0.000 claims description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims 12
- 241001513093 Aspergillus awamori Species 0.000 claims 2
- 241000206602 Eukaryota Species 0.000 claims 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims 2
- 241000228212 Aspergillus Species 0.000 claims 1
- 241000351920 Aspergillus nidulans Species 0.000 claims 1
- 241000191940 Staphylococcus Species 0.000 claims 1
- 239000013599 cloning vector Substances 0.000 claims 1
- 238000003259 recombinant expression Methods 0.000 claims 1
- 230000010076 replication Effects 0.000 claims 1
- 238000010276 construction Methods 0.000 description 25
- 239000013612 plasmid Substances 0.000 description 20
- 206010039509 Scab Diseases 0.000 description 15
- 230000028327 secretion Effects 0.000 description 15
- 102000004196 processed proteins & peptides Human genes 0.000 description 13
- 238000010367 cloning Methods 0.000 description 11
- 239000013615 primer Substances 0.000 description 11
- 108010084185 Cellulases Proteins 0.000 description 8
- 102000005575 Cellulases Human genes 0.000 description 8
- 239000002299 complementary DNA Substances 0.000 description 8
- 238000013461 design Methods 0.000 description 8
- KGRVJHAUYBGFFP-UHFFFAOYSA-N 2,2'-Methylenebis(4-methyl-6-tert-butylphenol) Chemical compound CC(C)(C)C1=CC(C)=CC(CC=2C(=C(C=C(C)C=2)C(C)(C)C)O)=C1O KGRVJHAUYBGFFP-UHFFFAOYSA-N 0.000 description 7
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 7
- 230000004927 fusion Effects 0.000 description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- 125000003275 alpha amino acid group Chemical group 0.000 description 6
- 150000001413 amino acids Chemical class 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- 230000006698 induction Effects 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 4
- 108020004635 Complementary DNA Proteins 0.000 description 4
- 101710098246 Exoglucanase 2 Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 229910001629 magnesium chloride Inorganic materials 0.000 description 4
- 238000003752 polymerase chain reaction Methods 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- SJHPCNCNNSSLPL-CSKARUKUSA-N (4e)-4-(ethoxymethylidene)-2-phenyl-1,3-oxazol-5-one Chemical compound O1C(=O)C(=C/OCC)\N=C1C1=CC=CC=C1 SJHPCNCNNSSLPL-CSKARUKUSA-N 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- 108010017826 DNA Polymerase I Proteins 0.000 description 3
- 102000004594 DNA Polymerase I Human genes 0.000 description 3
- 108050008938 Glucoamylases Proteins 0.000 description 3
- 102100034349 Integrase Human genes 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 102000034356 gene-regulatory proteins Human genes 0.000 description 3
- 108091006104 gene-regulatory proteins Proteins 0.000 description 3
- 229940072221 immunoglobulins Drugs 0.000 description 3
- 238000005304 joining Methods 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 238000012856 packing Methods 0.000 description 3
- 238000004153 renaturation Methods 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000005063 solubilization Methods 0.000 description 3
- 230000007928 solubilization Effects 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 2
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 2
- 101710132601 Capsid protein Proteins 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 101710091045 Envelope protein Proteins 0.000 description 2
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 2
- 101710098247 Exoglucanase 1 Proteins 0.000 description 2
- 108010058643 Fungal Proteins Proteins 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 2
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 2
- 102000002568 Multienzyme Complexes Human genes 0.000 description 2
- 108010093369 Multienzyme Complexes Proteins 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- 101710188315 Protein X Proteins 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 230000001588 bifunctional effect Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- -1 e.g. Proteins 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 239000012723 sample buffer Substances 0.000 description 2
- 238000001179 sorption measurement Methods 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 1
- XZKIHKMTEMTJQX-UHFFFAOYSA-N 4-Nitrophenyl Phosphate Chemical compound OP(O)(=O)OC1=CC=C([N+]([O-])=O)C=C1 XZKIHKMTEMTJQX-UHFFFAOYSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 101100136076 Aspergillus oryzae (strain ATCC 42149 / RIB 40) pel1 gene Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 229910002535 CuZn Inorganic materials 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101710121417 Envelope glycoprotein Proteins 0.000 description 1
- 241001288713 Escherichia coli MC1061 Species 0.000 description 1
- 108700005088 Fungal Genes Proteins 0.000 description 1
- 108010001496 Galectin 2 Proteins 0.000 description 1
- 102100021735 Galectin-2 Human genes 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 102000006395 Globulins Human genes 0.000 description 1
- 108010044091 Globulins Proteins 0.000 description 1
- 108700005091 Immunoglobulin Genes Proteins 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108091036060 Linker DNA Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 230000004989 O-glycosylation Effects 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241000710799 Rubella virus Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 102000019197 Superoxide Dismutase Human genes 0.000 description 1
- 108010012715 Superoxide dismutase Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 238000002441 X-ray diffraction Methods 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 235000020054 awamori Nutrition 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 101150114858 cbh2 gene Proteins 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 102000029724 enzyme binding proteins Human genes 0.000 description 1
- 108091009282 enzyme binding proteins Proteins 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000000302 molecular modelling Methods 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 101150040383 pel2 gene Proteins 0.000 description 1
- 101150050446 pelB gene Proteins 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000013595 supernatant sample Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
Definitions
- the present invention relates to a method for producing secretable, biologically active single chain antibodies (scAbs) and other secretable fusion proteins having at least two distinct functional proteins or protein domains linked to form a single polypeptide chain by a flexible linker peptide that is compatible with secretion.
- scAbs secretable, biologically active single chain antibodies
- rDNA reco binant DNA
- proteins are the key elements in a wide variety of biotechnological applications today and their importance is expected to increase significantly in the future. Genetic and protein engineering are used widely to study and improve properties of proteins.
- the key techniques include site-directed and random mutagenesis of the isolated genes or cDNAs coding for the desired proteins and the design and construction of fusion proteins using genetic engineering. Increased understanding of protein properties and function will make possible the design and development of completely novel proteins with new activities.
- Fusion proteins have been constructed by linking together two previously independent proteins or protein domains with varying degrees of success. Most commonly, an enzymatically detectable marker protein or an affinity tail facilitating protein purification has been linked to the N- or C-terminus of another protein. In most cases, the fusions have been accomplished by trial and error and only very rarely have attempts been made to design linkers or spacers for the optimization of the fusion point.
- Immunoglobulins are large, complicated molecules consisting of two identical light chains and two identical heavy chains which pack together forming the complete immunoglobulin. Each of the heavy chains is composed of 4-5 domains and each of the light chains of 2 domains. Immunoglobulins are unique in their capacity to recognize and bind specifically a wide variety of molecular structures. This capability can be used in many applications for the detection or purification of compounds, for targeting of drugs or marker proteins to specific tissues and for other applications for which specific molecular recognition and binding is important.
- Fab fragments have been produced proteolytically from intact immunoglobulin or genetically using truncated immunoglobulin genes expressed in bacteria (Better et al. , Science ,
- Single chain antibodies in which the antigen binding domains of an immunoglobulin have been linked to a single polypeptide chain using short flexible linkers or spacers have been constructed and produced in bacteria (Bird et al., Science , 242:423- 426, 1988; Ladner et al., PCT application WO 88/01649, 1988; Huston et al., Proc . Natl . Acad . Sci . USA, 85:5879-5883, 1988; Chaudhary et al. , Nature , 339:394- 397, 1989) . In all of the published cases, the single chain antibodies were produced inside of E.
- linker sequences used in constructing novel fusion proteins.
- Three examples concern the construction of single chain antibodies using two different types of linker sequences.
- the linker is a thrice-repeated sequence of Ser-Gly-Gly-Gly-Gly, (Chaudhary et al., 1989, supra; Huston et al, 1988, supra) and in the third example an 18 residues long linker peptide was designed by computer aided modelling (Bird et al., 1988, supra; Ladner et al.. 1988, supra) .
- polymers of human CuZn superoxide dismutase were constructed by linking the subunits together using the hinge peptide of an IgAl (Hallewell et al., J. Biol . Chem . , 264:5260-5258, 1989) .
- IgAl Haallewell et al., J. Biol . Chem . , 264:5260-5258, 1989.
- no signal sequence was included and the fusion proteins were produced inside E. coli or yeast cells and were not secreted.
- coli pyruvate dehydrogenase multienzyme complex consist mainly of Ala and Pro residues with occasional Gin, Glu, Lys, Arg, Ser or Thr residues.
- the proteins used as sources for the linker peptides are intracellular and there are no published reports of the analysis or design of secretable linker peptides. Yet in industrial applications, secretion of the product is in most cases desired because the recovery of the product is easier and thus cheaper from the culture media than from inside of the producer organism. Perhaps more importantly, the proportion of correctly folded and thus biologically active proteins is generally very low (less than 10%) after solubilization and subsequent renaturation of the proteins produced intracellularly while the major fraction of secreted proteins are as active as the wild type protein.
- Carbohydrate degrading enzymes e.g., cellulases and glucoamylases are secreted proteins many of which consist of at least two domains; a large catalytic core domain and a smaller substrate recognition domain (van Tilebeurgh et al., FEBS Lett . , 204:223-227, 1986; Teeri, Doctoral Thesis, Technical Research Centre of Finland, Publications 38, Espoo, 1987; Teeri et al., Gene , 51:43-52, 1987; Knowles et al., Tibtech , 5:255-261, 1987; Tomme et al. , Eur . J . Biochem .
- the substrate recognition domains are generally linked to the catalytically active core proteins by linker peptides of about 30-40 amino acids.
- hinge In their natural environment the function of the hinge is apparently to act as a flexible spacer separating the two functional domains by a distance of over 100 angstroms. Low angle X-ray analysis of intact cellulases indicates that the hinge region has an almost fully extended conformation (Sch uck et al., Biotechnol . Lett . , 9:397-402, 1986). This hinge peptide sequence is not included in any of the above mentioned compilations of potential linker regions for the construction of novel fusion proteins.
- Rational design of novel fusion proteins with two or more domains will require the availability of relatively short and flexible spacer or linker peptides for joining the two proteins or domains without interfering with their function or with their secretion. This will be particularly important for the design and development of fusion proteins in which the association and interaction of two domains is required for activity such * single chain antibodies and which are designed fo; ecretion from the host cells.
- One object of the present -invention is to provide a method for producing single chain functional proteins that can be secreted in active form into the periplasmic space of bacteria such as E. coli or into culture media.
- Another object of the invention is to produce secretable single chain antibodies. Another object of the invention is to produce multi-functional secretable single chain proteins.
- Another object of the invention is to provide isolated DNA sequences for flexible spacer peptides which are readily adaptable to recombinant methodology and easily used to construct rDNA sequences for single chain proteins whose linked functional domains can interact or assemble to form an active conformation of the protein.
- Fig. 1 shows the schematic presentation of the CBH I hinge cloning.
- Fig. 2 shows the sequences of the original CBH
- FIG. 3A shows the construction of the plasmid pMLLl.
- Fig. 3B shows the construction of the plasmid pMLL2.
- Fig. 4 shows the nucleotide and the amino acid sequence of the V H . linker-V L insert of the plasmid pML5.
- Fig. 5 shows the construction of plasmid pML5 used for expression of the single-chain antibody binding oxazolone with CBH I hinge linker.
- Fig. 6 shows the result of the western blot analysis after overnight induction of the pML5 in E. coli RV308 and MC1061 strains.
- the difference between the samples si and s2 and also between the samples sll and s22 is only the dilution factor.
- Fig. 7 shows the cloning procedure for the CBHII signal sequence.
- Fig. 8 shows the construction of plasmid pDSl containing the CBHII signal sequence preceding the anti-Ox scAb.
- Fig. 9 shows the construction of the plasmid pDS2 in which the scAb construction of Fig. 8 is linked to the yeast expression vector pMA91.
- an rDNA molecule coding for a secretable single chain fusion protein said fusion protein comprising a plurality of functional proteins or protein domains joined by spacer peptides; wherein each said spacer peptide comprises a flexible hinge region of a naturally secreted multidomain protein or an analogue thereof; wherein each adjacent pair of functional proteins or protein domains is joined by a spacer peptide which is not naturally continuous with either functional protein or protein domain to which it is joined; and wherein at least two ' of said functional proteins or protein domains each independently exhibit physicochemical or biological activity or interact to form an assembly that exhibits physicochemical or biological activity; (b) inserting said rDNA molecule into an expression vector capable of transforming a host organism, wherein said vector further comprises DNA sequences coding for a promoter sequence, a signal sequence, a terminator sequence, a replication origin and at least one selection marker for selecting transformed host cells, and recovering the resultant vector;
- Single chain secretable functional fusion proteins are provided using the foregoing method, as are rDNA molecules, vectors and transformed host organisms suitable for use therein.
- single chain antibodies can be produced and secreted in active antigen-binding form using the method of the invention.
- immunoglobulin As used herein, the terms, "immunoglobulin", “heavy chain”, “light chain” and “Fab” are used in the same way as in Eur. Pat . Appln . No . 0125023, of Cabilly et al., published 11/14/84, corresponding to U.S . Serial No . 483,457, filed 4/8/83.
- Fv is used to describe those regions of the immunoglobulin molecule which are responsible for antigen or hapten binding. Usually these consist of approximately the first 100 amino acids of the N-termini of the light and the heavy chain of the immunoglobulin molecule.
- Domain is used to describe an independently folding part of a protein which may or may not function independently. General structural definitions for domain borders in natural proteins are given in Argos, 1988 (Argos, Protein Engineering,
- Single chain is used to define a molecule in which two or more proteins or protein domains of the same or different origin are joined together to form a continuous amino acid chain synthesized from a single mRNA molecule (transcript) .
- Linker or "linker peptide” is used to describe an amino acid sequence that extends between adjacent domains in a natural or engineered protein.
- Linkers useful in the present invention are flexible so that they can be inserted between a variety of proteins or protein domains and used to link these together without interfering with their function. It will be appreciated by the skilled artisan that recombinant DNA techniques can be used to position any such linker peptide sequence between any two domains and that, in particular instances, the linker sequences may be augmented or truncated by a few amino acids in order to facilitate cloning procedures.
- Hybrid region is a term used to describe a class of flexible linker peptide sequences that link functional domains in a protein chain, e.g., the peptide sequences that link the substrate binding domain to the catalytic domain in a glycosidase enzyme such as a cellulase or a glucoamylase.
- a general method is presented herein for constructing fusion proteins in which two or more functional proteins or protein domains from the same or different origin are joined together by a flexible linker peptide to form a single chain fusion protein which can be secreted from bacteria, yeasts and fungi and other eukaryotic hosts. Specifically a method is introduced for constructing single chain antibodies which can be secreted to the culture medium of the above mentioned hosts or to the periplasmic space of bacteria such as E. coli.
- Linker peptides from naturally secreted multidomain proteins e.g., glycosidases such as cellulases, glucoamylases, and other secreted proteins are used as spacers between the proteins or protein domains to be joined.
- the hinge peptide may be heterologous to the functional proteins or protein domains to which it is joined or homologous to one or both, but will not be the continuation of the natural amino acid sequence for the functional protein/domain into the sequence for the hinge region of the same naturally secreted protein.
- Naturally secreted multidomain proteins are understood to embrace any secretable protein having two or more structurally and/or functionally distinct domains.
- the domains can be biologically active, either individually, e.g., a substrate binding region of an enzyme, or by association of more than one domain, e.g., immunoglobulin (Ig) heavy and light chain variable regions.
- the activity can be physicochemical, e.g., conferring certain properties upon the protein such as solubility or adsorption onto a solid support or into a membrane or fiber matrix.
- hinge regions are suitable for linking together proteins or protein domains even when close packing of the two parts is required as in the case of scAbs.
- the constructions are made using recombinant- DNA technology and genes and/or cDNAs of the relevant enzymes and other proteins.
- the DNA sequences coding for the desired fusion protein are inserted in an appropriate expression vector which subsequently transforms the host organism for expression and secretion.
- the functional portions of the constructs can be selected from among sequences taken from antibodies, enzymes, hormones, viral envelope glycoproteins or the like.
- linkers used in these constructs have as their common characteristic that they facilitate, or at least do not interfere with, secretory processes that transport transcribed proteins to the bacterial periplasmic space or entirely out of eukaryotic cells and into the surrounding medium.
- Illustrative linkers are the highly conserved regions of glycosidases that contain mainly proline and serine/threonine residues.
- the hinge regions of most bacterial cellulases and, e.g., IgA consist of Thr and/or Ser and Pro residues exclusively while in the hinges of fungal cellulases and glucoamylases, occasional Ala, Cy ⁇ , Gly, Gin or charged residues are permitted in addition to the Ser, Thr and Pro residues.
- linker sequences share the properties of flexibility and substantial absence of secondary structure such as pleated sheet or helical segments.
- the Thr and Ser residues tend to be O-glycosylated even in the bacterial cellulase hinge regions while Asn residues, which are sites for N-glycosylation, are usually not found in these hinges. This suggests that the O- glycosylation might facilitate the secretion or otherwise be important for the function of these hinges.
- glycosylation of the hinge appears not to be an absolute requirement for the secretion as scAb produced in E. coli is secreted.
- hinge regions can be used to construct many other types of fusion proteins such as enzyme-enzyme fusions (bifunctional enzymes) in which two or more enzymatic activities are required in a single polypeptide chain or enzyme-binding protein fusions in which the enzyme is bound to a carrier or support by a specialized binding domain in such a way that its biological activity is not destroyed.
- enzyme-enzyme fusions bifunctional enzymes
- enzyme-binding protein fusions in which the enzyme is bound to a carrier or support by a specialized binding domain in such a way that its biological activity is not destroyed.
- Such techniques are much wanted in biotechnology to facilitate protein purification or immobilization.
- Another example is a fusion protein in which the domains or proteins linked together interact or pack together, and this interaction is required for their correct folding, stability or activity.
- glycosylated envelope proteins of animal viruses In addition to scAbs, an example of such a fusion protein of practical importance is the production of the glycosylated envelope proteins of animal viruses to be used in diagnostics or as vaccines. These envelope glycoproteins contain important antigenic epitopes towards which neutralizing antibodies are elicited (Webster et al., Nature , 296:115-121, 1982; Bruck et al.. Virology, 122:352-362, 1982;- Ho-Terry et al. , Arch. Virol . , 90:145-152, 1986). For instance the two envelope proteins of rubella virus must form a heterodimer in order to be secreted efficiently.
- a binding domain for cellulose can be linked to either terminus of a desired scAb, thereby facilitating its immobilization on cellulose.
- a construct could, e.g., contain: one of the hypervariable domains of the antibody - hinge I - the other hypervariable domain of the antibody - hinge II - the cellulose binding domain.
- the first hinge allows the packing of the two antibody hypervariable domains onto each other while the second hinge (the same or different from the first one) acts as a spacer to separate the binding domain from the scAb.
- fusion proteins consisting of even more than three domains.
- bifunctional single chain antibodies could be constructed by linking together two domains of one antibody and two domains of another antibody and subsequently joining these together by yet another hinge.
- Such a construct thus contains four immunoglobulin domains linked to a single poly ⁇ peptide via three hinges which allow pairwise packing of the correct immunoglobulin domains and the separa ⁇ tion of the two scAbs to allow their independent functioning.
- Similar multidomain proteins exist in nature, e.g., regulatory proteins and the use of the flexible secretable hinges described in this invention now makes possible the construction of novel multi ⁇ domain proteins by protein engineering.
- the genes coding for the desired proteins or protein domains to be linked are isolated and sequenced using standard recombinant-DNA methods.
- the cDNAs corresponding to the said proteins or protein domains are synthesized enzymatically using the specific mRNAs as templates also using standard recombinant-DNA techniques.
- the DNA sequences can also be synthesized chemically according to a pre-existing sequence or a sequence designed die novo.
- the DNA sequences of such proteins or protein domains can be synthesized in such a way as to optimize the codon usage to correspond to that preferred by the host organism used for expression.
- the isolated genes or synthesized cDNAs are linked to suitable expression vectors comprising: 1) an efficient promoter; 2) an efficient signal sequence; and 3) terminator sequences.
- E. coli many powerful promoters have been described, e.g., lac, trp or phage lambda Pr and Pi promoters as well as hybrid promoters such as tac.
- yeast powerful promoters such as the phosphoglycerokinase (PGK) or alcohol dehydrogenase (ADH) promoters have been used, and in filamentous fungi promoters of, e.g., hydro- lytic enzymes such as the cellulase, cellobiohydrolase I, of Trichoderma reesei have been used.
- PGK phosphoglycerokinase
- ADH alcohol dehydrogenase
- any strong promoter functional in the host organism used can in principle be used for the expression of the secretable fusion proteins. It is an advantage, although not an absolute requirement, for the promoter to be kept in a repressed state until it is activated (induced) at a given time point by, e.g., elevated temperature, as in the case of the lambda Pr and PI promoters, or by adding particular inducing agents such as IPTG in the case of the lac or tac promoters of E. coli.
- the signal sequences directing secretion to the periplasmic space or to the culture medium of E. coli include those of OmpA protein of E. coli or pectate lyase of Erwinia carotovora but a number of other signal sequences also can be used.
- secretion of foreign proteins can be achieved using the signal sequences of the yeast alpha factor, acid phosphatase or invertase or other signal sequences of the rare secreted proteins of yeast.
- signal sequences from fungal cellulases have worked in yeast (Penttila et al., Yeast , 3:75-83, 1987; Penttila et al., Gene , 63:103-112, 1988) and other eukaryotic and possibly prokaryotic signal sequences may be used in yeast as well.
- yeast Pentila et al., Yeast , 3:75-83, 1987; Penttila et al., Gene , 63:103-112, 1988
- the signal sequences of its cellulases have been shown to work with other proteins and signal sequences from other secreted fungal proteins can also be used.
- a signal sequence can be chosen from among those derived from homologous or heterologous secreted proteins.
- the terminator sequences are isolated from genes homologous to each production organism.
- the terminators of, e.g., rrnB from the ribosomal RNA transcription terminators work well.
- yeast terminators of yeast proteins or, e.g., terminators of filamentous fungal genes can be used.
- the 5' end of the gene or cDNA of the first protein or domain is cloned in frame with the 3 ' end of the signal sequence chosen to generate a proper signal sequence cleavage site. This may require alteration of the DNA and/or amino acid sequence of the first protein.
- the DNA corresponding to the secretable joining peptide is isolated from a gene or cDNA coding for a secretable protein or synthesized and joined at its 5* end to the 3 1 end of the DNA of the first protein.
- the DNA for the second protein or domain is cloned in between the 3 ' end of the linker DNA and the 5' end of the terminator.
- the plasmids used contain a replication origin functional in the production organism and selection markers for the selection of transformed cells from nontransformants.
- Examples of usable host organisms include bacteria, e.g., Escherichia coli MC1061, derivatives of Bacillus subtilis BRB1 (Sibakov.et al. , Eur. J. Biochem . , 145:567-572, 1984), Staphylococcus aureus SA1123 (Iordanescu, J. Bacterial . 12:597-601, 1975) or
- Streptococcus lividans Hopwood et al. , Genetic Manipulation of Streptomyces, a Laboratory Manual, The John Innes Foundation, Norwich 1985); yeasts, e.g., Saccharomvces cerevisiae AH 22 (Mellor et al., Gene. 24:1-14, 1983) and Schizosaccharomyces pombe; filamentous fungi, e.g., Aspercrillus nidulans, Aspercrillus awamori (Ward, Proc. Embo-Alko Workshop on Molecular Biology of Filamentous Fungi , Helsinki, pp. 119-128, 1989), Trichoderma reesei (Penttila et al., Gene 61:155-164, 1987; Harkki et al. , Bio /Technology ,
- Plasmids pLTl see Fig. 4
- pML2 see Fig. 1
- pML3 see Fig. 7
- pML4 see Fig. 7
- pML5 see Fig. 7
- pMLLl see Fig. 3
- pMLL2 see Fig. 4
- pTI8 see Fig. 5
- E. coli strains DH5 ⁇ (F-, endAl, hsdR17(r k -, m k ) , supE44, thi-l, ⁇ -, recAl, • g rA96, reLAl, ⁇ far ⁇ F- laczya)U169, ⁇ 80dJLacZ ⁇ M15) , MC1061 (F-,araD139, ⁇ (ara- leu)7696, ⁇ lacI74, galU-, galK-, hsr-, hsm+, strA) and RV308 (su-, ⁇ lacX74, galSII: :OP308, strA) are used as hosts in transformation and expression studies.
- Table I shows the primers annealing to the sense strand in their complementary form, the relevant amino acid residues incorporated to the primers and the restriction sites (bolded) used for the in-frame fusions. The restriction sites used for the subcloning of the amplified fragments are also indicated.
- Table II shows the production levels of the active single-chain antibody measured by ELISA.
- the oligomers were annealed in 10 ⁇ l of 30 mM Tris-HCl '
- the double-stranded hinge fragment was cloned into a pSP72 vector, from which the Xhol restriction site was deleted (Fig. 1) .
- the hinge linker contained an internal Xhol restriction site, which was used for the in-frame fusion of the V L fragment. Therefore the Xhol restriction site was deleted by cutting the pSP72 vector with Ndel and Xhol and the 5'-protruding ends were filled in with 2 U of Klenow fragment and 40 ⁇ M dNTPs at room temperature for 30 min.
- the reaction mixture was extracted with phenol and precipitated with ethanol.
- the deleted linear pSP72 ⁇ XhoI vector was separated from the short Xhol-Ndel fragment (87 bp) on 1% agarose gel and eluted using GeneClean kit (BIO 101) .
- the linear vector was recircularized with 1 U of T4 DNA ligase (Boehringer) in 10 ⁇ l containing 50 mM Tris-HCl (pH 7.5) , 10 mM MgCl 2/ 5 mM DDT, 100 ⁇ M sper idine, 100 ⁇ M EDTA and 10 mM ATP at 15°C overnight. An aliquot of 1 ⁇ l was transformed into E. coli DH5 ⁇ strain according to Hanahan (Hanahan, J * . Mol . Biol . , 166:557-580, 1983).
- Plasmid pSP72 ⁇ XhoI was cleaved with SacI and Xbal and ligated with 1 ⁇ l and 1 ⁇ l of 1:20 dilution of the double-stranded Sacl-Xbal hinge fragment.
- the authenticity of the cloned hinge insert was confirmed by sequencing using an oligonucleotide primer complementary to the T7 promoter of the vector using sequenase (USB) according to manufactures instructions.
- the hinge construction was designated pML2. Cloning of the Ox V H and V L Domains
- Plasmid pTI8 contains the cDNA fragments of the Fab coding regions of the heavy and light chains of an anti-2-phenyloxazolone binding antibody, IgGl subclass (OX-1 IgGl) .
- Plasmid pLTl contains the cDNA of the complete light ( K) chain coding region cloned into pSP73 vector. The oligonucleotide primers used for the amplifications of single-chain antibody are shown in Table I.
- the reaction mixtures were amplified with 25 repeating cycles of denaturation at 94°C for 1 min. , annealing at 55°C for 1 min. and polymerization at 72°C for 3 min.
- the reaction mixture was extracted with phenol and precipitated with ethanol.
- the amplified V H and V L fragments were cleaved with EcoRI and Hindlll and purified and ligated into pSP73 and pSP72 vectors. These constructions are designated pMLLl and pMLL2.
- the nucleotide sequences of the amplified DNA fragments were confirmed by sequencing using primers complementary to both T7 and SP6 promoters (Fig. 4).
- the final cloning procedure for the Ox scFv (pML5) is presented in Figure 5.
- the pML2 construction containing the CBH I hinge insert, was cleaved with Xhol and Hindlll and ligated with the Xhol-Hindlll V L fragment derived from the pMLL2 construction (Fig. 3b) .
- the V H fragment was cleaved from the pMLLl construction (Fig. 3a) with EcoRI and Xmal and ligated with the pML3 cleaved with the same restriction enzymes.
- the resulting construction, designed pML4, contains the signal sequence of Erwinia carotovora's pectate lyase (pelB) gene (Lei et al., J. Bact., 169: 4379-4383, 1987; Better et al., Science, 240: 1041-1043, 1988) and the whole coding region of the V H -hinge-V L fusion.
- the EcoRI-Hindlll insert of pML4 was ligated into the E. coli pKK233-3 expression vector (Pharmacia) ' under the control of the tac promoter (Fig. 5) . To achieve efficient repression of the tac promoter before induction with IPTG the lacl q gene fragment was cloned into the expression vector (Brosius, Gene, 27:161-172, 1984; Stark, Gene, 51:255-
- OX-1 single-chain antibody expression construction obtained by this procedure is designated pML5.
- Plasmid DNA from several clones of pML5 was transformed into E. coli strains RV308 and MC1061.
- transformant colonies were inoculated into LB medium containing 100 ⁇ g/ml ampicillin and the cultures were grown at 37°C overnight. After 1:50 dilution of a 50 ml culture was incubated at 30°C with shaking 115 rpm until the A ⁇ was between 0.5 and 1.5 before the induction with the
- IPTG isopropyl ⁇ -D-thiogalactopyranoside
- the proteins were transferred electrophoretic- ally onto nitrocellulose filters (Towbin et al., Proc. Natl . Acad . Sci . USA, 76:4350-4354, 1979) and the single-chain antibody was detected with a polyclonal antiserum raised in rabbits against the Fab-moiety of the oxazolone monoclonal antibody " (Ox IgG3 Fab antiserum, Fig. 6) .
- the ELISA assay was essentially performed as described by Hudson and Hay (1976) . Oxazolone coupled to BSA in a ration of 22:1 (Ox-BSA) was used in the assay to detect immunoreactive single- chain antibody.
- the antigen was adsorbed to 96-well microtiter plates (NUNC Immunoplate I) by incubating 100 ⁇ l/well of a Ox-BSA solution containing 46 ⁇ g/ml in 0.1 M carbonate buffer (pH 9.6) at 4°C overnight. Unless indicated otherwise in the text, all the following binding steps were carried out by applying 100 ⁇ l of the respective solution per well and shaking the plates (Wallac Plateshake) at room temperature for 1 h. After each binding step the plates were washed three times in a Platewash (Wallac) with PBS containing 50 mM phosphate buffer (pH 7.4) and 0.9% NaCl.
- VH 1 NH-,-terminal primer of VH. COMPLEMENTARY TO THE ptac PROMOTER REGION 5 ' GCGCCGACATCATAACGGTCC 3 '
- VH 2 COOH-terminal primer of VH
- VL 2 COOH-TERMINAL PRIMER OF VL
- Plasmids pTTc9 (Teeri et al., Gene , 51:43-52, 1987)
- pMA91 Mellor et al., Gene , 24:1-14, 1983
- pMLL4 see Example 1
- pSP72 Promega
- Saccharomvces cerevisiae strain AH 22 leu2-3 leu2-112 his4-519 canl gal2 cin+
- AH 22 leu2-3 leu2-112 his4-519 canl gal2 cin+
- Yeast transformation was carried out according to Keszenman-Pereyra et al. , Curr. Genet . , 13:21-23, 1988) and the transformants were grown in synthetic complete medium lacking leucine (Sherman et al., Methods in Yeast Genetics, A Laboratory Manual . Cold
- DNA manipulations are performed essentially as described in Maniati ⁇ (Maniatis et al., 1982, supra ) and Innis (Innis et al., PCR Protocols . A Guide for Methods and Application , Academic Press, San Diego, CA. , 1990) .
- the cloning procedure of the CBHII signal sequence is represented schematically in Figure 7.
- the DNA sequence for the first oligonucleotide primer for fragment amplification is derived from the region upstream of the EcoRI site of the cbh2 cDNA in pTTc9.
- the second oligonucleotide primer contains the DNA sequence coding for the last four amino acids at the CBHII signal sequence and the first four amino acids of the anti Ox L-chain as well as the Hindlll restriction enzyme recognition sequence as shown in Figure 7.
- the fragment is amplified using the protocols described in Example 1 and subsequently digested with the restriction enzymes EcoRI and Hindlll.
- the digested fragment is cloned into the plasmid pSP72 digested with EcoRI and Hindlll to generate the plasmid pDSl ( Figure 8) .
- the DNA fragment corresponding to the anti-Ox scAb preceded by the cBHII signal sequence is released from pDSl by cleavage with EcoRI and Hindlll, the protruding ends are blunted using SI nuclease, and the fragment is cloned in the yeast expression vector pMA9I cleaved with Bglll and treated with SI nuclease to remove the protruding ends.
- the resulting plasmid pDS2 (Fig. 9) is transformed to E. coli DH5 ⁇ .
- the plasmid DNA is isolated and the construction is checked by sequencing.
- the plasmid pDS2 is transformed into yeast.
- the transformants are grown as described by Penttila et al. (Penttila et al., Yeast , 3:75-83, 1987). After the growth the yeast cells are removed by centrifuga- tion and the activity of the anti-Ox scAbs assayed as in Example 1. Functional single chain antibody is found in the culture medium.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Immunology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
A method for producing a single chain fusion protein, comprises the steps of: (a) constructing an rDNA molecule coding for a secretable single chain fusion protein having a plurality of functional proteins or protein domains joined by spacer peptides, wherein each such spacer peptide comprises a flexible hinge region of a naturally secreted multidomain protein or an analogue thereof, each adjacent pair of functional proteins or protein domains is joined by a spacer peptide which is not naturally continuous with either functional protein or protein domain to which it is joined, and at least two of the functional proteins or protein domains each independently exhibit physicochemical or biological activity or interact to form an assembly that exhibits physicochemical or biological activity; (b) inserting the rDNA molecule into an expression vector capable of transforming a host organism; (c) transforming a host organism with the resultant vector, and isolating transformants; and (d) culturing the transformants and recovering thesingle chain fusion protein.
Description
1 .
RECOMBINANT SECRETABLE FUSION PROTEINS
Background of the Invention
The present invention relates to a method for producing secretable, biologically active single chain antibodies (scAbs) and other secretable fusion proteins having at least two distinct functional proteins or protein domains linked to form a single polypeptide chain by a flexible linker peptide that is compatible with secretion. Also disclosed are reco binant DNA (rDNA) molecules coding for the fusion proteins, expression vectors and transformed hosts suitable for use in the method, as well as fusion proteins having known or new functions which can be made by the foregoing method. Proteins are the key elements in a wide variety of biotechnological applications today and their importance is expected to increase significantly in the future. Genetic and protein engineering are used widely to study and improve properties of proteins. The key techniques include site-directed and random mutagenesis of the isolated genes or cDNAs coding for the desired proteins and the design and construction of fusion proteins using genetic engineering. Increased understanding of protein properties and function will make possible the design and development of completely novel proteins with new activities.
Today, however, protein engineering and design are largely dependent on the building blocks of protein structures provided by nature. Fusion
proteins have been constructed by linking together two previously independent proteins or protein domains with varying degrees of success. Most commonly, an enzymatically detectable marker protein or an affinity tail facilitating protein purification has been linked to the N- or C-terminus of another protein. In most cases, the fusions have been accomplished by trial and error and only very rarely have attempts been made to design linkers or spacers for the optimization of the fusion point.
Immunoglobulins are large, complicated molecules consisting of two identical light chains and two identical heavy chains which pack together forming the complete immunoglobulin. Each of the heavy chains is composed of 4-5 domains and each of the light chains of 2 domains. Immunoglobulins are unique in their capacity to recognize and bind specifically a wide variety of molecular structures. This capability can be used in many applications for the detection or purification of compounds, for targeting of drugs or marker proteins to specific tissues and for other applications for which specific molecular recognition and binding is important.
In many biotechnical applications, e.g., diagnostics, only the specificity determining regions
(hypervariable domains) of the complete immunoglobulin molecule are required. Fab fragments have been produced proteolytically from intact immunoglobulin or genetically using truncated immunoglobulin genes expressed in bacteria (Better et al. , Science ,
240:10410-1043, 1988; Cabilly et al., Eur. Pat . Appln .
No . 0125023, published 11/14/84, corresponding to U.S.
Serial No . 483,457, filed 4/8/83). There are
published methods for the secretion of the two domains of Fab and Fv fragments as separate peptides in E. coli (Skerra et al., Science , 240:1938-1041, 1988; Better et al., 1988, supra) . Single chain antibodies (scAbs) , in which the antigen binding domains of an immunoglobulin have been linked to a single polypeptide chain using short flexible linkers or spacers have been constructed and produced in bacteria (Bird et al., Science , 242:423- 426, 1988; Ladner et al., PCT application WO 88/01649, 1988; Huston et al., Proc . Natl . Acad . Sci . USA, 85:5879-5883, 1988; Chaudhary et al. , Nature , 339:394- 397, 1989) . In all of the published cases, the single chain antibodies were produced inside of E. coli and required solubilization and subsequent renaturation to obtain active protein, even in cases where the linker peptides were designed for single chain antibodies using all available information of natural linker sequences (Bird et al., 1988, supra ; Ladner et al., 1988, supra) . However, for efficient and economically feasible exploitation of the engineered immuno¬ globulins their secretion to the culture medium of the production host is required.
In the literature there are few examples of linker sequences used in constructing novel fusion proteins. Three examples concern the construction of single chain antibodies using two different types of linker sequences. In two of the examples the linker is a thrice-repeated sequence of Ser-Gly-Gly-Gly-Gly, (Chaudhary et al., 1989, supra; Huston et al, 1988, supra) and in the third example an 18 residues long linker peptide was designed by computer aided modelling (Bird et al., 1988, supra; Ladner et al..
1988, supra) . As noted above, the proteins in each of these cases were produced inside of Escherichia coli cells and solubilization and renaturation were required in order to obtain active fusion protein. Computer-aided molecular modeling programs used today for protein design utilize protein structural information contained in data banks such as the Brookhaven protein databank. The data collected in these data banks are obtained almost exclusively from protein structures solved by x-ray crystallo¬ graphy. Only the structures of very few small proteins (less than 20 KDa) have been solved by nuclear magnetic resonance (NMR) , and these consist of only a single domain. In designing linker sequences suitable for scAbs, the data banks have been used as a course for generating the starting structures (Ladner et al., 1988, supra) . However, it has been observed, e.g., that intact fungal cellulases are not readily crystallizable while the core proteins without the hinge and the substrate recognition domain can be crystallized (Bergfors et al. , J. Mol . Biol . , 209:167- 169, 1989).
In a recent publication, Argos describes the properties and sequences of a number of natural linker sequences found at the domain interfaces of natural proteins included in the Brookhaven protein structural databank (Argos, J. Mol . Biol . , 211:943-958, 1990). These interdomain linkers are short segments containing an average of 5-6 amino acid residues which are mainly small polar amino acid residues with occasional charged residues in between, but which are not large basic and acidic residues like Arg and Glu.
It was suggested that the best general linker consists of Ser, Thr and Gly residues only.
It is likely that longer . hinges are not included since their intrinsic flexibility prevents crystallization. Therefore, important structural information concerning the architecture of flexible regions in proteins is likely to be missing from such data banks and for this reason, the Ladner et al. approach is not likely to be readily adapted to the design of linkers compatible with secretion.
In one recent example, polymers of human CuZn superoxide dismutase were constructed by linking the subunits together using the hinge peptide of an IgAl (Hallewell et al., J. Biol . Chem . , 264:5260-5258, 1989) . In this case, no signal sequence was included and the fusion proteins were produced inside E. coli or yeast cells and were not secreted.
Two recent papers disclose properties of the linker peptides found in bacterial multidomain regulatory proteins and multienzyme complexes (Wootton et al.. Protein Engineering, 2:535-543, 1989; Radford et al., J. Biol . Chem . , 264:767-775, 1989). The Q- linkers described by Wootton et al. are a class of interdomain sequences found in bacterial multidomain regulatory proteins. They are approximately 20 amino acid residues long, relatively rich in Glu, Arg, Gin, Ser and Pro residues and found primarily in prokary- otic two-component regulatory and signal transduction systems. Radford discloses that the interdomain linkers of about 30 amino acids long from E. coli pyruvate dehydrogenase multienzyme complex consist mainly of Ala and Pro residues with occasional Gin, Glu, Lys, Arg, Ser or Thr residues.
The proteins used as sources for the linker peptides are intracellular and there are no published reports of the analysis or design of secretable linker peptides. Yet in industrial applications, secretion of the product is in most cases desired because the recovery of the product is easier and thus cheaper from the culture media than from inside of the producer organism. Perhaps more importantly, the proportion of correctly folded and thus biologically active proteins is generally very low (less than 10%) after solubilization and subsequent renaturation of the proteins produced intracellularly while the major fraction of secreted proteins are as active as the wild type protein. Most if not all of the production organisms commonly used for expressing recombinant proteins are capable of secretion. These include many different bacteria, yeasts, fungi and plant and animal cells. A number of naturally secreted proteins have been cloned and successfully secreted from novel host organisms using signal sequences of naturally secreted proteins of the host or related organisms. However, there are no published examples to date of novel fusion proteins with added heterologous linker sequences which are secreted to the culture medium of the host. This is believed to be in part due to the fact that the rules for protein folding are still very poorly known and therefore it is generally not possible to predict the behavior of a protein or a part thereof based on its amino acid sequence.
Carbohydrate degrading enzymes, e.g., cellulases and glucoamylases are secreted proteins many of which consist of at least two domains; a large
catalytic core domain and a smaller substrate recognition domain (van Tilebeurgh et al., FEBS Lett . , 204:223-227, 1986; Teeri, Doctoral Thesis, Technical Research Centre of Finland, Publications 38, Espoo, 1987; Teeri et al., Gene , 51:43-52, 1987; Knowles et al., Tibtech , 5:255-261, 1987; Tomme et al. , Eur . J . Biochem . , 170:575-581, 1987; Ong et al., Tibtech , 7:239-243, 1989; Svensson et al., Biochem . J. , 264:309-311, 1989). The substrate recognition domains are generally linked to the catalytically active core proteins by linker peptides of about 30-40 amino acids.
In their natural environment the function of the hinge is apparently to act as a flexible spacer separating the two functional domains by a distance of over 100 angstroms. Low angle X-ray analysis of intact cellulases indicates that the hinge region has an almost fully extended conformation (Sch uck et al., Biotechnol . Lett . , 9:397-402, 1986). This hinge peptide sequence is not included in any of the above mentioned compilations of potential linker regions for the construction of novel fusion proteins.
Rational design of novel fusion proteins with two or more domains will require the availability of relatively short and flexible spacer or linker peptides for joining the two proteins or domains without interfering with their function or with their secretion. This will be particularly important for the design and development of fusion proteins in which the association and interaction of two domains is required for activity such * single chain antibodies and which are designed fo; ecretion from the host cells.
Objects of the Invention
One object of the present -invention is to provide a method for producing single chain functional proteins that can be secreted in active form into the periplasmic space of bacteria such as E. coli or into culture media.
Another object of the invention is to produce secretable single chain antibodies. Another object of the invention is to produce multi-functional secretable single chain proteins.
Another object of the invention is to provide isolated DNA sequences for flexible spacer peptides which are readily adaptable to recombinant methodology and easily used to construct rDNA sequences for single chain proteins whose linked functional domains can interact or assemble to form an active conformation of the protein.
Upon further study of the specification, drawings and appended claims, further objects and advantages of this invention will become apparent to those skilled in the art.
Brief Description of the Drawings
The figures of the constructions are not in scale.
Fig. 1 shows the schematic presentation of the CBH I hinge cloning. Fig. 2 shows the sequences of the original CBH
I hinge cDNA and the synthetic CBH I hinge linker used for the cloning of the single-chain antibodies.
Fig. 3A shows the construction of the plasmid pMLLl.
Fig. 3B shows the construction of the plasmid pMLL2. Fig. 4 shows the nucleotide and the amino acid sequence of the VH. linker-VL insert of the plasmid pML5.
Fig. 5 shows the construction of plasmid pML5 used for expression of the single-chain antibody binding oxazolone with CBH I hinge linker.
Fig. 6 shows the result of the western blot analysis after overnight induction of the pML5 in E. coli RV308 and MC1061 strains. Ox Fab has been used as a positive and pKK223-lacI as a negative control (s=supernatant and c=cells) . The difference between the samples si and s2 and also between the samples sll and s22 is only the dilution factor.
Fig. 7 shows the cloning procedure for the CBHII signal sequence. Fig. 8 shows the construction of plasmid pDSl containing the CBHII signal sequence preceding the anti-Ox scAb.
Fig. 9 shows the construction of the plasmid pDS2 in which the scAb construction of Fig. 8 is linked to the yeast expression vector pMA91.
Summary of the Invention
The aforementioned objects of the invention and other objects are achieved by providing a method for producing a single chain fusion protein, comprising the steps of:
(a) constructing an rDNA molecule coding for a secretable single chain fusion protein, said fusion protein comprising a plurality of functional proteins or protein domains joined by spacer peptides; wherein each said spacer peptide comprises a flexible hinge region of a naturally secreted multidomain protein or an analogue thereof; wherein each adjacent pair of functional proteins or protein domains is joined by a spacer peptide which is not naturally continuous with either functional protein or protein domain to which it is joined; and wherein at least two' of said functional proteins or protein domains each independently exhibit physicochemical or biological activity or interact to form an assembly that exhibits physicochemical or biological activity; (b) inserting said rDNA molecule into an expression vector capable of transforming a host organism, wherein said vector further comprises DNA sequences coding for a promoter sequence, a signal sequence, a terminator sequence, a replication origin and at least one selection marker for selecting
transformed host cells, and recovering the resultant vector;
(c) transforming a host organism with said resultant vector, and isolating transformants, said host being capable of expressing said single chain fusion protein and secreting it into the periplasmic space of a bacterial host or into the culture medium; and
(d) culturing said transformants in a culture medium, under secretory conditions, and recovering said single chain fusion protein from said periplasmic space or culture medium.
Single chain secretable functional fusion proteins are provided using the foregoing method, as are rDNA molecules, vectors and transformed host organisms suitable for use therein. In particular, single chain antibodies can be produced and secreted in active antigen-binding form using the method of the invention.
Detailed Description
As used herein, the terms, "immunoglobulin", "heavy chain", "light chain" and "Fab" are used in the same way as in Eur. Pat . Appln . No . 0125023, of Cabilly et al., published 11/14/84, corresponding to U.S . Serial No . 483,457, filed 4/8/83. "Fv" is used to describe those regions of the immunoglobulin molecule which are responsible for antigen or hapten binding. Usually these consist of approximately the first 100 amino acids of the N-termini of the light and the heavy chain of the immunoglobulin molecule.
"Domain" is used to describe an independently folding part of a protein which may or may not function independently. General structural definitions for domain borders in natural proteins are given in Argos, 1988 (Argos, Protein Engineering,
2:101-113, 1988) .
"Single chain" is used to define a molecule in which two or more proteins or protein domains of the same or different origin are joined together to form a continuous amino acid chain synthesized from a single mRNA molecule (transcript) .
"Linker" or "linker peptide" is used to describe an amino acid sequence that extends between adjacent domains in a natural or engineered protein. Linkers useful in the present invention are flexible so that they can be inserted between a variety of proteins or protein domains and used to link these together without interfering with their function. It will be appreciated by the skilled artisan that recombinant DNA techniques can be used to position any such linker peptide sequence between any two domains and that, in particular instances, the linker sequences may be augmented or truncated by a few amino acids in order to facilitate cloning procedures. "Hinge region" is a term used to describe a class of flexible linker peptide sequences that link functional domains in a protein chain, e.g., the peptide sequences that link the substrate binding domain to the catalytic domain in a glycosidase enzyme such as a cellulase or a glucoamylase.
A general method is presented herein for constructing fusion proteins in which two or more functional proteins or protein domains from the same
or different origin are joined together by a flexible linker peptide to form a single chain fusion protein which can be secreted from bacteria, yeasts and fungi and other eukaryotic hosts. Specifically a method is introduced for constructing single chain antibodies which can be secreted to the culture medium of the above mentioned hosts or to the periplasmic space of bacteria such as E. coli.
Linker peptides from naturally secreted multidomain proteins, e.g., glycosidases such as cellulases, glucoamylases, and other secreted proteins are used as spacers between the proteins or protein domains to be joined. The hinge peptide may be heterologous to the functional proteins or protein domains to which it is joined or homologous to one or both, but will not be the continuation of the natural amino acid sequence for the functional protein/domain into the sequence for the hinge region of the same naturally secreted protein. Naturally secreted multidomain proteins are understood to embrace any secretable protein having two or more structurally and/or functionally distinct domains. The domains can be biologically active, either individually, e.g., a substrate binding region of an enzyme, or by association of more than one domain, e.g., immunoglobulin (Ig) heavy and light chain variable regions. The activity can be physicochemical, e.g., conferring certain properties upon the protein such as solubility or adsorption onto a solid support or into a membrane or fiber matrix.
The present inventors have now found that these hinge regions are suitable for linking together proteins or protein domains even when close packing of
the two parts is required as in the case of scAbs. Use of the hinge regions of glycosidases or other secreted proteins to link together the hapten binding domains of immunoglobulins, according to the present invention, results in fusion proteins, secretable to the culture medium of various host organisms, and retaining their antigen-binding properties. It will be appreciated that minor variations in the structure of such linker peptides will normally not alter their basic characteristics and destroy their efficacy as linkers that are compatible both with function and secretion of the peptide of which they are a part.
The constructions are made using recombinant- DNA technology and genes and/or cDNAs of the relevant enzymes and other proteins. The DNA sequences coding for the desired fusion protein are inserted in an appropriate expression vector which subsequently transforms the host organism for expression and secretion. The functional portions of the constructs can be selected from among sequences taken from antibodies, enzymes, hormones, viral envelope glycoproteins or the like.
The linkers used in these constructs have as their common characteristic that they facilitate, or at least do not interfere with, secretory processes that transport transcribed proteins to the bacterial periplasmic space or entirely out of eukaryotic cells and into the surrounding medium. Illustrative linkers are the highly conserved regions of glycosidases that contain mainly proline and serine/threonine residues. The hinge regions of most bacterial cellulases and, e.g., IgA, consist of Thr and/or Ser and Pro residues exclusively while in the hinges of fungal cellulases
and glucoamylases, occasional Ala, Cyε, Gly, Gin or charged residues are permitted in addition to the Ser, Thr and Pro residues.
Without being bound by any theoretical rationale as to why such linkers are so useful for linking proteins and/or protein domains into single chain secretable proteins, it appears that such linker sequences share the properties of flexibility and substantial absence of secondary structure such as pleated sheet or helical segments. The Thr and Ser residues tend to be O-glycosylated even in the bacterial cellulase hinge regions while Asn residues, which are sites for N-glycosylation, are usually not found in these hinges. This suggests that the O- glycosylation might facilitate the secretion or otherwise be important for the function of these hinges. However, glycosylation of the hinge appears not to be an absolute requirement for the secretion as scAb produced in E. coli is secreted. Similarly the hinge regions can be used to construct many other types of fusion proteins such as enzyme-enzyme fusions (bifunctional enzymes) in which two or more enzymatic activities are required in a single polypeptide chain or enzyme-binding protein fusions in which the enzyme is bound to a carrier or support by a specialized binding domain in such a way that its biological activity is not destroyed. Such techniques are much wanted in biotechnology to facilitate protein purification or immobilization. Another example is a fusion protein in which the domains or proteins linked together interact or pack together, and this interaction is required for their correct folding, stability or activity. In
addition to scAbs, an example of such a fusion protein of practical importance is the production of the glycosylated envelope proteins of animal viruses to be used in diagnostics or as vaccines. These envelope glycoproteins contain important antigenic epitopes towards which neutralizing antibodies are elicited (Webster et al., Nature , 296:115-121, 1982; Bruck et al.. Virology, 122:352-362, 1982;- Ho-Terry et al. , Arch. Virol . , 90:145-152, 1986). For instance the two envelope proteins of rubella virus must form a heterodimer in order to be secreted efficiently. For efficient production of these two proteins for medical or diagnostic purposes it would clearly be an advantage if the two proteins involved in the dimerization could be produced as a single polypeptide chain rather than independently. In this way the proportions of the two counterparts would be automatically equal and their secretion and folding would be facilitated by the close presence of the counterpart. The use of the flexible hinge described in this invention also allows the construction of this kind of fusion protein.
It is also possible to make constructs in which more than two proteins or protein domains are linked to a single chain fusion protein by the flexible hinges. For instance a binding domain for cellulose can be linked to either terminus of a desired scAb, thereby facilitating its immobilization on cellulose. In linear form (from N-terminus to C- terminus) such a construct could, e.g., contain: one of the hypervariable domains of the antibody - hinge I - the other hypervariable domain of the antibody - hinge II - the cellulose binding domain. In such a
fusion protein the first hinge allows the packing of the two antibody hypervariable domains onto each other while the second hinge (the same or different from the first one) acts as a spacer to separate the binding domain from the scAb.
Furthermore, it is possible to construct fusion proteins consisting of even more than three domains. For instance bifunctional single chain antibodies could be constructed by linking together two domains of one antibody and two domains of another antibody and subsequently joining these together by yet another hinge. Such a construct thus contains four immunoglobulin domains linked to a single poly¬ peptide via three hinges which allow pairwise packing of the correct immunoglobulin domains and the separa¬ tion of the two scAbs to allow their independent functioning. Similar multidomain proteins exist in nature, e.g., regulatory proteins and the use of the flexible secretable hinges described in this invention now makes possible the construction of novel multi¬ domain proteins by protein engineering.
The genes coding for the desired proteins or protein domains to be linked are isolated and sequenced using standard recombinant-DNA methods. Alternatively the cDNAs corresponding to the said proteins or protein domains are synthesized enzymatically using the specific mRNAs as templates also using standard recombinant-DNA techniques. The DNA sequences can also be synthesized chemically according to a pre-existing sequence or a sequence designed die novo. The DNA sequences of such proteins or protein domains can be synthesized in such a way as
to optimize the codon usage to correspond to that preferred by the host organism used for expression.
The isolated genes or synthesized cDNAs are linked to suitable expression vectors comprising: 1) an efficient promoter; 2) an efficient signal sequence; and 3) terminator sequences. In E. coli, many powerful promoters have been described, e.g., lac, trp or phage lambda Pr and Pi promoters as well as hybrid promoters such as tac. In yeast, powerful promoters such as the phosphoglycerokinase (PGK) or alcohol dehydrogenase (ADH) promoters have been used, and in filamentous fungi promoters of, e.g., hydro- lytic enzymes such as the cellulase, cellobiohydrolase I, of Trichoderma reesei have been used. Any strong promoter functional in the host organism used can in principle be used for the expression of the secretable fusion proteins. It is an advantage, although not an absolute requirement, for the promoter to be kept in a repressed state until it is activated (induced) at a given time point by, e.g., elevated temperature, as in the case of the lambda Pr and PI promoters, or by adding particular inducing agents such as IPTG in the case of the lac or tac promoters of E. coli.
The signal sequences directing secretion to the periplasmic space or to the culture medium of E. coli include those of OmpA protein of E. coli or pectate lyase of Erwinia carotovora but a number of other signal sequences also can be used. In yeast, secretion of foreign proteins can be achieved using the signal sequences of the yeast alpha factor, acid phosphatase or invertase or other signal sequences of the rare secreted proteins of yeast. Also, signal sequences from fungal cellulases have worked in yeast
(Penttila et al., Yeast , 3:75-83, 1987; Penttila et al., Gene , 63:103-112, 1988) and other eukaryotic and possibly prokaryotic signal sequences may be used in yeast as well. In filamentous fungi, the signal sequences of its cellulases have been shown to work with other proteins and signal sequences from other secreted fungal proteins can also be used. Thus, for any selected organism, a signal sequence can be chosen from among those derived from homologous or heterologous secreted proteins.
The terminator sequences are isolated from genes homologous to each production organism. In E. coli, the terminators of, e.g., rrnB from the ribosomal RNA transcription terminators work well. In yeast, terminators of yeast proteins or, e.g., terminators of filamentous fungal genes can be used.
The 5' end of the gene or cDNA of the first protein or domain is cloned in frame with the 3 ' end of the signal sequence chosen to generate a proper signal sequence cleavage site. This may require alteration of the DNA and/or amino acid sequence of the first protein. The DNA corresponding to the secretable joining peptide is isolated from a gene or cDNA coding for a secretable protein or synthesized and joined at its 5* end to the 31 end of the DNA of the first protein. Finally, the DNA for the second protein or domain is cloned in between the 3 ' end of the linker DNA and the 5' end of the terminator.
The plasmids used contain a replication origin functional in the production organism and selection markers for the selection of transformed cells from nontransformants.
Examples of usable host organisms include bacteria, e.g., Escherichia coli MC1061, derivatives of Bacillus subtilis BRB1 (Sibakov.et al. , Eur. J. Biochem . , 145:567-572, 1984), Staphylococcus aureus SA1123 (Iordanescu, J. Bacterial . 12:597-601, 1975) or
Streptococcus lividans (Hopwood et al. , Genetic Manipulation of Streptomyces, a Laboratory Manual, The John Innes Foundation, Norwich 1985); yeasts, e.g., Saccharomvces cerevisiae AH 22 (Mellor et al., Gene. 24:1-14, 1983) and Schizosaccharomyces pombe; filamentous fungi, e.g., Aspercrillus nidulans, Aspercrillus awamori (Ward, Proc. Embo-Alko Workshop on Molecular Biology of Filamentous Fungi , Helsinki, pp. 119-128, 1989), Trichoderma reesei (Penttila et al., Gene 61:155-164, 1987; Harkki et al. , Bio /Technology ,
7:596-603, 1989) , the foregoing being illustrative but not limitative of the many possible host organisms know to the art. In principle, all hosts capable of secretion can be used whether prokaryotic or eukaryotic.
Without further elaboration, it is believed that one skilled in the art can, using the preceding description, utilize the present invention to its fullest extent. The following preferred specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever. In the following examples, all temperatures are set forth uncorrected in degrees Celsius; unless otherwise indicated, all parts and percentages are by weight.
Example 1
ScFv in E. coli
The following plas ids are used to make the construct: Plasmids pLTl (see Fig. 4) pML2 (see Fig. 1) pML3 (see Fig. 7) pML4 (see Fig. 7) pML5 (see Fig. 7) pMLLl (see Fig. 3) pMLL2 (see Fig. 4) pTI8 (see Fig. 5) pSP72 (Promega) pSP73 (Promega)
E. coli strains DH5α (F-, endAl, hsdR17(rk-, mk) , supE44, thi-l,λ-, recAl,• g rA96, reLAl, ΔfarσF- laczya)U169, ø80dJLacZΔM15) , MC1061 (F-,araD139, Δ(ara- leu)7696, ΔlacI74, galU-, galK-, hsr-, hsm+, strA) and RV308 (su-,ΔlacX74, galSII: :OP308, strA) are used as hosts in transformation and expression studies.
Table I shows the primers annealing to the sense strand in their complementary form, the relevant amino acid residues incorporated to the primers and the restriction sites (bolded) used for the in-frame fusions. The restriction sites used for the subcloning of the amplified fragments are also indicated. Table II shows the production levels of the active single-chain antibody measured by ELISA.
Cloning of the CBH I Hinge Linker
DNA manipulations were mainly performed according to standard protocols (Maniatis et al., Molecular cloning. A laboratory manual. Cold Spring
Harbor Laboratory, Cold Spring Harbor, NY, 1982) . The cloning procedure for the CBH I hinge linker is schematically presented in Figure l. Oligomers (45- and 81-mers) were synthesized on an Applied Biosystems DNA synthesizer (Model 391 PCR MATE) using the phosphoramidite chemistry. The nucleotide sequence of the authentic CBHI hinge-region is shown in Fig. 2 as well as the nucleotide sequence of the synthetic CBHI hinge-linker. The synthesized oligonucleotides were provided with restriction enzyme cleavage sites used in the cloning steps. The partial overlapping oligomers were denatured, annealed and extended using Klenow polymerase as follows. The hinge oligomers, 100 p ol of the 45-mer and 100 pmol of the 81-mer, were denatured in 5 μl of water at 65°C for 5 min.
The oligomers were annealed in 10 μl of 30 mM Tris-HCl'
(pH 7.5), 150 mM NaCl, 30 mM MgCl2, 15 mM DTT and 0.1 mg/ml gelatin for 2 h at 37°C and then allowed to cool slowly to room temperature. For the extension at 37°C for 2 h the annealed oligo mixture was diluted to a final volume of 30 μl containing 300 μM deoxynucleoside triphosphates (dNTPs) and 5 U of Klenow fragment of E. coli DNA polymerase I (Boehringer) . The reaction mixture was extracted with phenol, ammonium acetate was added into the water phase to a final concentration of 2 M before ethanol precipitation. The double-stranded hinge fragment was cleaved with 30 units of Xbal and SacI at 37°C
overnight, phenol extracted, ethanol precipitated, lyophilized and dissolved into 10 μl of water.
The double-stranded hinge fragment was cloned into a pSP72 vector, from which the Xhol restriction site was deleted (Fig. 1) . The hinge linker contained an internal Xhol restriction site, which was used for the in-frame fusion of the VL fragment. Therefore the Xhol restriction site was deleted by cutting the pSP72 vector with Ndel and Xhol and the 5'-protruding ends were filled in with 2 U of Klenow fragment and 40 μM dNTPs at room temperature for 30 min. The reaction mixture was extracted with phenol and precipitated with ethanol. The deleted linear pSP72ΔXhoI vector was separated from the short Xhol-Ndel fragment (87 bp) on 1% agarose gel and eluted using GeneClean kit (BIO 101) . The linear vector was recircularized with 1 U of T4 DNA ligase (Boehringer) in 10 μl containing 50 mM Tris-HCl (pH 7.5) , 10 mM MgCl2/ 5 mM DDT, 100 μM sper idine, 100 μM EDTA and 10 mM ATP at 15°C overnight. An aliquot of 1 μl was transformed into E. coli DH5α strain according to Hanahan (Hanahan, J*. Mol . Biol . , 166:557-580, 1983).
Plasmid pSP72ΔXhoI was cleaved with SacI and Xbal and ligated with 1 μl and 1 μl of 1:20 dilution of the double-stranded Sacl-Xbal hinge fragment. The authenticity of the cloned hinge insert was confirmed by sequencing using an oligonucleotide primer complementary to the T7 promoter of the vector using sequenase (USB) according to manufactures instructions. The hinge construction was designated pML2.
Cloning of the Ox VH and VL Domains
For the in-frame fusions with the hinge coding region the 5' end of the Ox VH cDNA and the 3 ' end of the VL cDNA were modified by using the polymerase chain reaction (PCR) method (Saiki et al., Science , 239:487-
494, 1988; Table I). The VH region was amplified using pTI8 construction and the VL region using pLTl construction as templates. Plasmid pTI8 contains the cDNA fragments of the Fab coding regions of the heavy and light chains of an anti-2-phenyloxazolone binding antibody, IgGl subclass (OX-1 IgGl) . Plasmid pLTl contains the cDNA of the complete light ( K) chain coding region cloned into pSP73 vector. The oligonucleotide primers used for the amplifications of single-chain antibody are shown in Table I.
Recognition sequences for EcoRI and Hindlll were incorporated into the primers to allow the subcloning of the amplified fragments into EcoRI-Hindlll cleaved pSP72 and pSP73 vectors (Fig. 3a and Fig. 3b) . The amplification was done in 100 μl volume containing 50 ng of template DNA, 100 pmol of each primer, 200 μl dNTPs, 10 mM Tris-HCl (pH 8.0) , 10 mM MgCl2, 50 mM KCl, 1 mg/ml gelatin, 1 M β-ME and 4 U of Taq I polymerase (USB) . The reaction mixtures were amplified with 25 repeating cycles of denaturation at 94°C for 1 min. , annealing at 55°C for 1 min. and polymerization at 72°C for 3 min. The reaction mixture was extracted with phenol and precipitated with ethanol. The amplified VH and VL fragments were cleaved with EcoRI and Hindlll and purified and ligated into pSP73 and pSP72 vectors. These constructions are designated pMLLl and pMLL2. The nucleotide sequences of the amplified DNA fragments were confirmed by sequencing
using primers complementary to both T7 and SP6 promoters (Fig. 4).
Construction of the Ox VH-Hinge-VL E. coli Expression Vector
The final cloning procedure for the Ox scFv (pML5) is presented in Figure 5. The pML2 construction, containing the CBH I hinge insert, was cleaved with Xhol and Hindlll and ligated with the Xhol-Hindlll VL fragment derived from the pMLL2 construction (Fig. 3b) . The VH fragment was cleaved from the pMLLl construction (Fig. 3a) with EcoRI and Xmal and ligated with the pML3 cleaved with the same restriction enzymes. The resulting construction, designed pML4, contains the signal sequence of Erwinia carotovora's pectate lyase (pelB) gene (Lei et al., J. Bact., 169: 4379-4383, 1987; Better et al., Science, 240: 1041-1043, 1988) and the whole coding region of the VH-hinge-VL fusion. The EcoRI-Hindlll insert of pML4 was ligated into the E. coli pKK233-3 expression vector (Pharmacia) ' under the control of the tac promoter (Fig. 5) . To achieve efficient repression of the tac promoter before induction with IPTG the laclq gene fragment was cloned into the expression vector (Brosius, Gene, 27:161-172, 1984; Stark, Gene, 51:255-
267, 1987; Amann et al., Gene 69: 301-315, 1988). The OX-1 single-chain antibody expression construction obtained by this procedure is designated pML5.
Expression of Single-Chain Antibody Genes
Plasmid DNA from several clones of pML5 was transformed into E. coli strains RV308 and MC1061. For expression studies transformant colonies were
inoculated into LB medium containing 100 μg/ml ampicillin and the cultures were grown at 37°C overnight. After 1:50 dilution of a 50 ml culture was incubated at 30°C with shaking 115 rpm until the A^ was between 0.5 and 1.5 before the induction with the
1 mM isopropyl β-D-thiogalactopyranoside (IPTG) . As a negative control the expression vector without the insert was induced identically. -The growth of the induced culture was followed by measuring the cell densities at different time points after the induction.
For the activity analysis of the OX-1 single- chain antibody, aliquots of 2 ml of the culture were centrifuged. The culture media were concentrated approximately 20 times using Centricon 10 (Amicon) concentrator units and an equal volume of 2x Laemmli's (1970) sample buffer was added. The cell pellets were resuspended into the same sample buffer and the samples were boiled for 3 min and 15 μl of them were subjected to 15% SDS-PAGE analysis according to
Laemmli (1970) .
The proteins were transferred electrophoretic- ally onto nitrocellulose filters (Towbin et al., Proc. Natl . Acad . Sci . USA, 76:4350-4354, 1979) and the single-chain antibody was detected with a polyclonal antiserum raised in rabbits against the Fab-moiety of the oxazolone monoclonal antibody" (Ox IgG3 Fab antiserum, Fig. 6) . The ELISA assay was essentially performed as described by Hudson and Hay (1976) . Oxazolone coupled to BSA in a ration of 22:1 (Ox-BSA) was used in the assay to detect immunoreactive single- chain antibody. The antigen was adsorbed to 96-well microtiter plates (NUNC Immunoplate I) by incubating
100 μl/well of a Ox-BSA solution containing 46 μg/ml in 0.1 M carbonate buffer (pH 9.6) at 4°C overnight. Unless indicated otherwise in the text, all the following binding steps were carried out by applying 100 μl of the respective solution per well and shaking the plates (Wallac Plateshake) at room temperature for 1 h. After each binding step the plates were washed three times in a Platewash (Wallac) with PBS containing 50 mM phosphate buffer (pH 7.4) and 0.9% NaCl. Free adsorption sites in the wells were blocked by incubating 250 μl/well of 0.5% BSA in PBS (BSA- PBS) . Supernatant samples were taken from cultures of E. coli RV308/pML5 and MC1061/pML5 after different times of induction and appropriate dilutions in BSA- PBS were dispersed in duplicate into the antigen- coated wells and incubated for 2 h. Bound single- chain antibody was detected by subsequent incubations with first Ox IgG3 Fab antiserum in BSA-PBS and then with alkaline phosphatase-conjugated goat anti-rabbit igG (H+L) (BioRad) in BSA-PBS. After addition of substrate solution containing 2 mg/ml p- nitrophenylphosphate in diethanolamine-MgCl2-buffer
(Orion) the absorbance in each well was read using a
Multiskan MCC/340 (Labsystems) with an interference filter of 405 nm. The amount of immunoreactive single-chain antibody in each well was determined by comparing the absorbances derived from the samples to a standard curve obtained by incubating known concentrations of the purified isotype monoclonal antibody on the same microtiter plate (Table II) .
TABLE I
VH 1 : NH-,-terminal primer of VH. COMPLEMENTARY TO THE ptac PROMOTER REGION 5 ' GCGCCGACATCATAACGGTCC 3 '
VH 2 : COOH-terminal primer of VH
5 ' ACTTGAAGCTTCCCGGGTGCAGAGACAGTGACCAGAGTCCC 3 '
3 ' TGAACTTCGAAGGGCCCACGTCTCTGTCACTGGTCTCAGGG 5'
VH * — > Hinge G T L V T V S A P G
5 ' GGGACTCTGGTCACTGTCTCTGCACCCGGGAAGCTTCAAGT 3 '
Xmal
VL 2 : COOH-TERMINAL PRIMER OF VL
5 ' ACATCAAGCTTCTATTTCAGCTCCAGCTTG 3 •
3 ' TGTAGTTCGAAGATAAAGTCGAGGTCGAAC 5 " L ^ K L E L K * [
5 ' CAAGCTGGAGCTGAAATAGAAGCTTGATGT 3 '
Hindlll
TABLE II Production levels of active anti Ox ScFv in the culture medium of E. coli as measured by ELISA
plasmid/ strain anti Ox ScFv fm l^
pML5/ RV308 3.4 pML5/ MC1061 0.3 pKK223-lacI9/ RV308 0
Example 2 Anti Ox ScAb in yeast
The following plasmids were used to make the construct: Plasmids pTTc9 (Teeri et al., Gene , 51:43-52, 1987) pMA91 (Mellor et al., Gene , 24:1-14, 1983) pMLL4 (see Example 1) pSP72 (Promega)
The Saccharomvces cerevisiae strain AH 22 (leu2-3 leu2-112 his4-519 canl gal2 cin+) (Mellor et al., 1983, supra) was used as host for transformation and expression studies.
Yeast transformation was carried out according to Keszenman-Pereyra et al. , Curr. Genet . , 13:21-23, 1988) and the transformants were grown in synthetic complete medium lacking leucine (Sherman et al., Methods in Yeast Genetics, A Laboratory Manual . Cold
Spring Harbor, USA, 1983)
Cloning of the Signal Sequence
DNA manipulations are performed essentially as described in Maniatiε (Maniatis et al., 1982, supra ) and Innis (Innis et al., PCR Protocols . A Guide for Methods and Application , Academic Press, San Diego, CA. , 1990) . The cloning procedure of the CBHII signal sequence is represented schematically in Figure 7. The DNA sequence for the first oligonucleotide primer for fragment amplification is derived from the region upstream of the EcoRI site of the cbh2 cDNA in pTTc9. The second oligonucleotide primer contains the DNA
sequence coding for the last four amino acids at the CBHII signal sequence and the first four amino acids of the anti Ox L-chain as well as the Hindlll restriction enzyme recognition sequence as shown in Figure 7. The fragment is amplified using the protocols described in Example 1 and subsequently digested with the restriction enzymes EcoRI and Hindlll. The digested fragment is cloned into the plasmid pSP72 digested with EcoRI and Hindlll to generate the plasmid pDSl (Figure 8) .
Construction of the Yeast Expression Vector
The DNA fragment corresponding to the anti-Ox scAb preceded by the cBHII signal sequence is released from pDSl by cleavage with EcoRI and Hindlll, the protruding ends are blunted using SI nuclease, and the fragment is cloned in the yeast expression vector pMA9I cleaved with Bglll and treated with SI nuclease to remove the protruding ends. The resulting plasmid pDS2 (Fig. 9) is transformed to E. coli DH5α. The plasmid DNA is isolated and the construction is checked by sequencing.
Expression of the ScAbs in veast The plasmid pDS2 is transformed into yeast.
The transformants are grown as described by Penttila et al. (Penttila et al., Yeast , 3:75-83, 1987). After the growth the yeast cells are removed by centrifuga- tion and the activity of the anti-Ox scAbs assayed as in Example 1. Functional single chain antibody is found in the culture medium.
From the foregoing description, one of ordinary skill in the art can easily ascertain the
essential characteristics of this invention and, without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions.
Claims
1. A recombinant DNA (rDNA)' molecule coding for a secretable single chain fusion protein, said fusion protein comprising a plurality of functional proteins or protein domains joined by spacer peptides; wherein each said spacer peptide comprises a flexible hinge region of a naturally secreted multidomain protein or an analogue thereof; wherein each adjacent pair of functional proteins or protein domains is joined by a spacer peptide which is not naturally continuous with either functional protein or protein domain to which it is joined; and wherein at least two of said functional proteins or protein domains each independently exhibit a physicochemical or biological activity or interact to form an assembly that exhibits a physicochemical or biological activity.
2. The rDNA molecule of claim* 1, coding for a secretable single chain fusion protein comprising two functional proteins or protein domains joined by a single spacer peptide.
3. The rDNA molecule of claim 2, wherein said fusion protein is a single chain antibody.
4. The rDNA molecule of claim 3, wherein the functional domains of said fusion protein comprise the antigen-binding regions of the heavy and light chains of an antibody or antibody fragment.
5. The rDNA molecule of claim 1, wherein one or more of the peptide spacers of said fusion protein is the hinge region of a glycosidase enzyme.
6. The rDNA molecule of claim 5, wherein said glycosidase is a cellulase or a glucoamylase.
7. The rDNA molecule of claim 6, wherein said glycosidase is cellobiohydrolase I.
8. The rDNA molecule of claim 1, wherein one or more of the peptide spacers of said fusion protein is the hinge region of an immunoglobulin.
9. The rDNA molecule of claim 1, wherein each of said peptide spacers of said fusion protein is heterologous to each of the proteins or protein domains to which it is joined.
10. The rDNA molecule of claim 1, wherein the functional proteins or protein domains of said fusion protein are selected from the group consisting of antibody, enzyme, hormone and viral envelope glycoprotein functional proteins or protein domains.
11. An isolated single chain fusion protein coded for by the rDNA molecule of any of claims 1-10.
12. An elongated rDNA molecule, comprising the rDNA sequence of the rDNA molecule of claim 1 coding for said single chain fusion protein, and further comprising one or more sequences selected from the group consisting of a promoter sequence, a signal sequence and a terminator sequence.
13. A recombinant expression vector capable of transforming a host organism, wherein said vector comprises the elongated rDNA molecule of claim 12, including said rDNA sequence coding for said single chain fusion protein and DNA sequences coding for a promoter sequence, a signal sequence and a terminator sequence, and wherein said vector further comprises a replication origin and at least one selection marker for selecting transformed host cells.
14. A host transformant transformed by the expression vector of claim 13 and capable of expressing and secreting said single chain fusion protein.
15. The transformed host of claim 14 which is selected from the group consisting of bacteria, yeast and filamentous fungi.
16. The transformed host of claim 14 which is a eukaryote.
17. The transformed host of claim 14, wherein said host is selected from the group consisting of Escherichia coli. Bacillus subtilis, Staphylococcus aureus, Streptococcus lividans," Saccharomyces cerevisiae, Schizosaccharomvces pombe, Aspergillus nidulans, Aspergillus awamori and Trichoder a reesei.
18. The transformed host of claim 17, wherein said host is E.coli transformed by an expression vector comprising a promoter selected from the group consisting of lac, trp, phage lambda Pr, phage lambda PI and tac promoters.
19. The transformed host of claim 17, wherein said host is E. coli transformed by an expression vector comprising the signal sequence for OmpA protein or the signal sequence for the pectate lyase of Erwinia carotovora.
20. The transformed host of claim 15, wherein said host is yeast transformed by an expression vector comprising the PGK or ADH promoter.
21. The transformed host of claim 15, wherein said host is yeast transformed by an expression vector comprising the yeast alpha factor, acid phosphatase or invertase signal sequence.
22. The transformed host of claim 15, wherein said host is yeast transformed by an expression vector comprising the fungal cellulase signal sequence.
23. The transformed host of claim 15, wherein said host is a filamentous fungus transformed by an expression vector comprising a cellulase promoter.
24. The transformed host of claim 23, wherein said cellulase is the cellobiohydrolase I of Trichoderma reesei.
25. The transformed host of claim 15, wherein said host is a filamentous fungus transformed by an expression vector comprising a cellulase signal sequence.
26. A method for producing a single chain fusion protein, comprising the steps of:
(a) constructing an rDNA molecule coding for a secretable single chain fusion protein, said fusion protein comprising a plurality of functional proteins or protein domains joined by spacer peptides; wherein each said spacer peptide comprises a flexible hinge region of a naturally secreted multidomain protein or an analogue thereof; wherein each adjacent pair of functional proteins or protein domains is joined by a spacer peptide which is not naturally continuous with either functional protein or protein domain to which it is joined; and wherein at least two of said functional proteins or protein domains each independently exhibit physicochemical or biological activity or interact to form an assembly that exhibits physicochemical or biological activity;
(b) inserting said rDNA molecule into an expression vector capable of transforming a host organism, wherein said vector further comprises DNA sequences coding for a promoter sequence, a signal sequence, a terminator sequence, a replication origin and at least one selection marker for selecting transformed host cells, and recovering the resultant vector;
(c) transforming a host organism with said resultant vector, and isolating transformants, said host being capable of expressing said single chain fusion protein and secreting it into the periplasmic space of a bacterial host or into the culture medium; and
(d) culturing said transformants in a culture medium, under secretory conditions, and recovering said single chain fusion protein from said periplasmic space or culture medium.
27. The method of claim 26, wherein in step (a) (1) , said rDNA molecule codes for a fusion protein which is a single chain antibody.
28. The method of claim 27, wherein the functional domains of said fusion protein comprise the antigen-binding regions of the heavy and light chains of an antibody or antibody fragment.
29. The method of claim 26, wherein in step (a) , said rDNA molecule codes for a fusion protein, a peptide spacer of which is the hinge region of a glycosidase enzyme.
30. The method claim 29, wherein said glycosidase is a cellulase or a glucoamylase.
31. The method of claim 30, wherein said glycosidase is cellobiohydrolase I.
32. The method of claim 26, wherein in step (a) , said rDNA molecule codes for a fusion protein, a peptide spacer of which is the hinge region of an immunoglobulin.
33. The method of claim 26, wherein in step (a) , said rDNA molecule codes for a fusion protein, the functional proteins or protein domains of which are selected from the group consisting of antibody, enzyme, hormone and viral envelope glycoprotein functional proteins or protein domains.
34. The method of claim 26, wherein in step (c) , said transformed host is selected from the group consisting of bacteria, yeast and filamentous fungi.
35. The method of claim 26, wherein in step (c) , said transformed host is a eukaryote.
36. The method of claim 26, wherein said host is selected from the group consisting of Escherichia coli. Bacillus subtilis, Staphylococcus aureusf Streptococcus lividans, Saccharomyces cerevisiae, Schizosaccharo yces pombe, Aspergillus nidulansr
Aspergillus awamori and Trichoderma reesei.
37. The method of claim 36, wherein said host is E. coli, and wherein in step (b) , said expression vector comprises a promoter selected from the group consisting of lac, trp, phage lambda Pr, phage lambda PI and tac promoters.
38. The method of claim 36, wherein said host is E. coli, and wherein in step (b) , said expression vector comprises the signal sequence for OmpA protein or the signal sequence for the pectate lyase of Erwinia carotovora.
39. The method of claim 34, wherein in step
(c) , said host is yeast transformed in step (b) by an expression vector comprising the PGK or ADH promoter.
40. The method of claim 34, wherein in step (c) , said host is yeast transformed in step (b) by an expression vector comprising the yeast alpha factor, acid phosphatase or invertase signal sequence.
41. The method of claim 34, wherein in step (c) , said host is yeast transformed in step (b) by an expression vector comprising the fungal cellulase signal sequence.
42. The method of claim 34, wherein in step (c) , said host is a filamentous fungus transformed in step (b) by an expression vector comprising a cellulase promoter.
43. The method of claim 42, wherein said cellulase is the cellobiohydrolase I of Trichoderma reesei.
44. The method of claim 34, wherein in step
(c) , said host is a filamentous fungus transformed in step (b) by an expression vector comprising a cellulase signal sequence.
45. An isolated single chain fusion protein produced by the method of any of claims 26-44.
46. An isolated, purified DNA fragment coding for a hinge region peptide of a glycosidase, and optionally further comprising short flanking sequences adapted for use in inserting said fragment into a recombinant DNA construct.
47. The DNA fragment of claim 46, wherein said hinge region peptide is the hinge region of a cellulase or glucoamylase.
48. The DNA fragment of claim 46, wherein said hinge region peptide is the hinge region of an immunoglobulin.
49. A recombinant DNA cloning vector, comprising a promoter, a signal sequence, one or more sequences each coding for a hinge region peptide of a glycosidase flanked at either end by unique multicloning sequences, a transcription terminator, an origin for replication and a selectable marker.
50. The vector of claim 49, wherein said hinge region peptide is the hinge region of a cellulase or glucoamylase.
51. The vector of claim 49, wherein said hinge region peptide is the hinge region of an immunoglobulin.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FI913434A FI913434A7 (en) | 1990-07-16 | 1991-07-16 | RECOMBINANT FUSIONSPROTEINER VILKA UTSOENDRAS. |
| PCT/FI1991/000225 WO1993002198A1 (en) | 1990-07-16 | 1991-07-23 | Recombinant secretable fusion proteins |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US55275190A | 1990-07-16 | 1990-07-16 | |
| PCT/FI1991/000225 WO1993002198A1 (en) | 1990-07-16 | 1991-07-23 | Recombinant secretable fusion proteins |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO1993002198A1 true WO1993002198A1 (en) | 1993-02-04 |
Family
ID=8532898
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/FI1991/000225 WO1993002198A1 (en) | 1990-07-16 | 1991-07-23 | Recombinant secretable fusion proteins |
Country Status (2)
| Country | Link |
|---|---|
| FI (1) | FI913434A7 (en) |
| WO (1) | WO1993002198A1 (en) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5801064A (en) * | 1995-12-04 | 1998-09-01 | Foresman; Mark D. | Assay methods and reagents for detecting autoantibodies |
| EP1002861A1 (en) * | 1998-10-26 | 2000-05-24 | Unilever Plc | Antigen-binding proteins comprising a linker which confers restricted conformational flexibility |
| US6103521A (en) * | 1995-02-06 | 2000-08-15 | Cell Genesys, Inc. | Multispecific chimeric receptors |
| US6838254B1 (en) | 1993-04-29 | 2005-01-04 | Conopco, Inc. | Production of antibodies or (functionalized) fragments thereof derived from heavy chain immunoglobulins of camelidae |
| EP1486566A4 (en) * | 2002-03-01 | 2005-10-12 | Japan Enviro Chemicals Ltd | Proteins capable of binding to female sex hormones and process for producing the same |
| US7833752B2 (en) | 2007-01-31 | 2010-11-16 | Pfenex, Inc. | Bacterial leader sequences for increased expression |
| US7985564B2 (en) | 2003-11-21 | 2011-07-26 | Pfenex, Inc. | Expression systems with sec-system secretion |
-
1991
- 1991-07-16 FI FI913434A patent/FI913434A7/en not_active Application Discontinuation
- 1991-07-23 WO PCT/FI1991/000225 patent/WO1993002198A1/en active Application Filing
Non-Patent Citations (6)
| Title |
|---|
| BIO/TECHNOLOGY, Vol. 9, 1991, GEORGE T. DAVIS et al.: "Single Chain (SCA) Encoding Genes: One-Step Construction and Expression in Eukaryotic Cells", see page 165 - page 169. * |
| DIALOG INFORMATION SERVICES, File 55, Biosis 85-92, Dialog Accession No. 9031001, Biosis Accession No. 93016001, TAKKINEN K et al.: "An Acitve Single-Chain Antibody Containing a Cellulase Linker Domain is Secreted by Escherichia-Coli"; & PROTEIN ENG 4 (7), 1991, 837-842 * |
| GENE, Vol. 93, 1990, PETRA MARKMEYER et al.: "The pAX Plasmids: New Gene-Fusion Vectors for Sequencing, Mutagenesis and Expression of Proteins in Escherichia Coli", see page 129 - page 134. * |
| KEYSTONE SYMPOSIA ON MOLECULAR & CELLULAR BIOLOGY 20, 8-14 April 1991, (Journal of Cellular Biochemistry Supplement O (15 part G)), 1991, TUULA T. TEERI et al.: "Strategies for Secretion of Antibody Fragments and Single Chain Antibodies in Escherichia Coli", p. 217, Abstract R 344. * |
| NATURE, Vol. 339, 1989, VIJAY K. CHAUDHARY et al.: "A Recombinant Immunotoxin Consisting of Two Antibody Variable Domains Fused to Pseudomonas Exotoxin", see page 394 - page 397. * |
| SCIENCE, Vol. 242, 1988, ROBERT E. BIRD et al.: "Single-Chain Antigen-Binding Proteins", see page 423 - page 426. * |
Cited By (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6838254B1 (en) | 1993-04-29 | 2005-01-04 | Conopco, Inc. | Production of antibodies or (functionalized) fragments thereof derived from heavy chain immunoglobulins of camelidae |
| US7794981B2 (en) | 1993-04-29 | 2010-09-14 | Bac Ip B.V. | Prosuction of antibodies or (functionalized) fragments thereof derived from heavy chain immunoglobulins of camelidae |
| US6103521A (en) * | 1995-02-06 | 2000-08-15 | Cell Genesys, Inc. | Multispecific chimeric receptors |
| US5801064A (en) * | 1995-12-04 | 1998-09-01 | Foresman; Mark D. | Assay methods and reagents for detecting autoantibodies |
| EP1002861A1 (en) * | 1998-10-26 | 2000-05-24 | Unilever Plc | Antigen-binding proteins comprising a linker which confers restricted conformational flexibility |
| WO2000024884A3 (en) * | 1998-10-27 | 2000-08-03 | Unilever Plc | Antigen-binding proteins comprising a linker which confers restricted conformational flexibility |
| EP1486566A4 (en) * | 2002-03-01 | 2005-10-12 | Japan Enviro Chemicals Ltd | Proteins capable of binding to female sex hormones and process for producing the same |
| US7985564B2 (en) | 2003-11-21 | 2011-07-26 | Pfenex, Inc. | Expression systems with sec-system secretion |
| US7833752B2 (en) | 2007-01-31 | 2010-11-16 | Pfenex, Inc. | Bacterial leader sequences for increased expression |
Also Published As
| Publication number | Publication date |
|---|---|
| FI913434A7 (en) | 1992-01-17 |
| FI913434A0 (en) | 1991-07-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US5863765A (en) | Production in yeasts of stable antibody fragments | |
| JP6184463B2 (en) | Proteins having affinity for immunoglobulin and immunoglobulin binding affinity ligands | |
| JP3691055B2 (en) | Immobilized proteins with specific binding force and their use in various processes and articles | |
| Pack et al. | Improved bivalent miniantibodies, with identical avidity as whole antibodies, produced by high cell density fermentation of Escherichia coli | |
| AU729449B2 (en) | Materials and methods relating to the attachment and display of substances on cell surfaces | |
| KR20130028083A (en) | Proteins that specifically bind to immunoglobulins and affinity ligands of immunoglobulin binding | |
| ZA200703235B (en) | Polypeptides with a signal sequence comprising an fivsi motif and olynucleotides encoding therefor | |
| SK776486A3 (en) | Dna construct, a transformed cell and method for producing an extraneous protein in the transformed cell | |
| JPH07110233B2 (en) | Releasable East Promoter | |
| WO2017143839A1 (en) | Method for synthesis of pharmaceutical recombinant protein based on intein | |
| WO1993002198A1 (en) | Recombinant secretable fusion proteins | |
| Einhauer et al. | Expression and purification of homogenous proteins in Saccharomyces cerevisiae based on ubiquitin-FLAG fusion | |
| Sano et al. | [19] Streptavidin-containing chimeric proteins: design and production | |
| JP2842693B2 (en) | Methods and DNA expression systems for overexpression of proteins in host cells | |
| JP6465794B2 (en) | Fd chain gene or L chain gene capable of increasing secretion amount of Fab type antibody | |
| WO2004096859A1 (en) | Chaperonine-target protein complex, method of producing the same, method of stabilizing target protein, method of immobilizing target protein, method of analyzing the structure of target protein, sustained-release preparation and method of producing antibody against target protein | |
| WO2011046218A1 (en) | Hansenula polymorpha capable of producing antibody, process for production of antibody utilizing same, and antibody produced from same | |
| Paluh et al. | Characterization of Neurospora CPC1, a bZIP DNA-binding protein that does not require aligned heptad leucines for dimerization | |
| US5646037A (en) | Yeast vectors | |
| JP7349684B2 (en) | Novel cells and methods for producing target proteins using them | |
| WO2008023689A1 (en) | Method of producing fused protein | |
| AU4300300A (en) | Microbial protein expression system | |
| EP3438259A1 (en) | Methanol-utilizing enzyme-derived novel protein and method for producing intended protein using same | |
| JPS6156078A (en) | Secretion manifestation vector having yeast as host | |
| JP7012663B2 (en) | New host cell and method for producing the target protein using it |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): AT AU BB BG BR CA CH CS DE DK ES GB HU JP KP KR LK LU MC MG MN MW NL NO PL RO SD SE SU US |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IT LU NL SE BF BJ CF CG CI CM GA GN ML MR SN TD TG |
|
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| NENP | Non-entry into the national phase |
Ref country code: CA |
|
| 122 | Ep: pct application non-entry in european phase |