CA2322105A1 - Antibiotic production (ii) - Google Patents
Antibiotic production (ii) Download PDFInfo
- Publication number
- CA2322105A1 CA2322105A1 CA002322105A CA2322105A CA2322105A1 CA 2322105 A1 CA2322105 A1 CA 2322105A1 CA 002322105 A CA002322105 A CA 002322105A CA 2322105 A CA2322105 A CA 2322105A CA 2322105 A1 CA2322105 A1 CA 2322105A1
- Authority
- CA
- Canada
- Prior art keywords
- nucleic acid
- acid sequence
- ala
- polypeptide
- accb
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 30
- 230000003115 biocidal effect Effects 0.000 title claims description 16
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 99
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 56
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 56
- 241000187432 Streptomyces coelicolor Species 0.000 claims abstract description 49
- 230000000694 effects Effects 0.000 claims abstract description 43
- 230000014509 gene expression Effects 0.000 claims abstract description 40
- 238000000034 method Methods 0.000 claims abstract description 40
- 241000187747 Streptomyces Species 0.000 claims abstract description 39
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 claims abstract description 36
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 claims abstract description 36
- 108010018763 Biotin carboxylase Proteins 0.000 claims abstract description 36
- OFOBLEOULBTSOW-UHFFFAOYSA-L Malonate Chemical compound [O-]C(=O)CC([O-])=O OFOBLEOULBTSOW-UHFFFAOYSA-L 0.000 claims abstract description 30
- 229930001119 polyketide Natural products 0.000 claims abstract description 20
- 230000001965 increasing effect Effects 0.000 claims abstract description 19
- 238000012258 culturing Methods 0.000 claims abstract description 7
- 239000003242 anti bacterial agent Substances 0.000 claims abstract description 6
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 20
- 101100000438 Mus musculus Acacb gene Proteins 0.000 claims description 57
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 55
- 229920001184 polypeptide Polymers 0.000 claims description 53
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 53
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 38
- 230000015572 biosynthetic process Effects 0.000 claims description 21
- 239000013598 vector Substances 0.000 claims description 13
- 230000001105 regulatory effect Effects 0.000 claims description 9
- 241000350481 Pterogyne nitens Species 0.000 claims description 8
- 241001446311 Streptomyces coelicolor A3(2) Species 0.000 claims description 8
- 230000001939 inductive effect Effects 0.000 claims description 8
- 241000894007 species Species 0.000 claims description 8
- 230000004071 biological effect Effects 0.000 claims description 7
- 239000002253 acid Substances 0.000 claims description 6
- 230000000295 complement effect Effects 0.000 claims description 6
- 241000187176 Streptomyces violaceoruber Species 0.000 claims description 3
- 238000010348 incorporation Methods 0.000 claims description 3
- 238000003786 synthesis reaction Methods 0.000 claims description 3
- 241000187398 Streptomyces lividans Species 0.000 claims description 2
- 241000892502 Streptomyces lividans 1326 Species 0.000 claims description 2
- 239000011148 porous material Substances 0.000 claims description 2
- 150000003881 polyketide derivatives Chemical class 0.000 claims 13
- 239000006143 cell culture medium Substances 0.000 claims 2
- 241001312734 Streptomyces parvulus Species 0.000 claims 1
- 150000001412 amines Chemical class 0.000 claims 1
- 229930000044 secondary metabolite Natural products 0.000 abstract description 19
- 229940088710 antibiotic agent Drugs 0.000 abstract description 5
- 125000000830 polyketide group Chemical group 0.000 abstract description 4
- 239000000463 material Substances 0.000 abstract description 3
- 125000002252 acyl group Chemical group 0.000 abstract 1
- 101150013885 accB gene Proteins 0.000 description 58
- 108090000623 proteins and genes Proteins 0.000 description 56
- 108010064926 acyl-CoA carboxylase Proteins 0.000 description 37
- 239000012634 fragment Substances 0.000 description 37
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 34
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 34
- 102000004190 Enzymes Human genes 0.000 description 30
- 108090000790 Enzymes Proteins 0.000 description 30
- 101100135725 Haloferax mediterranei (strain ATCC 33500 / DSM 1411 / JCM 8866 / NBRC 14739 / NCIMB 2177 / R-4) pccA gene Proteins 0.000 description 30
- 101100322107 Neisseria meningitidis serogroup B (strain MC58) accA2 gene Proteins 0.000 description 30
- 239000000284 extract Substances 0.000 description 26
- 150000001413 amino acids Chemical group 0.000 description 25
- 239000000523 sample Substances 0.000 description 25
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 24
- 230000012010 growth Effects 0.000 description 24
- 238000002474 experimental method Methods 0.000 description 21
- 108091034117 Oligonucleotide Proteins 0.000 description 19
- 239000002609 medium Substances 0.000 description 19
- 239000013612 plasmid Substances 0.000 description 18
- 102000004169 proteins and genes Human genes 0.000 description 18
- 101710180724 Propionyl-CoA carboxylase beta chain Proteins 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 16
- 210000004027 cell Anatomy 0.000 description 16
- 241000588724 Escherichia coli Species 0.000 description 13
- 230000005526 G1 to G0 transition Effects 0.000 description 13
- 101710163270 Nuclease Proteins 0.000 description 13
- 101150036991 pccB gene Proteins 0.000 description 13
- 238000000338 in vitro Methods 0.000 description 12
- 239000000758 substrate Substances 0.000 description 12
- 102000007132 Carboxyl and Carbamoyl Transferases Human genes 0.000 description 11
- 108010072957 Carboxyl and Carbamoyl Transferases Proteins 0.000 description 11
- 230000007704 transition Effects 0.000 description 11
- 238000011144 upstream manufacturing Methods 0.000 description 11
- 238000007792 addition Methods 0.000 description 10
- 244000005700 microbiome Species 0.000 description 10
- 230000037361 pathway Effects 0.000 description 10
- 108091026890 Coding region Proteins 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 8
- 101150006252 accA-1 gene Proteins 0.000 description 8
- 229910052799 carbon Inorganic materials 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- 230000010354 integration Effects 0.000 description 8
- 238000013507 mapping Methods 0.000 description 8
- 238000002156 mixing Methods 0.000 description 8
- 230000035899 viability Effects 0.000 description 8
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 7
- 241000894006 Bacteria Species 0.000 description 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 108020004999 messenger RNA Proteins 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 230000002103 transcriptional effect Effects 0.000 description 7
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 6
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 6
- 239000000287 crude extract Substances 0.000 description 6
- 235000014113 dietary fatty acids Nutrition 0.000 description 6
- 229930195729 fatty acid Natural products 0.000 description 6
- 239000000194 fatty acid Substances 0.000 description 6
- 150000004665 fatty acids Chemical group 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 5
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 5
- 108700028369 Alleles Proteins 0.000 description 5
- 241000589194 Rhizobium leguminosarum Species 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 5
- 230000006698 induction Effects 0.000 description 5
- QAQREVBBADEHPA-IEXPHMLFSA-N propionyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 QAQREVBBADEHPA-IEXPHMLFSA-N 0.000 description 5
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 5
- 241000186361 Actinobacteria <class> Species 0.000 description 4
- VTIKDEXOEJDMJP-UHFFFAOYSA-N Actinorhodine Natural products CC1OC(CC(=O)O)CC2=C1C(=O)c3c(O)c(cc(O)c3C2=O)c4cc(O)c5C(=O)C6=C(C(C)OC(CC(=O)O)C6)C(=O)c5c4O VTIKDEXOEJDMJP-UHFFFAOYSA-N 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- 102100026665 Malonate-CoA ligase ACSF3, mitochondrial Human genes 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 4
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 4
- 238000002105 Southern blotting Methods 0.000 description 4
- HIYSWASSDOXZLC-UHFFFAOYSA-N Undecylprodigiosin Natural products CCCCCCCCCCCc1ccc(C=C2N=C(C=C2OC)c2ccc[nH]2)[nH]1 HIYSWASSDOXZLC-UHFFFAOYSA-N 0.000 description 4
- 101150099430 accD5 gene Proteins 0.000 description 4
- VTIKDEXOEJDMJP-WYUUTHIRSA-N actinorhodin Chemical compound C([C@@H](CC(O)=O)O[C@@H]1C)C(C(C2=C(O)C=3)=O)=C1C(=O)C2=C(O)C=3C(C(=C1C2=O)O)=CC(O)=C1C(=O)C1=C2[C@@H](C)O[C@H](CC(O)=O)C1 VTIKDEXOEJDMJP-WYUUTHIRSA-N 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 230000003698 anagen phase Effects 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108091006004 biotinylated proteins Proteins 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 230000002759 chromosomal effect Effects 0.000 description 4
- 230000021615 conjugation Effects 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 108010089734 malonyl-CoA synthetase Proteins 0.000 description 4
- 239000002207 metabolite Substances 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- -1 polyketide compounds Chemical class 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- 230000028070 sporulation Effects 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- HIYSWASSDOXZLC-HKOYGPOVSA-N undecylprodigiosin Chemical compound N1C(CCCCCCCCCCC)=CC=C1\C=C\1C(OC)=CC(C=2NC=CC=2)=N/1 HIYSWASSDOXZLC-HKOYGPOVSA-N 0.000 description 4
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 3
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 3
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 3
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 3
- 101100256938 Streptomyces griseus sigA gene Proteins 0.000 description 3
- NSFFHOGKXHRQEW-UHFFFAOYSA-N Thiostrepton B Natural products N1C(=O)C(C)NC(=O)C(=C)NC(=O)C(C)NC(=O)C(C(C)CC)NC(C(C2=N3)O)C=CC2=C(C(C)O)C=C3C(=O)OC(C)C(C=2SC=C(N=2)C2N=3)NC(=O)C(N=4)=CSC=4C(C(C)(O)C(C)O)NC(=O)C(N=4)CSC=4C(=CC)NC(=O)C(C(C)O)NC(=O)C(N=4)=CSC=4C21CCC=3C1=NC(C(=O)NC(=C)C(=O)NC(=C)C(N)=O)=CS1 NSFFHOGKXHRQEW-UHFFFAOYSA-N 0.000 description 3
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 3
- 108700009124 Transcription Initiation Site Proteins 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 150000007942 carboxylates Chemical class 0.000 description 3
- 230000021523 carboxylation Effects 0.000 description 3
- 238000006473 carboxylation reaction Methods 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 230000009918 complex formation Effects 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 101150086151 hrdB gene Proteins 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 230000002018 overexpression Effects 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 229930188070 thiostrepton Natural products 0.000 description 3
- NSFFHOGKXHRQEW-AIHSUZKVSA-N thiostrepton Chemical compound C([C@]12C=3SC=C(N=3)C(=O)N[C@H](C(=O)NC(/C=3SC[C@@H](N=3)C(=O)N[C@H](C=3SC=C(N=3)C(=O)N[C@H](C=3SC=C(N=3)[C@H]1N=1)[C@@H](C)OC(=O)C3=CC(=C4C=C[C@H]([C@@H](C4=N3)O)N[C@H](C(N[C@@H](C)C(=O)NC(=C)C(=O)N[C@@H](C)C(=O)N2)=O)[C@@H](C)CC)[C@H](C)O)[C@](C)(O)[C@@H](C)O)=C\C)[C@@H](C)O)CC=1C1=NC(C(=O)NC(=C)C(=O)NC(=C)C(N)=O)=CS1 NSFFHOGKXHRQEW-AIHSUZKVSA-N 0.000 description 3
- 229940063214 thiostrepton Drugs 0.000 description 3
- NSFFHOGKXHRQEW-OFMUQYBVSA-N thiostrepton A Natural products CC[C@H](C)[C@@H]1N[C@@H]2C=Cc3c(cc(nc3[C@H]2O)C(=O)O[C@H](C)[C@@H]4NC(=O)c5csc(n5)[C@@H](NC(=O)[C@H]6CSC(=N6)C(=CC)NC(=O)[C@@H](NC(=O)c7csc(n7)[C@]8(CCC(=N[C@@H]8c9csc4n9)c%10nc(cs%10)C(=O)NC(=C)C(=O)NC(=C)C(=O)N)NC(=O)[C@H](C)NC(=O)C(=C)NC(=O)[C@H](C)NC1=O)[C@@H](C)O)[C@](C)(O)[C@@H](C)O)[C@H](C)O NSFFHOGKXHRQEW-OFMUQYBVSA-N 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- WDVIDPRACNGFPP-QWRGUYRKSA-N (2s)-2-[[(2s)-6-amino-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound NCC(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WDVIDPRACNGFPP-QWRGUYRKSA-N 0.000 description 2
- 241000023308 Acca Species 0.000 description 2
- 102100022089 Acyl-[acyl-carrier-protein] hydrolase Human genes 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 2
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 2
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 2
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- 108090000489 Carboxy-Lyases Proteins 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 2
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 2
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 2
- 101710088194 Dehydrogenase Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 240000001680 Eleocharis parvula Species 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- 108010039731 Fatty Acid Synthases Proteins 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 101100135734 Haloferax mediterranei (strain ATCC 33500 / DSM 1411 / JCM 8866 / NBRC 14739 / NCIMB 2177 / R-4) pccB gene Proteins 0.000 description 2
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- 108010081805 Malonyl-CoA decarboxylase Proteins 0.000 description 2
- 102100029461 Malonyl-CoA decarboxylase, mitochondrial Human genes 0.000 description 2
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- 108010030975 Polyketide Synthases Proteins 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- SAQSTQBVENFSKT-UHFFFAOYSA-M TCA-sodium Chemical compound [Na+].[O-]C(=O)C(Cl)(Cl)Cl SAQSTQBVENFSKT-UHFFFAOYSA-M 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 101150046124 accA gene Proteins 0.000 description 2
- 101150008263 accD gene Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 230000008848 allosteric regulation Effects 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- XZNUGFQTQHRASN-XQENGBIVSA-N apramycin Chemical compound O([C@H]1O[C@@H]2[C@H](O)[C@@H]([C@H](O[C@H]2C[C@H]1N)O[C@@H]1[C@@H]([C@@H](O)[C@H](N)[C@@H](CO)O1)O)NC)[C@@H]1[C@@H](N)C[C@@H](N)[C@H](O)[C@H]1O XZNUGFQTQHRASN-XQENGBIVSA-N 0.000 description 2
- 229950006334 apramycin Drugs 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 108700023293 biotin carboxyl carrier Proteins 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 239000013065 commercial product Substances 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000001952 enzyme assay Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- KHPXUQMNIQBQEV-UHFFFAOYSA-N oxaloacetic acid Chemical compound OC(=O)CC(=O)C(O)=O KHPXUQMNIQBQEV-UHFFFAOYSA-N 0.000 description 2
- JTJMJGYZQZDUJJ-UHFFFAOYSA-N phencyclidine Chemical compound C1CCCCN1C1(C=2C=CC=CC=2)CCCCC1 JTJMJGYZQZDUJJ-UHFFFAOYSA-N 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000019525 primary metabolic process Effects 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 201000008827 tuberculosis Diseases 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- IHPYMWDTONKSCO-UHFFFAOYSA-N 2,2'-piperazine-1,4-diylbisethanesulfonic acid Chemical compound OS(=O)(=O)CCN1CCN(CCS(O)(=O)=O)CC1 IHPYMWDTONKSCO-UHFFFAOYSA-N 0.000 description 1
- XWJBVGZSIAZDKJ-FXQIFTODSA-N 2-[3-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]propyl]propanedioic acid Chemical compound N1C(=O)N[C@@H]2[C@H](CCCC(C(=O)O)C(O)=O)SC[C@@H]21 XWJBVGZSIAZDKJ-FXQIFTODSA-N 0.000 description 1
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 1
- OTEWWRBKGONZBW-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]-4-methylpentanoyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NC(CC(C)C)C(=O)NCC(=O)NCC(O)=O OTEWWRBKGONZBW-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 101710149790 64 kDa protein Proteins 0.000 description 1
- 101710122462 65 kDa protein Proteins 0.000 description 1
- 101150099190 ARR3 gene Proteins 0.000 description 1
- 240000004507 Abelmoschus esculentus Species 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical group NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- POZKLUIXMHIULG-FDARSICLSA-N Arg-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N POZKLUIXMHIULG-FDARSICLSA-N 0.000 description 1
- KEZVOBAKAXHMOF-GUBZILKMSA-N Arg-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N KEZVOBAKAXHMOF-GUBZILKMSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- GXAJEYWSNDOXFA-UHFFFAOYSA-N Asp Thr His Gly Chemical compound OC(=O)CC(N)C(=O)NC(C(O)C)C(=O)NC(C(=O)NCC(O)=O)CC1=CN=CN1 GXAJEYWSNDOXFA-UHFFFAOYSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000272470 Circus Species 0.000 description 1
- 108091027551 Cointegrate Proteins 0.000 description 1
- 238000011537 Coomassie blue staining Methods 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- BUXAPSQPMALTOY-WHFBIAKZSA-N Cys-Glu Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BUXAPSQPMALTOY-WHFBIAKZSA-N 0.000 description 1
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 108010092160 Dactinomycin Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241001522878 Escherichia coli B Species 0.000 description 1
- 239000004606 Fillers/Extenders Substances 0.000 description 1
- 102100040004 Gamma-glutamylcyclotransferase Human genes 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical group C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- 101000886680 Homo sapiens Gamma-glutamylcyclotransferase Proteins 0.000 description 1
- 101000878605 Homo sapiens Low affinity immunoglobulin epsilon Fc receptor Proteins 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- XVUAQNRNFMVWBR-BLMTYFJBSA-N Ile-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N XVUAQNRNFMVWBR-BLMTYFJBSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 108091029795 Intergenic region Proteins 0.000 description 1
- 241000186984 Kitasatospora aureofaciens Species 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- 101001090725 Leuconostoc gelidum Bacteriocin leucocin-A Proteins 0.000 description 1
- 102100038007 Low affinity immunoglobulin epsilon Fc receptor Human genes 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 1
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- ULLIQRYQNMAAHC-RWMBFGLXSA-N Met-His-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N ULLIQRYQNMAAHC-RWMBFGLXSA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- 101100063932 Micromonospora echinospora gacH gene Proteins 0.000 description 1
- 101100293261 Mus musculus Naa15 gene Proteins 0.000 description 1
- 241000186362 Mycobacterium leprae Species 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 101150082943 NAT1 gene Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 101150089617 PCCA gene Proteins 0.000 description 1
- 239000007990 PIPES buffer Substances 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- 108091000041 Phosphoenolpyruvate Carboxylase Proteins 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- 108030004154 Propionyl-CoA carboxylases Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241001486234 Sciota Species 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- 241000187180 Streptomyces sp. Species 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 1
- KIMOCKLJBXHFIN-YLVFBTJISA-N Trp-Ile-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O)=CNC2=C1 KIMOCKLJBXHFIN-YLVFBTJISA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- MKUXAQIIEYXACX-UHFFFAOYSA-N aciclovir Chemical compound N1C(N)=NC(=O)C2=C1N(COCCO)C=N2 MKUXAQIIEYXACX-UHFFFAOYSA-N 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 229930183665 actinomycin Natural products 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 230000003281 allosteric effect Effects 0.000 description 1
- 108010065885 aminoglycoside N(3')-acetyltransferase Proteins 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000000843 anti-fungal effect Effects 0.000 description 1
- 230000002141 anti-parasite Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000003096 antiparasitic agent Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- VTTONGPRPXSUTJ-UHFFFAOYSA-N bufotenin Chemical compound C1=C(O)C=C2C(CCN(C)C)=CNC2=C1 VTTONGPRPXSUTJ-UHFFFAOYSA-N 0.000 description 1
- CRFNGMNYKDXRTN-CITAKDKDSA-N butyryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CRFNGMNYKDXRTN-CITAKDKDSA-N 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 229940068840 d-biotin Drugs 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 101150046913 ecpA gene Proteins 0.000 description 1
- 101150105575 ecpB gene Proteins 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 229960003444 immunosuppressant agent Drugs 0.000 description 1
- 230000001861 immunosuppressant effect Effects 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 101150062334 int gene Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 238000010983 kinetics study Methods 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 101150093139 ompT gene Proteins 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 230000027086 plasmid maintenance Effects 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000017363 positive regulation of growth Effects 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 239000008057 potassium phosphate buffer Substances 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 229930010796 primary metabolite Natural products 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000001814 protein method Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 230000024053 secondary metabolic process Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000009105 vegetative growth Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
- C12P17/06—Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/10—Nitrogen as only ring hetero atom
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The nucleic acid and amino acid sequences of .alpha.1, .alpha.2, .beta. and .epsilon.
subunits of acetyl-CoA carboxylase (ACCase) from Streptomyces coelicolor are provided. This subunit structure differs from that of known acyl carboxylases. Materials and methods are provided of increasing ACCase activity and production of secondary metabolites (such as polyketides and especially antibiotics) by causing expression in Streptomyces of such nucleic acid.
Also provided are methods of increasing ACCase activity and production of secondary metabolites (such as polyketides and especially antibiotics) by culturing Streptomyces in the presence of exogenous malonate.
subunits of acetyl-CoA carboxylase (ACCase) from Streptomyces coelicolor are provided. This subunit structure differs from that of known acyl carboxylases. Materials and methods are provided of increasing ACCase activity and production of secondary metabolites (such as polyketides and especially antibiotics) by causing expression in Streptomyces of such nucleic acid.
Also provided are methods of increasing ACCase activity and production of secondary metabolites (such as polyketides and especially antibiotics) by culturing Streptomyces in the presence of exogenous malonate.
Description
ANTIBIOTIC PRODUCTION (II) Introduction Malonyl-CoA is essential as a metabolic substrate in all living organisms studied so far and it also plays a role as a modulator of specific protein activity (for a review see Brownsey et al., 1997). Malonyl-CoA is a substrate for fatty acid synthase (FAS) (Bloch and Vance, 1977), for polyketide synthases (PKS) in plants, fungi and bacteria (Hopwood & Sherman, 1990) and for fatty acid chain-elongation systems (Saggerson, et al., 1992). Understanding the pathways) that lead to the biosynthesis of malonyl-CoA
in Streptomyces might have an outstanding interest, since these micro-organisms are well known to have the ability to synthesize a vast array of pharmaceutically important polyketide compounds (such as antibiotic, antiparasitic, antifungal, immunosuppressant and/or antitumour polyketides), where malonyl-CoA is used as the most common extender unit (Hopwood & Sherman, 1990). Therefore, information gained on the enzymes) involved in the supply of this key metabolite will be relevant, not only for a better understanding of the primary metabolism of Streptomyces, but for improving production of many useful secondary metabolites.
Biosynthesis of malonyl-CoA occurs in most species through the ATP-dependent carboxylation of acetyl-CoA by an acetyl-CoA carboxylase (ACCase) (Bloch & Vance, 1977; Harwood, 1988). The overall reaction catalyzed by ACCase is a two step process that involves ATP-dependent formation of carboxybiotin followed by transfer of the carboxyl moiety to acetyl-CoA. The importance of this biosynthetic pathway is most directly reflected by the fact that ACCase expression is essential for normal growth of bacteria (Perez, et al., 1998; Li and Cronan, 1993), yeast (Hasselacher, et al., 1993) and isolated animal cells in culture (Pfizer, et al. , 1996) .
Several complexes with ACCase activity have been purified from various actinomycetes. Interestingly, these complexes have also shown the ability to carboxylate other substrates like propionyl- and butyryl-CoA (Erfle, 1973; Henrikson and Allen, 1979; Huanaiti and Kolattukudy, 1982). This property has led to these enzyme being called acyl-CoA
carboxylases, and all of them have been shown to consist of two subunits, a larger one (a-chain) with the ability to carboxylate its covalently bound biotin group, and a smaller sub-unit ((3-chain) bearing the carboxyl transferase activity. However, there is no information gained, so far, regarding the physiological role of these enzymes.
In Streptomyces the purification of a complex with ACCase activity has proved to be unsuccessful, probably due to its high instability (Bramwell et al., 1996). However ACCase activity has been readily measured in crude extracts of S.
coelicolor (Bramwell et al., 1996; Rodriguez and Gramajo, 1999), indicating that this enzyme complex was present in this micro-organism.
A pathway for the biosynthesis of malonyl-CoA in S.
aureofaciens has been described that does not involve ACCase (Behal et al., 1977; Laakel et al., 1994). This route involves the anaplerotic enzymes phosphoenolpyruvate carboxylase and oxaloacetate dehydrogenase. In S.
coelicolor A3(2), no evidence for the presence of oxaloacetate dehydrogenase has been found (Bramwell et al., 1993); thus, biosynthesis of malonyl-CoA in this organism seemed to occur exclusively through the ACCase enzyme activity.
in Streptomyces might have an outstanding interest, since these micro-organisms are well known to have the ability to synthesize a vast array of pharmaceutically important polyketide compounds (such as antibiotic, antiparasitic, antifungal, immunosuppressant and/or antitumour polyketides), where malonyl-CoA is used as the most common extender unit (Hopwood & Sherman, 1990). Therefore, information gained on the enzymes) involved in the supply of this key metabolite will be relevant, not only for a better understanding of the primary metabolism of Streptomyces, but for improving production of many useful secondary metabolites.
Biosynthesis of malonyl-CoA occurs in most species through the ATP-dependent carboxylation of acetyl-CoA by an acetyl-CoA carboxylase (ACCase) (Bloch & Vance, 1977; Harwood, 1988). The overall reaction catalyzed by ACCase is a two step process that involves ATP-dependent formation of carboxybiotin followed by transfer of the carboxyl moiety to acetyl-CoA. The importance of this biosynthetic pathway is most directly reflected by the fact that ACCase expression is essential for normal growth of bacteria (Perez, et al., 1998; Li and Cronan, 1993), yeast (Hasselacher, et al., 1993) and isolated animal cells in culture (Pfizer, et al. , 1996) .
Several complexes with ACCase activity have been purified from various actinomycetes. Interestingly, these complexes have also shown the ability to carboxylate other substrates like propionyl- and butyryl-CoA (Erfle, 1973; Henrikson and Allen, 1979; Huanaiti and Kolattukudy, 1982). This property has led to these enzyme being called acyl-CoA
carboxylases, and all of them have been shown to consist of two subunits, a larger one (a-chain) with the ability to carboxylate its covalently bound biotin group, and a smaller sub-unit ((3-chain) bearing the carboxyl transferase activity. However, there is no information gained, so far, regarding the physiological role of these enzymes.
In Streptomyces the purification of a complex with ACCase activity has proved to be unsuccessful, probably due to its high instability (Bramwell et al., 1996). However ACCase activity has been readily measured in crude extracts of S.
coelicolor (Bramwell et al., 1996; Rodriguez and Gramajo, 1999), indicating that this enzyme complex was present in this micro-organism.
A pathway for the biosynthesis of malonyl-CoA in S.
aureofaciens has been described that does not involve ACCase (Behal et al., 1977; Laakel et al., 1994). This route involves the anaplerotic enzymes phosphoenolpyruvate carboxylase and oxaloacetate dehydrogenase. In S.
coelicolor A3(2), no evidence for the presence of oxaloacetate dehydrogenase has been found (Bramwell et al., 1993); thus, biosynthesis of malonyl-CoA in this organism seemed to occur exclusively through the ACCase enzyme activity.
Attempts carried on in S. coelicolor to characterize enzymes with carboxylase activity, have led to the characterization of two complexes exhibiting exclusively PCCase activity. The PCCase purified by Bramwell et al., (1996) comprises a biotinylated protein of 88 kDa, PccA, and a non-biotinylated component, the carboxyl transferase, of 66 kDa. More recently the inventors have also.
characterized at both the genetic and biochemical levels, the components of a second PCCase in this bacterium. In vitro reconstitution experiments have shown that an active complex could be obtained by mixing a carboxyl transferase component of 59 kDa (deduced MW, though it runs anomalously in SDS-PAGE, with an apparent MW of 65 kDa), PccB, with either of the two almost identical biotinylated components named AccAl and AccA2 (Rodriguez and Gramajo, 1999).
Recently a gene cluster encoding malonyl-CoA decarboxylase (MatA), malonyl-CoA synthetase (MatB) and a putative decarboxylate carrier protein (MatC) has been proposed as the pathway for malonate metabolism in Rhizobium trifolii (An and Kim, 1998). After the transport of the malonate by MatC, the malonate is converted into malonyl-CoA by MatB
and finally decarboxylated to acetyl-CoA by MatA. However, the fact that the F~, of the malonyl-CoA decarboxylase for malonyl-CoA is relatively high has led the inventors to speculate that malonyl-CoA synthesised from malonate by malonyl-CoA synthetase (rather than malonyl CoA synthesised by ACCase) is the major source for fatty acid biosynthesis in the bacterioid R. trifolii. Interestingly, genes with very high identity to MatC and MatB have been recently reported in the S. coelicolor genome project, suggesting that malonyl-CoA could also be synthesized from malonate in this micro-organism.
characterized at both the genetic and biochemical levels, the components of a second PCCase in this bacterium. In vitro reconstitution experiments have shown that an active complex could be obtained by mixing a carboxyl transferase component of 59 kDa (deduced MW, though it runs anomalously in SDS-PAGE, with an apparent MW of 65 kDa), PccB, with either of the two almost identical biotinylated components named AccAl and AccA2 (Rodriguez and Gramajo, 1999).
Recently a gene cluster encoding malonyl-CoA decarboxylase (MatA), malonyl-CoA synthetase (MatB) and a putative decarboxylate carrier protein (MatC) has been proposed as the pathway for malonate metabolism in Rhizobium trifolii (An and Kim, 1998). After the transport of the malonate by MatC, the malonate is converted into malonyl-CoA by MatB
and finally decarboxylated to acetyl-CoA by MatA. However, the fact that the F~, of the malonyl-CoA decarboxylase for malonyl-CoA is relatively high has led the inventors to speculate that malonyl-CoA synthesised from malonate by malonyl-CoA synthetase (rather than malonyl CoA synthesised by ACCase) is the major source for fatty acid biosynthesis in the bacterioid R. trifolii. Interestingly, genes with very high identity to MatC and MatB have been recently reported in the S. coelicolor genome project, suggesting that malonyl-CoA could also be synthesized from malonate in this micro-organism.
The inventors have identified an essential acyl-CoA
carboxylase of S. coelicolor, and provide detailed genetic and biochemical characterization. The enzyme complex contains a unique sub-unit composition and appears to be the main pathway for the biosynthesis of malonyl-CoA, one of the key metabolites in the linkage between primary and secondary metabolism. An alternative pathway for the biosynthesis of malonyl-CoA is also proposed for cultures growing in malonate, and it most probably involves the matB
and matC homologues of R. trifolii. However, even in these growing conditions, the acyl-CoA carboxylase seems to be essential for the viability of the micro-organism.
Suaanary of invention Two genes accB and accE, forming a single operon, have been cloned from Streptomyces coelicolor A(3)2. The deduced amino acid sequence of AccB showed high similarity to carboxyl transferase of several propionyl- or acyl-CoA
carboxylases of different actinomycetes. By contrast, AccE
did not show any significant homology with protein sequences deposited in the GenBank data base. Heterologous expression of accB and accE in Escherichia coli and in vitro reconstitution of enzyme activity in the presence of the biotinylated component AccAl or AccA2 confirmed that AccB was the carboxyl transferase subunit of an acyl-CoA
carboxylase.
These experiments also established that AccE was a necessary component to obtain a fully active enzyme complex, whose subunit composition seems to be unique within this type of carboxylase. Gene disruption experiments clearly determined that AccB was essential for S. coelicolor viability. This protein together with AccA2, a biotinylated component essential for the .viability of this micro-organism (Rodriguez and Gramajo, 1999), are the best candidates to form an acyl-CoA carboxylase, whose main physiological role is, most probably, the biosynthesis of malonyl-CoA.
Transcriptional studies of accBE, accA2 and acct have shown that accBE and accA2 are mainly expressed during vegetative and transition phase of growth, although some expression of these genes also occurred during stationary phase where they should provide the malonyl-CoA subunits for secondary metabolites biosynthesis. accA1 is only expressed during the transition phase of growth and its role in the formation of a carboxylase complex involved in providing the substrate for polyketide compounds of S.
coelicolor is discussed.
Finally, an alternative route for the biosynthesis of malonyl-CoA is proposed when malonate is used as a carbon source. However, this route seems unable to substitute the main one, determined by the acyl-CoA carboxylase.
Accordingly, in a first aspect, the present invention provides a nucleic acid comprising a nucleic acid sequence which encodes an AccB polypeptide and/or an AccE
polypeptide, or a nucleic acid sequence complementary thereto.
In a second aspect, the present invention provides a nucleic acid comprising a nucleic acid sequence which encodes an AccAl and/or AccA2 polypeptide, or a nucleic acid sequence complementary thereto. It is believed that such nucleic acid was not made available to the public S
before 24 October 1999, when the amino acid sequences of these polypeptides were disclosed in an oral presentation.
Exemplary nucleic acid sequences encoding the AccB, AccE, AccAl and AccA2 polypeptides are given herein. Preferred embodiments of the invention include such sequences.
However, it would be a matter of routine for the skilled person to obtain other nucleic acid sequences encoding these polypeptides, e.g. by introducing mutations which do not alter the encoded amino acid sequence, by virtue of the degeneracy of the genetic code, or by introducing mutations which alter the encoded amino acid sequence, within limits as defined below. Moreover, nucleic acids encoding variants of the polypeptides may be obtained e.g. by screening different strains of S. coelicolor or closely related species of Streptomyces using degenerate probes based on the sequences given herein.
Preferred nucleic acids of the first and second aspects encode AccB and AccE polypeptides along with an AccAl and/or an AccA2 polypeptide (preferably AccA2).
The nucleic acid sequences encoding Acc polypeptides are preferably in operative association with regulatory sequences, e.g. sequences which enable constitutive or inducible expression in Streptomyces species. Examples of plasmids which include such regulatory sequences and of suitable promoters are given herein. A suitable inducible promoter is tipA (inducible by thiostrepton); suitable constitutive promoters are ermE and the optimised ermE*.
Alternatively, naturally occurring nucleic acid sequences may be in operative association with the regulatory sequences with which they are normally associated, or corresponding regulatory sequences from homologous genes in other strains or species. For example, the nucleic acid sequences may be in operative association with the corresponding regulatory (e. g. promoter) sequences defined herein.
For detailed protocols relevant to this and other aspects, see standard reference texts, such as Sambrook et al.
(1989) and Hopwood et al. (1985).
In a third aspect, the present invention separately provides AccB, AccE, AccAl and AccA2 polypeptides having amino acid sequences encoded or encodable by the respective nucleic acid sequences referred to in the first and second aspects.
In a fourth aspect, the present invention provides: vectors containing the nucleic acids of the first and second aspects (preferably vectors, e.g. plasmids, suitable for transforming Streptomyces species for expression therein) and cells, particularly Streptomyces cells, transformed with such vectors. Furthermore, the present invention provides a method of producing a secondary metabolite of a Streptomyces species, the method comprising culturing such transformed Streptomyces cells and extracting the secondary metabolite from the cell culture. The metabolite may be purified and/or formulated as a commercial product according to standard procedures.
In a fifth aspect, the invention provides a method of modifying a secondary metabolite-producing strain of a Streptomyces species to increase production of said secondary metabolite, the method comprising modifying said strain to express, or to increase expression of, nucleic acid encoding one or more polypeptides selected from the group consisting of AccB, AccE, AccAl and AccA2.
In a sixth aspect, the present invention provides a method of modifying a strain of a Streptomyces species to increase ACCase and/or PCCase activity, the method comprising modifying said strain to express, or to increase expression of, nucleic acid encoding one or more polypeptides selected from the group consisting of AccB, AccE, AccAl and AccA2.
In a seventh aspect, the present invention provides a modified strain of a Streptomyces species, produced or producible according to the method of the fifth or sixth aspect. Also provided are cells of said strain, methods of producing secondary metabolites comprising culturing said cells and extracting the secondary metabolite, which may be purified and/or formulated as a commercial product.
In an eighth aspect, the invention provides a method of increasing production of a secondary metabolite in cells of a Streptomyces species, the method comprising culturing said cells in the presence of exogenous malonate, preferably at a concentration of at least about 0.1%, more preferably at least about 0.2%, 0.4%, 0.5%, 0.75% or 1%, though higher concentrations may be used. 1% represents 1g per 100 ml of medium.
Detailed Description In relation to the fifth and sixth aspects, the modification preferably provides for increased expression of nucleic acid encoding more than one of AccB, AccE, AccAl and AccA2, more preferably at least AccB and AccE or at least AccB and either AccAl or AccA2, more preferably AccB, AccE and either AccAl or AccA2. Of AccAl and AccA2, AccA2 is preferred. Increased expression of nucleic acid encoding both AccAl and AccA2 (usually in combination with AccB and optionally AccE) is also contemplated.
The methods of the fifth and sixth aspects preferably include a step of transforming a Streptomyces ce~.l with a said nucleic acid under the control of a constitutive or inducible promoter, preferably a strong promoter. However, the expression of existing said nucleic acid could be increased, e.g. by placing them under the control of a stronger promoter sequence or sequences.
Exogenous said nucleic acid can replace existing said nucleic acid in the cell, or can be added without removing or functionally deleting existing said nucleic acid.
Acc polypeptides and acc genes In the definitions herein of the invention, and of the scope of protection (but not, except where the context requires otherwise, in the experimental sections), the term AccB is intended to include not only a polypeptide having the deduced amino acid sequence encoded by the nucleic acid sequence of Fig. 12 (though this is a preferred embodiment), but also a polypeptide which is a variant (e.g. an allelic or isoallelic variant) or a derivative of said polypeptide, having at least about 60% amino acid identity with said polypeptide, preferably at least about 65%, 70% or 75%, especially preferably (in view of the similarity of AccB as disclosed herein to another protein of unconfirmed function) at least about 80%, 85%, 90%, 92%, 94%, 96%, 98% or 99% identity. Such a variant or derivative may possess any one or more of the biological properties of the wild-type AccB protein, as disclosed herein, e.g. complex formation with AccAl, .AccA2 and/or AccE (or allosteric regulation by AccE), ACCase and/or PCCase activity when AccB is co-expressed with AccAl, AccA2 and/or AccE, or increased secondary metabolite production when AccB is overexpressed in Streptomyces species (preferably in conjuction with AccAl, AccA2 and/or AccE).
Similarly, the term AccE is intended to include not only a polypeptide having the deduced amino acid sequence encoded by the nucleic acid sequence of Fig. 13 (though this is a preferred embodiment), but also a polypeptide which is a variant (e.g. an allelic or isoallelic variant) or a derivative of said polypeptide, having at least about 40%
amino acid identity with said polypeptide, preferably at least about 50%, 60%, 70%, 80%, 85%, 90%, 95% or 99%
identity. Such a variant or derivative may possess any one or more of the biological properties of the wild-type AccE
protein, as demonstrated herein, e.g. complex formation with AccAl, AccA2 and/or AccB (or allosteric regulation of AccB), ACCase and/or PCCase activity when AccE is co-expressed with AccB, or increased secondary metabolite production when AccE is overexpressed in Streptomyces species (preferably in conjuction with AccB).
Similarly, the terms AccAl and AccA2 are intended to include not only the polypeptides having the amino acid sequences shown in Fig. 11 (though these are respective preferred embodiments), but also polypeptides which are variants (e.g. allelic or isoallelic variants) or are derivatives of said polypeptides, having at least about 75%
amino acid identity with said polypeptide, preferably at least about 80%, 85%, 90%, 92%, 94%, 96%, 98% or 99%
identity. Such variants or derivatives may possess any one or more of the biological properties of the wild-type AccAl or AccA2 polypeptides, as disclosed herein, e.g. complex formation with AccB and/or AccE, ACCase and/or PCCase activity when AccAl or AccA2 is co-expressed with AccB
and/or AccE, or increased secondary metabolite production when AccB is overexpressed in Streptomyces species (preferably in conjuction with AccB and/or AccE).
A variant or a derivative of a given peptide may have one or more of internal deletions, internal insertions, terminal truncations, terminal additions, or substitutions of one or more amino acids, compared to the given peptide.
References to nucleic acid encoding AccAl, AccA2, AccB
and/or AccE (or to accAl, accA2, accB and/or accE genes) should be interpreted accordingly.
In relation to the first aspect, preferred nucleic acids comprise a nucleic acid sequence having at least about 50%, preferably at least about 60%, 70%, 80%, 85%, 90%, 95%, 98%
or 99% nucleic acid sequence identity with the accB nucleic acid sequence shown in Fig. 12. Other preferred nucleic acids comprise a nucleic acid sequence having at least about 40%, preferably at least about 50%, 60%, 70%, 80%, 85%, 90%, 95%, 98% or 99% nucleic acid sequence identity with the accE nucleic acid sequence shown in Fig. 13.
Similarly, in relation to the second aspect, preferred nucleic acids comprise a nucleic acid sequence having at least about 50%, preferably at least about 60%, 70%, 80%, 85%, 90%, 95%, 98% or 99% nucleic acid sequence identity with the accAl or accA2 nucleic acid sequence shown in Fig.
11.
Secondary metabolites and Streptomyces species While the experimental disclosure herein relates to the production of Act (actinomycin) and Red (undecylprodigiosin) in S. coelicolor A3(2) (strain M145), it is thought that the teaching is applicable to other strains of Streptomyces in particular, it is thought that overexpession of all three Acc polypeptides (i.e. AccB, AccE and AccAl and/or AccA2) will lead to increased malonyl-CoA production in substantially any Streptomyces species or even in other actinomycetes or in fungi (which also produce polyketide compounds). Since malonyl-CoA is an essential metabolic substrate, it is thought that this will lead to greater yield of desired secondary metabolites (for which see page 1), e.g. polyketides (including antibiotic polyketidss) and fatty acids.
Preferred secondary metabolites are, however, antibiotics, especially Act and Red.
Preferred Streptomyces species are the closely related species S. coelicolor, S. violaceoruber, S. lividans and S.
parvulus, especially S. coelicolor. Strains of such species are commonly available, e.g. from the ATCC, for example under ATCC deposit numbers 12434 for S. parvulus and 19832 for S. violaceoruber. S. coelicolor A3(2) and S.
lividans 66 are available from the John Innes Culture Collection (Norwich, UK) under JICC deposit numbers 1147 and 1326, respectively. However, the invention is not limited to such particular strains.
Acetyl-CoA
In preferred embodiments, present invention further provides for the increased production in Streptomyces of acetyl-CoA, since it is thought that when ACCase activity is increased by the methods and means of the present invention, production of malonyl-CoA may become limited by the availability of the substrate acetyl-CoA. It is proposed that increased acetyl-CoA production could then lead to a further increased rate of malonyl-CoA production and hence secondary metabolite production. For example, oils or fatty acids could be used as the carbon ~urce (together with glucose); fatty acids are degraded by b-oxidation giving high levels of acetyl-CoA.
Sequence identity "Percent (%) amino acid sequence identity" is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the sequence with which it is being compared, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. The % identity values used herein are generated by WU-BLAST-2 which was obtained from Altschul et al. (1996); http://blast.wustl/edu/blast/README.html. WU-BLAST-2 uses several search parameters, most of which are set to the default values. The adjustable parameters are set with the following values: overlap span =1, overlap fraction = 0.125, word threshold (T) - 11. The HSPS and HSPS2 parameters are dynamic values and are established by the program itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched; however, the values may be adjusted to increase sensitivity. A % amino acid sequence identity value is determined by the number of matching identical residues divided by the total number of residues of the "longer"
sequence in the aligned region, multiplied by 100. The "longer" sequence is the one having the most actual residues in the aligned region (gaps introduced by WU-BLAST-2 to maximize the alignment score are ignored).
"Percent (%) nucleic acid sequence identity" is defined as the percentage of nucleotide residues in a candidate sequence that are identical with the nucleotide residues in the sequence under comparison. The identity values used herein were generated by the BLASTN module of WU BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively.
The work underlying the invention will now be described in detail, by way of example only, with reference to the accompanying figures.
Figures Fig. 1 Organization of the genomic region of S.
coelicolor M145 chromosome harbouring accB and accE
genes. A. Genetic and physical map of the 6.2 kb insert in pRM08. The secondary structure downstream accE represents a rho-independent transcriptional terminator. Fragments I and II were amplified by PCR
with the pair of oligos accBup-accBdown and accBEup-accBEdown respectively, uniquely labelled at the 5'-end (*) and used as probes in transcriptional analysis of the accBE operon. B. Physical map of the DNA
fragments cloned in pET22b(+) and used for the heterologous expression of accB and/or accE. Only the most relevant restriction sites are shown: B, BamHI;
Bc, BclI; E, EcoRI; K, KpnI; Nd, NdeI; N, NotI; S, SpHI .
Fig. 2 Attempted d:i.sruption of accB. A. Diagram showing the integration of pTR124 through one of the accBE
flanking regions and the resolution of the cointegrate by a second event of homologous recombination. The crossed out arrow indicates the impossibility of obtaining the reF>lacement of the wild-type accB by the HygR mutant allele. B. The integration of a second copy of the accB~; genes in the ~C31 att site of T'124 (to yield strain T149) allowed the replacement of the wild-type accB by a mutant allele containing the Hyg resistance cassette.
Fig. 3 Growth~phasF~ dependent expression and transcription start site of the accBE operon. A. S1 nuclease mappin:.~ of accB, actII-ORF4 and hrdB, using RNA isolated from a liquid time course of S.
coelicolor M145. Exp, Trans and Stat indicate the exponential, transition and stationary phase of growth, respective:Ly.B.rsEQ.ID.NO~s.:l8-2o) The nucleotide sequences of both strands from the accB promoter region are shown.
The arrow indicates the mcst likely transcription.
start point for the accBE promoter, as determined. by Sl nuclease mapp:i.ng. The potential -10 and -35 regions for the accBEp ar-a underlined. C. S1 nuclease mapping of the accB accE intergenic region using a 563 nt probe. FLP repre~~ents the full-length RNA-protected fragment that is ~.3 nt shorter than the probe.
Fig. 4 Growth-phase dependent expression of accA2 and accA2. S1 nuclease mapping of accA2 (A) and accAl. (B), using RNA isolated from a liquid time course of S.
coelicolor M145.
Fig. S Mapping of t:he accA2 and accAl transcription start point. A. H:i.gh resolution S1 nuclease mapping of the Send of the accA2 transcript. S1, RNA-protected products of the S1 nuclease protection assay. Lanes labelled A, C, G and T indi;~ate a dideoxy sequencing ladder using the same oligonucleotide that was used to make the S1 probe (accA2down). B. High resolution S1 nuclease mapping of the S' end of the accA1 transcript. S1, RNA-protected products of the S1 nuclease protect::ion assay. Lanes labelled T, G, C and A indicate a dideoxy sequencing ladder using the same oligonucleotide that was used to make the S1 probe (accAldown).C.(sEQ.I~.NGS.:21-23) Sequence of the accA2 and accAl upstream regions, indicating the most likely transcript;~on st::art points for the promoters of each of the accAl and accA2 genes (bent arrows). The potential -10 arid. -35 sequences for the accAl and accA2 promoters are underlined. The potential ribosomal bindi::ca sites (rbs) are highlighted with bold letters. The 16 nt direct repeats (DR) found upstream o~ the t:a~anscription start point of accAlpl are indicated with straight arrows.
Fig. 6 Construct.ioh. and analysis of the accBE
conditional mutant:. A. Diagram showing the integration of ~>IJ8600 in strain M86 and the expected organisation of t:3ne Campbell integration of pTR94 in M94. Restriction sites: B. BamHI; N, NotI; Nd, NdeI;
S, SacI; Sp, Sphl; Xb, XbaI. B. Hybridisation analysis of Southern blot. of SacI-digested DNAs from M145, M86 and M94. The prcbe was the internal NdeI-XbaI
fragment of accB shown in A (see Fig. 10).
Fig. 7 Expression of the acyl-CoA components in M86 and M94. A. SDS-PAGE of cell-free extracts of S.
coelicolor M86 and. M94 strains grown in YEME medium containing 10 ~g/m.l Am with or without the addition of S ~g/ml Th. B. A duplicate of t:he SDS-PAGE gel shown in A was subjected to Western blotting and stained for biotinylated prouteins by using alkaline phosphatase-streptavidin conjugate.
Fig. 8 Growth curves of M145, M86 and M94 strains. 108 spores of strain's M86 and M94 were inoculated in YEME
medium containing 10 ~g of Am or 10 ~tg/ml Am and 5 ~g/ml of Th'. 108 ,pores of M145 were inoculated in YEME. The growth was followed by measuring OD4sown.
Fig. 9 Morphologic:a.l and physi-ological differentiation of M86 and M94 i.n the presence of Th. Spores of M86 and M94 were spread in R2 or R~s medium containing 10~g/ml Am. A drop containing 1 ~g of Th was spotted in the centre of each plate. The picture shows the results obtained after the incubation of the plate at 30°C for 48h.
Flg. 10 (sEQ.ID.NO.:24) 'he sequence of the amplification product obtained from accB using primers TC16 and TC17. NdeI
(CATATG) and XbaI (TCTAGA) :sites introduced into the accB by the primers are shown in bold. The 1 kb NdeI-XbaI fragment was cloned into pIJ8600.
Fig. 11 A. (SEQ.ID.NOS.:25--26) Amino acid sequences and B. iSEQ.ID.N05.:27-2ii Nucleic acid sequences of accA1 and accA2.
Fig. 12 (SEQ.ID.NOS.:29 and 31) Nucleic acid sequence of accB.
Fig. 13 (SEQ.ID.NOS.:30 and 32) Nucleic acid sequence of accE.
Fig. 14 Plasmid map for the construction of an expression vector for accA, accB and accE.
Example 1: Cloning of accBE genes pccB of S. coelicolor (Rodriguez and Gramajo, 1999) was used as an heterologaus probe in Southern blot experiments.
When a BamHI digest af: S. coelicalor DNA was probed with pccB and washed under low stringent conditions, a second, low hybridising, band was readily detected (data not shown). The target sequence was cloned from a size-enriched library as a 2.5 kb BamHl fragment and sequenced as described in Experimental Procedures (below). The sequence revealed the presence of an in'omplete ORF with high homology to pccB. The complete gene was finally cloned as a 6 kb SstI fragment yielding pR~I0~3 (:Fig. 1). Sequencing and analysis of this DNA 1_z:agment revealed the presence of an ORF that exhibited enca.-tc-end similarity with a putative decarboxylase (though the real functicn is unknown) of S.
cyanogenous (Westri~:t,. et al . , 1999) , with the S.
coelicolor PccB (Rodriguez and Gramajc, 1999) and with the (3-subunit (PccB! of tree Sac. erythraea PCCase (Donadio, et al., 1996). The leve~~~~r of identity were 76%, 57% and 56%, respectively. The gene encoding this new putative carboxyl transferase was callec_~ accB.
Surprisingly, the sequence also revealed the presence of a small ORF, designated accE, whose start codon is only 17 by downstream of the termination codon of acc8. A 17 nt inverted repeat, which could function as a factor-independent bidirectional transcriptional terminator (reviewed in Lewin, 1994), separates accE from three convergent ORFs with homology to putative proteins of M.
tuberculosis with unknown functions. The putative AccE
polypeptide has a deduced molecular mass of 7.07 kDa and no significant homology to this polypeptide was found in a search of sequences deposited in the GenBank database.
Upstream of accB there is an ORF highly homologous to several known hialuronidases.
Example 2: accB is essential for S. coelicolor viability An accB mutant was constructed by gene replacement (Fig.
2A). A Hyg-resistant cassette was cloned in the unique BamHI site present in the coding sequence of accB, contained in pTR80. After an intermediate construction in pIJ2925, a BglII fragment containing the mutated allele was finally cloned in the conjugative vector pSETl5l. The resulting plasmid, pTR124, was cloned into the E. coli donor strain ET12567/pUZ8002 and transferred by conjugation into M145. Exconjugants were selected for ThR HygR for a simple crossover event. One of the exconjugants, named T124, was taken through four rounds of non-selective growth (SFM Hyg) to promote homologous recombination for the second crossover. Spores were plated to give single colonies and several thousands screened for Th sensitivity (which would have reflected successful gene replacement), but no Ths isolates were obtained. This result suggested that accB is essential for S. coelicolor viability.
The inventors proposed, however, that if a second copy of accB were present in the chromosome of T124, a second crossover event (leading to the replacement of the wild type gene by the HygR mutant allele) would then be allowed.
To confirm this hypothesis, pTR149, which contains a copy of the accBE genes under its own promoter (see Experimental procedures, Fig. 2B), was first integrated in the ~C31 attB
site of T124. (The introduction of a second copy of both genes into the chromosome was prompted by the probability of a polar effect on accE taking place after the gene replacement event and because AccE is important for the recovery of a fully active acyl-CoA carboxylase complex -see in vitro reconstitution experiments below). The resultant strain T149 (HygR, ThR, AmR) was passed through three rounds of sporulation on SFM Hyg Am and after the screening of approximately 500 colonies, 20 were found to be AmR HygR Ths. The final chromosomal organization of accB
in each of the strains constructed (T124, T149 and T149A), was analyzed by Southern blots using an internal fragment of accB as a probe.
Example 3: Heterologous expression of accB, accE and in vitro reconstitution of an acyl-CoA carboxylase complex.
Since accB proved to be essential for S. coelicolor viability, we could not clearly evaluate in vivo the physiological function of this gene product.
In order to study if AccB and AccE were components of an acyl-CoA carboxylase complex, we attempted in vitro reconstitution of the enzyme activity by mixing E. coli cell-free extracts containing the AccB and AccE with cell-free extracts containing the biotinylated sub-units AccAl or AccA2. E. coli does not contain an ACCase enzyme, so ACCase activity cannot be assayed directly by carboxylation of acetyl-CoA (Polakis et al., 1972); therefore, the acyl-CoA carboxylase activity measured in these crude extracts exclusively represents the activity of the heterologous complexes reconstituted in vitro.
Heterologous expression of accB and accE was att~npted by introducing a NdeI site at the ATG start codon of accB;
after an intermediate construction (see Experimental procedures), accBE was cloned as a NdeI-SacI fragment into pET22(b), yielding pTR88 (Fig. 1). Transformation of E.
coli BL21(DE3) with this plasmid yielded strain RG8 (Table 1). Crude extracts of RG8, prepared from IPTG-induced cultures, showed a clear over-expression of a 64 kDa protein in a 15 s SDS-PAGE, corresponding to AccB; by contrast, AccE was not clearly visualised by Coomassie blue staining of the same gel (data not shown). In vitro reconstitution of an acyl-CoA carboxylase was then attempted my mixing crude extracts prepared from IPTG-induced cultures of RG8 with cell-free extracts of the E.
coli strains RG7, which overproduces the biotinylated protein AccAl. After incubation for 1 h at 4 °C, the mixture was assayed for ACCase and PCCase activity. As shown in Table 2 an enzyme complex showing high levels of both ACCase and PCCase activities was successfully reconstituted To study if cell-free extracts containing AccB but not AccE
were capable of reconstituting an active acyl-CoA
carboxylase complex when mixed with cell-free extracts containing AccAl, we constructed a new pET22(b) derivative that only expresses accB. For this we took advantage of the NotI site present approximately in the middle of the coding sequence of accE and cloned the NdeI-NotI fragment from pTR88 into the expression vector, yielding pTR90 (Fig. 1).
Cell-free extracts of RG9, obtained by transformation of BL21(DE3) with pTR90, showed high levels of soluble AccB
after IPTG induction. However, the acyl-CoA carboxylase complex reconstituted in vitro, after mixing cell-free extracts of RG9 (AccB) and RG7 (AccAl), showed much lower levels (approximately 10~) of ACCase and PCCase activities than the acyl-CoA carboxylase previously obtained by mixing RG8 with RG7 cell-free extracts (Table 2). Since the levels of AccB in cell-free extracts of RG8 and RG9 were essentially the same, we inferred from these experiments that AccE was necessary in order to obtain a fully active acyl-CoA carboxylase complex.
To confirm that the absence of AccE was the responsible of the lower acyl-CoA carboxylase activities, we studied the effect that the addition of cell-free extract containing AccE, had on the crude extracts containing AccB and AccAl proteins. For this we first constructed strain RG10 (BL21(DE3) containing pTR107) that expresses high levels of soluble AccE (data not shown).
When cell-free extracts of RG10 where mixed with those of RG9 (AccB) and RG7 (AccAl) and incubated for 1h on ice, the levels of enzyme activity where at least five times higher than in the control experiment, without the addition of AccE (Table 2). Although the results presented in this section clearly show that AccE is a functional part of the acyl-CoA carboxylase, enzyme kinetics studies with purified components will be necessary to understand more precisely the role of this protein in the enzyme complex activity.
Similar results were obtained in all the reconstitution experiments mentioned above when AccAl was replaced by AccA2 as the biotinylated component of the acyl-CoA
carboxylase, indicating that either AccAl or AccA2 can be efficiently used as the a-subunit of this enzyme complex.
Example 4: Transcriptional analysis of accBF~,-accA1 and accA2 At least four combinations that resulted in active carboxylase complexes have been reconstituted by mixing the ~-subunits PccB (Rodriguez and Gramajo, 1999) or AccB (this work) with either of the two almost identical a-subunits, AccAl or AccA2. In any of these complexes the carboxyl transferase subunit seems to dictate the substrate specificity; thus, PccB seems to recognize only propionyl-CoA, while AccB has a broader substrate specificity, which allows the enzyme to recognize either acetyl- or propionyl-CoA. Moreover, a third.complex with PCCase activity has also being described in S. coelicolor (Bramwell, et al., 1996). These findings show a remarkable overlapping of gene function in Streptomyces species. We followed two different approaches to gain more information on this; one was the generation of mutants and the second the study of the mRNA
levels of some of these four genes throughout the different growth stages by using S1 nuclease protection.
S. coelicolor A3(2) strain M145 was grown in SMM medium and RNA extracted at exponential, transition and stationary phase. S1 nuclease protection of accB was performed by using a 483 by PCR product, uniquely labelled at the 5'end of the downstream oligo. Transcription of accB occurs primarily during active growth (exponential and transition phases), while its level of expression decayed significantly after entering into stationary phase (Fig 3A). The transcripts of the major essential sigma factor hrdB and of the pathway-specific activator gene for acitnorhodin biosynthesis, actII-ORF4, were also studied as positive controls for the RNAs used in these experiments.
As expected from previous results, hrdB was expressed constantly throughout growth (Buttner, M.J., 1990), while actII-ORF4 had a peak of expression during transition phase that shut off in stationary phase (Gramajo, et al., 1993).
The RNA-protected fragments found for accB corresponded to a transcription start site 1 by upstream, or in the adenine, of the most likely translation start site of accB.
Upstream of the transcription initiation site we found a putative -10 and -35 promoter regions with a high consensus sequences of promoters recognised by the vegetative okras (Strohl, 1991) (Fig. 3B).
In order to find out if accB and accE were co-transcribed as a unique bi-cistronic mRNA, a new 563 by probe was obtained by PCR. For this we used a 5'oligo corresponding to a sequence within the coding region of accB and a 3'oligo corresponding to a sequence within accE. The full-length RNA-protected fragment was easily differentiated from the probe-probe re-annealing due to the addition of a 13 nt tail to the 5'oligonucleotide (Experimental Procedures). The results obtained in this experiment clearly showed that accB and accE were part of the same transcript, confirming that these two genes form a single-copy operon (Fig. 3C). Moreover, the expression of accBE
during the different growth phases as detected with this new probe followed the same profile as the expression observed with the probe used for accB.
The levels of accA2 and accA1 mRNA present throughout growth were also studied by S1 protection experiments (Fig.
4). The probe used for accA2 was a 766 by DNA fragment generated by PCR and uniquely labelled on the 5'end of the oligo corresponding to the sequence within accA2. This experiment showed the existence of three mRNA-protected fragments. The growth phase-dependent expression of two of them, accA2p1 and accA2p2, resemble very much that of the accBE operon. Thus, a constant and high level of expression occurs during exponential and transition phase (TP), while the transcription shuts down when the cultures reach.
stationary phase (Fig. 4A).
Considering that the nucleotide sequences of accAl and accA2 are identical from the first two nucleotides upstream of the most probable GTG translation start sites down to the end of the probe (Rodriguez and Gramajo, 1999), it is important to note that a fragment of 185 by of the accA2 probe could also be protected by the accA1 mRNA. Since the lowest RNA-protected fragment observed in Fig. 4A shows a different pattern of expression with respect to accA2p1 and p2, and considering that the size of the band corresponds to a 185 by fragment, we believe that this band might represent the level of expression of accA1 (although we cannot rule out the existence of a third promoter for accA2, regulated in a different manner).
S1 nuclease protection of accAl mRNA was performed by using a 563 by PCR product, uniquely labelled at the 5'end of the downstream oligo, corresponding to a sequence within accAl.
As shown in Fig. 4B, the expression of this gene occurs from at least three different putative promoters, and all of them showed a clear burst of expression during the first hours of the TP, which rapidly shut down during late TP.
This pattern of transcription resembled very much the one observed for the third RNA-protected band found for accA2.
The transcription starts sites for the accA2p1 and p2 were mapped by high resolution S1 mapping (Fig. 5A and B). The transcription start points and the putative -10 and -35 promoter regions of these two promoters are shown in Fig.
5C. A certain degree of homology was found between the -10 consensus sequence of accA2p1 and p2 and the promoters recognised by the vegetative ahras (Strohl, 1992). High resolution S1 mapping of accAl revealed that the transcription start point of the most abundant mRNA species starts 88 by upstream of the GTG initiation codon of AccAl and the putative -10 regions resemble, in some extent, the consensus sequences of promoters recognised by ahr~.
Interestingly, two direct repeat (DR) sequences of 16 bp, containing only two mismatches, were found flanking the putative -35 region of accAlpl and the transcription start point of accAlp2 (Fig. 5C). These DRs could represent DNA
binding sites recognised by a putative regulator. A third putative promoter, accAlp3, was also detected in longer exposures and the most probable nucleotide start sites are also indicated in Fig. 5C.
Example 5: accBE genes are essential in the presence of malonate The presence of MatC and MatB homologues in S. coelicolor suggested that this micro-organism was potentially capable of transporting malonate within the cell through the MatC
transporter, and then activating malonate to malonyl-CoA
with the putative malonyl-CoA synthetase MatB. To test whether S. coelicolor was able to utilize malonate as a sole carbon and energy source, we grew S. coelicolor in a modified SMM medium with no casamino-acids and containing 0.4 % malonate instead of glucose as a sole carbon source.
In this medium S. coelicolor M145 was able to grow, indicating that MatC and MatB could be the proteins involved in the transport and activation of malonate to malonyl-CoA, and suggesting that a decarboxylase that could convert malonyl- into acetyl-CoA should also be present in this bacterium, to allow the use of malonate as a carbon and energy source.
This result encouraged us to test whether this route could also be an alternative pathway to provide malonyl-CoA to the cell. To prove this hypothesis we tried to obtain an acyl-CoA carboxylase minus mutant in the presence of malonate. For this we took spores of strain T124 and grew them in liquid MM containing 0.4 % of malonate instead of glucose. After 36 h of growth we sonicated the mycelia and spread them in SFM medium containing 0.4 % of malonate and incubated until sporulation. Spores were collected and treated in the same way one more time. Finally, spores harvested after the second round of sporulation were diluted out, inoculated in SFM malonate to give aprox. 500 colonies per plate and replica plated in SFM medium with or without Th. After analyzing approximately 5000 isolated colonies, no Ths were obtained. This result indicates that although malonate can be efficiently used as a sole carbon and energy source, the pathway involved in its catabolism can not fulfill the malonyl-CoA requirements of the cell.
Example 6: Construction of a strain with the accBE
operon under the control of a tipA promoter As shown above, the accBE operon, which encodes the carboxyl-transferase and a previously unidentified ~ sub-unit of an acyl-CoA carboxylase, is essential for the viability of S. coelicolor A3(2). In order to regulate the expression of this operon and study its effect on the physiology of this microorganism, we constructed a conditional mutant strain where the expression of the accBE
operon was under the control of the thiostrepton-inducible tipA promoter (Murakami, et al., 1989).
A 947 by fragment containing a modified 5'end of the accB
gene was cloned under the tipA promoter in pIJ8600 (Sun et al (1999) supra) to yield pTR93. After removal of the ~C31 integration components (att and int) present in pTR93 we obtained pTR94, which was transformed into the E. coli strain ET12567/pUZ8002 (MacNeil et al (1992)/Paget et al (1999)). Conjugation of pTR94 into the S. coelicolor strain M145 gave several exconjugants ThR. One of these exconjugants, designated M94, was purified in SFM medium for further analysis. Integration of pTR94 could only take place by Campbell recombination through the accBE
homologous sequences, and this event should leave a complete copy of the accBE operon under the tipA promoter (Fig. 6A). To confirm that this event had occurred in M94, we performed Southern blot experiments of DNA samples prepared from strains M145, M94 and M86. The last strain (M86) was obtained by integration of pIJ8600 in the ~C31 att site of the chromosome and used as the best isogenic control for M94 (Fig. 6A). As shown in Fig. 6B, a Sacl digested DNA from M145 and M86 lights up a unique hybridisation band of 5.94 kb that contains the accBE
operon. DNA from M94, instead, lights up two hybridising bands corresponding to the expected sizes for the integration of pTR94 in the accBE operon (Fig. 6A and B).
Example 7: Acyl-CoA carboxylase levels in M94 and M86 Cultures of the conditional accBE mutant M94 grew normally in YEME medium containing 5 ~g of Th. Interestingly, in the absence of the antibiotic, the cultures were still able to grow, although at much lower rate. This experiment re-confirms the leakiness of the tipA promoter (M. J. Bibb, personal communication). In order to determine the levels of the acyl-CoA carboxylase in conditions of induction or non-induction we carried out the following protocol. YEME
medium containing 10 ~g of Am was inoculated with spores of M94 (or M86) to give and initial OD4so= 0.1. Cultures were grown for 12 h at 30 °C and after that time 5 ~g of Th was added to a half of each culture, keeping the other half as a control. Both flasks were then incubated for additional 24 h at 30 °C. The harvested mycelia were disrupted by sonication and cell debris removed by centrifugation. Cell-free extracts were finally analysed by SDS-PAGE and used for enzyme assays. Fig. 7A shows a 60 kDa protein that is only induced in cultures of M94 grown in the presence of Th; the size of this protein corresponded to the molecular mass of AccB. We were not able to detect an inducible band corresponding to AccE. The levels of the biotinylated components (AccAl or AccA2) of the acyl-CoA carboxylase, in each of the cell-free extracts, were analysed by a modified Western Blotting procedure (Fig. 7B). As shown in this figure the levels of AccAl and/or AccA2 were not modified by presence of Th. However, cell free-extracts of M94 do contain a slightly higher amount of the 65 kDa protein compared to M86.
ACCase and PCCase activities were assayed in cell-free extracts of M94 and M86. The levels of both enzyme activities were similar in cell-free extracts prepared from cultures of M86 grown in the presence or in the absence of Th (Table 3). Cell-free extracts prepared from induced cultures of M94 show instead a remarkable increase in both ACCase (11.5 fold) and PCCase (3.5 fold) activities, compared with the levels found in non-induced cultures of the same strain or in M86. Moreover, if the enzyme levels found in the wild type strain M145 (Rodriguez and Gramajo, 1999) are compared with those found for M94, the increase in ACCase and PCCase levels were still 4- and 2-fold, respectively (Table 3). These results indicate that by overproducing only two (~ and e) of the three sub-units that form the acyl-CoA carboxylase of S. coelicolor we can increase significantly the levels of this enzyme activity.
Example 8: Influence of the acyl-CoA carboxylase levels in the physiological properties of M94 Growth curves were determined for the conditional mutant M94 and for M86 by inoculating a spore suspension in YEME
medium supplemented with 10 ~g of Am, with or without the addition of 5 ~g of Th. For M145, YEME medium without the addition of any antibiotic was used. M94 supplemented with the inducer (Th) showed a growth rate during exponential phase very similar to M145, judged from the slope of the curves. However, the initiation of growth for M94 seems to occur sooner than in M145, reaching the stationary phase earlier than the wild type strain. When the cultures were not supplemented with Th, M94 grew considerably slower, reaching stationary phase several hours latter than in the presence of Th. Also, the final OD reached by M94 in the presence of Th and by M145 were very similar (OD4so= 3) after 60 h of growth. Cultures of M86 grew very slowly compared with M94 and M145, independently of the presence or not of Th. However, these cultures levelled off at the final OD reached by M145 and M94 after 50 h of growth.
Actinorhodin and undecylprodigiosin were also quantitated throughout growth. Table 4 shows that antibiotic production was only detected in cultures of M94 grown in the presence of 1 or 5 ug of Th. No antibiotic production was observed in cultures of M145 or M94 without Th, at least until after 60 h of growth. No antibiotic production was detected in M86.
To determine the effect of Th induction in M86 and M94, 1 ~g of the antibiotic was spotted to a confluent lawn of these strains in R2 and R5 medium supplemented with 10 ~g of Am. A striking stimulatory effect in both sporulation and antibiotic production was observed in M94 after 48 h.
No stimulation of growth or antibiotic production was observed in M86.
Example 9: Co-expression of accA, accB and accE in S.
coelicolor The NdeI-XbaI fragment of pTR154 (Fig. 14) is introduced into pIJ8600 and then transformed into S. coelicolor M145 (Fig. 14). Transformants are selected with apramycin and thiostrepton. Overexpression of the three components accA2, accB and accE results in increased ACCase activity and antibiotic production compared to the wild type M145 strain.
Discussion The use of pccB (Rodriguez and Gramajo, 1999) as an heterologous probe, allowed the successful isolation of a chromosomal DNA fragment containing accB, a gene encoding for a putative new carboxyl transferase of S. coelicolor.
This predicted function was based on the high percentage of identity that AccB showed not only to the S. coelicolor PccB, but to several others biochemical and/or genetically characterized carboxyl transferases reported for actinomycetes, such as the PccB of Sac. erythraea (Donadio, et al., 1996) and to a less extent to the AccD5 of M.
tuberculosis (Cole, et al., 1998) and PccB of M. leprae (Doukhan, 1995). An interesting finding from the analysis of the cloned sequence was the presence of a very small ORF, named accE, immediately downstream of accB.
The successful expression of accB, accE and the BC-BCCP-(biotin carboxylase- and biotin carboxylase carrier protein-)encoding genes accAl and accA2 in E. coli allowed in vitro studies to be performed in order to understand the role of the corresponding encoded proteins as components of a previously uncharacterized acyl-CoA carboxylase. The reconstitution, by mixing cell-free extracts of E. coli containing AccB and AccAl (or AccA2), of an active enzyme with the ability to carboxylate either acetyl- or propionyl-CoA clearly established that AccB was the carboxyl transferase component of an acyl-CoA carboxylase complex. Interestingly, the small polypeptide, AccE, also showed to play an important role in the reconstitution of a fully active enzyme complex (Table 2). It remains to be elucidated whether this protein plays a role as an allosteric regulator of the enzyme or whether it is a structural component of the complex. Thus, our results represent the first characterization, at both the genetic and biochemical levels, of a prokaryotic acyl-CoA
carboxylase.
All the acyl-CoA carboxylases studied so far contain the three functional domains in two individual polypeptides (for a review see Brownsey et al., 1997 ), and none of the purified complexes have shown the presence of a small component equivalent to AccE. Therefore, this might be a distinctive feature for Streptomyces sp. In addition, no AccE homologues have been found in any of the bacteria genomes sequenced so far, an observation that could also support this hypothesis.
Malonyl-CoA is an essential component of all living organisms, since it is the main elongation unit for fatty acid biosynthesis (Brownsey et al., 1997). This primary metabolite is synthesised in most species through the carboxylation of acetyl-CoA by an ACCase (Bloch and Vance, 1977). If this was also the case for S. coelicolor and, if AccB was the component of an essential acyl-CoA
carboxylase, mutation of this gene should be lethal for the micro-organism. Replacement of the wild-type accB for the HygR mutant allele prove to be unsuccessful, and it only occurred when a second copy of the accBE genes was present in the chromosome (Fig. 2B).
These experiments clearly indicated that at least accB was essential for S. coelicolor viability. The fact that both AccA2 (Rodriguez and Gramajo, 1999) and AccB have proved to be essential, along with the fact that acyl-CoA carboxylase reconstituted in vitro with these two sub-units has the ability to recognise either acetyl- or propionyl-CoA as substrates, strongly suggests that AccA2 and AccB are the a and ~ components of an essential acyl-CoA carboxylase, whose main physiological role should be the biosynthesis of malonyl-CoA. The transcriptional levels of accB and accA2 throughout growth (Fig.3A and 4A) also support this interpretation, since both genes are principally transcribed during exponential and transition phase.
Moreover, ACCase and PCCase activities also showed the highest and constant levels of activities during exponential and transition phase while in stationary phase the activities were low but readily measurable.
In S. coelicolor, besides the obvious need for malonyl-CoA
biosynthesis during vegetative growth, there is also a requirement for this metabolite during transition and stationary phase, since at least two secondary metabolites (undecylprodigiosin and actinorhodin) are synthesised during these growth-phases and they both require malonyl-CoA for their biosynthesis. Hence, if the ACCase is the only enzyme that synthesises malonyl-CoA in this bacterium, its presence will be also required during the idiophase.
According to the proposed composition of this enzyme complex and based on the transcriptional studies, we propose that the low level of expression of accA2 and accBE
during stationary phase is sufficient to produce enough of the a and ~ components for an active acyl-CoA carboxylase.
From the observation that accA1 mRNA peaks during transition phase, we propose that enough AccAl might be present in the cytoplasm to compete with AccA2 as the main a sub-unit of this enzyme complex in the stationary phase.
However, no difference in antibiotic production has been found between M145 and the isogenic accA1 mutant MA4 (Rodriguez and Gramajo, 1999).
We have clearly demonstrated the ability of S. coelicolor to efficiently utilize malonate as a sole carbon and energy source. A putative pathway for the utilization of this substrate could involve the R. trifolii MatC and MatB
homologues which are found in the genome of S. coelicolor.
The biochemical characterization of MatB in R. trifolii demonstrated that this protein is a malonyl-CoA
synthetase, which catalyzes the formation of malonyl-CoA
directly from malonate and CoA. MatC, instead, has not been characterized biochemically but computer analysis indicate that it is a transmembrane protein that could function as a dicarboxylate (malonate for example) carrier (An and Kim, 1998). If these enzymes were part of the pathway that allows S. coelicolor to utilize malonate as a sole carbon source, one could also presume that the malonyl-CoA
synthesized by MatB should fulfill the malonyl-CoA
requirements of Ithe micro-organism. However, we could not show that under these conditions the essential acyl-CoA
carboxylase becomes dispensable.
Interestingly, the addition of 0.4% malonate to SFM and glucose-MM media produced a clear stimulation of actinorhodin production (data not shown). From this we propose that higher levels of malonyl-CoA were probably available under this growth conditions. From this, and the observation that even the limited levels of the ACCase activity found during the stationary phase of growth of this bacterium are sufficient to allowed regular levels of antibiotic production, the inventors propose that increasing the expression of the ACCase components will probably lead to an improved production of antibiotics.
Experimental Procedures Bacterial strains, cultures and transformation conditions S. coelicolor A3 (2) strain M145 (SCP1- SCP2-) was manipulated as described by Hopwood et al. (1985). The strain was grown on various agar media - SFM (Rodriguez and Gramajo, 1999), R2 and R5 - or in 50 ml SMM or YEME liquid media (Hopwood et a1 (1985) supra). Escherichia coli strain DHSa (Hanahan 1983) was used for routine subcloning and was transformed according to Sambrook et a1. (1989).
Transformants were selected on media supplemented with the appropiate antibiotics . ampicillin (Ap) 100 ~tg/ml;
apramaycin (Am) 100 ~tg/ml; chloramphenicol (Cm) 25 ~g/ml or kanamycin (Km) 30 ~,g/ml. Strain BL21(DE3) is an E. coli B
strain [F- ompT (r$- m$-) (DE3) ] lysogenized with 1DE3, a prophage that expresses the T7 RNA polymerase downstream of the IPTG-inducible lacUVS promoter (Studier & Moffat, 1986). ET12567/pUZ8002 (MacNeil et al (1992)/Paget et al (1999)) was used for E. coli - S. coelicolor conjugation experiments (Bierman, 1992). For selection of Streptomyces transformants and exconjugants, media were overlayed with thiostrepton (Th) (300 ~g per plate), hygromycin (Hyg) (1 mg per plate) or apramycin (Am) (1 mg per plate). Strains and recombinant plasmids are listed in Table 1.
Growth conditions, protein expression and preparation of cell-free extracts S. coelicolor M145 was grown at 30°C in shake flasks in YEME medium for 24-48 h. When necessary, 10 mg Am ml-1 or 5 mg Th ml-1 were added to the medium. Mycelia were harvested by centrifugation at 5000 x g for 10 min at 4 °C, washed in 100 mM potassium phosphate buffer pH 8 containing 0.1 mM
DTT, 1 mM EDTA, 1 mM PMSF and 10% glycerol (buffer A) and resuspended in 1 ml of the same buffer. The cells were disrupted by sonic treatment (4 or 5 s bursts) using a VibraCell Ultrasonic Processor (Sonics & Materials, Inc.).
Cell debris was removed by centrifugation and the supernatant used as cell-free extract. For the expression of heterologous proteins, E. coli strain BL21(DE3) harbouring the appropriate plasmids were grown at 37°C in shake flasks in LB medium in the presence of 25 ~g Cm ml-1 or 100 ~g Ap ml-1 for plasmid maintenance. For the expression of biotinylated proteins, 10 ~M d-biotin was supplemented to the medium. Overnight cultures were diluted 1:10 in fresh medium and grown to A6oo 0.4-0.5 before the addition of IPTG to a final concentration of 0.1 mM.
Induction was allowed to proceed for 4 h. The cells were then harvested, washed and resuspended in 1 ml buffer A.
Cell-free extracts were prepared as described above.
Protein methods Cell-free extracts were analysed by denaturing (SDS)-PAGE
(Laemmli, 1970) using the Bio Rad mini-gel apparatus. The final acrylamide monomer concentration was 12% (w/v) for the separating gel and 5% for the stacking gel. Coomassie brilliant blue was used to stain protein bands. The biotinylated proteins were detected by a modification of the Western blotting procedure described by Nikolau et a1.
(1985). After electrophoretic separation, proteins were electro-blotted onto nitrocellulose membranes (Bio-Rad) and probed with alkaline phosphatase-streptavidin conjugate (Bio-Rad) diluted 1:10000. Protein content was determined by the method of Bradford (1976) with BSA as standard.
In vitro reconstitution and assay of the acyl-CoA
carboxylase complex In vitro reconstitution of the enzyme complex was carried out by mixing 100 ~g of each of the cell-free extracts shown in Table 2 in a final volume of 300 ~1. When AccE was not included in the incubation mix, 100 ~tg of BSA were added instead. The mixes were incubated for 1 h at 4 °C and 100 ~.g of each used for enzyme assay.
ACCase and PCCase activities in cell-free extracts were measured following the incorporation of H14C03- into acid non-volatile material (Huanaiti & Kolattukudy, 1982;
Bramwell et al., 1996). The reaction mixture contained 100 mM potassium phosphate pH 8.0, 300 ~g BSA, 3 mM ATP, 5 mM
MgCl2, 50 mM NaH14C03 [specific activity 200 ~tCi mmol-1 (740 kBq mmol-1)], 1 mM substrate (acetyl-CoA or propionyl-CoA) and 100 ~g cell-free protein extract in a total reaction volume of 100 ~1. The reaction was initiated by the addition of NaH14C03, allowed to proceed at 30 °C for 15 min and stopped with 200 ~.1 6 M HC1. The contents of the tubes were then evaporated to dryness at 95 °C. The residue was resuspended in 100 ~l water, 1 ml of Optiphase liquid scintillation (Wallac Oy) was added and 14C radioactivity determined in a Beckman scintillation liquid counter. Non-specific C02 fixation by crude extracts was assayed in the absence of substrate. One unit of enzyme activity catalysed the incorporation of 1 ~mol 14C into acid-stable products per min.
DNA manipulations Isolation of chromosomal and plasmid DNA, restriction enzyme digestion and agarose gel electrophoresis were carried out by conventional methods (Sambrook et al., 1989;
Hopwood et al., 1985). Southern analyses were performed by using P-labelled probes made by random oligonucleotide priming (Prime-a-gene kit; Promega).
Gene cloning and plasmid construction The synthetic oligonucleotides TC1, 5'-CAGAATTCAAGCAGCACGCCAA(3GGC AAG ( :3EQ . :ID . N0 . : 1 ) , and TC2 , 5 ' -CAGAATTCGATGCCGTCGTGCTCCTGGTC,(sEQ.ID.NO.:2) were used to amplify an internal fragment of the S. coelicolor pccB gene. The reaction mixture contained 10 mM Tris-HCl pH 8.3, 50 mM
KCl, 1 mM MgCl2 , 6% glycerol, 25 ~M of each of tile four dNTPs, 2.5 U Taq DNApo_Lymerase, 20 pmol of each primer and 50 ng of S. coelicolor chromosomal DNA in a final volume of 100 ~l. Samples were subjected to 30 cycles of denaturation (95°C, 30 s), annealing (65°C, 30 s) and extension (72°C, min) . A 1 kb PCR fragmE=nt was used as a 32P-labelled probe to screen a size-enriched library. A 2.7 kb BamHI fragment containing an incomplete accB gene was cloned in BamHI-cleaved pBluescript SK(+), yielding pTR62.
The synthetic oligonuc.Leotide TC16 (5' TATTCTAGACATATGACCGTTT'rGGATGAGG ( sEQ . ID. No . : 3 ) used to introduce an Ndel site at the translational start codon of the S. coelicolor accB gene ) and TC17 ( 5 ' -.~1CCTCTAGACAACGCTCGTGGACC , ( SEQ . ID . NO . : 4 ) used to introduce an Xbal site in the accB coding sequence) were used to amplify an internal fragment of S. coelicolor accB
gene, having the sequence shown in Fig. 10. The reaction mixture was the same as the one indicated above. Samples were subjected to 30 o:r 35 cycles of: denaturation (95°C, 30 s) , annealing (65°C, 30 s) and extension ('72°C, 1 min) . The 1 kb PCR product was digested with NdeI and XbaI (these sites were introduced in the 5~ ends of the oligos TC16 and TC17 and are shown in bold in Fig. 10) and cloned in XbaI-cleaved pBluescript SK.(+) in E. coli DHSa, yielding pTR82.
This plasmid was digested with BstEII and SacI, ligated with a BstEII-SacI fragment cleaved from pRM08 and introduced by transformation into E. coli DHSa, yielding pTR87.
An NdeI-XbaI fragment :From the plasmid pTR82 was cloned in NdeI-XbaI-cleaved pIJ8f;00 (Sun et al (1999)), yielding pTR93. In order to place the chromosomal copy of accBE
operon under the tipA promoter we removed from pTR93 a HindIII fragment containing the int gene and att of ~C31, yielding pTR94. Plasmid pTR94 was transformed into strain ET12567/pUZ8002 and transferred by conjugation to S.
coelicolor M145 (Hopwoc~d et al (1985)).
A NdeI-SacI fragment f:rom the plasmid pTR87 was cloned in NdeI-Sacl-cleaved pET22b(+) (Novagen) (pTR88), thus placing the accBE operon under the control of the powerful T7 promoter and ribosome-binding sequences. The synthetic 011.goriuCleotldeS NaCCE, 5'-TTATCTAGACATATGTCCCCTGCCGAC, (SEQ.ID.N0.:5) used to introduce an NcleI site at the translational start codon of the S. coelicc~lor accE gene, and CaccE, 5'-ATGAATTCTATGCATCGGGTCAGCGCC.~GCTG,(SEQ.ID.N0.:6) were used to amplify the accE gene of S. coelicc~lor. The reaction mixture was the same as the one indicated above. Samples were subjected to 35 cycles of denaturat:ion (95°C, 30 s), annealing (65°C, 30 s) and extension (72°C, 30 s). The PCR product was cloned using pGEM-T easy vector (Promega) in E. coli DHSa, yielding pTR106. A Ndel-EcoRI fragment from the plasmid pTR106 was cloned in NdeI-EcoRI-cleaved pET22(b) (Novagen) yielding the plasmid p'rR107, thus placing the accE gene under the control of the powerful T7 promoter and ribosome-binding sequences.
Plasmid pIJ8600 was digested with. BglII and EcoRI and the fragment containing or:i~' RK2, on pUClB, attP site, int ~C31 and aac (3) IV (AmR cassette) genes was ligated with a linker containing the following enzymes (Mike Butler personal comunication): BglII, AseI, EcoRI, BglII, NdeI, KpnI, XbaI, PstI, HindIII, BamHI, SstI, NotI and EcoRI, yielding pTR141. A 4.0 kb KpnI fragment containing the complete accBE operon from pRM08 was cloned into KpnI-cleaved pTR141, yieldi:c~g pTR149.
For an efficient over-expression in S. coelicolor of the three components of the acyl-CoA carboxylase complex of this micro-organism, we carried out the construction of pTR156 through the fol=Lowing steps. First we did a PCR.
amplification of the chromosomal accBE operon using the:
O Z 1g0 TC 16 ( 5 ' -TATTCTAGAC:ATATGACCGTTTTGGATGAGG 3 ' ) ( SEQ . ID . NO .
: 3 ) to introduce a NdeI site at the translation start codon c>f accB, and the oligo C--accE (5'ATG AAT TCT ATG CAT CGG GTC
AGC GCC AGC 3')(SEQ.ID.N0.:7) to introduce a Nsil restriction site at the 3' end of accE. The amp:Lified DNA, was then cloned into pGEM-T (Promega), to g_Lve pTR99. To introduce a NsiI site upstream of the RBS of accA2 we amplified this gene using the oligo N-accA2 (5' ATG AAT TCA TGC ATG AGG GAG CCT CAA
TCG 3' ) , (SEQ.ID.NO. :8> , for ~.I-;e 5' end and the oligo C-accA2 (5' AGA
TCT
AGA TCA GTC CTT GAT CTC GC 3')(SEQ.ID.N0.:9) containing a Xbal and a EcoRI site, for the 3' end of the gene. The amplified DNA
was cloned in pGEM-T to give pTR112. The NdeI-NsiI DNA
fragment from pTR99 and the NsiI-EcoRI fragment isolated from pTR112 were final=Ly cloned into pET22(b) (Stratagene), previously digested with NdeI and EcoRI, to yield pTR154.
In order to introduce these genes in S. coelicolor we sub-cloned the NdeI-XbaI fragment, containing aceBE and accA2, from pTR154 to pIJ8600 digested with the same enzymes to give pTRl56. See Fig. 14 for plasmid constructions.
Nucleotide sequencing The sequence of the SphI original fragment was performed from plasmids DNA constructed by subcloning ApaI DNA
fragments from pRM08 into pSKBluescri.be SK(+). Synthetic oligonucleotides were ~zsed to complete the sequence. The nucleotide sequence of the accBE region was determined by dideoxy sequencing (Sanger et al., 1977) using the Promega TaqTrack sequencing kit. and double-stranded DNA templates.
The complete sequence <.~f the 1C2 cosmid, that includes the SphI fragment harbouring accBE, is available from the S.
coelicolor genome sequE~ncing project.
S1 nuclease mapping For each S1 nuclease rE~action, 30 ~g of RNA were hybridized in NaTCA buffer (Murray, 1986); solid NaTCA (Aldrich) was dissolved to 3M in 50mM PIPES (pH 7.0), 5mM EDTA, to about 0.002 pmol (approximatf=_:ly 104 cpm) of the following probes.
For accA2 the synthet:Lc: oligonucleotide 5'-GCTTTGAGGACCTTGGCGATG (SEQ.I:D.NO.:10) (AccA2down), corresponding to the sequence within the coding region of accA2, was uniquely labelled at the 5' end of the oligonucleotide with [32P]-ATP using T4 polynucleotide kinase. The labelled oligo was then used in the PCR :reaction with the unlabelled OllgOnuCleOtlde (aCCA2~Llp) 5'-GAAGTAC.'AGGCCGAAGACCAC (SEQ.ID.N0.:11), which corresponds to a regioxi upstream of the accA2 promoter region, to generate a '766 by probe. For accAl the synthetic oligonucleotide (accAldown) 5'-GCGATTTCGCCACGATTGGCG, (SEQ.ID.NO.: 12), corresponding to the region within the coding region of accAl, was uniquely labelled with [3~P]-ATP using T4 polynucleotide kinase at the 5' end of the oligonucleotide.
The accAldown oligo was later used in the PCR reaction with the unlabelled oligonuc;leotide (accAl.up) 5'-CCGATATCAGCCCCTGATGAC,(SEQ.ID..N0.:13), which corresponds to a region upstream of the accAl promoter to generate a 563 by probe.
For accB the synthetic oligonucleotide (accBdown) 5'-CGTCAGCTTGCCCTTGGCGTG,(SEQ.ID.N0.:14), corresponding to the region within the coding region of accB, was uniquely labelled with [32P]-ATP using T4 polynucleotide kinase at the 5' end of the oligonucleotide. accBdown waa, then used in the PCR
reaction with the unlabelled oligonucleotide (accBup) 5'-CTACGCTCCGGGTGAGCGAAC,(SEQ.ID.t~0.:15), which corresponds to a region upstream of the accB promoter, to generate a 483 by probe.
For accBE the synthetic oligonucleotide (accBEdown) 5'-GGAGGGCCGTGATGGCGGCGACTTCCTCGGG,(SEQ.ID.N0.:16), corresponding to the region within the coding region of accE was uniquely labelled with [32P]-ATP using T4 polynucleotide kinase at the 5' end of the oligonucleotide. The accBEdown oligo was then used in the PCR reaction with the unlabelled oligonucleotide (accBEup) 5'-GAGGAACTGGTACGCGCGGGCG (GTACAe~.GCAAGCT),(SEQ.ID.N0.:17), which corresponds to a region within the coding region of accB Cbracketed oligonucleotides are a tail added to the probe to differentiate probe re<~nnealing from fully protected DNA-RNA complexes), to genE~rate a 563 by probe. Subsequent steps were as described by Strauch et al. (1991).
Determination oflactinorhodin 1 ml of whole broth wars mixed with 0.5 ml of 3N KOH to give a final concentration of 1N KOH. The solutions were mixed vigorously and centrifuge at 4000 x g for 5 minutes. The supernatant was collected and measured at A64onm.
Actinorhodin concentration was calculated using the molar extinction coefficient (in 1 N KOH) at 640 nm of 25.320 (Bystrykh et al., 1996?.
Determination of undecylprodigiosin This was carried out a~~cording to the procedure of Hobbs et al. (1990) .
References An, J. H. et al. (1998) Eur J Biochem 257: 395-402.
Behal, V. et al. (1977) Phytochemistry 16: 347-350.
Bierman, M. et al. (1992) Gene 116: 43-49.
Block, K. and Vance, D. (1977) Annu Rev Biochem 46: 263-298.
Bradford, M. (1976) Anal Biochem 72:248.
Bramwell, H. et al. (1996) Microbiol 142: 649-655.
Bramwell, H. et al. (1993) Biochem J 293:131-136.
Brownsey, R. W. et al. (1997) Biochem Soc Trans 25: 1232-1238.
Buttner, M.J. et al. (1990) J Bacteriol 172: 3367-3378.
L. V. Bystrykh, M. A. et al. (1996). J Bacteriol. 187:
2238-2344.
Cole, S.T. et al. (1998) Nature 393: 537-544.
Donadio, S. et al. (1996) Mo1 Microbiol 19: 977-984.
Doukhan, L. et al. (1995) Gene 165: 67-70 Erfle, J. D. (1973) Biocim Biophys Acta 316: 143-155.
Gramajo, H.C. et al. (1993) Molecular Microbiology 7: 837-845.
Hanahan (1983) J. Mol. Biol. 166: 557-580.
Harwood, J.L. (1988) Annu Rev Plant Physiol Plant Mol Bio1 39: 101-138.
Hasslacher, M. et al. (1993) J. Biol. Chem. 268: 10946-10952.
Henrikson, K. P. and Allen, S. H. G. (1979) J Bio1 Chem 254:5888-5891.
Hobbs, G. et al. (1990) J Gen Microbiol 136:2291-2296.
Hopwood, D. A. et al. (1985) Genetic manipulation of Streptomyces: A laboratory manual. John Innes Foundation, Norwich.
Hopwood, D.A. and Sherman, D.H. (1990) Ann Rev Genet 24:37-66.
Hunaiti, A. R. and Kolattukudy, P. E. (1982) Arch Biochem Biophys 216:362-371.
Laakel, M. et al. (1994) Microbiol 140: 1451-1456.
Laemmli, U. K. (1970) Nature 227:680-685.
Li, S. J. and Cronan. J. E., Jr. (1993) J Bacteriol 175:
332-340.
MacNeil et al. (1992) Gene 115: 119-125 Murakami, T. et al. (1989) J Bacteriol 171: 1459-1466.
Nikolau, B.J. et al. (1981) Arch Biochem Biophys 211: 605-612.
Paget et al. (1999) Mol. Microbiol. 3: 97-107 Perez, C.A. et al. (1998) Microbiology 144: 895-903.
Pfizer, E.S. et al. (1996) Cancer Res. 56: 2745-7.
Polakis, S. et al. (1972) J Bio1 Chem 247: 1335-1337.
Redenbach, M. etlal. (1996) Mol Microbiol 21: 77-95.
Rodriguez, E. and Gramajo, H. (1999) Microbiology 145:
3109-3119.
Saggerson, E. D. et al. (1992) Adv Enzyme Regul 32: 285-306.
Sambrook, J. et al. (1989) Molecular Cloning: a Laboratory Manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press.
Sanger, F. et al. (1977) Proc Nat1 Acad Sci USA 74: 5463-5467.
Strauch, E. et al. (1991) Mol Microbiol 5: 289-298.
Strohl, W.R. (1992) Nucleic Acids Res. 20: 961-974.
Sun et al. (1999) Microbiology 145: 2221-2227 Westrich, L. et al. (1999) FEMS Microbiol. Lett. 170: 381-387.
All the above references are hereby incorporated by reference in their entirety, individually and for all purposes.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME: Plant Bioscience Limited (B) STREET: Norwich Research Park, Colney Lane (C) CITY: Norwich (D) STATE: Norfolk (E) COUNTRY: UK
(F) POSTAL CODE (ZIP): NR4 7UH
(ii) TITLE OF INVENTION: Ant:.ibiotic Production (II) (iii) NUMBER OF SEQUENCES: 32 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Bereskin & Parr (B) STREET: Suite 4000, Scotia P7.aza, Box 401, 40 King Street West (C) CITY: Toronto (D) STATE: Ontario (E) COUNTRY: Canada (F) POSTAL CODE (ZIP).- M5H 3Y2 (v) COMPUTER READABLE FORM:
(A) COMPUTER: IBM PC' compatible (B) OPERATING SYSTEM: PC-DOS/MS-DOS
(C) SOFTWARE: Patent:In Release #1.0, Version #1.25 (EPO) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA 2,322,105 (B) FILING DATE: 23-OC:T-;?000 (viii) PATENT AGENT INFORMATION:
(A) NAME: David W. R. Langton (B) REGISTRATION N0: :?800 (C) REFERENCE NUMBER: 420-360 (2) INFORMATION FOR SEQ ID N0: :L:
( i ) SEQUENCE CHARACTERISTIc_:S
(A) LENGTH: 29 base p<zi.rs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:
(2) INFORMATION FOR SEQ :CD N0: 2:
(i) SEQUENCE CHARACTERISTI~::S:
(A) LENGTH: 29 base pairs (B) TYPE: nucleic a~~id (C) STRANDEDNESS: single (D) TOPOLOGY: .Linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
TATTCTAGAC ATATGACCGT TTT(3GATGAG C; 31 (2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 4:
(2) INFORMATION FOR SEQ ID NO: 'i:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: si.nc;~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 5:
TTATCTAGAC ATATGTCCCC TGC:CGAC 27 (2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTIC'.S:
(A) LENGTH: 32 base p<~irs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:
ATGAATTCTA TGCATCGGGT CAGCGCCi~GC TG 32 (2) INFORMATION FOR SEQ :CD NO: '7:
( i ) SEQUENCE CHARACTERIS'PIC:>
(A) LENGTH: 30 base pairs (B) TYPE: nucl~=_ic acid (C) STRANDEDNESS: single (D) TOPOLOGY: .Linea:r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:
(2) INFORMATION FOR SEQ ID N0: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :>EQ ID N0: 8:
ATGAATTCAT GCATGAGGGA GCC'PCAATCC.; 30 (2) INFORMATION FOR SEQ ID N0: ~!:
(i) SEQUENCE CHARACTERISTIC'S:
(A) LENGTH: 26 base p<xirs (B) TYPE: nucleic acid (C) STRANDEDNESS: sinc;~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :>EQ ID NO: 9:
(2) INFORMATION FOR SEQ ID NO: :LO:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs (B) TYPE: nucleic aci<L
(C) STRANDEDNESS: sinc;~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:
(2) INFORMATION FOR SEQ ID N0: 'L1.:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base p<~irs (B) TYPE: nucleic acid (C) STRANDEDNESS: s:inc~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
(2) INFORMATION FOR SEQ ID NO: 12:
( i ) SEQUENCE CHARACTERIS'rI~::
(A) LENGTH: 21 base pairs (B) TYPE: nucleic aci3 (C) STRANDEDNESS: single (D) TOPOLOGY: Linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:
(2) INFORMATION FOR SEQ ID NO: 13 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: ::EQ ID NO: 13:
(2) INFORMATION FOR SEQ ID N0: 1.4:
(i) SEQUENCE CHARACTERISTIC:'S:
(A) LENGTH: 21 base pair's (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :3EQ ID N0: 14:
(2) INFORMATION FOR SEQ ID NO: :1.5:
(i) SEQUENCE CHARACTERISTI(:~S:
(A) LENGTH: 21 base pe~irs (B) TYPE: nuclE:ic ac:ic:i (C) STRANDEDNESS: si_n~xle (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 15:
(2) INFORMATION FOR SEQ ID N0: 16:
( i ) SEQUENCE CHARACTERIS'CI~S
(A) LENGTH: 31 base p<~irs (B) TYPE: nucleic acir.~
(C) STRANDEDNESS: s:ingl.e (D) TOPOLOGY: .Linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:
GGAGGGCCGT GATGGCGGCG ACTTCCT~~GG G 31 (2) INFORMATION FOR SEQ ID N0: 17:
(i) SEQUENCE CHARACTERISTIC:>:
(A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: sin gle (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
GAGGAACTGG TACGCGCGGG CGGTACAAE'~C AAGCT 35 (2) INFORMATION FOR SEQ ID NO: 18 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 120 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: circular (xi) SEQUENCE DESCRIPTION: :~EQ ID N0: 18:
AAACGGGCCG GAGACTGTAC GGAGTCGACC; GCTCGCAATC CTTGCTCGGC TTCGTAGAGT 60 CGCTACATGA CCGTTTTGGA TGAGGCGCCC:~ C~GCGAGCCGA CGGACGCGCG CGGGCGGc3TG 120 (2) INFORMATION FOR SEQ ID NO: 1.9:
(i) SEQUENCE CHARACTERISTI<:'S:
(A) LENGTH: 120 base C:>airs (B) TYPE: nucleic acic:3 (C) STRANDEDNESS: douk:Re (D) TOPOLOGY: circul.axv (xi) SEQUENCE DESCRIPTION: >EQ ID NO: .L9:
CACCCGCCCG CGCGCGTCCG TCGGCTCGCce CGGCGCCTCA TCCAAAACGG TCATGTAGCG 60 ACTCTACGAA GCCGAGCAAG GATTGCGAGC_' CGTCGACTCC GTAC'AGTC'rC CGGCCCGTTT 120 (2 ) INFORMATION FOR SEQ 7:D N0: :?0 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: sin<!le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 20:
Met Thr Val Leu Asp Glu Ala Pro Gly Glu Pro Thr Asp Ala Arg Gly Arg Val (2) INFORMATION FOR SEQ :CD N0: ',?1.:
( i ) SEQUENCE CHARACTERIS'PI~~S
(A) LENGTH: 240 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: circular (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 21:
ACCGGCCGAT ACGTTCTGCG CATGACTGTT CCGAACAGGG CC>GCGTGAAT TCCAACCGTT 180 GGCCGTCGGC GAGCCCCGAT CAGTAATCGA. GTGAGTGAGG A(sAATCT'PCG TGCGCAAGGT 240 (2) INFORMATION FOR SEQ ID NO: 22 (i1 SEQUENCE CHARACTERISTIC."S:
(A) LENGTH: 240 base x>airs (B) TYPE: nucleic acid (C) STRANDEDNESS: dout:~le (D) TOPOLOGY: circular:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22:
CCACAAAGAA ACCGCGTGGC CCGCAGCACG CC'CTTACAGA GACCTTGACC ACACAGGAGG 60 GCTAGGGTTT CCCCCAGGAG TCCTGCGTA<.: CGCGGTACTA CAAGGGCT'PT CGGGGGTCGA 120 GCGAGCCTCG AATCACGCTC CGTGTGGGCh AGCTCACCAT TGGGGACGGG 'PCGAAGTGCC 180 GTGTCGGCAG TCCCTAAACT CGGCTTGTT~I' CAAGGAGGGA GCCTCAATCG TGCGCAAGGT 240 (2) INFORMATION FOR SEQ ID NO: 23 (i) SEQUENCE CHARACTERISTIC=S:
(A) LENGTH: 4 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: sincFle (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: ;vEQ ID N0: 23:
Val Arg Val Lys (2) INFORMATION FOR SEQ :
ID NO: '..24 (i) SEQUENCE CHARACTERISTIC~~:
(A) LENGTH: 968 base irs .pa (B) TYPE: nucleic aci:l (C) STRANDEDNESS: doubl e (D) TOPOLOGY: circu.Lar (xi) SEQUENCE DESCRIPTION:Q ID
SE: N0:
24:
TATTCTAGAC ATATGACCGT GCGCCGGGCGAGCCGAC'GGA CGCGCGCGGG60 TTTGGATGA~~
CGGGTGGCCG AGCTGCACGG GCGGC;GCTCGCC.'Gt3GCC'GAG 120 GATCCGTGCA TGAGAAGGCG
ACGGCGGCGC AGCACGCCAA ACGG(~ACGTGAGCGCATCGA GCTGCTCCTG180 GGc3CAAGCTG
GGTCGAGCAG
CTGGAGGCCA AGAAGCCGTA GTCATCACCGGC:TGGGGCAC GGTCGAGGGC300 CACCGACGGT
CCACGAC'rTC
CACGCCACGA AGATCCACAA ATGGC_CATCGCGGCCGC>TGC CCCGCTGGTG420 GA'I'CATGGAC
CGCCCGTATC
CACCAAGGCG
CGCGGCCTAC
GGCCCGGACCi GAGGAGATCACGCAGAACGG TCTGGGCGGC' ACcsCCGAGAC GTCCGGCGTG720 GC'CGAC:GTGC'.
TGCCACTTCGCCTACGACGA CGAGGAGACC:' AGGTCCGCTA CCTCCTCTCC780 TGCCTCGCCG
CTCCTCCCGCAGAACAACCG GGAGAACCCG AG'PCCTCCGA CCCCGTGGAC840 C.'CCCGCGCCG
CGCCGCTCGGACACCCTCCT CGACCTGGTC' GCAA.CCGCCC GTACGACATG900 CCGGCGGACG
(2) INFORMATION
FOR
SEQ
ID
NO:
25:
(i) SEQUENCE
CHARACTERISTIC:.'S:
(A) LENGTH: 590 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: si.nc:~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25:
Val Arg Lys Val Leu Ile Ala Asn Arg Gly Glu Ile Ala Val Arg V<~1 Ala Arg Ala Cys Arg Asp Ala Gly Ile A:La Sex' Val Ala Val Tyr Ala Asp Pro Asp Arg Asp Ala Le~.i His Val Arg Ala Ala Asp Glu Ala Phe Ala Leu Gly Gly Asp Thr Pro Ala Thr Ser Tyr Leu Asp Ile Ala Lys 50 'i5 60 Val Leu Lys Ala Ala Arg (~lu Ser Gly Ala Asp Ala Ile His Pro Gly Tyr Gly Phe Leu Ser Glu Asn Ala Glu Phe A1<~ Gln Ala Val Leu Asp Ala Gly Leu Ile Trp Ile Gly Pro Pro Pro Hi;> Ala Ile Arg Asp Arg Gly Glu Lys Val Ala Ala Arcfi His 11e Ala Gln Arg Ala G.Ly Ala Pro Leu Val Ala Gly Thr Pro Asp Pro Val Ser Gly Ala Asp Glu Val Val Ala Phe Ala Lys Glu His Gl;y Leu Pro Ile A1<~ Ile Lys Ala Ala Phe Gly Gly Gly Gly Arg Gly :Leu Lys Val Ala Ary Thr Leu Glu Glu Val 165 1.70 175 Pro Glu Leu Tyr Asp Ser Ala Val Arg Glu Ala Va.L Ala Ala Phe Gly Arg Gly Glu Cys Phe Val ~Jlu Arg Tyr Leu Asp Lys Pro Arg His Val Glu Thr Gln Cys Leu Ala Asp Thr His Gly Asn Val Val Val Val Ser 210 215 2,20 Thr Arg Asp Cys Ser Leu Gln Arg Arg His Gln Lys Leu Val Glu Glu Ala Pro Ala Pro Phe Leu Ser Glu Ala Gln Thr G1u Gln Leu Tyr Ser Ser Ser Lys Ala Ile Leu L~y:-, Glu Ala Gly 'ryr Gly Gly Ala Gly Thr Val Glu Phe Leu Val Gly Met: Asp Gly Thr Ile Phe Phe Leu Glu Val Asn Thr Arg Leu Gln Val Glu His Pro Val T.hr Glu Glu Val Ala Gly Ile Asp Leu Val Arg Glu stet: Phe Arg Ile Ala Asp Gly Glu Glu Leu 305 310 315 3'<:0 Gly Tyr Asp Asp Pro Ala Lei, Arg Gly His Ser Phe Glu Phe Arg Ile Asn Gly Glu Asp Pro Gly Arcl Gly Phe Leu Pro Ala Pro Gly Thr Val Thr Leu Phe Asp Ala Pro Th::' Gl.y Pro Gly Val. Arg Leu A:~p Ala Gly Val Glu Ser Gly Ser Val Ile Gly Pro Ala Trp Asp Ser Leu Leu Aia 370 =.75 380 Lys Leu Ile Val Thr Gly Arg Thr Arg Ala Glu Ala Leu Gln Arg A.La Ala Arg Ala Leu Asp Glu Phf=_ Thr Val Glu Gly Met Ala Thr Ala I.Le Pro Phe His Arg Thr Val Va.L Arg Asp Pro Ala Phe Ala Pro G1u Leu Thr Gly Ser Thr Asp Pro Phe Thr Val His Thr Arg Trp Ile Glu Thr Glu Phe Val Asn Glu Ile Ly,s Pro Phe Thr Thr Pro Ala Asp Thr Glu Thr Asp Glu Glu Ser Gly Arg G:Lu Thr Val Va:L Val Glu Val Gly Gly Lys Arg Leu Glu Val Ser Leu Pro Ser Ser Leu Gly Met Ser Leu Ala Arg Thr Gly Leu Ala Ala Gl:y Ala Arg Pro Lys Arg Arg Ala Ala Lys Lys Ser Gly Pro Ala Ala Ser Gly Asp Thr Leu Ala Ser Pro Met Gln Gly Thr Ile Val Lys Ile Ala Val C~lu Glu C~ly Gln Glu Val Gln Glu Gly Asp Leu Ile Val Val Leu Glu Ala Met Lys Met Glu Gln Pro Leu Asn Ala His Arg Ser Gly 'rhr Ile Lys Gly Leu Thr Ala Glu Val Gly Ala Ser Leu Thr Ser Gly Ala Ala I.le Cys Glu I.Le Lys Asp (2) INFORMATION FOR SEQ ID NO: 26:
(i) SEQUENCE CHARACTERISTIC:'S.
(A) LENGTH: 590 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: ;.E(~ ID NO: 26:
Val Arg Lys Val Leu Ile Ala ~~sn Arg Gly Glu Ile Ala Val Arg Val Ala Arg Ala Cys Arg Asp Ala Giy Ile A.La Ser Val Ala Val Tyr Ala Asp Pro Asp Arg Asp Ala Leu liis Val Arg Ala Ala Asp Glu Ala Phe Ala Leu Gly Gly Asp Thr Pro ~~:a Thr Ser Tyr Leu Asp Ile Ala Lys Val Leu Lys Ala Ala Arg Gl:~ 3er Gly Ala Asp Ala Ile His Pro Gly Tyr Gly Phe Leu Ser Glu Asn ,Ala Asp Phe Ala Gln Ala Val Leu Asp Ala Gly Leu Ile Trp Ile G1;~ Pr_o Pro Pro His Ala Ile Arg Asp Arg Gly Glu Lys Val Ala Ala Arg l-Lis I1e Ala Gln Arg Ala Gly Ala Pro Leu Val Ala Gly Thr Pro Asp :Pro Val Ser Gly A1<r Asp Glu Val Val 130 13':. 140 Ala Phe Ala Lys Glu His Gly Leu Pro Ile Ala Ile Lys Ala Ala Phe 145 150 1.55 160 Gly Gly Gly Gly Arg Gly Leu C~ys Val Ala Ard Thr Leu Glu Glu Val Pro Glu Leu Tyr Asp Ser Ala 'Val Arg Glu Ala Val Ala Ala Phe Gly 180 1.85 190 Arg Gly Glu Cys Phe Val Glnr.Arg Tyr Leu Asp Lys Pro Arg His Val Glu Thr Gln Cys Leu Ala Asp 'rhr His Gly Asn Val Val Val Val Ser Thr Arg Asp Cys Ser Leu Gln Arg Arg His Gln Lys Leu Val Glu Glu Ala Pro Ala Pro Phe Leu Ser :,1u Ala Gln Thr Glu Gln Leu Tyr Ser Ser Ser Lys Ala Ile Leu Lys Glu Ala Gly Tyr Va.1 Gly Ala Gly Thr Val Glu Phe Leu Val Gly Met Asp Gly Thr I:le Ser Phe Leu Glu Val 275 2.30 285 Asn Thr Arg Leu Gln Val Glu His Pro Val Thr Glu Glu Val Ala Gly 290 29'> 300 Ile Asp Leu Val Arg Glu Met Phe Arg Ile Ala Asp Gly Glu Glu Leu Gly Tyr Asp Asp Pro Ala Leu Arg Gly His Ser Phe Glu Phe Arg Ile Asn Gly Asp His Pro Gly Arg (31y Phe Leu Pro Ala Pro Gly Thr Val Thr Leu Phe Asp Ala Pro Thr Gly Pro Gly Val Arg Leu Asp Ala Gly 355 :3fi0 365 Val Glu Ser Gly Ser Val Ile tziy Pro Ala Trp Asp Ser Leu Leu Ala Lys Leu Ile Val Thr Gly Arc7 'Phr Arg Ala Glu Ala Leu Gln Arg Ala Ala Arg Ala Leu Asp Glu Phe 'Phr Val Glu Gly Met, Ala Thr Ala Ile Pro Phe His Arg Thr Val Val.:~lrg Asp Pro Ala Phe Ala Pro Glu Leu Thr Gly Ser Thr Asp Pro Phe 'Phr Val His Thr Arc) Trp Ile Glu Thr 435 ~140 445 Glu Phe Val Asn Glu Ile Lys 'Pro Phe Thr Thr Pro Ala Asp Thr Glu 450 45'. 46!) Thr Asp Glu Glu Ser Gly Arc)G1u Thr Val Val Val Glu Val Gly Gly Lys Arg Leu Glu Val Ser Lei:WPro Ser Ser Leu Gly Met Ser Leu Ala Arg Thr Gly Leu Ala Ala G1;~,~11a Arg Pro Lys Arg Arg Ala Ala Lys Lys Ser Gly Pro Ala Ala Ser Gly Asp Thr Leu Ala Ser Pro Met Gln 515 !p20 525 Gly Thr Ile Val Lys Ile Ala 'J<~1 Glu Glu Gly Gln Glu Val Gln Glu 530 53'i 54() Gly Asp Leu Ile Val Val Le~.i G.Lu Ala Met Lys Met= Glu Gln Pro Leu Asn Ala His Arg Ser Gly Thr I.Le Lys Gly Leu Thr Ala Glu Val Gly Ala Ser Leu Thr Ser Gly Alai .~la Ile Cys Glu Ile Lys Asp (2) INFORMATION FOR SEQ TD N0: 27:
(i) SEQUENCE CHARACTERISTIi'S':
(A) LENGTH: 1773 base p~sirs (B) TYPE: nucleic acid (C) STRANDEDNESS: doubl,=_ (D) TOPOLOGY: circular (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27:
GTGCGCAAGG TGCTCATCGC CAATCGTGGC'.GAAATCGCTG TCCC>CGTGGC CCGGGCCTGC 60 CGGGACGCCG GGATCGCGAG CGTGGCCGTt:'Pi~CGCGGATC CGGACCGGGA CGCGTTGCAC 120 GTCCGTGCCG CTGATGAGGC GTTCGCCCT(>Gt3TGGTGACA CCCt~CGCGAC CAGCTATCTG 180 TACGGATTCCTCTCGGAGAA TCCTGGAC;GC CGGCCTGATC300 CGCCGAGTTC.' GC;GCAGGCGG
TGGATCGGCCCGCCCCCGCACGCCATCCGC'(iACCGTGGCGAAAAGGTCGC CGCCCGCCAC360 ATCGCCCAGCGGGCCGGCGCCCCCCTGGTC.' CCGACCCCGT CTCCGGCGCG420 GCCGGCACCC
GACGAGGTCGTCGCCTTCGCCAAGGAGCAC GC=CCTGCCCATCGCCATCAA GGCCGCCTTC480 GGCGGCGGCGGGCGCGGCCTCAAGGTCGCC:'(:GCACCCTCGAAGAGGTC;CC GGAGCTGTAC540 GACTCCGCCGTCCGCGAGGCCGTGGCCGCC' 'f7.'CGGCCGCGGGGAGTGC;TT CGTCGAGCGC600 TACCTCGACAAGCCCCGCCACGTGGAGACC:'(:AGTGCCTGGCCGF~CACC.'CA 660 CGGCAACGTG
GTCGTCGTCTCCACCCGCGACTGCTCCCTC C:AGCGCCGCCACCAAAAGCT CGTCGAGGAG720 GCCCCCGCGCCCTTTCTCTCCGAGGCCCAC~~~C:GGAGCAGCTGTACTCATC CTCCAAGGCC;780 ATCCTGAAGGAGGCCGGCTACGGCGGCGCC:'(zC=CACCGTGGAGTTCCTC;GT CGGCATGGAC840 GGCACGATCTTCTTCCTGGAGGTCAACACC" (~CiCCTCCAGGTCGAGCAC;CC GGTCACCGAG900 GAAGTCGCCGGCATCGACTTGGTCCGCGAG i~TGTTCCGCATCGCCGACGG CGAGGAACTC960 GGTTACGACGACCCCGCCCTGCGCGGCCAC"CC:CTTCGAGTTCCG1CATCAA CGGCGAGGAC1020 CCCGGCCGCGGCTTCCTGCCCGCCCCCGGC' ACCGTCACCCTCTTCGACGC GCCCACCGGC1080 CCCGGCGTCCGCCTGGACGCCGGCGTCGAG ',CCCGGCTCCGTCATCGGC;CC CGCCTGGGAC1140 TCCCTCCTCGCCAAACTGATCGTCACCGGC:'(~GCACCCGCGCCGAGGCACT CCAGCGCGCG1200 GCCCGCGCCCTGGACGAGTTCACCGTCGAG t~GCATGGCCACCGC;CATC;CC CTTCCACCGC1260 ACGGTCGTCCGCGACCCGGCCTTCGCCCCC' GAACTCACCGGCTCCACGGA CCCCTTCACC1320 GTCCACACCCGGTGGATCGAGACGGAGTTC' GTCAACGAGATCAAGCCC;TT CACCACGCCC1380 GCCGACACCGAGACGGACGAGGAGTCGGG<:' TCGTCGTC:GA GGTCGGCGGC1440 (:GGGAGACGG
AAGCGCCTGGAAGTCTCCCTCCCCTCCAGC:' CCCi'GGCC;CG CACCGGCCTG1500 t:'l.'GGGCATGT
GCCGCCGGGGCCCGCCCCAAGCGCCGCGCG (3C;CAAGAAGTCCGC~CCCC:GC CGCCTCGGGC1560 GACACCCTCGCCTCCCCGATGCAGGGCACG ~~TCGTCAAGATCGCCGTCGA GGAAGGCCAG1620 GAAGTCCAGGAAGGCGACCTCATCGTCGTA CTCGAGGCGATGAAGATC~GA ACAGCCCCTC1680 AACGCCCACAGGTCCGGCACCATCAAGGGC' CTCACCGCCGAGGTCGGC;GC CTCCCTCACC1740 TCCGGCGCCGCCATCTGCGAGATCAAGGAC:' 1773 'PGA
(2) INFORMATION
FOR
SEQ
ID
NO:
:8:
(i) EQUENCE
S CHARACTERISTIC'S:
(A) LENGTH:1773 base pairs (B) TYPE:cleic aci<a nu (C) STRANDEDNESS:
douk>l~=_ (D) TOPOLOGY:
circular (xi) SEQUENCE DESCRIPTION: S'EQ ID NO: 28:
GTGCGCAAGG TGCTCATCGC CAATCGTGGt:; GAAATCGCTG TCCC>CGTGGC CCGGGCCTGC 60 CGGGACGCCG GGATCGCGAG CGTGGCCGTC''P~1CGCGGATC CGGACCGGGA CGCGTTGCAC 120 GTCCGTGCCG CTGATGAGGC GTTCGCCCT(.~ GGTGGTGACA CCCCCGCGAC CAGCTATCTG 180 GACATCGCCAAGGTCCTCAA CGGACGCC.'AT CCACCCCGGC240 AGCCGCGCGC:'(;~AGTCGGGCG
TACGGATTCCTCTCGGAGAACGCCGAGTTC." TCCTGGAC:GC CGGCCTGATC300 (3C:GCAGGCGG
TGGATCGGCCCGCCCCCGCACGCCATCCGC'(~ACCGTGGCGAAAAGGTC.'GC CGCCCGCCAC360 ATCGCCCAGCGGGCCGGCGCCCCCCTGGTC (3CCGGCACCCCCGACCCC:GT CTCCGGCGCG420 GACGAGGTCGTCGCCTTCGCCAAGGAGCAC' GGCCTGCCCATCGC:CATC'AA GGCC:GCCTTC480 GGCGGCGGCGGGCGCGGCCTCAAGGTCGC<:'(:GCACCCTCGAAGAGGTGCC GGAGCTGTAC:540 GACTCCGCCGTCCGCGAGGCCGTGGCCGCC' TTCGGCCGCGGGGAGTGC:TT CGTCGAGCGC600 TACCTCGACAAGCCCCGCCACGTGGAGACC: CAGTGCCTGGCCGACACC:CA CGGCAACGTG660 GTCGTCGTCTCCACCCGCGACTGCTCCCTC'(:AGCGCCGCCACCAAAAGCT CGTCGAGGAG720 GCCCCCGCGCCCTTTCTCTCCGAGGCCCACI AC:GGAGCAGCTGTACTCATC CTCCAAGGCC780 ATCCTGAAGGAGGCCGGCTACGGCGGCGCC'..'(~GCACCGTGGAGTTCCTC:GT CGGCATGGAC840 GGCACGATCTTCTTCCTGGAGGTCAACACC (:GCCTCCAGGTCGAGCAC:CC GGTCACCGAG900 GAAGTCGCCGGCATCGACTTGGTCCGCGAG i~TGTTCCGCATCGCCGACGG CGAGGAACTC960 GGTTACGACGACCCCGCCCTGCGCGGCCA(:''PCCTTCGAGTTCCC~CATCAA CGGCGAGGAC1020 CCCGGCCGCGGCTTCCTGCCCGCCCCCGGC:' TCTTCGACGC GCCCACCGGC1080 AC:CGTCACCC
CCCGGCGTCCGCCTGGACGCCGGCGTCGAC~ 'L'C:CGGCTCCGTCATCGGC:CC CGCCTGGGAC1140 TCCCTCCTCGCCAAACTGATCGTCACCGGC (:CiCACCCGCGCCGAGGCACT CCAGCGCGCG1200 GCCCGCGCCCTGGACGAGTTCACCGTCGAC~(3GCATGGCCACCGC:CATCCC CTTCCACCGC1260 ACGGTCGTCCGCGACCCGGCCTTCGCCCC<'(>AACTCACCGGCT(:CACGGA CCCCTTCACC:1320 GTCCACACCCGGTGGATCGAGACGGAGTTC:(sTCAACGAGATCAAGCCC:TT CACCACGCCC1380 GCCGACACCGAGACGGACGAGGAGTCGGGC:.' TCG'7.'CGTC:GA 1440 CGGGAGACGG GGTCGGCGGC
AAGCGCCTGGAAGTCTCCCTCCCCTCCAGC'c:TGGGCATGTCCC'TGGCC:CG CACCGGCCTG1500 GCCGCCGGGGCCCGCCCCAAGCGCCGCGCC:; CCGGCCCC:GC CGCCTCGGGC1560 GC:CAAGAAGT
GACACCCTCGCCTCCCCGATGCAGGGCACC; ATCGTCAAGATCGC:CGTC:GA GGAAGGCCAG1620 GAAGTCCAGGAAGGCGACCTCATCGTCGTI, CTCGAGGCGATGAAGATGGA ACAGCCCCTC1680 AACGCCCACAGGTCCGGCACCATCAAGGG(:: AGG7.'CGGC:GC 1740 t:TCACCGCCG CTCCCTCACC
TCCGGCGCCGCCATCTGCGAGATCAAGGA(:: 1773 'Pt:3A
(2) INFORMATION
FOR
SEQ
ID
N0:
29:
(i) EQUENCE
S CHARACTERISTIC'S:
(A) LENGTH:1584 base pairs (B) TYPE:
nucleic acid (C) STRANDEDNESS:
double (D) TOPOLOGY:
circular (xi) SEQUENCE DESCRIPTION: ~E',t;~ ID N0: 29:
ATGACCGTTT TGGATGAGGC GCCGGGCGAC> CCGACGGACG CGCGCGGGCG GGTGGCCGAG 60 CTGCACGGGA TCCGTGCAGC GGCGCTCGCC: GGGCCGAGTG AGAAGGCGAC GGCGGCGCAG 120 CACGCCAAGGGCAAGCTGACGGCACGTGAG (;GCATCGAGCTGCTCCTGGA CCCCGGCTCC180 TTCCGCGAGGTCGAGCAGCTGCGCCGGCAC'(~GGGCGACCGGGTTCGGCCT GGAGGCCAAG240 AAGCCGTACACCGACGGTGTCATCACCGGC.' TCGAGGGCCG CACGGTCTTC300 'CGGGGCACGG
GTCTACGCCCACGACTTCCGGATCTTCGGC'(~GCGCGCTGC,GCGAGGCC'.CA CGCCACGAAG360 ATCCACAAGATCATGGACATGGCCATCGCG GC:CGGTGCCCCGCTGGTGTC GCTGAACGAC420 GGCGCCGGCGCCCGTATCCAGGAGGGCGTC.'i~GCGCGCTCGCCGC:GTAC.'GG 480 CGGCATCTTC
CAGCGCAACACCAAGGCGTCCGGCGTCATC'(~CGCAGATCAGCGTGATGCT CGGCCCCTGC540 GCGGGCGGCGCGGCCTACAGCCCCGCCCTC' ACCGACTTCGTCTTCATGGT CCGCGACACC600 TCGCAGATGTTCATCACGGGCCCGGACGTC' (~l'CAAGGCGGTCAC'.CGGC'.GA 660 GGAGATCACG
CAGAACGGTCTGGGCGGCGCCGACGTGCAC (s(:.CGAGACGTCCGC>CGTC~TG CCACTTCGCC720 TACGACGACGAGGAGACCTGCCTCGCCGAO (37.'CCGCTACCTCCTCTCCCT CCTCCCGCAG780 AACAACCGGGAGAACCCGCCCCGCGCCGAi'I'CCTCCGACCCCGTGGAC:CG CCGCTCGGAC840 ACCCTCCTCGACCTGGTCCCGGCGGACGG(:' ACGACATGAC CAAGGTCATC900 AACCGCCCGT
GAGGAACTCGTCGACGAGGGCGAGTACCTi:~(3AGGTCCACGAGCC~TTGGGC CCGCAACATC960 ATCTGCGCGCTGGCCCGTCTCGACGGGCGG GTCGTGGGCATCGTCGCC'.AA CCAGCCGCAG1020 GCCCTGGCCGGTGTCCTGGACATCGAGGCi3 'PC'.GGAGAAGGCGGC:CCGCTT CGTCCAGATG1080 TGCGACGCCTTCAACATCCCGATCATCACT CTTCTGGACGTACC'CGGCTT CCTGCCCGGC1140 GTCGACCAGGAGCACGGCGGGATCATCCG<' CACGGCGCCAAGCTGCTCTA CGCGTACTGC1200 AACGCGACCGTGCCCCGGATCTCGCTGATC' CTGCGCAAGGCGTACGGAGG TGCTTACATC1260 GTCATGGACAGCCAGTCCATCGGCGCCGAC'(:TCACCTACGCCTC~GCCGAC CAACGAGATC1320 GCCGTCATGGGCGCGGAAGGTGC'CGCGAAC.'(~TCATCTTCCGCCGGCAGAT CGCCGACGCC1380 GAGGACCCCGAGGCCATGCGGGCGCGCATC~ G7.'CAAGGAGTACAAGTCCGA GCTGATGCAC1440 CCCTACTACGCGGCCGAACGCGGTCTGGTC'(~ACGACGTCATCGACCCC:GC CGAAACCCGC1500 GAGGTGCTGATCACGTCCCTGGCGATGCTC.' ACGCCGACCT GCCCTCCCGC1560 CACACCAAGC
(2) INFORMATION
FOR
SEQ
ID
N0:
.?0:
(i) EQUENCE
S CHARACTERISTIC'S:
(A) LENGTH:198 base pairs (B) TYPE:cleic acid nu (C) STRANDEDNESS:
doux:>le (D) TOPOLOGY:
circular (xi) SEQUENCE DESCRIPTION: ~Et~ ID NO: 30:
ATGTCCCCTG CCGACATCCG CGTCGAGAA<> GGCCACGCCG AGCCCGAGGA AGTCGCCGCC 60 ATCACGGCCC TCCTCCTGGC CCGCGCCGC(:'GC:CCGCCCCG CCGAGATCGC GCCGACCCAC 120 GGCGGCGGCC GCGCCCGCGC CGGCTGGCGC' CGCCTGGAAC GCGAGCCGGG CTTCCGCGCC 180 (2) INFORMATION FOR SEQ ID NO: ~1:
(i) SEQUENCE CHARACTERISTI(:S:
(A) LENGTH: 527 amino ac:Lds (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: 3.inear (xi) SEQUENCE DESCRIPTION: SE~~ ID NO: 31:
Met Thr Val Leu Asp Glu Ala Pro Gly Glu Pro Thr Asp Ala Arg Gly Arg Val Ala Glu Leu His Gly I.Le Arg Ala Ala Ala Leu Ala Gly Pro Ser Glu Lys Ala Thr Ala Ala Gln His Ala Lys Gly Lys Leu Thr Ala Arg Glu Arg Ile Glu Leu Le~.x Leu Asp Pro Gly Ser Phe Arg Glu Val Glu Gln Leu Arg Arg His Arr3 .Ala Thr Gly Phe Gl.y Leu Glu Ala Lys Lys Pro Tyr Thr Asp Gly Va'~ I.Le Thr Gly Trp Gly Thr Val Glu Gly Arg Thr Val Phe Val Tyr Al<i:H:is Asp Phe Arch Ile Phe Gly Gly Ala Leu Gly Glu Ala His Ala Th:r Lys Ile His Lys Il.e Met Asp Met Ala Ile Ala Al.a Gly Ala Pro Le~.z Val Ser Leu Asn Asp Gly Ala Gly Ala Arg Ile Gln Glu Gly Val Sea- A=La Leu Ala Gly Tyr Gly Gly Ile Phe 145 150 15'> 160 Gln Arg Asn Thr Lys Ala Ser 31y Val Ile Pro Gln Ile Ser Val Met Leu Gly Pro Cys Ala Gly Gly Ala Ala Tyr Sex Pro Ala Leu Thr Asp Phe Val Phe Met Val Arg Asp 'rhr Ser Gln Met Phe Ile Thr Gly Pro Asp Val Val Lys Ala Val Thr G.Ly Glu Glu Ile Thr Gln Asn Gly Leu Gly Gly Ala Asp Val His Ala 31u Thr Ser Gly Va.L Cys His Phe Ala Tyr Asp Asp Glu Glu Thr Cys Leu Ala Glu Va1 Arg Tyr Leu Leu Ser Leu Leu Pro Gln Asn Asn Arg Glu Asn Pro Pro Arg Ala Glu Ser Ser Asp Pro Val Asp Arg Arg Se:r Asp Thr Leu Leu Asp Leu Val Pro Ala Asp Gly Asn Arg Pro Tyr Asp Met Thr Lys Va.L Ile Glu Glu Leu Val Asp Glu Gly Glu Tyr Leu Glu 'J<~l His Glu Arg Trp Ala Arg Asn Ile Ile Cys Ala Leu Ala Arg Le~.i.~sp Gly Arg Val Val Gly Ile Val Ala Asn Gln Pro Gln Ala Leu Ala G.Ly Val Leu Asp Ile Glu A1a Ser Glu Lys Ala Ala Arg Phe Val Gln Met Cys Asp Al<~ Phe Asn Ile Pro Ile Ile Thr Leu Leu Asp Val Pro ~3.Ly Phe Leu Pro Gly Val Asp Gln Glu His Gly Gly Ile Ile Arg Hi:~ Gly Ala Lys Leu Leu Tyr Ala Tyr Cys 385 390 39'_i 400 Asn Ala Thr Val Pro Arg Ile Ser Leu Ile Leu Arg Lys Ala Tyr Gly Gly Ala Tyr Ile Val Met Asp Ser Gln Ser Ile Gly Ala Asp Leu Thr Tyr Ala Trp Pro Thr Asn Glu I.Le Ala Val Met Gly Ala Glu Gly Ala Ala Asn Val Ile Phe Arg Arc) G.Ln Ile Ala Asp Ala Glu Asp Pro Glu Ala Met Arg Ala Arg Met Va.I. Lys Glu Tyr Ly:~ Ser Glu Leu Met His Pro Tyr Tyr Ala Ala Glu Arg G.Ly Leu Val Asp As,p Val Ile Asp Pro Ala Glu Thr Arg Glu Val Leu I.Le Thr Ser Leu Ala Met Leu His Thr Lys His Ala Asp Leu Pro Se:r.Arg Lys His Gly Asn Pro Pro Gln (2) INFORMATION FOR SEQ ID NO: 32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino <acids (B) TYPE: amino acid (C) STRANDEDNESS: sin:)1.2 (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :>E?~ ID N0: 32:
Met Ser Pro Ala Asp Ile Arch Val Glu Lys G'ly Hi.s Ala Glu Pro Glu Glu Val Ala Ala Ile Thr Ala Leu Leu Leu A1<~ Arg Ala Ala Ala Arg Pro Ala Glu Ile Ala Pro Th:v His Gly Gly Gly Arg Ala Arg Ala Gly Trp Arg Arg Leu Glu Arg Glu Pro Gly Phe Arg Ala Pro His Ser Trp Arg
carboxylase of S. coelicolor, and provide detailed genetic and biochemical characterization. The enzyme complex contains a unique sub-unit composition and appears to be the main pathway for the biosynthesis of malonyl-CoA, one of the key metabolites in the linkage between primary and secondary metabolism. An alternative pathway for the biosynthesis of malonyl-CoA is also proposed for cultures growing in malonate, and it most probably involves the matB
and matC homologues of R. trifolii. However, even in these growing conditions, the acyl-CoA carboxylase seems to be essential for the viability of the micro-organism.
Suaanary of invention Two genes accB and accE, forming a single operon, have been cloned from Streptomyces coelicolor A(3)2. The deduced amino acid sequence of AccB showed high similarity to carboxyl transferase of several propionyl- or acyl-CoA
carboxylases of different actinomycetes. By contrast, AccE
did not show any significant homology with protein sequences deposited in the GenBank data base. Heterologous expression of accB and accE in Escherichia coli and in vitro reconstitution of enzyme activity in the presence of the biotinylated component AccAl or AccA2 confirmed that AccB was the carboxyl transferase subunit of an acyl-CoA
carboxylase.
These experiments also established that AccE was a necessary component to obtain a fully active enzyme complex, whose subunit composition seems to be unique within this type of carboxylase. Gene disruption experiments clearly determined that AccB was essential for S. coelicolor viability. This protein together with AccA2, a biotinylated component essential for the .viability of this micro-organism (Rodriguez and Gramajo, 1999), are the best candidates to form an acyl-CoA carboxylase, whose main physiological role is, most probably, the biosynthesis of malonyl-CoA.
Transcriptional studies of accBE, accA2 and acct have shown that accBE and accA2 are mainly expressed during vegetative and transition phase of growth, although some expression of these genes also occurred during stationary phase where they should provide the malonyl-CoA subunits for secondary metabolites biosynthesis. accA1 is only expressed during the transition phase of growth and its role in the formation of a carboxylase complex involved in providing the substrate for polyketide compounds of S.
coelicolor is discussed.
Finally, an alternative route for the biosynthesis of malonyl-CoA is proposed when malonate is used as a carbon source. However, this route seems unable to substitute the main one, determined by the acyl-CoA carboxylase.
Accordingly, in a first aspect, the present invention provides a nucleic acid comprising a nucleic acid sequence which encodes an AccB polypeptide and/or an AccE
polypeptide, or a nucleic acid sequence complementary thereto.
In a second aspect, the present invention provides a nucleic acid comprising a nucleic acid sequence which encodes an AccAl and/or AccA2 polypeptide, or a nucleic acid sequence complementary thereto. It is believed that such nucleic acid was not made available to the public S
before 24 October 1999, when the amino acid sequences of these polypeptides were disclosed in an oral presentation.
Exemplary nucleic acid sequences encoding the AccB, AccE, AccAl and AccA2 polypeptides are given herein. Preferred embodiments of the invention include such sequences.
However, it would be a matter of routine for the skilled person to obtain other nucleic acid sequences encoding these polypeptides, e.g. by introducing mutations which do not alter the encoded amino acid sequence, by virtue of the degeneracy of the genetic code, or by introducing mutations which alter the encoded amino acid sequence, within limits as defined below. Moreover, nucleic acids encoding variants of the polypeptides may be obtained e.g. by screening different strains of S. coelicolor or closely related species of Streptomyces using degenerate probes based on the sequences given herein.
Preferred nucleic acids of the first and second aspects encode AccB and AccE polypeptides along with an AccAl and/or an AccA2 polypeptide (preferably AccA2).
The nucleic acid sequences encoding Acc polypeptides are preferably in operative association with regulatory sequences, e.g. sequences which enable constitutive or inducible expression in Streptomyces species. Examples of plasmids which include such regulatory sequences and of suitable promoters are given herein. A suitable inducible promoter is tipA (inducible by thiostrepton); suitable constitutive promoters are ermE and the optimised ermE*.
Alternatively, naturally occurring nucleic acid sequences may be in operative association with the regulatory sequences with which they are normally associated, or corresponding regulatory sequences from homologous genes in other strains or species. For example, the nucleic acid sequences may be in operative association with the corresponding regulatory (e. g. promoter) sequences defined herein.
For detailed protocols relevant to this and other aspects, see standard reference texts, such as Sambrook et al.
(1989) and Hopwood et al. (1985).
In a third aspect, the present invention separately provides AccB, AccE, AccAl and AccA2 polypeptides having amino acid sequences encoded or encodable by the respective nucleic acid sequences referred to in the first and second aspects.
In a fourth aspect, the present invention provides: vectors containing the nucleic acids of the first and second aspects (preferably vectors, e.g. plasmids, suitable for transforming Streptomyces species for expression therein) and cells, particularly Streptomyces cells, transformed with such vectors. Furthermore, the present invention provides a method of producing a secondary metabolite of a Streptomyces species, the method comprising culturing such transformed Streptomyces cells and extracting the secondary metabolite from the cell culture. The metabolite may be purified and/or formulated as a commercial product according to standard procedures.
In a fifth aspect, the invention provides a method of modifying a secondary metabolite-producing strain of a Streptomyces species to increase production of said secondary metabolite, the method comprising modifying said strain to express, or to increase expression of, nucleic acid encoding one or more polypeptides selected from the group consisting of AccB, AccE, AccAl and AccA2.
In a sixth aspect, the present invention provides a method of modifying a strain of a Streptomyces species to increase ACCase and/or PCCase activity, the method comprising modifying said strain to express, or to increase expression of, nucleic acid encoding one or more polypeptides selected from the group consisting of AccB, AccE, AccAl and AccA2.
In a seventh aspect, the present invention provides a modified strain of a Streptomyces species, produced or producible according to the method of the fifth or sixth aspect. Also provided are cells of said strain, methods of producing secondary metabolites comprising culturing said cells and extracting the secondary metabolite, which may be purified and/or formulated as a commercial product.
In an eighth aspect, the invention provides a method of increasing production of a secondary metabolite in cells of a Streptomyces species, the method comprising culturing said cells in the presence of exogenous malonate, preferably at a concentration of at least about 0.1%, more preferably at least about 0.2%, 0.4%, 0.5%, 0.75% or 1%, though higher concentrations may be used. 1% represents 1g per 100 ml of medium.
Detailed Description In relation to the fifth and sixth aspects, the modification preferably provides for increased expression of nucleic acid encoding more than one of AccB, AccE, AccAl and AccA2, more preferably at least AccB and AccE or at least AccB and either AccAl or AccA2, more preferably AccB, AccE and either AccAl or AccA2. Of AccAl and AccA2, AccA2 is preferred. Increased expression of nucleic acid encoding both AccAl and AccA2 (usually in combination with AccB and optionally AccE) is also contemplated.
The methods of the fifth and sixth aspects preferably include a step of transforming a Streptomyces ce~.l with a said nucleic acid under the control of a constitutive or inducible promoter, preferably a strong promoter. However, the expression of existing said nucleic acid could be increased, e.g. by placing them under the control of a stronger promoter sequence or sequences.
Exogenous said nucleic acid can replace existing said nucleic acid in the cell, or can be added without removing or functionally deleting existing said nucleic acid.
Acc polypeptides and acc genes In the definitions herein of the invention, and of the scope of protection (but not, except where the context requires otherwise, in the experimental sections), the term AccB is intended to include not only a polypeptide having the deduced amino acid sequence encoded by the nucleic acid sequence of Fig. 12 (though this is a preferred embodiment), but also a polypeptide which is a variant (e.g. an allelic or isoallelic variant) or a derivative of said polypeptide, having at least about 60% amino acid identity with said polypeptide, preferably at least about 65%, 70% or 75%, especially preferably (in view of the similarity of AccB as disclosed herein to another protein of unconfirmed function) at least about 80%, 85%, 90%, 92%, 94%, 96%, 98% or 99% identity. Such a variant or derivative may possess any one or more of the biological properties of the wild-type AccB protein, as disclosed herein, e.g. complex formation with AccAl, .AccA2 and/or AccE (or allosteric regulation by AccE), ACCase and/or PCCase activity when AccB is co-expressed with AccAl, AccA2 and/or AccE, or increased secondary metabolite production when AccB is overexpressed in Streptomyces species (preferably in conjuction with AccAl, AccA2 and/or AccE).
Similarly, the term AccE is intended to include not only a polypeptide having the deduced amino acid sequence encoded by the nucleic acid sequence of Fig. 13 (though this is a preferred embodiment), but also a polypeptide which is a variant (e.g. an allelic or isoallelic variant) or a derivative of said polypeptide, having at least about 40%
amino acid identity with said polypeptide, preferably at least about 50%, 60%, 70%, 80%, 85%, 90%, 95% or 99%
identity. Such a variant or derivative may possess any one or more of the biological properties of the wild-type AccE
protein, as demonstrated herein, e.g. complex formation with AccAl, AccA2 and/or AccB (or allosteric regulation of AccB), ACCase and/or PCCase activity when AccE is co-expressed with AccB, or increased secondary metabolite production when AccE is overexpressed in Streptomyces species (preferably in conjuction with AccB).
Similarly, the terms AccAl and AccA2 are intended to include not only the polypeptides having the amino acid sequences shown in Fig. 11 (though these are respective preferred embodiments), but also polypeptides which are variants (e.g. allelic or isoallelic variants) or are derivatives of said polypeptides, having at least about 75%
amino acid identity with said polypeptide, preferably at least about 80%, 85%, 90%, 92%, 94%, 96%, 98% or 99%
identity. Such variants or derivatives may possess any one or more of the biological properties of the wild-type AccAl or AccA2 polypeptides, as disclosed herein, e.g. complex formation with AccB and/or AccE, ACCase and/or PCCase activity when AccAl or AccA2 is co-expressed with AccB
and/or AccE, or increased secondary metabolite production when AccB is overexpressed in Streptomyces species (preferably in conjuction with AccB and/or AccE).
A variant or a derivative of a given peptide may have one or more of internal deletions, internal insertions, terminal truncations, terminal additions, or substitutions of one or more amino acids, compared to the given peptide.
References to nucleic acid encoding AccAl, AccA2, AccB
and/or AccE (or to accAl, accA2, accB and/or accE genes) should be interpreted accordingly.
In relation to the first aspect, preferred nucleic acids comprise a nucleic acid sequence having at least about 50%, preferably at least about 60%, 70%, 80%, 85%, 90%, 95%, 98%
or 99% nucleic acid sequence identity with the accB nucleic acid sequence shown in Fig. 12. Other preferred nucleic acids comprise a nucleic acid sequence having at least about 40%, preferably at least about 50%, 60%, 70%, 80%, 85%, 90%, 95%, 98% or 99% nucleic acid sequence identity with the accE nucleic acid sequence shown in Fig. 13.
Similarly, in relation to the second aspect, preferred nucleic acids comprise a nucleic acid sequence having at least about 50%, preferably at least about 60%, 70%, 80%, 85%, 90%, 95%, 98% or 99% nucleic acid sequence identity with the accAl or accA2 nucleic acid sequence shown in Fig.
11.
Secondary metabolites and Streptomyces species While the experimental disclosure herein relates to the production of Act (actinomycin) and Red (undecylprodigiosin) in S. coelicolor A3(2) (strain M145), it is thought that the teaching is applicable to other strains of Streptomyces in particular, it is thought that overexpession of all three Acc polypeptides (i.e. AccB, AccE and AccAl and/or AccA2) will lead to increased malonyl-CoA production in substantially any Streptomyces species or even in other actinomycetes or in fungi (which also produce polyketide compounds). Since malonyl-CoA is an essential metabolic substrate, it is thought that this will lead to greater yield of desired secondary metabolites (for which see page 1), e.g. polyketides (including antibiotic polyketidss) and fatty acids.
Preferred secondary metabolites are, however, antibiotics, especially Act and Red.
Preferred Streptomyces species are the closely related species S. coelicolor, S. violaceoruber, S. lividans and S.
parvulus, especially S. coelicolor. Strains of such species are commonly available, e.g. from the ATCC, for example under ATCC deposit numbers 12434 for S. parvulus and 19832 for S. violaceoruber. S. coelicolor A3(2) and S.
lividans 66 are available from the John Innes Culture Collection (Norwich, UK) under JICC deposit numbers 1147 and 1326, respectively. However, the invention is not limited to such particular strains.
Acetyl-CoA
In preferred embodiments, present invention further provides for the increased production in Streptomyces of acetyl-CoA, since it is thought that when ACCase activity is increased by the methods and means of the present invention, production of malonyl-CoA may become limited by the availability of the substrate acetyl-CoA. It is proposed that increased acetyl-CoA production could then lead to a further increased rate of malonyl-CoA production and hence secondary metabolite production. For example, oils or fatty acids could be used as the carbon ~urce (together with glucose); fatty acids are degraded by b-oxidation giving high levels of acetyl-CoA.
Sequence identity "Percent (%) amino acid sequence identity" is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the sequence with which it is being compared, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. The % identity values used herein are generated by WU-BLAST-2 which was obtained from Altschul et al. (1996); http://blast.wustl/edu/blast/README.html. WU-BLAST-2 uses several search parameters, most of which are set to the default values. The adjustable parameters are set with the following values: overlap span =1, overlap fraction = 0.125, word threshold (T) - 11. The HSPS and HSPS2 parameters are dynamic values and are established by the program itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched; however, the values may be adjusted to increase sensitivity. A % amino acid sequence identity value is determined by the number of matching identical residues divided by the total number of residues of the "longer"
sequence in the aligned region, multiplied by 100. The "longer" sequence is the one having the most actual residues in the aligned region (gaps introduced by WU-BLAST-2 to maximize the alignment score are ignored).
"Percent (%) nucleic acid sequence identity" is defined as the percentage of nucleotide residues in a candidate sequence that are identical with the nucleotide residues in the sequence under comparison. The identity values used herein were generated by the BLASTN module of WU BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively.
The work underlying the invention will now be described in detail, by way of example only, with reference to the accompanying figures.
Figures Fig. 1 Organization of the genomic region of S.
coelicolor M145 chromosome harbouring accB and accE
genes. A. Genetic and physical map of the 6.2 kb insert in pRM08. The secondary structure downstream accE represents a rho-independent transcriptional terminator. Fragments I and II were amplified by PCR
with the pair of oligos accBup-accBdown and accBEup-accBEdown respectively, uniquely labelled at the 5'-end (*) and used as probes in transcriptional analysis of the accBE operon. B. Physical map of the DNA
fragments cloned in pET22b(+) and used for the heterologous expression of accB and/or accE. Only the most relevant restriction sites are shown: B, BamHI;
Bc, BclI; E, EcoRI; K, KpnI; Nd, NdeI; N, NotI; S, SpHI .
Fig. 2 Attempted d:i.sruption of accB. A. Diagram showing the integration of pTR124 through one of the accBE
flanking regions and the resolution of the cointegrate by a second event of homologous recombination. The crossed out arrow indicates the impossibility of obtaining the reF>lacement of the wild-type accB by the HygR mutant allele. B. The integration of a second copy of the accB~; genes in the ~C31 att site of T'124 (to yield strain T149) allowed the replacement of the wild-type accB by a mutant allele containing the Hyg resistance cassette.
Fig. 3 Growth~phasF~ dependent expression and transcription start site of the accBE operon. A. S1 nuclease mappin:.~ of accB, actII-ORF4 and hrdB, using RNA isolated from a liquid time course of S.
coelicolor M145. Exp, Trans and Stat indicate the exponential, transition and stationary phase of growth, respective:Ly.B.rsEQ.ID.NO~s.:l8-2o) The nucleotide sequences of both strands from the accB promoter region are shown.
The arrow indicates the mcst likely transcription.
start point for the accBE promoter, as determined. by Sl nuclease mapp:i.ng. The potential -10 and -35 regions for the accBEp ar-a underlined. C. S1 nuclease mapping of the accB accE intergenic region using a 563 nt probe. FLP repre~~ents the full-length RNA-protected fragment that is ~.3 nt shorter than the probe.
Fig. 4 Growth-phase dependent expression of accA2 and accA2. S1 nuclease mapping of accA2 (A) and accAl. (B), using RNA isolated from a liquid time course of S.
coelicolor M145.
Fig. S Mapping of t:he accA2 and accAl transcription start point. A. H:i.gh resolution S1 nuclease mapping of the Send of the accA2 transcript. S1, RNA-protected products of the S1 nuclease protection assay. Lanes labelled A, C, G and T indi;~ate a dideoxy sequencing ladder using the same oligonucleotide that was used to make the S1 probe (accA2down). B. High resolution S1 nuclease mapping of the S' end of the accA1 transcript. S1, RNA-protected products of the S1 nuclease protect::ion assay. Lanes labelled T, G, C and A indicate a dideoxy sequencing ladder using the same oligonucleotide that was used to make the S1 probe (accAldown).C.(sEQ.I~.NGS.:21-23) Sequence of the accA2 and accAl upstream regions, indicating the most likely transcript;~on st::art points for the promoters of each of the accAl and accA2 genes (bent arrows). The potential -10 arid. -35 sequences for the accAl and accA2 promoters are underlined. The potential ribosomal bindi::ca sites (rbs) are highlighted with bold letters. The 16 nt direct repeats (DR) found upstream o~ the t:a~anscription start point of accAlpl are indicated with straight arrows.
Fig. 6 Construct.ioh. and analysis of the accBE
conditional mutant:. A. Diagram showing the integration of ~>IJ8600 in strain M86 and the expected organisation of t:3ne Campbell integration of pTR94 in M94. Restriction sites: B. BamHI; N, NotI; Nd, NdeI;
S, SacI; Sp, Sphl; Xb, XbaI. B. Hybridisation analysis of Southern blot. of SacI-digested DNAs from M145, M86 and M94. The prcbe was the internal NdeI-XbaI
fragment of accB shown in A (see Fig. 10).
Fig. 7 Expression of the acyl-CoA components in M86 and M94. A. SDS-PAGE of cell-free extracts of S.
coelicolor M86 and. M94 strains grown in YEME medium containing 10 ~g/m.l Am with or without the addition of S ~g/ml Th. B. A duplicate of t:he SDS-PAGE gel shown in A was subjected to Western blotting and stained for biotinylated prouteins by using alkaline phosphatase-streptavidin conjugate.
Fig. 8 Growth curves of M145, M86 and M94 strains. 108 spores of strain's M86 and M94 were inoculated in YEME
medium containing 10 ~g of Am or 10 ~tg/ml Am and 5 ~g/ml of Th'. 108 ,pores of M145 were inoculated in YEME. The growth was followed by measuring OD4sown.
Fig. 9 Morphologic:a.l and physi-ological differentiation of M86 and M94 i.n the presence of Th. Spores of M86 and M94 were spread in R2 or R~s medium containing 10~g/ml Am. A drop containing 1 ~g of Th was spotted in the centre of each plate. The picture shows the results obtained after the incubation of the plate at 30°C for 48h.
Flg. 10 (sEQ.ID.NO.:24) 'he sequence of the amplification product obtained from accB using primers TC16 and TC17. NdeI
(CATATG) and XbaI (TCTAGA) :sites introduced into the accB by the primers are shown in bold. The 1 kb NdeI-XbaI fragment was cloned into pIJ8600.
Fig. 11 A. (SEQ.ID.NOS.:25--26) Amino acid sequences and B. iSEQ.ID.N05.:27-2ii Nucleic acid sequences of accA1 and accA2.
Fig. 12 (SEQ.ID.NOS.:29 and 31) Nucleic acid sequence of accB.
Fig. 13 (SEQ.ID.NOS.:30 and 32) Nucleic acid sequence of accE.
Fig. 14 Plasmid map for the construction of an expression vector for accA, accB and accE.
Example 1: Cloning of accBE genes pccB of S. coelicolor (Rodriguez and Gramajo, 1999) was used as an heterologaus probe in Southern blot experiments.
When a BamHI digest af: S. coelicalor DNA was probed with pccB and washed under low stringent conditions, a second, low hybridising, band was readily detected (data not shown). The target sequence was cloned from a size-enriched library as a 2.5 kb BamHl fragment and sequenced as described in Experimental Procedures (below). The sequence revealed the presence of an in'omplete ORF with high homology to pccB. The complete gene was finally cloned as a 6 kb SstI fragment yielding pR~I0~3 (:Fig. 1). Sequencing and analysis of this DNA 1_z:agment revealed the presence of an ORF that exhibited enca.-tc-end similarity with a putative decarboxylase (though the real functicn is unknown) of S.
cyanogenous (Westri~:t,. et al . , 1999) , with the S.
coelicolor PccB (Rodriguez and Gramajc, 1999) and with the (3-subunit (PccB! of tree Sac. erythraea PCCase (Donadio, et al., 1996). The leve~~~~r of identity were 76%, 57% and 56%, respectively. The gene encoding this new putative carboxyl transferase was callec_~ accB.
Surprisingly, the sequence also revealed the presence of a small ORF, designated accE, whose start codon is only 17 by downstream of the termination codon of acc8. A 17 nt inverted repeat, which could function as a factor-independent bidirectional transcriptional terminator (reviewed in Lewin, 1994), separates accE from three convergent ORFs with homology to putative proteins of M.
tuberculosis with unknown functions. The putative AccE
polypeptide has a deduced molecular mass of 7.07 kDa and no significant homology to this polypeptide was found in a search of sequences deposited in the GenBank database.
Upstream of accB there is an ORF highly homologous to several known hialuronidases.
Example 2: accB is essential for S. coelicolor viability An accB mutant was constructed by gene replacement (Fig.
2A). A Hyg-resistant cassette was cloned in the unique BamHI site present in the coding sequence of accB, contained in pTR80. After an intermediate construction in pIJ2925, a BglII fragment containing the mutated allele was finally cloned in the conjugative vector pSETl5l. The resulting plasmid, pTR124, was cloned into the E. coli donor strain ET12567/pUZ8002 and transferred by conjugation into M145. Exconjugants were selected for ThR HygR for a simple crossover event. One of the exconjugants, named T124, was taken through four rounds of non-selective growth (SFM Hyg) to promote homologous recombination for the second crossover. Spores were plated to give single colonies and several thousands screened for Th sensitivity (which would have reflected successful gene replacement), but no Ths isolates were obtained. This result suggested that accB is essential for S. coelicolor viability.
The inventors proposed, however, that if a second copy of accB were present in the chromosome of T124, a second crossover event (leading to the replacement of the wild type gene by the HygR mutant allele) would then be allowed.
To confirm this hypothesis, pTR149, which contains a copy of the accBE genes under its own promoter (see Experimental procedures, Fig. 2B), was first integrated in the ~C31 attB
site of T124. (The introduction of a second copy of both genes into the chromosome was prompted by the probability of a polar effect on accE taking place after the gene replacement event and because AccE is important for the recovery of a fully active acyl-CoA carboxylase complex -see in vitro reconstitution experiments below). The resultant strain T149 (HygR, ThR, AmR) was passed through three rounds of sporulation on SFM Hyg Am and after the screening of approximately 500 colonies, 20 were found to be AmR HygR Ths. The final chromosomal organization of accB
in each of the strains constructed (T124, T149 and T149A), was analyzed by Southern blots using an internal fragment of accB as a probe.
Example 3: Heterologous expression of accB, accE and in vitro reconstitution of an acyl-CoA carboxylase complex.
Since accB proved to be essential for S. coelicolor viability, we could not clearly evaluate in vivo the physiological function of this gene product.
In order to study if AccB and AccE were components of an acyl-CoA carboxylase complex, we attempted in vitro reconstitution of the enzyme activity by mixing E. coli cell-free extracts containing the AccB and AccE with cell-free extracts containing the biotinylated sub-units AccAl or AccA2. E. coli does not contain an ACCase enzyme, so ACCase activity cannot be assayed directly by carboxylation of acetyl-CoA (Polakis et al., 1972); therefore, the acyl-CoA carboxylase activity measured in these crude extracts exclusively represents the activity of the heterologous complexes reconstituted in vitro.
Heterologous expression of accB and accE was att~npted by introducing a NdeI site at the ATG start codon of accB;
after an intermediate construction (see Experimental procedures), accBE was cloned as a NdeI-SacI fragment into pET22(b), yielding pTR88 (Fig. 1). Transformation of E.
coli BL21(DE3) with this plasmid yielded strain RG8 (Table 1). Crude extracts of RG8, prepared from IPTG-induced cultures, showed a clear over-expression of a 64 kDa protein in a 15 s SDS-PAGE, corresponding to AccB; by contrast, AccE was not clearly visualised by Coomassie blue staining of the same gel (data not shown). In vitro reconstitution of an acyl-CoA carboxylase was then attempted my mixing crude extracts prepared from IPTG-induced cultures of RG8 with cell-free extracts of the E.
coli strains RG7, which overproduces the biotinylated protein AccAl. After incubation for 1 h at 4 °C, the mixture was assayed for ACCase and PCCase activity. As shown in Table 2 an enzyme complex showing high levels of both ACCase and PCCase activities was successfully reconstituted To study if cell-free extracts containing AccB but not AccE
were capable of reconstituting an active acyl-CoA
carboxylase complex when mixed with cell-free extracts containing AccAl, we constructed a new pET22(b) derivative that only expresses accB. For this we took advantage of the NotI site present approximately in the middle of the coding sequence of accE and cloned the NdeI-NotI fragment from pTR88 into the expression vector, yielding pTR90 (Fig. 1).
Cell-free extracts of RG9, obtained by transformation of BL21(DE3) with pTR90, showed high levels of soluble AccB
after IPTG induction. However, the acyl-CoA carboxylase complex reconstituted in vitro, after mixing cell-free extracts of RG9 (AccB) and RG7 (AccAl), showed much lower levels (approximately 10~) of ACCase and PCCase activities than the acyl-CoA carboxylase previously obtained by mixing RG8 with RG7 cell-free extracts (Table 2). Since the levels of AccB in cell-free extracts of RG8 and RG9 were essentially the same, we inferred from these experiments that AccE was necessary in order to obtain a fully active acyl-CoA carboxylase complex.
To confirm that the absence of AccE was the responsible of the lower acyl-CoA carboxylase activities, we studied the effect that the addition of cell-free extract containing AccE, had on the crude extracts containing AccB and AccAl proteins. For this we first constructed strain RG10 (BL21(DE3) containing pTR107) that expresses high levels of soluble AccE (data not shown).
When cell-free extracts of RG10 where mixed with those of RG9 (AccB) and RG7 (AccAl) and incubated for 1h on ice, the levels of enzyme activity where at least five times higher than in the control experiment, without the addition of AccE (Table 2). Although the results presented in this section clearly show that AccE is a functional part of the acyl-CoA carboxylase, enzyme kinetics studies with purified components will be necessary to understand more precisely the role of this protein in the enzyme complex activity.
Similar results were obtained in all the reconstitution experiments mentioned above when AccAl was replaced by AccA2 as the biotinylated component of the acyl-CoA
carboxylase, indicating that either AccAl or AccA2 can be efficiently used as the a-subunit of this enzyme complex.
Example 4: Transcriptional analysis of accBF~,-accA1 and accA2 At least four combinations that resulted in active carboxylase complexes have been reconstituted by mixing the ~-subunits PccB (Rodriguez and Gramajo, 1999) or AccB (this work) with either of the two almost identical a-subunits, AccAl or AccA2. In any of these complexes the carboxyl transferase subunit seems to dictate the substrate specificity; thus, PccB seems to recognize only propionyl-CoA, while AccB has a broader substrate specificity, which allows the enzyme to recognize either acetyl- or propionyl-CoA. Moreover, a third.complex with PCCase activity has also being described in S. coelicolor (Bramwell, et al., 1996). These findings show a remarkable overlapping of gene function in Streptomyces species. We followed two different approaches to gain more information on this; one was the generation of mutants and the second the study of the mRNA
levels of some of these four genes throughout the different growth stages by using S1 nuclease protection.
S. coelicolor A3(2) strain M145 was grown in SMM medium and RNA extracted at exponential, transition and stationary phase. S1 nuclease protection of accB was performed by using a 483 by PCR product, uniquely labelled at the 5'end of the downstream oligo. Transcription of accB occurs primarily during active growth (exponential and transition phases), while its level of expression decayed significantly after entering into stationary phase (Fig 3A). The transcripts of the major essential sigma factor hrdB and of the pathway-specific activator gene for acitnorhodin biosynthesis, actII-ORF4, were also studied as positive controls for the RNAs used in these experiments.
As expected from previous results, hrdB was expressed constantly throughout growth (Buttner, M.J., 1990), while actII-ORF4 had a peak of expression during transition phase that shut off in stationary phase (Gramajo, et al., 1993).
The RNA-protected fragments found for accB corresponded to a transcription start site 1 by upstream, or in the adenine, of the most likely translation start site of accB.
Upstream of the transcription initiation site we found a putative -10 and -35 promoter regions with a high consensus sequences of promoters recognised by the vegetative okras (Strohl, 1991) (Fig. 3B).
In order to find out if accB and accE were co-transcribed as a unique bi-cistronic mRNA, a new 563 by probe was obtained by PCR. For this we used a 5'oligo corresponding to a sequence within the coding region of accB and a 3'oligo corresponding to a sequence within accE. The full-length RNA-protected fragment was easily differentiated from the probe-probe re-annealing due to the addition of a 13 nt tail to the 5'oligonucleotide (Experimental Procedures). The results obtained in this experiment clearly showed that accB and accE were part of the same transcript, confirming that these two genes form a single-copy operon (Fig. 3C). Moreover, the expression of accBE
during the different growth phases as detected with this new probe followed the same profile as the expression observed with the probe used for accB.
The levels of accA2 and accA1 mRNA present throughout growth were also studied by S1 protection experiments (Fig.
4). The probe used for accA2 was a 766 by DNA fragment generated by PCR and uniquely labelled on the 5'end of the oligo corresponding to the sequence within accA2. This experiment showed the existence of three mRNA-protected fragments. The growth phase-dependent expression of two of them, accA2p1 and accA2p2, resemble very much that of the accBE operon. Thus, a constant and high level of expression occurs during exponential and transition phase (TP), while the transcription shuts down when the cultures reach.
stationary phase (Fig. 4A).
Considering that the nucleotide sequences of accAl and accA2 are identical from the first two nucleotides upstream of the most probable GTG translation start sites down to the end of the probe (Rodriguez and Gramajo, 1999), it is important to note that a fragment of 185 by of the accA2 probe could also be protected by the accA1 mRNA. Since the lowest RNA-protected fragment observed in Fig. 4A shows a different pattern of expression with respect to accA2p1 and p2, and considering that the size of the band corresponds to a 185 by fragment, we believe that this band might represent the level of expression of accA1 (although we cannot rule out the existence of a third promoter for accA2, regulated in a different manner).
S1 nuclease protection of accAl mRNA was performed by using a 563 by PCR product, uniquely labelled at the 5'end of the downstream oligo, corresponding to a sequence within accAl.
As shown in Fig. 4B, the expression of this gene occurs from at least three different putative promoters, and all of them showed a clear burst of expression during the first hours of the TP, which rapidly shut down during late TP.
This pattern of transcription resembled very much the one observed for the third RNA-protected band found for accA2.
The transcription starts sites for the accA2p1 and p2 were mapped by high resolution S1 mapping (Fig. 5A and B). The transcription start points and the putative -10 and -35 promoter regions of these two promoters are shown in Fig.
5C. A certain degree of homology was found between the -10 consensus sequence of accA2p1 and p2 and the promoters recognised by the vegetative ahras (Strohl, 1992). High resolution S1 mapping of accAl revealed that the transcription start point of the most abundant mRNA species starts 88 by upstream of the GTG initiation codon of AccAl and the putative -10 regions resemble, in some extent, the consensus sequences of promoters recognised by ahr~.
Interestingly, two direct repeat (DR) sequences of 16 bp, containing only two mismatches, were found flanking the putative -35 region of accAlpl and the transcription start point of accAlp2 (Fig. 5C). These DRs could represent DNA
binding sites recognised by a putative regulator. A third putative promoter, accAlp3, was also detected in longer exposures and the most probable nucleotide start sites are also indicated in Fig. 5C.
Example 5: accBE genes are essential in the presence of malonate The presence of MatC and MatB homologues in S. coelicolor suggested that this micro-organism was potentially capable of transporting malonate within the cell through the MatC
transporter, and then activating malonate to malonyl-CoA
with the putative malonyl-CoA synthetase MatB. To test whether S. coelicolor was able to utilize malonate as a sole carbon and energy source, we grew S. coelicolor in a modified SMM medium with no casamino-acids and containing 0.4 % malonate instead of glucose as a sole carbon source.
In this medium S. coelicolor M145 was able to grow, indicating that MatC and MatB could be the proteins involved in the transport and activation of malonate to malonyl-CoA, and suggesting that a decarboxylase that could convert malonyl- into acetyl-CoA should also be present in this bacterium, to allow the use of malonate as a carbon and energy source.
This result encouraged us to test whether this route could also be an alternative pathway to provide malonyl-CoA to the cell. To prove this hypothesis we tried to obtain an acyl-CoA carboxylase minus mutant in the presence of malonate. For this we took spores of strain T124 and grew them in liquid MM containing 0.4 % of malonate instead of glucose. After 36 h of growth we sonicated the mycelia and spread them in SFM medium containing 0.4 % of malonate and incubated until sporulation. Spores were collected and treated in the same way one more time. Finally, spores harvested after the second round of sporulation were diluted out, inoculated in SFM malonate to give aprox. 500 colonies per plate and replica plated in SFM medium with or without Th. After analyzing approximately 5000 isolated colonies, no Ths were obtained. This result indicates that although malonate can be efficiently used as a sole carbon and energy source, the pathway involved in its catabolism can not fulfill the malonyl-CoA requirements of the cell.
Example 6: Construction of a strain with the accBE
operon under the control of a tipA promoter As shown above, the accBE operon, which encodes the carboxyl-transferase and a previously unidentified ~ sub-unit of an acyl-CoA carboxylase, is essential for the viability of S. coelicolor A3(2). In order to regulate the expression of this operon and study its effect on the physiology of this microorganism, we constructed a conditional mutant strain where the expression of the accBE
operon was under the control of the thiostrepton-inducible tipA promoter (Murakami, et al., 1989).
A 947 by fragment containing a modified 5'end of the accB
gene was cloned under the tipA promoter in pIJ8600 (Sun et al (1999) supra) to yield pTR93. After removal of the ~C31 integration components (att and int) present in pTR93 we obtained pTR94, which was transformed into the E. coli strain ET12567/pUZ8002 (MacNeil et al (1992)/Paget et al (1999)). Conjugation of pTR94 into the S. coelicolor strain M145 gave several exconjugants ThR. One of these exconjugants, designated M94, was purified in SFM medium for further analysis. Integration of pTR94 could only take place by Campbell recombination through the accBE
homologous sequences, and this event should leave a complete copy of the accBE operon under the tipA promoter (Fig. 6A). To confirm that this event had occurred in M94, we performed Southern blot experiments of DNA samples prepared from strains M145, M94 and M86. The last strain (M86) was obtained by integration of pIJ8600 in the ~C31 att site of the chromosome and used as the best isogenic control for M94 (Fig. 6A). As shown in Fig. 6B, a Sacl digested DNA from M145 and M86 lights up a unique hybridisation band of 5.94 kb that contains the accBE
operon. DNA from M94, instead, lights up two hybridising bands corresponding to the expected sizes for the integration of pTR94 in the accBE operon (Fig. 6A and B).
Example 7: Acyl-CoA carboxylase levels in M94 and M86 Cultures of the conditional accBE mutant M94 grew normally in YEME medium containing 5 ~g of Th. Interestingly, in the absence of the antibiotic, the cultures were still able to grow, although at much lower rate. This experiment re-confirms the leakiness of the tipA promoter (M. J. Bibb, personal communication). In order to determine the levels of the acyl-CoA carboxylase in conditions of induction or non-induction we carried out the following protocol. YEME
medium containing 10 ~g of Am was inoculated with spores of M94 (or M86) to give and initial OD4so= 0.1. Cultures were grown for 12 h at 30 °C and after that time 5 ~g of Th was added to a half of each culture, keeping the other half as a control. Both flasks were then incubated for additional 24 h at 30 °C. The harvested mycelia were disrupted by sonication and cell debris removed by centrifugation. Cell-free extracts were finally analysed by SDS-PAGE and used for enzyme assays. Fig. 7A shows a 60 kDa protein that is only induced in cultures of M94 grown in the presence of Th; the size of this protein corresponded to the molecular mass of AccB. We were not able to detect an inducible band corresponding to AccE. The levels of the biotinylated components (AccAl or AccA2) of the acyl-CoA carboxylase, in each of the cell-free extracts, were analysed by a modified Western Blotting procedure (Fig. 7B). As shown in this figure the levels of AccAl and/or AccA2 were not modified by presence of Th. However, cell free-extracts of M94 do contain a slightly higher amount of the 65 kDa protein compared to M86.
ACCase and PCCase activities were assayed in cell-free extracts of M94 and M86. The levels of both enzyme activities were similar in cell-free extracts prepared from cultures of M86 grown in the presence or in the absence of Th (Table 3). Cell-free extracts prepared from induced cultures of M94 show instead a remarkable increase in both ACCase (11.5 fold) and PCCase (3.5 fold) activities, compared with the levels found in non-induced cultures of the same strain or in M86. Moreover, if the enzyme levels found in the wild type strain M145 (Rodriguez and Gramajo, 1999) are compared with those found for M94, the increase in ACCase and PCCase levels were still 4- and 2-fold, respectively (Table 3). These results indicate that by overproducing only two (~ and e) of the three sub-units that form the acyl-CoA carboxylase of S. coelicolor we can increase significantly the levels of this enzyme activity.
Example 8: Influence of the acyl-CoA carboxylase levels in the physiological properties of M94 Growth curves were determined for the conditional mutant M94 and for M86 by inoculating a spore suspension in YEME
medium supplemented with 10 ~g of Am, with or without the addition of 5 ~g of Th. For M145, YEME medium without the addition of any antibiotic was used. M94 supplemented with the inducer (Th) showed a growth rate during exponential phase very similar to M145, judged from the slope of the curves. However, the initiation of growth for M94 seems to occur sooner than in M145, reaching the stationary phase earlier than the wild type strain. When the cultures were not supplemented with Th, M94 grew considerably slower, reaching stationary phase several hours latter than in the presence of Th. Also, the final OD reached by M94 in the presence of Th and by M145 were very similar (OD4so= 3) after 60 h of growth. Cultures of M86 grew very slowly compared with M94 and M145, independently of the presence or not of Th. However, these cultures levelled off at the final OD reached by M145 and M94 after 50 h of growth.
Actinorhodin and undecylprodigiosin were also quantitated throughout growth. Table 4 shows that antibiotic production was only detected in cultures of M94 grown in the presence of 1 or 5 ug of Th. No antibiotic production was observed in cultures of M145 or M94 without Th, at least until after 60 h of growth. No antibiotic production was detected in M86.
To determine the effect of Th induction in M86 and M94, 1 ~g of the antibiotic was spotted to a confluent lawn of these strains in R2 and R5 medium supplemented with 10 ~g of Am. A striking stimulatory effect in both sporulation and antibiotic production was observed in M94 after 48 h.
No stimulation of growth or antibiotic production was observed in M86.
Example 9: Co-expression of accA, accB and accE in S.
coelicolor The NdeI-XbaI fragment of pTR154 (Fig. 14) is introduced into pIJ8600 and then transformed into S. coelicolor M145 (Fig. 14). Transformants are selected with apramycin and thiostrepton. Overexpression of the three components accA2, accB and accE results in increased ACCase activity and antibiotic production compared to the wild type M145 strain.
Discussion The use of pccB (Rodriguez and Gramajo, 1999) as an heterologous probe, allowed the successful isolation of a chromosomal DNA fragment containing accB, a gene encoding for a putative new carboxyl transferase of S. coelicolor.
This predicted function was based on the high percentage of identity that AccB showed not only to the S. coelicolor PccB, but to several others biochemical and/or genetically characterized carboxyl transferases reported for actinomycetes, such as the PccB of Sac. erythraea (Donadio, et al., 1996) and to a less extent to the AccD5 of M.
tuberculosis (Cole, et al., 1998) and PccB of M. leprae (Doukhan, 1995). An interesting finding from the analysis of the cloned sequence was the presence of a very small ORF, named accE, immediately downstream of accB.
The successful expression of accB, accE and the BC-BCCP-(biotin carboxylase- and biotin carboxylase carrier protein-)encoding genes accAl and accA2 in E. coli allowed in vitro studies to be performed in order to understand the role of the corresponding encoded proteins as components of a previously uncharacterized acyl-CoA carboxylase. The reconstitution, by mixing cell-free extracts of E. coli containing AccB and AccAl (or AccA2), of an active enzyme with the ability to carboxylate either acetyl- or propionyl-CoA clearly established that AccB was the carboxyl transferase component of an acyl-CoA carboxylase complex. Interestingly, the small polypeptide, AccE, also showed to play an important role in the reconstitution of a fully active enzyme complex (Table 2). It remains to be elucidated whether this protein plays a role as an allosteric regulator of the enzyme or whether it is a structural component of the complex. Thus, our results represent the first characterization, at both the genetic and biochemical levels, of a prokaryotic acyl-CoA
carboxylase.
All the acyl-CoA carboxylases studied so far contain the three functional domains in two individual polypeptides (for a review see Brownsey et al., 1997 ), and none of the purified complexes have shown the presence of a small component equivalent to AccE. Therefore, this might be a distinctive feature for Streptomyces sp. In addition, no AccE homologues have been found in any of the bacteria genomes sequenced so far, an observation that could also support this hypothesis.
Malonyl-CoA is an essential component of all living organisms, since it is the main elongation unit for fatty acid biosynthesis (Brownsey et al., 1997). This primary metabolite is synthesised in most species through the carboxylation of acetyl-CoA by an ACCase (Bloch and Vance, 1977). If this was also the case for S. coelicolor and, if AccB was the component of an essential acyl-CoA
carboxylase, mutation of this gene should be lethal for the micro-organism. Replacement of the wild-type accB for the HygR mutant allele prove to be unsuccessful, and it only occurred when a second copy of the accBE genes was present in the chromosome (Fig. 2B).
These experiments clearly indicated that at least accB was essential for S. coelicolor viability. The fact that both AccA2 (Rodriguez and Gramajo, 1999) and AccB have proved to be essential, along with the fact that acyl-CoA carboxylase reconstituted in vitro with these two sub-units has the ability to recognise either acetyl- or propionyl-CoA as substrates, strongly suggests that AccA2 and AccB are the a and ~ components of an essential acyl-CoA carboxylase, whose main physiological role should be the biosynthesis of malonyl-CoA. The transcriptional levels of accB and accA2 throughout growth (Fig.3A and 4A) also support this interpretation, since both genes are principally transcribed during exponential and transition phase.
Moreover, ACCase and PCCase activities also showed the highest and constant levels of activities during exponential and transition phase while in stationary phase the activities were low but readily measurable.
In S. coelicolor, besides the obvious need for malonyl-CoA
biosynthesis during vegetative growth, there is also a requirement for this metabolite during transition and stationary phase, since at least two secondary metabolites (undecylprodigiosin and actinorhodin) are synthesised during these growth-phases and they both require malonyl-CoA for their biosynthesis. Hence, if the ACCase is the only enzyme that synthesises malonyl-CoA in this bacterium, its presence will be also required during the idiophase.
According to the proposed composition of this enzyme complex and based on the transcriptional studies, we propose that the low level of expression of accA2 and accBE
during stationary phase is sufficient to produce enough of the a and ~ components for an active acyl-CoA carboxylase.
From the observation that accA1 mRNA peaks during transition phase, we propose that enough AccAl might be present in the cytoplasm to compete with AccA2 as the main a sub-unit of this enzyme complex in the stationary phase.
However, no difference in antibiotic production has been found between M145 and the isogenic accA1 mutant MA4 (Rodriguez and Gramajo, 1999).
We have clearly demonstrated the ability of S. coelicolor to efficiently utilize malonate as a sole carbon and energy source. A putative pathway for the utilization of this substrate could involve the R. trifolii MatC and MatB
homologues which are found in the genome of S. coelicolor.
The biochemical characterization of MatB in R. trifolii demonstrated that this protein is a malonyl-CoA
synthetase, which catalyzes the formation of malonyl-CoA
directly from malonate and CoA. MatC, instead, has not been characterized biochemically but computer analysis indicate that it is a transmembrane protein that could function as a dicarboxylate (malonate for example) carrier (An and Kim, 1998). If these enzymes were part of the pathway that allows S. coelicolor to utilize malonate as a sole carbon source, one could also presume that the malonyl-CoA
synthesized by MatB should fulfill the malonyl-CoA
requirements of Ithe micro-organism. However, we could not show that under these conditions the essential acyl-CoA
carboxylase becomes dispensable.
Interestingly, the addition of 0.4% malonate to SFM and glucose-MM media produced a clear stimulation of actinorhodin production (data not shown). From this we propose that higher levels of malonyl-CoA were probably available under this growth conditions. From this, and the observation that even the limited levels of the ACCase activity found during the stationary phase of growth of this bacterium are sufficient to allowed regular levels of antibiotic production, the inventors propose that increasing the expression of the ACCase components will probably lead to an improved production of antibiotics.
Experimental Procedures Bacterial strains, cultures and transformation conditions S. coelicolor A3 (2) strain M145 (SCP1- SCP2-) was manipulated as described by Hopwood et al. (1985). The strain was grown on various agar media - SFM (Rodriguez and Gramajo, 1999), R2 and R5 - or in 50 ml SMM or YEME liquid media (Hopwood et a1 (1985) supra). Escherichia coli strain DHSa (Hanahan 1983) was used for routine subcloning and was transformed according to Sambrook et a1. (1989).
Transformants were selected on media supplemented with the appropiate antibiotics . ampicillin (Ap) 100 ~tg/ml;
apramaycin (Am) 100 ~tg/ml; chloramphenicol (Cm) 25 ~g/ml or kanamycin (Km) 30 ~,g/ml. Strain BL21(DE3) is an E. coli B
strain [F- ompT (r$- m$-) (DE3) ] lysogenized with 1DE3, a prophage that expresses the T7 RNA polymerase downstream of the IPTG-inducible lacUVS promoter (Studier & Moffat, 1986). ET12567/pUZ8002 (MacNeil et al (1992)/Paget et al (1999)) was used for E. coli - S. coelicolor conjugation experiments (Bierman, 1992). For selection of Streptomyces transformants and exconjugants, media were overlayed with thiostrepton (Th) (300 ~g per plate), hygromycin (Hyg) (1 mg per plate) or apramycin (Am) (1 mg per plate). Strains and recombinant plasmids are listed in Table 1.
Growth conditions, protein expression and preparation of cell-free extracts S. coelicolor M145 was grown at 30°C in shake flasks in YEME medium for 24-48 h. When necessary, 10 mg Am ml-1 or 5 mg Th ml-1 were added to the medium. Mycelia were harvested by centrifugation at 5000 x g for 10 min at 4 °C, washed in 100 mM potassium phosphate buffer pH 8 containing 0.1 mM
DTT, 1 mM EDTA, 1 mM PMSF and 10% glycerol (buffer A) and resuspended in 1 ml of the same buffer. The cells were disrupted by sonic treatment (4 or 5 s bursts) using a VibraCell Ultrasonic Processor (Sonics & Materials, Inc.).
Cell debris was removed by centrifugation and the supernatant used as cell-free extract. For the expression of heterologous proteins, E. coli strain BL21(DE3) harbouring the appropriate plasmids were grown at 37°C in shake flasks in LB medium in the presence of 25 ~g Cm ml-1 or 100 ~g Ap ml-1 for plasmid maintenance. For the expression of biotinylated proteins, 10 ~M d-biotin was supplemented to the medium. Overnight cultures were diluted 1:10 in fresh medium and grown to A6oo 0.4-0.5 before the addition of IPTG to a final concentration of 0.1 mM.
Induction was allowed to proceed for 4 h. The cells were then harvested, washed and resuspended in 1 ml buffer A.
Cell-free extracts were prepared as described above.
Protein methods Cell-free extracts were analysed by denaturing (SDS)-PAGE
(Laemmli, 1970) using the Bio Rad mini-gel apparatus. The final acrylamide monomer concentration was 12% (w/v) for the separating gel and 5% for the stacking gel. Coomassie brilliant blue was used to stain protein bands. The biotinylated proteins were detected by a modification of the Western blotting procedure described by Nikolau et a1.
(1985). After electrophoretic separation, proteins were electro-blotted onto nitrocellulose membranes (Bio-Rad) and probed with alkaline phosphatase-streptavidin conjugate (Bio-Rad) diluted 1:10000. Protein content was determined by the method of Bradford (1976) with BSA as standard.
In vitro reconstitution and assay of the acyl-CoA
carboxylase complex In vitro reconstitution of the enzyme complex was carried out by mixing 100 ~g of each of the cell-free extracts shown in Table 2 in a final volume of 300 ~1. When AccE was not included in the incubation mix, 100 ~tg of BSA were added instead. The mixes were incubated for 1 h at 4 °C and 100 ~.g of each used for enzyme assay.
ACCase and PCCase activities in cell-free extracts were measured following the incorporation of H14C03- into acid non-volatile material (Huanaiti & Kolattukudy, 1982;
Bramwell et al., 1996). The reaction mixture contained 100 mM potassium phosphate pH 8.0, 300 ~g BSA, 3 mM ATP, 5 mM
MgCl2, 50 mM NaH14C03 [specific activity 200 ~tCi mmol-1 (740 kBq mmol-1)], 1 mM substrate (acetyl-CoA or propionyl-CoA) and 100 ~g cell-free protein extract in a total reaction volume of 100 ~1. The reaction was initiated by the addition of NaH14C03, allowed to proceed at 30 °C for 15 min and stopped with 200 ~.1 6 M HC1. The contents of the tubes were then evaporated to dryness at 95 °C. The residue was resuspended in 100 ~l water, 1 ml of Optiphase liquid scintillation (Wallac Oy) was added and 14C radioactivity determined in a Beckman scintillation liquid counter. Non-specific C02 fixation by crude extracts was assayed in the absence of substrate. One unit of enzyme activity catalysed the incorporation of 1 ~mol 14C into acid-stable products per min.
DNA manipulations Isolation of chromosomal and plasmid DNA, restriction enzyme digestion and agarose gel electrophoresis were carried out by conventional methods (Sambrook et al., 1989;
Hopwood et al., 1985). Southern analyses were performed by using P-labelled probes made by random oligonucleotide priming (Prime-a-gene kit; Promega).
Gene cloning and plasmid construction The synthetic oligonucleotides TC1, 5'-CAGAATTCAAGCAGCACGCCAA(3GGC AAG ( :3EQ . :ID . N0 . : 1 ) , and TC2 , 5 ' -CAGAATTCGATGCCGTCGTGCTCCTGGTC,(sEQ.ID.NO.:2) were used to amplify an internal fragment of the S. coelicolor pccB gene. The reaction mixture contained 10 mM Tris-HCl pH 8.3, 50 mM
KCl, 1 mM MgCl2 , 6% glycerol, 25 ~M of each of tile four dNTPs, 2.5 U Taq DNApo_Lymerase, 20 pmol of each primer and 50 ng of S. coelicolor chromosomal DNA in a final volume of 100 ~l. Samples were subjected to 30 cycles of denaturation (95°C, 30 s), annealing (65°C, 30 s) and extension (72°C, min) . A 1 kb PCR fragmE=nt was used as a 32P-labelled probe to screen a size-enriched library. A 2.7 kb BamHI fragment containing an incomplete accB gene was cloned in BamHI-cleaved pBluescript SK(+), yielding pTR62.
The synthetic oligonuc.Leotide TC16 (5' TATTCTAGACATATGACCGTTT'rGGATGAGG ( sEQ . ID. No . : 3 ) used to introduce an Ndel site at the translational start codon of the S. coelicolor accB gene ) and TC17 ( 5 ' -.~1CCTCTAGACAACGCTCGTGGACC , ( SEQ . ID . NO . : 4 ) used to introduce an Xbal site in the accB coding sequence) were used to amplify an internal fragment of S. coelicolor accB
gene, having the sequence shown in Fig. 10. The reaction mixture was the same as the one indicated above. Samples were subjected to 30 o:r 35 cycles of: denaturation (95°C, 30 s) , annealing (65°C, 30 s) and extension ('72°C, 1 min) . The 1 kb PCR product was digested with NdeI and XbaI (these sites were introduced in the 5~ ends of the oligos TC16 and TC17 and are shown in bold in Fig. 10) and cloned in XbaI-cleaved pBluescript SK.(+) in E. coli DHSa, yielding pTR82.
This plasmid was digested with BstEII and SacI, ligated with a BstEII-SacI fragment cleaved from pRM08 and introduced by transformation into E. coli DHSa, yielding pTR87.
An NdeI-XbaI fragment :From the plasmid pTR82 was cloned in NdeI-XbaI-cleaved pIJ8f;00 (Sun et al (1999)), yielding pTR93. In order to place the chromosomal copy of accBE
operon under the tipA promoter we removed from pTR93 a HindIII fragment containing the int gene and att of ~C31, yielding pTR94. Plasmid pTR94 was transformed into strain ET12567/pUZ8002 and transferred by conjugation to S.
coelicolor M145 (Hopwoc~d et al (1985)).
A NdeI-SacI fragment f:rom the plasmid pTR87 was cloned in NdeI-Sacl-cleaved pET22b(+) (Novagen) (pTR88), thus placing the accBE operon under the control of the powerful T7 promoter and ribosome-binding sequences. The synthetic 011.goriuCleotldeS NaCCE, 5'-TTATCTAGACATATGTCCCCTGCCGAC, (SEQ.ID.N0.:5) used to introduce an NcleI site at the translational start codon of the S. coelicc~lor accE gene, and CaccE, 5'-ATGAATTCTATGCATCGGGTCAGCGCC.~GCTG,(SEQ.ID.N0.:6) were used to amplify the accE gene of S. coelicc~lor. The reaction mixture was the same as the one indicated above. Samples were subjected to 35 cycles of denaturat:ion (95°C, 30 s), annealing (65°C, 30 s) and extension (72°C, 30 s). The PCR product was cloned using pGEM-T easy vector (Promega) in E. coli DHSa, yielding pTR106. A Ndel-EcoRI fragment from the plasmid pTR106 was cloned in NdeI-EcoRI-cleaved pET22(b) (Novagen) yielding the plasmid p'rR107, thus placing the accE gene under the control of the powerful T7 promoter and ribosome-binding sequences.
Plasmid pIJ8600 was digested with. BglII and EcoRI and the fragment containing or:i~' RK2, on pUClB, attP site, int ~C31 and aac (3) IV (AmR cassette) genes was ligated with a linker containing the following enzymes (Mike Butler personal comunication): BglII, AseI, EcoRI, BglII, NdeI, KpnI, XbaI, PstI, HindIII, BamHI, SstI, NotI and EcoRI, yielding pTR141. A 4.0 kb KpnI fragment containing the complete accBE operon from pRM08 was cloned into KpnI-cleaved pTR141, yieldi:c~g pTR149.
For an efficient over-expression in S. coelicolor of the three components of the acyl-CoA carboxylase complex of this micro-organism, we carried out the construction of pTR156 through the fol=Lowing steps. First we did a PCR.
amplification of the chromosomal accBE operon using the:
O Z 1g0 TC 16 ( 5 ' -TATTCTAGAC:ATATGACCGTTTTGGATGAGG 3 ' ) ( SEQ . ID . NO .
: 3 ) to introduce a NdeI site at the translation start codon c>f accB, and the oligo C--accE (5'ATG AAT TCT ATG CAT CGG GTC
AGC GCC AGC 3')(SEQ.ID.N0.:7) to introduce a Nsil restriction site at the 3' end of accE. The amp:Lified DNA, was then cloned into pGEM-T (Promega), to g_Lve pTR99. To introduce a NsiI site upstream of the RBS of accA2 we amplified this gene using the oligo N-accA2 (5' ATG AAT TCA TGC ATG AGG GAG CCT CAA
TCG 3' ) , (SEQ.ID.NO. :8> , for ~.I-;e 5' end and the oligo C-accA2 (5' AGA
TCT
AGA TCA GTC CTT GAT CTC GC 3')(SEQ.ID.N0.:9) containing a Xbal and a EcoRI site, for the 3' end of the gene. The amplified DNA
was cloned in pGEM-T to give pTR112. The NdeI-NsiI DNA
fragment from pTR99 and the NsiI-EcoRI fragment isolated from pTR112 were final=Ly cloned into pET22(b) (Stratagene), previously digested with NdeI and EcoRI, to yield pTR154.
In order to introduce these genes in S. coelicolor we sub-cloned the NdeI-XbaI fragment, containing aceBE and accA2, from pTR154 to pIJ8600 digested with the same enzymes to give pTRl56. See Fig. 14 for plasmid constructions.
Nucleotide sequencing The sequence of the SphI original fragment was performed from plasmids DNA constructed by subcloning ApaI DNA
fragments from pRM08 into pSKBluescri.be SK(+). Synthetic oligonucleotides were ~zsed to complete the sequence. The nucleotide sequence of the accBE region was determined by dideoxy sequencing (Sanger et al., 1977) using the Promega TaqTrack sequencing kit. and double-stranded DNA templates.
The complete sequence <.~f the 1C2 cosmid, that includes the SphI fragment harbouring accBE, is available from the S.
coelicolor genome sequE~ncing project.
S1 nuclease mapping For each S1 nuclease rE~action, 30 ~g of RNA were hybridized in NaTCA buffer (Murray, 1986); solid NaTCA (Aldrich) was dissolved to 3M in 50mM PIPES (pH 7.0), 5mM EDTA, to about 0.002 pmol (approximatf=_:ly 104 cpm) of the following probes.
For accA2 the synthet:Lc: oligonucleotide 5'-GCTTTGAGGACCTTGGCGATG (SEQ.I:D.NO.:10) (AccA2down), corresponding to the sequence within the coding region of accA2, was uniquely labelled at the 5' end of the oligonucleotide with [32P]-ATP using T4 polynucleotide kinase. The labelled oligo was then used in the PCR :reaction with the unlabelled OllgOnuCleOtlde (aCCA2~Llp) 5'-GAAGTAC.'AGGCCGAAGACCAC (SEQ.ID.N0.:11), which corresponds to a regioxi upstream of the accA2 promoter region, to generate a '766 by probe. For accAl the synthetic oligonucleotide (accAldown) 5'-GCGATTTCGCCACGATTGGCG, (SEQ.ID.NO.: 12), corresponding to the region within the coding region of accAl, was uniquely labelled with [3~P]-ATP using T4 polynucleotide kinase at the 5' end of the oligonucleotide.
The accAldown oligo was later used in the PCR reaction with the unlabelled oligonuc;leotide (accAl.up) 5'-CCGATATCAGCCCCTGATGAC,(SEQ.ID..N0.:13), which corresponds to a region upstream of the accAl promoter to generate a 563 by probe.
For accB the synthetic oligonucleotide (accBdown) 5'-CGTCAGCTTGCCCTTGGCGTG,(SEQ.ID.N0.:14), corresponding to the region within the coding region of accB, was uniquely labelled with [32P]-ATP using T4 polynucleotide kinase at the 5' end of the oligonucleotide. accBdown waa, then used in the PCR
reaction with the unlabelled oligonucleotide (accBup) 5'-CTACGCTCCGGGTGAGCGAAC,(SEQ.ID.t~0.:15), which corresponds to a region upstream of the accB promoter, to generate a 483 by probe.
For accBE the synthetic oligonucleotide (accBEdown) 5'-GGAGGGCCGTGATGGCGGCGACTTCCTCGGG,(SEQ.ID.N0.:16), corresponding to the region within the coding region of accE was uniquely labelled with [32P]-ATP using T4 polynucleotide kinase at the 5' end of the oligonucleotide. The accBEdown oligo was then used in the PCR reaction with the unlabelled oligonucleotide (accBEup) 5'-GAGGAACTGGTACGCGCGGGCG (GTACAe~.GCAAGCT),(SEQ.ID.N0.:17), which corresponds to a region within the coding region of accB Cbracketed oligonucleotides are a tail added to the probe to differentiate probe re<~nnealing from fully protected DNA-RNA complexes), to genE~rate a 563 by probe. Subsequent steps were as described by Strauch et al. (1991).
Determination oflactinorhodin 1 ml of whole broth wars mixed with 0.5 ml of 3N KOH to give a final concentration of 1N KOH. The solutions were mixed vigorously and centrifuge at 4000 x g for 5 minutes. The supernatant was collected and measured at A64onm.
Actinorhodin concentration was calculated using the molar extinction coefficient (in 1 N KOH) at 640 nm of 25.320 (Bystrykh et al., 1996?.
Determination of undecylprodigiosin This was carried out a~~cording to the procedure of Hobbs et al. (1990) .
References An, J. H. et al. (1998) Eur J Biochem 257: 395-402.
Behal, V. et al. (1977) Phytochemistry 16: 347-350.
Bierman, M. et al. (1992) Gene 116: 43-49.
Block, K. and Vance, D. (1977) Annu Rev Biochem 46: 263-298.
Bradford, M. (1976) Anal Biochem 72:248.
Bramwell, H. et al. (1996) Microbiol 142: 649-655.
Bramwell, H. et al. (1993) Biochem J 293:131-136.
Brownsey, R. W. et al. (1997) Biochem Soc Trans 25: 1232-1238.
Buttner, M.J. et al. (1990) J Bacteriol 172: 3367-3378.
L. V. Bystrykh, M. A. et al. (1996). J Bacteriol. 187:
2238-2344.
Cole, S.T. et al. (1998) Nature 393: 537-544.
Donadio, S. et al. (1996) Mo1 Microbiol 19: 977-984.
Doukhan, L. et al. (1995) Gene 165: 67-70 Erfle, J. D. (1973) Biocim Biophys Acta 316: 143-155.
Gramajo, H.C. et al. (1993) Molecular Microbiology 7: 837-845.
Hanahan (1983) J. Mol. Biol. 166: 557-580.
Harwood, J.L. (1988) Annu Rev Plant Physiol Plant Mol Bio1 39: 101-138.
Hasslacher, M. et al. (1993) J. Biol. Chem. 268: 10946-10952.
Henrikson, K. P. and Allen, S. H. G. (1979) J Bio1 Chem 254:5888-5891.
Hobbs, G. et al. (1990) J Gen Microbiol 136:2291-2296.
Hopwood, D. A. et al. (1985) Genetic manipulation of Streptomyces: A laboratory manual. John Innes Foundation, Norwich.
Hopwood, D.A. and Sherman, D.H. (1990) Ann Rev Genet 24:37-66.
Hunaiti, A. R. and Kolattukudy, P. E. (1982) Arch Biochem Biophys 216:362-371.
Laakel, M. et al. (1994) Microbiol 140: 1451-1456.
Laemmli, U. K. (1970) Nature 227:680-685.
Li, S. J. and Cronan. J. E., Jr. (1993) J Bacteriol 175:
332-340.
MacNeil et al. (1992) Gene 115: 119-125 Murakami, T. et al. (1989) J Bacteriol 171: 1459-1466.
Nikolau, B.J. et al. (1981) Arch Biochem Biophys 211: 605-612.
Paget et al. (1999) Mol. Microbiol. 3: 97-107 Perez, C.A. et al. (1998) Microbiology 144: 895-903.
Pfizer, E.S. et al. (1996) Cancer Res. 56: 2745-7.
Polakis, S. et al. (1972) J Bio1 Chem 247: 1335-1337.
Redenbach, M. etlal. (1996) Mol Microbiol 21: 77-95.
Rodriguez, E. and Gramajo, H. (1999) Microbiology 145:
3109-3119.
Saggerson, E. D. et al. (1992) Adv Enzyme Regul 32: 285-306.
Sambrook, J. et al. (1989) Molecular Cloning: a Laboratory Manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press.
Sanger, F. et al. (1977) Proc Nat1 Acad Sci USA 74: 5463-5467.
Strauch, E. et al. (1991) Mol Microbiol 5: 289-298.
Strohl, W.R. (1992) Nucleic Acids Res. 20: 961-974.
Sun et al. (1999) Microbiology 145: 2221-2227 Westrich, L. et al. (1999) FEMS Microbiol. Lett. 170: 381-387.
All the above references are hereby incorporated by reference in their entirety, individually and for all purposes.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME: Plant Bioscience Limited (B) STREET: Norwich Research Park, Colney Lane (C) CITY: Norwich (D) STATE: Norfolk (E) COUNTRY: UK
(F) POSTAL CODE (ZIP): NR4 7UH
(ii) TITLE OF INVENTION: Ant:.ibiotic Production (II) (iii) NUMBER OF SEQUENCES: 32 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Bereskin & Parr (B) STREET: Suite 4000, Scotia P7.aza, Box 401, 40 King Street West (C) CITY: Toronto (D) STATE: Ontario (E) COUNTRY: Canada (F) POSTAL CODE (ZIP).- M5H 3Y2 (v) COMPUTER READABLE FORM:
(A) COMPUTER: IBM PC' compatible (B) OPERATING SYSTEM: PC-DOS/MS-DOS
(C) SOFTWARE: Patent:In Release #1.0, Version #1.25 (EPO) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA 2,322,105 (B) FILING DATE: 23-OC:T-;?000 (viii) PATENT AGENT INFORMATION:
(A) NAME: David W. R. Langton (B) REGISTRATION N0: :?800 (C) REFERENCE NUMBER: 420-360 (2) INFORMATION FOR SEQ ID N0: :L:
( i ) SEQUENCE CHARACTERISTIc_:S
(A) LENGTH: 29 base p<zi.rs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:
(2) INFORMATION FOR SEQ :CD N0: 2:
(i) SEQUENCE CHARACTERISTI~::S:
(A) LENGTH: 29 base pairs (B) TYPE: nucleic a~~id (C) STRANDEDNESS: single (D) TOPOLOGY: .Linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
TATTCTAGAC ATATGACCGT TTT(3GATGAG C; 31 (2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 4:
(2) INFORMATION FOR SEQ ID NO: 'i:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: si.nc;~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 5:
TTATCTAGAC ATATGTCCCC TGC:CGAC 27 (2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTIC'.S:
(A) LENGTH: 32 base p<~irs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:
ATGAATTCTA TGCATCGGGT CAGCGCCi~GC TG 32 (2) INFORMATION FOR SEQ :CD NO: '7:
( i ) SEQUENCE CHARACTERIS'PIC:>
(A) LENGTH: 30 base pairs (B) TYPE: nucl~=_ic acid (C) STRANDEDNESS: single (D) TOPOLOGY: .Linea:r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:
(2) INFORMATION FOR SEQ ID N0: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :>EQ ID N0: 8:
ATGAATTCAT GCATGAGGGA GCC'PCAATCC.; 30 (2) INFORMATION FOR SEQ ID N0: ~!:
(i) SEQUENCE CHARACTERISTIC'S:
(A) LENGTH: 26 base p<xirs (B) TYPE: nucleic acid (C) STRANDEDNESS: sinc;~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :>EQ ID NO: 9:
(2) INFORMATION FOR SEQ ID NO: :LO:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs (B) TYPE: nucleic aci<L
(C) STRANDEDNESS: sinc;~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:
(2) INFORMATION FOR SEQ ID N0: 'L1.:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base p<~irs (B) TYPE: nucleic acid (C) STRANDEDNESS: s:inc~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
(2) INFORMATION FOR SEQ ID NO: 12:
( i ) SEQUENCE CHARACTERIS'rI~::
(A) LENGTH: 21 base pairs (B) TYPE: nucleic aci3 (C) STRANDEDNESS: single (D) TOPOLOGY: Linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:
(2) INFORMATION FOR SEQ ID NO: 13 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: ::EQ ID NO: 13:
(2) INFORMATION FOR SEQ ID N0: 1.4:
(i) SEQUENCE CHARACTERISTIC:'S:
(A) LENGTH: 21 base pair's (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :3EQ ID N0: 14:
(2) INFORMATION FOR SEQ ID NO: :1.5:
(i) SEQUENCE CHARACTERISTI(:~S:
(A) LENGTH: 21 base pe~irs (B) TYPE: nuclE:ic ac:ic:i (C) STRANDEDNESS: si_n~xle (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 15:
(2) INFORMATION FOR SEQ ID N0: 16:
( i ) SEQUENCE CHARACTERIS'CI~S
(A) LENGTH: 31 base p<~irs (B) TYPE: nucleic acir.~
(C) STRANDEDNESS: s:ingl.e (D) TOPOLOGY: .Linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:
GGAGGGCCGT GATGGCGGCG ACTTCCT~~GG G 31 (2) INFORMATION FOR SEQ ID N0: 17:
(i) SEQUENCE CHARACTERISTIC:>:
(A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: sin gle (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
GAGGAACTGG TACGCGCGGG CGGTACAAE'~C AAGCT 35 (2) INFORMATION FOR SEQ ID NO: 18 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 120 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: circular (xi) SEQUENCE DESCRIPTION: :~EQ ID N0: 18:
AAACGGGCCG GAGACTGTAC GGAGTCGACC; GCTCGCAATC CTTGCTCGGC TTCGTAGAGT 60 CGCTACATGA CCGTTTTGGA TGAGGCGCCC:~ C~GCGAGCCGA CGGACGCGCG CGGGCGGc3TG 120 (2) INFORMATION FOR SEQ ID NO: 1.9:
(i) SEQUENCE CHARACTERISTI<:'S:
(A) LENGTH: 120 base C:>airs (B) TYPE: nucleic acic:3 (C) STRANDEDNESS: douk:Re (D) TOPOLOGY: circul.axv (xi) SEQUENCE DESCRIPTION: >EQ ID NO: .L9:
CACCCGCCCG CGCGCGTCCG TCGGCTCGCce CGGCGCCTCA TCCAAAACGG TCATGTAGCG 60 ACTCTACGAA GCCGAGCAAG GATTGCGAGC_' CGTCGACTCC GTAC'AGTC'rC CGGCCCGTTT 120 (2 ) INFORMATION FOR SEQ 7:D N0: :?0 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: sin<!le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 20:
Met Thr Val Leu Asp Glu Ala Pro Gly Glu Pro Thr Asp Ala Arg Gly Arg Val (2) INFORMATION FOR SEQ :CD N0: ',?1.:
( i ) SEQUENCE CHARACTERIS'PI~~S
(A) LENGTH: 240 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: circular (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 21:
ACCGGCCGAT ACGTTCTGCG CATGACTGTT CCGAACAGGG CC>GCGTGAAT TCCAACCGTT 180 GGCCGTCGGC GAGCCCCGAT CAGTAATCGA. GTGAGTGAGG A(sAATCT'PCG TGCGCAAGGT 240 (2) INFORMATION FOR SEQ ID NO: 22 (i1 SEQUENCE CHARACTERISTIC."S:
(A) LENGTH: 240 base x>airs (B) TYPE: nucleic acid (C) STRANDEDNESS: dout:~le (D) TOPOLOGY: circular:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22:
CCACAAAGAA ACCGCGTGGC CCGCAGCACG CC'CTTACAGA GACCTTGACC ACACAGGAGG 60 GCTAGGGTTT CCCCCAGGAG TCCTGCGTA<.: CGCGGTACTA CAAGGGCT'PT CGGGGGTCGA 120 GCGAGCCTCG AATCACGCTC CGTGTGGGCh AGCTCACCAT TGGGGACGGG 'PCGAAGTGCC 180 GTGTCGGCAG TCCCTAAACT CGGCTTGTT~I' CAAGGAGGGA GCCTCAATCG TGCGCAAGGT 240 (2) INFORMATION FOR SEQ ID NO: 23 (i) SEQUENCE CHARACTERISTIC=S:
(A) LENGTH: 4 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: sincFle (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: ;vEQ ID N0: 23:
Val Arg Val Lys (2) INFORMATION FOR SEQ :
ID NO: '..24 (i) SEQUENCE CHARACTERISTIC~~:
(A) LENGTH: 968 base irs .pa (B) TYPE: nucleic aci:l (C) STRANDEDNESS: doubl e (D) TOPOLOGY: circu.Lar (xi) SEQUENCE DESCRIPTION:Q ID
SE: N0:
24:
TATTCTAGAC ATATGACCGT GCGCCGGGCGAGCCGAC'GGA CGCGCGCGGG60 TTTGGATGA~~
CGGGTGGCCG AGCTGCACGG GCGGC;GCTCGCC.'Gt3GCC'GAG 120 GATCCGTGCA TGAGAAGGCG
ACGGCGGCGC AGCACGCCAA ACGG(~ACGTGAGCGCATCGA GCTGCTCCTG180 GGc3CAAGCTG
GGTCGAGCAG
CTGGAGGCCA AGAAGCCGTA GTCATCACCGGC:TGGGGCAC GGTCGAGGGC300 CACCGACGGT
CCACGAC'rTC
CACGCCACGA AGATCCACAA ATGGC_CATCGCGGCCGC>TGC CCCGCTGGTG420 GA'I'CATGGAC
CGCCCGTATC
CACCAAGGCG
CGCGGCCTAC
GGCCCGGACCi GAGGAGATCACGCAGAACGG TCTGGGCGGC' ACcsCCGAGAC GTCCGGCGTG720 GC'CGAC:GTGC'.
TGCCACTTCGCCTACGACGA CGAGGAGACC:' AGGTCCGCTA CCTCCTCTCC780 TGCCTCGCCG
CTCCTCCCGCAGAACAACCG GGAGAACCCG AG'PCCTCCGA CCCCGTGGAC840 C.'CCCGCGCCG
CGCCGCTCGGACACCCTCCT CGACCTGGTC' GCAA.CCGCCC GTACGACATG900 CCGGCGGACG
(2) INFORMATION
FOR
SEQ
ID
NO:
25:
(i) SEQUENCE
CHARACTERISTIC:.'S:
(A) LENGTH: 590 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: si.nc:~le (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25:
Val Arg Lys Val Leu Ile Ala Asn Arg Gly Glu Ile Ala Val Arg V<~1 Ala Arg Ala Cys Arg Asp Ala Gly Ile A:La Sex' Val Ala Val Tyr Ala Asp Pro Asp Arg Asp Ala Le~.i His Val Arg Ala Ala Asp Glu Ala Phe Ala Leu Gly Gly Asp Thr Pro Ala Thr Ser Tyr Leu Asp Ile Ala Lys 50 'i5 60 Val Leu Lys Ala Ala Arg (~lu Ser Gly Ala Asp Ala Ile His Pro Gly Tyr Gly Phe Leu Ser Glu Asn Ala Glu Phe A1<~ Gln Ala Val Leu Asp Ala Gly Leu Ile Trp Ile Gly Pro Pro Pro Hi;> Ala Ile Arg Asp Arg Gly Glu Lys Val Ala Ala Arcfi His 11e Ala Gln Arg Ala G.Ly Ala Pro Leu Val Ala Gly Thr Pro Asp Pro Val Ser Gly Ala Asp Glu Val Val Ala Phe Ala Lys Glu His Gl;y Leu Pro Ile A1<~ Ile Lys Ala Ala Phe Gly Gly Gly Gly Arg Gly :Leu Lys Val Ala Ary Thr Leu Glu Glu Val 165 1.70 175 Pro Glu Leu Tyr Asp Ser Ala Val Arg Glu Ala Va.L Ala Ala Phe Gly Arg Gly Glu Cys Phe Val ~Jlu Arg Tyr Leu Asp Lys Pro Arg His Val Glu Thr Gln Cys Leu Ala Asp Thr His Gly Asn Val Val Val Val Ser 210 215 2,20 Thr Arg Asp Cys Ser Leu Gln Arg Arg His Gln Lys Leu Val Glu Glu Ala Pro Ala Pro Phe Leu Ser Glu Ala Gln Thr G1u Gln Leu Tyr Ser Ser Ser Lys Ala Ile Leu L~y:-, Glu Ala Gly 'ryr Gly Gly Ala Gly Thr Val Glu Phe Leu Val Gly Met: Asp Gly Thr Ile Phe Phe Leu Glu Val Asn Thr Arg Leu Gln Val Glu His Pro Val T.hr Glu Glu Val Ala Gly Ile Asp Leu Val Arg Glu stet: Phe Arg Ile Ala Asp Gly Glu Glu Leu 305 310 315 3'<:0 Gly Tyr Asp Asp Pro Ala Lei, Arg Gly His Ser Phe Glu Phe Arg Ile Asn Gly Glu Asp Pro Gly Arcl Gly Phe Leu Pro Ala Pro Gly Thr Val Thr Leu Phe Asp Ala Pro Th::' Gl.y Pro Gly Val. Arg Leu A:~p Ala Gly Val Glu Ser Gly Ser Val Ile Gly Pro Ala Trp Asp Ser Leu Leu Aia 370 =.75 380 Lys Leu Ile Val Thr Gly Arg Thr Arg Ala Glu Ala Leu Gln Arg A.La Ala Arg Ala Leu Asp Glu Phf=_ Thr Val Glu Gly Met Ala Thr Ala I.Le Pro Phe His Arg Thr Val Va.L Arg Asp Pro Ala Phe Ala Pro G1u Leu Thr Gly Ser Thr Asp Pro Phe Thr Val His Thr Arg Trp Ile Glu Thr Glu Phe Val Asn Glu Ile Ly,s Pro Phe Thr Thr Pro Ala Asp Thr Glu Thr Asp Glu Glu Ser Gly Arg G:Lu Thr Val Va:L Val Glu Val Gly Gly Lys Arg Leu Glu Val Ser Leu Pro Ser Ser Leu Gly Met Ser Leu Ala Arg Thr Gly Leu Ala Ala Gl:y Ala Arg Pro Lys Arg Arg Ala Ala Lys Lys Ser Gly Pro Ala Ala Ser Gly Asp Thr Leu Ala Ser Pro Met Gln Gly Thr Ile Val Lys Ile Ala Val C~lu Glu C~ly Gln Glu Val Gln Glu Gly Asp Leu Ile Val Val Leu Glu Ala Met Lys Met Glu Gln Pro Leu Asn Ala His Arg Ser Gly 'rhr Ile Lys Gly Leu Thr Ala Glu Val Gly Ala Ser Leu Thr Ser Gly Ala Ala I.le Cys Glu I.Le Lys Asp (2) INFORMATION FOR SEQ ID NO: 26:
(i) SEQUENCE CHARACTERISTIC:'S.
(A) LENGTH: 590 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: ;.E(~ ID NO: 26:
Val Arg Lys Val Leu Ile Ala ~~sn Arg Gly Glu Ile Ala Val Arg Val Ala Arg Ala Cys Arg Asp Ala Giy Ile A.La Ser Val Ala Val Tyr Ala Asp Pro Asp Arg Asp Ala Leu liis Val Arg Ala Ala Asp Glu Ala Phe Ala Leu Gly Gly Asp Thr Pro ~~:a Thr Ser Tyr Leu Asp Ile Ala Lys Val Leu Lys Ala Ala Arg Gl:~ 3er Gly Ala Asp Ala Ile His Pro Gly Tyr Gly Phe Leu Ser Glu Asn ,Ala Asp Phe Ala Gln Ala Val Leu Asp Ala Gly Leu Ile Trp Ile G1;~ Pr_o Pro Pro His Ala Ile Arg Asp Arg Gly Glu Lys Val Ala Ala Arg l-Lis I1e Ala Gln Arg Ala Gly Ala Pro Leu Val Ala Gly Thr Pro Asp :Pro Val Ser Gly A1<r Asp Glu Val Val 130 13':. 140 Ala Phe Ala Lys Glu His Gly Leu Pro Ile Ala Ile Lys Ala Ala Phe 145 150 1.55 160 Gly Gly Gly Gly Arg Gly Leu C~ys Val Ala Ard Thr Leu Glu Glu Val Pro Glu Leu Tyr Asp Ser Ala 'Val Arg Glu Ala Val Ala Ala Phe Gly 180 1.85 190 Arg Gly Glu Cys Phe Val Glnr.Arg Tyr Leu Asp Lys Pro Arg His Val Glu Thr Gln Cys Leu Ala Asp 'rhr His Gly Asn Val Val Val Val Ser Thr Arg Asp Cys Ser Leu Gln Arg Arg His Gln Lys Leu Val Glu Glu Ala Pro Ala Pro Phe Leu Ser :,1u Ala Gln Thr Glu Gln Leu Tyr Ser Ser Ser Lys Ala Ile Leu Lys Glu Ala Gly Tyr Va.1 Gly Ala Gly Thr Val Glu Phe Leu Val Gly Met Asp Gly Thr I:le Ser Phe Leu Glu Val 275 2.30 285 Asn Thr Arg Leu Gln Val Glu His Pro Val Thr Glu Glu Val Ala Gly 290 29'> 300 Ile Asp Leu Val Arg Glu Met Phe Arg Ile Ala Asp Gly Glu Glu Leu Gly Tyr Asp Asp Pro Ala Leu Arg Gly His Ser Phe Glu Phe Arg Ile Asn Gly Asp His Pro Gly Arg (31y Phe Leu Pro Ala Pro Gly Thr Val Thr Leu Phe Asp Ala Pro Thr Gly Pro Gly Val Arg Leu Asp Ala Gly 355 :3fi0 365 Val Glu Ser Gly Ser Val Ile tziy Pro Ala Trp Asp Ser Leu Leu Ala Lys Leu Ile Val Thr Gly Arc7 'Phr Arg Ala Glu Ala Leu Gln Arg Ala Ala Arg Ala Leu Asp Glu Phe 'Phr Val Glu Gly Met, Ala Thr Ala Ile Pro Phe His Arg Thr Val Val.:~lrg Asp Pro Ala Phe Ala Pro Glu Leu Thr Gly Ser Thr Asp Pro Phe 'Phr Val His Thr Arc) Trp Ile Glu Thr 435 ~140 445 Glu Phe Val Asn Glu Ile Lys 'Pro Phe Thr Thr Pro Ala Asp Thr Glu 450 45'. 46!) Thr Asp Glu Glu Ser Gly Arc)G1u Thr Val Val Val Glu Val Gly Gly Lys Arg Leu Glu Val Ser Lei:WPro Ser Ser Leu Gly Met Ser Leu Ala Arg Thr Gly Leu Ala Ala G1;~,~11a Arg Pro Lys Arg Arg Ala Ala Lys Lys Ser Gly Pro Ala Ala Ser Gly Asp Thr Leu Ala Ser Pro Met Gln 515 !p20 525 Gly Thr Ile Val Lys Ile Ala 'J<~1 Glu Glu Gly Gln Glu Val Gln Glu 530 53'i 54() Gly Asp Leu Ile Val Val Le~.i G.Lu Ala Met Lys Met= Glu Gln Pro Leu Asn Ala His Arg Ser Gly Thr I.Le Lys Gly Leu Thr Ala Glu Val Gly Ala Ser Leu Thr Ser Gly Alai .~la Ile Cys Glu Ile Lys Asp (2) INFORMATION FOR SEQ TD N0: 27:
(i) SEQUENCE CHARACTERISTIi'S':
(A) LENGTH: 1773 base p~sirs (B) TYPE: nucleic acid (C) STRANDEDNESS: doubl,=_ (D) TOPOLOGY: circular (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27:
GTGCGCAAGG TGCTCATCGC CAATCGTGGC'.GAAATCGCTG TCCC>CGTGGC CCGGGCCTGC 60 CGGGACGCCG GGATCGCGAG CGTGGCCGTt:'Pi~CGCGGATC CGGACCGGGA CGCGTTGCAC 120 GTCCGTGCCG CTGATGAGGC GTTCGCCCT(>Gt3TGGTGACA CCCt~CGCGAC CAGCTATCTG 180 TACGGATTCCTCTCGGAGAA TCCTGGAC;GC CGGCCTGATC300 CGCCGAGTTC.' GC;GCAGGCGG
TGGATCGGCCCGCCCCCGCACGCCATCCGC'(iACCGTGGCGAAAAGGTCGC CGCCCGCCAC360 ATCGCCCAGCGGGCCGGCGCCCCCCTGGTC.' CCGACCCCGT CTCCGGCGCG420 GCCGGCACCC
GACGAGGTCGTCGCCTTCGCCAAGGAGCAC GC=CCTGCCCATCGCCATCAA GGCCGCCTTC480 GGCGGCGGCGGGCGCGGCCTCAAGGTCGCC:'(:GCACCCTCGAAGAGGTC;CC GGAGCTGTAC540 GACTCCGCCGTCCGCGAGGCCGTGGCCGCC' 'f7.'CGGCCGCGGGGAGTGC;TT CGTCGAGCGC600 TACCTCGACAAGCCCCGCCACGTGGAGACC:'(:AGTGCCTGGCCGF~CACC.'CA 660 CGGCAACGTG
GTCGTCGTCTCCACCCGCGACTGCTCCCTC C:AGCGCCGCCACCAAAAGCT CGTCGAGGAG720 GCCCCCGCGCCCTTTCTCTCCGAGGCCCAC~~~C:GGAGCAGCTGTACTCATC CTCCAAGGCC;780 ATCCTGAAGGAGGCCGGCTACGGCGGCGCC:'(zC=CACCGTGGAGTTCCTC;GT CGGCATGGAC840 GGCACGATCTTCTTCCTGGAGGTCAACACC" (~CiCCTCCAGGTCGAGCAC;CC GGTCACCGAG900 GAAGTCGCCGGCATCGACTTGGTCCGCGAG i~TGTTCCGCATCGCCGACGG CGAGGAACTC960 GGTTACGACGACCCCGCCCTGCGCGGCCAC"CC:CTTCGAGTTCCG1CATCAA CGGCGAGGAC1020 CCCGGCCGCGGCTTCCTGCCCGCCCCCGGC' ACCGTCACCCTCTTCGACGC GCCCACCGGC1080 CCCGGCGTCCGCCTGGACGCCGGCGTCGAG ',CCCGGCTCCGTCATCGGC;CC CGCCTGGGAC1140 TCCCTCCTCGCCAAACTGATCGTCACCGGC:'(~GCACCCGCGCCGAGGCACT CCAGCGCGCG1200 GCCCGCGCCCTGGACGAGTTCACCGTCGAG t~GCATGGCCACCGC;CATC;CC CTTCCACCGC1260 ACGGTCGTCCGCGACCCGGCCTTCGCCCCC' GAACTCACCGGCTCCACGGA CCCCTTCACC1320 GTCCACACCCGGTGGATCGAGACGGAGTTC' GTCAACGAGATCAAGCCC;TT CACCACGCCC1380 GCCGACACCGAGACGGACGAGGAGTCGGG<:' TCGTCGTC:GA GGTCGGCGGC1440 (:GGGAGACGG
AAGCGCCTGGAAGTCTCCCTCCCCTCCAGC:' CCCi'GGCC;CG CACCGGCCTG1500 t:'l.'GGGCATGT
GCCGCCGGGGCCCGCCCCAAGCGCCGCGCG (3C;CAAGAAGTCCGC~CCCC:GC CGCCTCGGGC1560 GACACCCTCGCCTCCCCGATGCAGGGCACG ~~TCGTCAAGATCGCCGTCGA GGAAGGCCAG1620 GAAGTCCAGGAAGGCGACCTCATCGTCGTA CTCGAGGCGATGAAGATC~GA ACAGCCCCTC1680 AACGCCCACAGGTCCGGCACCATCAAGGGC' CTCACCGCCGAGGTCGGC;GC CTCCCTCACC1740 TCCGGCGCCGCCATCTGCGAGATCAAGGAC:' 1773 'PGA
(2) INFORMATION
FOR
SEQ
ID
NO:
:8:
(i) EQUENCE
S CHARACTERISTIC'S:
(A) LENGTH:1773 base pairs (B) TYPE:cleic aci<a nu (C) STRANDEDNESS:
douk>l~=_ (D) TOPOLOGY:
circular (xi) SEQUENCE DESCRIPTION: S'EQ ID NO: 28:
GTGCGCAAGG TGCTCATCGC CAATCGTGGt:; GAAATCGCTG TCCC>CGTGGC CCGGGCCTGC 60 CGGGACGCCG GGATCGCGAG CGTGGCCGTC''P~1CGCGGATC CGGACCGGGA CGCGTTGCAC 120 GTCCGTGCCG CTGATGAGGC GTTCGCCCT(.~ GGTGGTGACA CCCCCGCGAC CAGCTATCTG 180 GACATCGCCAAGGTCCTCAA CGGACGCC.'AT CCACCCCGGC240 AGCCGCGCGC:'(;~AGTCGGGCG
TACGGATTCCTCTCGGAGAACGCCGAGTTC." TCCTGGAC:GC CGGCCTGATC300 (3C:GCAGGCGG
TGGATCGGCCCGCCCCCGCACGCCATCCGC'(~ACCGTGGCGAAAAGGTC.'GC CGCCCGCCAC360 ATCGCCCAGCGGGCCGGCGCCCCCCTGGTC (3CCGGCACCCCCGACCCC:GT CTCCGGCGCG420 GACGAGGTCGTCGCCTTCGCCAAGGAGCAC' GGCCTGCCCATCGC:CATC'AA GGCC:GCCTTC480 GGCGGCGGCGGGCGCGGCCTCAAGGTCGC<:'(:GCACCCTCGAAGAGGTGCC GGAGCTGTAC:540 GACTCCGCCGTCCGCGAGGCCGTGGCCGCC' TTCGGCCGCGGGGAGTGC:TT CGTCGAGCGC600 TACCTCGACAAGCCCCGCCACGTGGAGACC: CAGTGCCTGGCCGACACC:CA CGGCAACGTG660 GTCGTCGTCTCCACCCGCGACTGCTCCCTC'(:AGCGCCGCCACCAAAAGCT CGTCGAGGAG720 GCCCCCGCGCCCTTTCTCTCCGAGGCCCACI AC:GGAGCAGCTGTACTCATC CTCCAAGGCC780 ATCCTGAAGGAGGCCGGCTACGGCGGCGCC'..'(~GCACCGTGGAGTTCCTC:GT CGGCATGGAC840 GGCACGATCTTCTTCCTGGAGGTCAACACC (:GCCTCCAGGTCGAGCAC:CC GGTCACCGAG900 GAAGTCGCCGGCATCGACTTGGTCCGCGAG i~TGTTCCGCATCGCCGACGG CGAGGAACTC960 GGTTACGACGACCCCGCCCTGCGCGGCCA(:''PCCTTCGAGTTCCC~CATCAA CGGCGAGGAC1020 CCCGGCCGCGGCTTCCTGCCCGCCCCCGGC:' TCTTCGACGC GCCCACCGGC1080 AC:CGTCACCC
CCCGGCGTCCGCCTGGACGCCGGCGTCGAC~ 'L'C:CGGCTCCGTCATCGGC:CC CGCCTGGGAC1140 TCCCTCCTCGCCAAACTGATCGTCACCGGC (:CiCACCCGCGCCGAGGCACT CCAGCGCGCG1200 GCCCGCGCCCTGGACGAGTTCACCGTCGAC~(3GCATGGCCACCGC:CATCCC CTTCCACCGC1260 ACGGTCGTCCGCGACCCGGCCTTCGCCCC<'(>AACTCACCGGCT(:CACGGA CCCCTTCACC:1320 GTCCACACCCGGTGGATCGAGACGGAGTTC:(sTCAACGAGATCAAGCCC:TT CACCACGCCC1380 GCCGACACCGAGACGGACGAGGAGTCGGGC:.' TCG'7.'CGTC:GA 1440 CGGGAGACGG GGTCGGCGGC
AAGCGCCTGGAAGTCTCCCTCCCCTCCAGC'c:TGGGCATGTCCC'TGGCC:CG CACCGGCCTG1500 GCCGCCGGGGCCCGCCCCAAGCGCCGCGCC:; CCGGCCCC:GC CGCCTCGGGC1560 GC:CAAGAAGT
GACACCCTCGCCTCCCCGATGCAGGGCACC; ATCGTCAAGATCGC:CGTC:GA GGAAGGCCAG1620 GAAGTCCAGGAAGGCGACCTCATCGTCGTI, CTCGAGGCGATGAAGATGGA ACAGCCCCTC1680 AACGCCCACAGGTCCGGCACCATCAAGGG(:: AGG7.'CGGC:GC 1740 t:TCACCGCCG CTCCCTCACC
TCCGGCGCCGCCATCTGCGAGATCAAGGA(:: 1773 'Pt:3A
(2) INFORMATION
FOR
SEQ
ID
N0:
29:
(i) EQUENCE
S CHARACTERISTIC'S:
(A) LENGTH:1584 base pairs (B) TYPE:
nucleic acid (C) STRANDEDNESS:
double (D) TOPOLOGY:
circular (xi) SEQUENCE DESCRIPTION: ~E',t;~ ID N0: 29:
ATGACCGTTT TGGATGAGGC GCCGGGCGAC> CCGACGGACG CGCGCGGGCG GGTGGCCGAG 60 CTGCACGGGA TCCGTGCAGC GGCGCTCGCC: GGGCCGAGTG AGAAGGCGAC GGCGGCGCAG 120 CACGCCAAGGGCAAGCTGACGGCACGTGAG (;GCATCGAGCTGCTCCTGGA CCCCGGCTCC180 TTCCGCGAGGTCGAGCAGCTGCGCCGGCAC'(~GGGCGACCGGGTTCGGCCT GGAGGCCAAG240 AAGCCGTACACCGACGGTGTCATCACCGGC.' TCGAGGGCCG CACGGTCTTC300 'CGGGGCACGG
GTCTACGCCCACGACTTCCGGATCTTCGGC'(~GCGCGCTGC,GCGAGGCC'.CA CGCCACGAAG360 ATCCACAAGATCATGGACATGGCCATCGCG GC:CGGTGCCCCGCTGGTGTC GCTGAACGAC420 GGCGCCGGCGCCCGTATCCAGGAGGGCGTC.'i~GCGCGCTCGCCGC:GTAC.'GG 480 CGGCATCTTC
CAGCGCAACACCAAGGCGTCCGGCGTCATC'(~CGCAGATCAGCGTGATGCT CGGCCCCTGC540 GCGGGCGGCGCGGCCTACAGCCCCGCCCTC' ACCGACTTCGTCTTCATGGT CCGCGACACC600 TCGCAGATGTTCATCACGGGCCCGGACGTC' (~l'CAAGGCGGTCAC'.CGGC'.GA 660 GGAGATCACG
CAGAACGGTCTGGGCGGCGCCGACGTGCAC (s(:.CGAGACGTCCGC>CGTC~TG CCACTTCGCC720 TACGACGACGAGGAGACCTGCCTCGCCGAO (37.'CCGCTACCTCCTCTCCCT CCTCCCGCAG780 AACAACCGGGAGAACCCGCCCCGCGCCGAi'I'CCTCCGACCCCGTGGAC:CG CCGCTCGGAC840 ACCCTCCTCGACCTGGTCCCGGCGGACGG(:' ACGACATGAC CAAGGTCATC900 AACCGCCCGT
GAGGAACTCGTCGACGAGGGCGAGTACCTi:~(3AGGTCCACGAGCC~TTGGGC CCGCAACATC960 ATCTGCGCGCTGGCCCGTCTCGACGGGCGG GTCGTGGGCATCGTCGCC'.AA CCAGCCGCAG1020 GCCCTGGCCGGTGTCCTGGACATCGAGGCi3 'PC'.GGAGAAGGCGGC:CCGCTT CGTCCAGATG1080 TGCGACGCCTTCAACATCCCGATCATCACT CTTCTGGACGTACC'CGGCTT CCTGCCCGGC1140 GTCGACCAGGAGCACGGCGGGATCATCCG<' CACGGCGCCAAGCTGCTCTA CGCGTACTGC1200 AACGCGACCGTGCCCCGGATCTCGCTGATC' CTGCGCAAGGCGTACGGAGG TGCTTACATC1260 GTCATGGACAGCCAGTCCATCGGCGCCGAC'(:TCACCTACGCCTC~GCCGAC CAACGAGATC1320 GCCGTCATGGGCGCGGAAGGTGC'CGCGAAC.'(~TCATCTTCCGCCGGCAGAT CGCCGACGCC1380 GAGGACCCCGAGGCCATGCGGGCGCGCATC~ G7.'CAAGGAGTACAAGTCCGA GCTGATGCAC1440 CCCTACTACGCGGCCGAACGCGGTCTGGTC'(~ACGACGTCATCGACCCC:GC CGAAACCCGC1500 GAGGTGCTGATCACGTCCCTGGCGATGCTC.' ACGCCGACCT GCCCTCCCGC1560 CACACCAAGC
(2) INFORMATION
FOR
SEQ
ID
N0:
.?0:
(i) EQUENCE
S CHARACTERISTIC'S:
(A) LENGTH:198 base pairs (B) TYPE:cleic acid nu (C) STRANDEDNESS:
doux:>le (D) TOPOLOGY:
circular (xi) SEQUENCE DESCRIPTION: ~Et~ ID NO: 30:
ATGTCCCCTG CCGACATCCG CGTCGAGAA<> GGCCACGCCG AGCCCGAGGA AGTCGCCGCC 60 ATCACGGCCC TCCTCCTGGC CCGCGCCGC(:'GC:CCGCCCCG CCGAGATCGC GCCGACCCAC 120 GGCGGCGGCC GCGCCCGCGC CGGCTGGCGC' CGCCTGGAAC GCGAGCCGGG CTTCCGCGCC 180 (2) INFORMATION FOR SEQ ID NO: ~1:
(i) SEQUENCE CHARACTERISTI(:S:
(A) LENGTH: 527 amino ac:Lds (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: 3.inear (xi) SEQUENCE DESCRIPTION: SE~~ ID NO: 31:
Met Thr Val Leu Asp Glu Ala Pro Gly Glu Pro Thr Asp Ala Arg Gly Arg Val Ala Glu Leu His Gly I.Le Arg Ala Ala Ala Leu Ala Gly Pro Ser Glu Lys Ala Thr Ala Ala Gln His Ala Lys Gly Lys Leu Thr Ala Arg Glu Arg Ile Glu Leu Le~.x Leu Asp Pro Gly Ser Phe Arg Glu Val Glu Gln Leu Arg Arg His Arr3 .Ala Thr Gly Phe Gl.y Leu Glu Ala Lys Lys Pro Tyr Thr Asp Gly Va'~ I.Le Thr Gly Trp Gly Thr Val Glu Gly Arg Thr Val Phe Val Tyr Al<i:H:is Asp Phe Arch Ile Phe Gly Gly Ala Leu Gly Glu Ala His Ala Th:r Lys Ile His Lys Il.e Met Asp Met Ala Ile Ala Al.a Gly Ala Pro Le~.z Val Ser Leu Asn Asp Gly Ala Gly Ala Arg Ile Gln Glu Gly Val Sea- A=La Leu Ala Gly Tyr Gly Gly Ile Phe 145 150 15'> 160 Gln Arg Asn Thr Lys Ala Ser 31y Val Ile Pro Gln Ile Ser Val Met Leu Gly Pro Cys Ala Gly Gly Ala Ala Tyr Sex Pro Ala Leu Thr Asp Phe Val Phe Met Val Arg Asp 'rhr Ser Gln Met Phe Ile Thr Gly Pro Asp Val Val Lys Ala Val Thr G.Ly Glu Glu Ile Thr Gln Asn Gly Leu Gly Gly Ala Asp Val His Ala 31u Thr Ser Gly Va.L Cys His Phe Ala Tyr Asp Asp Glu Glu Thr Cys Leu Ala Glu Va1 Arg Tyr Leu Leu Ser Leu Leu Pro Gln Asn Asn Arg Glu Asn Pro Pro Arg Ala Glu Ser Ser Asp Pro Val Asp Arg Arg Se:r Asp Thr Leu Leu Asp Leu Val Pro Ala Asp Gly Asn Arg Pro Tyr Asp Met Thr Lys Va.L Ile Glu Glu Leu Val Asp Glu Gly Glu Tyr Leu Glu 'J<~l His Glu Arg Trp Ala Arg Asn Ile Ile Cys Ala Leu Ala Arg Le~.i.~sp Gly Arg Val Val Gly Ile Val Ala Asn Gln Pro Gln Ala Leu Ala G.Ly Val Leu Asp Ile Glu A1a Ser Glu Lys Ala Ala Arg Phe Val Gln Met Cys Asp Al<~ Phe Asn Ile Pro Ile Ile Thr Leu Leu Asp Val Pro ~3.Ly Phe Leu Pro Gly Val Asp Gln Glu His Gly Gly Ile Ile Arg Hi:~ Gly Ala Lys Leu Leu Tyr Ala Tyr Cys 385 390 39'_i 400 Asn Ala Thr Val Pro Arg Ile Ser Leu Ile Leu Arg Lys Ala Tyr Gly Gly Ala Tyr Ile Val Met Asp Ser Gln Ser Ile Gly Ala Asp Leu Thr Tyr Ala Trp Pro Thr Asn Glu I.Le Ala Val Met Gly Ala Glu Gly Ala Ala Asn Val Ile Phe Arg Arc) G.Ln Ile Ala Asp Ala Glu Asp Pro Glu Ala Met Arg Ala Arg Met Va.I. Lys Glu Tyr Ly:~ Ser Glu Leu Met His Pro Tyr Tyr Ala Ala Glu Arg G.Ly Leu Val Asp As,p Val Ile Asp Pro Ala Glu Thr Arg Glu Val Leu I.Le Thr Ser Leu Ala Met Leu His Thr Lys His Ala Asp Leu Pro Se:r.Arg Lys His Gly Asn Pro Pro Gln (2) INFORMATION FOR SEQ ID NO: 32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino <acids (B) TYPE: amino acid (C) STRANDEDNESS: sin:)1.2 (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: :>E?~ ID N0: 32:
Met Ser Pro Ala Asp Ile Arch Val Glu Lys G'ly Hi.s Ala Glu Pro Glu Glu Val Ala Ala Ile Thr Ala Leu Leu Leu A1<~ Arg Ala Ala Ala Arg Pro Ala Glu Ile Ala Pro Th:v His Gly Gly Gly Arg Ala Arg Ala Gly Trp Arg Arg Leu Glu Arg Glu Pro Gly Phe Arg Ala Pro His Ser Trp Arg
Claims (31)
1. A nucleic acid comprising a nucleic acid sequence which encodes AccB polypeptide, or a nucleic acid sequence complementary thereto, wherein said AccB polypeptide:
(a) has the amino acid sequence set out in Fig. 12A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 12B; or (b) has an amino acid sequence which is at least about 80% identical with the amino acid sequence of (a), and has any one or more of the biological properties cf the polypeptide having the amino acid sequence of (a).
(a) has the amino acid sequence set out in Fig. 12A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 12B; or (b) has an amino acid sequence which is at least about 80% identical with the amino acid sequence of (a), and has any one or more of the biological properties cf the polypeptide having the amino acid sequence of (a).
2. The nucleic acid of claim 1, further comprising a nucleic acid sequence which encodes AccE polypeptide, or a nucleic acid sequence complementary thereto, wherein said AccE
polypeptide:
(a) has the amino acid sequence set out in Fig. 13A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 13B; or (b) has an amine acid sequence which at least about 80% identical with the amino acid sequence of (a), and has any one or pore of the biological properties of the polypeptide having the amino acid sequence of (a).
polypeptide:
(a) has the amino acid sequence set out in Fig. 13A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 13B; or (b) has an amine acid sequence which at least about 80% identical with the amino acid sequence of (a), and has any one or pore of the biological properties of the polypeptide having the amino acid sequence of (a).
3. The nucleic acid of claim 1 or claim 2, further comprising a nucleic acid sequence which encodes AccA1 polypeptide, or a nucleic acid sequence complementary thereto, wherein said AccA1 polypetide:
(a) has the amino acid sequence set out in Fig. 11A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 11B; or (b) has an amino acid sequence which is at least about 80% identical with the amino acid sequence of (a), and has any one or more of the biological properties of the polypeptide having the amino acid sequence of (a).
(a) has the amino acid sequence set out in Fig. 11A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 11B; or (b) has an amino acid sequence which is at least about 80% identical with the amino acid sequence of (a), and has any one or more of the biological properties of the polypeptide having the amino acid sequence of (a).
4. The nucleic acid of any preceding claim, further comprising a nucleic acid sequence which encodes AccA2 polypeptide, or a nucleic acid sequence complementary thereto, wherein said AccA2 polypeptide:
(a) has the amino acid sequence set out in Fig. 11A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 11B; or (b) has an amino acid sequence which is at least about 80% identical with the amino acid sequence of (a), and has any one or more of the biological properties of the polypeptide having the amino acid sequence of (a).
(a) has the amino acid sequence set out in Fig. 11A
and/or the amino acid sequence encoded by the nucleic acid sequence set out in Fig. 11B; or (b) has an amino acid sequence which is at least about 80% identical with the amino acid sequence of (a), and has any one or more of the biological properties of the polypeptide having the amino acid sequence of (a).
5. The nucleic acid of any preceding claim, wherein the level of identity is at least about 95%.
6. The nucleic acid of any preceding claim, wherein the level of identity is at least about 90%.
7. the nucleic acid of any preceding claim, wherein:
(a) the nucleic acid sequence which encodes the AccB
polypeptide is at least about 80% identical with the nucleic acid of Fig. 12B; and/or (b) the nucleic acid sequence which encodes the AccE
polypeptide is at least about 80% identical with the nucleic acid of Fig. 13B; and/or (c) the nucleic and sequence which encodes the AccB
polypeptide is at least about 80% identical with the nucleic acid of Fig. 11B; and/or (d) the nucleic acid sequence which encodes the AccB
polypeptide is at least about 80% identical with the nucleic acid of Fig. 11B.
(a) the nucleic acid sequence which encodes the AccB
polypeptide is at least about 80% identical with the nucleic acid of Fig. 12B; and/or (b) the nucleic acid sequence which encodes the AccE
polypeptide is at least about 80% identical with the nucleic acid of Fig. 13B; and/or (c) the nucleic and sequence which encodes the AccB
polypeptide is at least about 80% identical with the nucleic acid of Fig. 11B; and/or (d) the nucleic acid sequence which encodes the AccB
polypeptide is at least about 80% identical with the nucleic acid of Fig. 11B.
8. The nucleic acid cf whim 7, wherein the level of identity is at least about 90%.
9. The nucleic acid of claim 8, wherein the level of identity is at least abort 95%.
10. The nucleic acid of claim 9, wherein the level of identity is at least about 99%.
11. The nucleic acid of any preceding claim wherein said nucleic acid sequence which encodes AccB polypeptide is in operative association with a regulatory sequence for constitutive or inducible expression of said AccB polypeptide in Streptomyces species.
12. The nucleic acid cf claim 11 wherein each said nucleic acid sequence which encodes AccB, AccE, AccA1 and/or AccA2 polypeptide is in operative association with a regulatory sequence for constitutive or inducible expression of said AccB
polypeptide in Streptomyces species.
polypeptide in Streptomyces species.
13. The nucleic acid of claim 11 or claim 12 wherein said regulatory sequence comprises the tipA inducible promoter, the ermE constitutive promoter, or the ermE* constitutive promoter.
14. A vector comprising the nucleic acid sequence of any one of claims 11 to 13, whereby said vector is capable, after incorporation into a Streptomyces species, of causing or increasing expression of AccB polypeptide as defined in claim 1, and optionally also AccE and/or AccA1 and/or AccA2 polypeptide as defined in claims 2 to 4, respectively.
15. A cell of a Streptomyces species, into which cell the vector of claim 14 has been introduced.
16. A method of producing a polyketide, the method comprising:
providing a cell of a polyketide-producing strain of a Streptomyces species into which cell the vector of claim 14 has been introduced;
culturing said cell under conditions suitable for polyketide synthesis; and extracting said polyketide from the cell culture medium.
providing a cell of a polyketide-producing strain of a Streptomyces species into which cell the vector of claim 14 has been introduced;
culturing said cell under conditions suitable for polyketide synthesis; and extracting said polyketide from the cell culture medium.
17. A method of modifying a polyketide-producing strain of a Streptomyces species to increase production of said polyketide, the method comprising modifying said strain to express, or to increase expression of, nucleic acid according to any one of claims 1 to 13.
18. A modified strain of a Streptomyces species, produced according to the method of claim 17.
19. A method of producing a polyketide, the method comprising:
providing a cell of the modified Streptomyces strain of claim 18;
culturing said cell under conditions suitable for polyketide synthesis; and extracting said polyketide from the cell culture medium.
providing a cell of the modified Streptomyces strain of claim 18;
culturing said cell under conditions suitable for polyketide synthesis; and extracting said polyketide from the cell culture medium.
20. A method of increasing acetyl-CoA carboxylase (ACCase) activity in a strain of a Streptomyces species, the method comprising modifying said strain to express, or to increase expression of, nucleic ac.id according to any one of claims 1 to 13.
21. A method of increasing production of a polyketide in a Streptomyces species, the method comprising culturing cells of said species in the presence of exogenous malonate.
22. The method of claim 21, wherein the malonate is present at a concentration of at least about 0.1%.
23. The method of claim 22, wherein the malonate is present at a concentration of at least about 0.5%.
24. The method of claim 23, wherein the malonate is present at a concentration of at least about 1%.
25. The method of any one of claims 16, 19 or 21-24, further comprising the step of purifying said polyketide.
26. The method of claim 25, further comprising the step of formulating said polyketide as a pharmaceutical.
27. The method or strain, of any one of claims 16 to 19 or 21-26 wherein said polyketide is an antibiotic.
28. The vector, cell, method, or strain of any one of claims 14 to 27, wherein said Streptomyces species is selected from the group consisting of S. coelicolor, S. violaceoruber, S.
lividans and S. parvulus.
lividans and S. parvulus.
29. The vector, cell, method, or strain of claim 28, wherein said Streptomyces species is of a strain selected from the group consisting of ATCC: 12434, ATCC 19832, S. coelicolor A3(2) and S. lividans 66.
30. The vector, cell, method, or strain of claim 29, wherein said species is S. coelicolor.
31. The vector, cell, method, or strain of claim 30, wherein said strain is S. coelicolor A3(2).
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CA002322105A CA2322105A1 (en) | 2000-10-23 | 2000-10-23 | Antibiotic production (ii) |
| PCT/GB2001/004709 WO2002034784A2 (en) | 2000-10-23 | 2001-10-23 | An acyl coenzyme a carboxylase from streptomyces |
| EP01976514A EP1328551A2 (en) | 2000-10-23 | 2001-10-23 | An acyl coenzyme a carboxylase from streptomyces |
| AU2001295782A AU2001295782A1 (en) | 2000-10-23 | 2001-10-23 | An acyl coenzyme a carboxylase from streptomyces |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CA002322105A CA2322105A1 (en) | 2000-10-23 | 2000-10-23 | Antibiotic production (ii) |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CA2322105A1 true CA2322105A1 (en) | 2002-04-23 |
Family
ID=4167299
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002322105A Abandoned CA2322105A1 (en) | 2000-10-23 | 2000-10-23 | Antibiotic production (ii) |
Country Status (4)
| Country | Link |
|---|---|
| EP (1) | EP1328551A2 (en) |
| AU (1) | AU2001295782A1 (en) |
| CA (1) | CA2322105A1 (en) |
| WO (1) | WO2002034784A2 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112410274A (en) * | 2019-08-23 | 2021-02-26 | 上海医药工业研究院 | A kind of genetically engineered bacteria for producing ascomycin and its preparation method and use |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU2010298004B2 (en) | 2009-09-27 | 2016-02-25 | Opx Biotechnologies, Inc. | Method for producing 3-hydroxypropionic acid and other products |
| US20130122541A1 (en) * | 2010-01-27 | 2013-05-16 | The Regents Of The University Of Colorado | Microorganism production of high-value chemical products, and related compositions, methods and systems |
| KR20150040359A (en) | 2012-08-10 | 2015-04-14 | 오피엑스 바이오테크놀로지스, 인크. | Microorganisms and methods for the production of fatty acids and fatty acid derived products |
| CA2905602A1 (en) | 2013-03-15 | 2014-09-18 | Sarah M. Hoyt | Flash evaporation for product purification and recovery |
| US20150119601A1 (en) | 2013-03-15 | 2015-04-30 | Opx Biotechnologies, Inc. | Monofunctional mcr + 3-hp dehydrogenase |
| US11408013B2 (en) | 2013-07-19 | 2022-08-09 | Cargill, Incorporated | Microorganisms and methods for the production of fatty acids and fatty acid derived products |
| JP6603658B2 (en) | 2013-07-19 | 2019-11-06 | カーギル インコーポレイテッド | Microorganisms and methods for the production of fatty acids and fatty acid derivatives |
| EP2993228B1 (en) | 2014-09-02 | 2019-10-09 | Cargill, Incorporated | Production of fatty acid esters |
| EP3577227A4 (en) | 2017-02-02 | 2020-12-30 | Cargill Inc. | GENETICALLY MODIFIED CELLS THAT PRODUCE C6-C10 Fatty Acid Derivatives |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5164305A (en) * | 1990-01-18 | 1992-11-17 | Cetus Oncology Corporation | Streptomyces promoter and method of use thereof |
| US5639949A (en) * | 1990-08-20 | 1997-06-17 | Ciba-Geigy Corporation | Genes for the synthesis of antipathogenic substances |
| US5622866A (en) * | 1994-06-23 | 1997-04-22 | Merck & Co., Inc. | Expression cassettes useful in construction of integrative and replicative expression vectors for Streptomyces |
-
2000
- 2000-10-23 CA CA002322105A patent/CA2322105A1/en not_active Abandoned
-
2001
- 2001-10-23 AU AU2001295782A patent/AU2001295782A1/en not_active Abandoned
- 2001-10-23 EP EP01976514A patent/EP1328551A2/en not_active Withdrawn
- 2001-10-23 WO PCT/GB2001/004709 patent/WO2002034784A2/en not_active Ceased
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112410274A (en) * | 2019-08-23 | 2021-02-26 | 上海医药工业研究院 | A kind of genetically engineered bacteria for producing ascomycin and its preparation method and use |
| CN112410274B (en) * | 2019-08-23 | 2023-01-24 | 上海医药工业研究院 | Genetic engineering bacterium for producing ascomycin and preparation method and application thereof |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2002034784A3 (en) | 2003-01-16 |
| AU2001295782A1 (en) | 2002-05-06 |
| WO2002034784A2 (en) | 2002-05-02 |
| EP1328551A2 (en) | 2003-07-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP2342335B1 (en) | Novel gene cluster | |
| US20080131943A1 (en) | Polynucleotides for production of farnesyl dibenzodiazepinones | |
| Hayashi et al. | Cloning of the gene cluster responsible for the biosynthesis of brasilicardin A, a unique diterpenoid | |
| EP2356211A1 (en) | Improved production of riboflavin | |
| CN106687577A (en) | Microorganisms and methods for producing vanillin | |
| CA2322105A1 (en) | Antibiotic production (ii) | |
| CN101363022B (en) | Biological synthesis gene cluster of tetrokacin A and use thereof | |
| US11858967B2 (en) | Compositions and methods for enhanced production of enduracidin in a genetically engineered strain of streptomyces fungicidicus | |
| Fu et al. | Biosynthesis of 3-hydroxy-5-methyl-O-methyltyrosine in the saframycin/safracin biosynthetic pathway | |
| Zhang et al. | Cloning and characterization of a gene (msdA) encoding methylmalonic acid semialdehyde dehydrogenase from Streptomyces coelicolor | |
| PL181916B1 (en) | Method of obtaining natural avermectin | |
| Hijarrubia et al. | Characterization of the lys2 gene of Acremonium chrysogenum encoding a functional α-aminoadipate activating and reducing enzyme | |
| JP2005512504A (en) | Reinamycin biosynthetic gene cluster and its components and their use | |
| KR20230121065A (en) | Amicolatopsis strain with inhibited vanillic acid formation for vanillin production | |
| KR102159415B1 (en) | Uk-2 biosynthetic genes and method for improving uk-2 productivity using the same | |
| WO2002068613A1 (en) | Biosynthesis of polyketide synthase substrates | |
| CA2391131C (en) | Genes and proteins for rosaramicin biosynthesis | |
| AU9421998A (en) | Genes encoding branched-chain alpha-ketoacid dehydrogenase complex from streptomyces avermitilis | |
| CN102517309A (en) | Manumycin-class antibiotic biosynthetic gene cluster | |
| US20030157652A1 (en) | Antibiotic production (II) | |
| KR20020064788A (en) | Transformant producing secondary metabolite modified with functional group and novel biosynthesis genes | |
| CN101153274A (en) | Method for increasing production of rifamycin | |
| JP5641232B2 (en) | Ogonori-derived cyclooxygenase gene and method for producing prostaglandins using the gene | |
| US20020192767A1 (en) | Biosynthesis of polyketide synthase substrates | |
| WO2001064943A1 (en) | Method of screening substance specifically inhibiting non-mevalonate pathway |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FZDE | Dead |